SQLFlow Introduce
The SQLFlow is a tool helps you collect data lineage information by analying the SQL scripts in a governed data environment. It scans SQL code to understand all the logic and reverse engineer it, to discover the data flow/movement from its source to destination via various changes and hops on its way, to build an understanding of how data changes and which data serves as input for calculating other data.
The input of the SQLFlow is SQL statement or file that includes the SQL statement.
CREATE VIEW vsal
AS
SELECT a.deptno "Department",
a.num_emp / b.total_count "Employees",
a.sal_sum / b.total_sal "Salary"
FROM (SELECT deptno,
Count() num_emp,
SUM(sal) sal_sum
FROM scott.emp
WHERE city = 'NYC'
GROUP BY deptno) a,
(SELECT Count() total_count,
SUM(sal) total_sal
FROM scott.emp
WHERE city = 'NYC') b
;
The output is the metadata of the table/column representing the changes and hops during the transition of the data.
source_db | source_schema | source_table | source_column | target_db | target_schema | target_table | target_column | relation_type | effectType |
---|---|---|---|---|---|---|---|---|---|
scott | scott.emp | sal | vsal | "Salary" | fdd | create_view | |||
scott | scott.emp | deptno | vsal | "Department" | fdd | create_view | |||
scott | scott.emp | PseudoRows | vsal | "Employees" | frd | create_view |
Once the metadata of the data lineage is ready, SQLFlow presents a nice clean graph to you that tells where the data came from, what transformations it underwent along the way, and what other data items are derived from this data value.
What SQLFlow can do for you
- Provide a nice cleam diagram to the end-user to understand the data lineage quickly.
- Incorporate the lineage metadata decoded from the complex SQL script into your own metadata database for further processing.
- Visualize the metadata already existing in your database to release the power of data.
- Perform impact analysis and root-cause analysis by tracing lineage backwards or forwards with several mouse click.
- Able to process SQL script from more than 20 major database vendors.
How to use SQLFlow
- Open the official website of the SQLFlow and paste your SQL script or metadata to get a nice clean lineage diagram.
- Call the Restful API of the SQLFlow in your own code to get data lineage metadata decoded by the SQLFlow from the SQL script.
- The on-premise version of SQLflow enables you to use it on your own server to keep the data safer.