Code Monkey home page Code Monkey logo

sqlflow_public's Introduction

SQLFlow Introduce

The SQLFlow is a tool helps you collect data lineage information by analying the SQL scripts in a governed data environment. It scans SQL code to understand all the logic and reverse engineer it, to discover the data flow/movement from its source to destination via various changes and hops on its way, to build an understanding of how data changes and which data serves as input for calculating other data.

The input of the SQLFlow is SQL statement or file that includes the SQL statement.

CREATE VIEW vsal 
AS 
  SELECT a.deptno                  "Department", 
         a.num_emp / b.total_count "Employees", 
         a.sal_sum / b.total_sal   "Salary" 
  FROM   (SELECT deptno, 
                 Count()  num_emp, 
                 SUM(sal) sal_sum 
          FROM   scott.emp 
          WHERE  city = 'NYC' 
          GROUP  BY deptno) a, 
         (SELECT Count()  total_count, 
                 SUM(sal) total_sal 
          FROM   scott.emp 
          WHERE  city = 'NYC') b 
;

The output is the metadata of the table/column representing the changes and hops during the transition of the data.

source_db source_schema source_table source_column target_db target_schema target_table target_column relation_type effectType
scott scott.emp sal vsal "Salary" fdd create_view
scott scott.emp deptno vsal "Department" fdd create_view
scott scott.emp PseudoRows vsal "Employees" frd create_view

Once the metadata of the data lineage is ready, SQLFlow presents a nice clean graph to you that tells where the data came from, what transformations it underwent along the way, and what other data items are derived from this data value.

SQLFlow Introduce

What SQLFlow can do for you

  • Provide a nice cleam diagram to the end-user to understand the data lineage quickly.
  • Incorporate the lineage metadata decoded from the complex SQL script into your own metadata database for further processing.
  • Visualize the metadata already existing in your database to release the power of data.
  • Perform impact analysis and root-cause analysis by tracing lineage backwards or forwards with several mouse click.
  • Able to process SQL script from more than 20 major database vendors.

How to use SQLFlow

  • Open the official website of the SQLFlow and paste your SQL script or metadata to get a nice clean lineage diagram.
  • Call the Restful API of the SQLFlow in your own code to get data lineage metadata decoded by the SQLFlow from the SQL script.
  • The on-premise version of SQLflow enables you to use it on your own server to keep the data safer.

The price plan

Restful APIs

SQLFlow architecture

User manual and FAQ

sqlflow_public's People

Contributors

cnfree avatar isfd avatar ktdynamic avatar lake2 avatar shenhuan2021 avatar sqlparser avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.