Code Monkey home page Code Monkey logo

flow's Introduction

img

Flow is a PHP based, strongly typed ETL (Extract Transform Load), asynchronous data processing library with constant memory consumption.

Latest Stable Version Latest Unstable Version License Test Suite

Supported PHP versions

  • Supported PHP Version

Installation

This package is a monorepo that should not be directly installed in your project. Please check below packages and select only those that you are going to use:

Contributing

In order to run tests locally please make sure you have docker up and running. You also need PHP 8.1 and composer to be available from your CLI.

For the code coverage, please install pcov.

Prepare Project:

cp docker-compose.yml.dist docker-compose.yml
composer install 
docker compose up -d

Run Test Suite

composer test

Run Static Analyze

composer static:analyze

Test everything

This command will execute exactly the same tests as we run at Github Actions before PR can get merged. If it passes locally, you are good to open pull request.

composer build 

Usage

In order to understand how Flow works, please read documentation

Features

  • low and constant memory consumption
  • asynchronous data processing
  • reading from any data source
  • writing to any data source
  • rich collection of data transformation functions
  • direct access to remote filesystems
  • partitioning
  • grouping & aggregating
  • remote files processing
  • joins
  • sorting
  • displaying datasets as ASCII table
  • validation against schema
  • caching

Asynchronous Processing

Building blocks

  • DataFrame - Lazy data processing frame.
  • Rows - Immutable colllection of Row objects.
  • Row - Immutable, strongly typed collection of Entry objects.
  • Entry - Immutable, strongly typed object representing cell in a row.
  • Extractor (Reader) - Memory safe, Data Source returning \Generator, yielding Rows to the Pipeline
  • Transformer - Data transformer receiving and returning Rows (in most cases transformer), one instance of Rows at once.
  • Loader (Writer) - Memory safe representation of Data Sink, responsibility of Loader is to write Rows into destination storage, one at time.
  • Pipeline - Interface representing ETL process, each received Rows instanced is pased through all Pipes, also responsible for error handling.
  • Pipe - Loader of Transformer instance existing in Pipes collection.

Supported PHP versions

  • 8.1 - โœ…

Sponsors

Flow PHP is sponsored by:

  • Blackfire - the best PHP profiling and monitoring tool!

flow's People

Contributors

dependabot[bot] avatar aeon-automation avatar norberttech avatar tomaszhanc avatar dawidsajdak avatar stloyd avatar github-actions[bot] avatar norbertmwk avatar peter279k avatar wirone avatar jpiatko avatar scyzoryck avatar mleczakm avatar drupol avatar szepeviktor avatar wiktor6 avatar owsiakl avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.