Topic: data-processing Goto Github
Some thing interesting about data-processing
Some thing interesting about data-processing
data-processing,Data and tools for generating and inspecting OLMo pre-training data.
Organization: allenai
Home Page: https://allenai.github.io/dolma/
data-processing,Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Organization: alttch
data-processing,Apache Wayang(incubating) is the first cross-platform data processing system.
Organization: apache
Home Page: https://wayang.incubator.apache.org/
data-processing,Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
Organization: asyml
data-processing,Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Organization: asyml
Home Page: https://asyml.io
data-processing,Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Organization: asyml
Home Page: https://asyml.io
data-processing,Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
User: benibela
Home Page: http://www.videlibri.de/xidel.html
data-processing,Substation is a security analytics and data pipeline toolkit for the cloud (AWS) and more.
Organization: brexhq
Home Page: https://substation.readme.io
data-processing,Python Stream Processing
Organization: bytewax
Home Page: https://docs.bytewax.io/
data-processing,All-in-one text de-duplication
User: chenghaomou
data-processing,Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
User: colasgael
data-processing,Harmonious distributed data analysis in Rust.
Organization: constellation-rs
Home Page: https://constellation.rs/amadeus
data-processing,Concurrent and multi-stage data ingestion and data processing with Elixir
Organization: dashbitco
Home Page: https://elixir-broadway.org
data-processing,PHP - ETL (Extract Transform Load) data processing library
Organization: flow-php
Home Page: https://flow-php.com
data-processing,Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Organization: googlecloudplatform
data-processing,Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Organization: googlecloudplatform
Home Page: http://cloud.google.com/dataflow
data-processing,Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Organization: helmholtz-analytics
Home Page: https://heat.readthedocs.io/
data-processing,HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Organization: hstreamdb
Home Page: https://hstream.io/
data-processing,ιη¨δΊι«ζ§θ½η³»η»ηε€θΏη¨θ§£εηΌ©θ½―δ»Ά(A multiprocess decompression software for high-performance system)
User: hxz393
data-processing,A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
User: iam-mhaseeb
data-processing,A list about Apache Kafka
User: infoslack
data-processing,convtools is a python library to declaratively define conversions for processing collections, doing complex aggregations and joins.
Organization: itechart
Home Page: https://convtools.readthedocs.io
data-processing,Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
User: johnkerl
Home Page: https://miller.readthedocs.io
data-processing,πΎ~ music, eternal ~ πΎ
User: kousun12
Home Page: https://eternal.rob.computer
data-processing,Catmandu - a data processing toolkit
Organization: librecat
Home Page: https://librecat.org
data-processing,A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud βοΈπ
Organization: lithops-cloud
Home Page: http://lithops.cloud
data-processing,Clojure Command-line Data Processor for JSON, YAML, EDN, XML and more
User: markus-wa
data-processing,Python Adaptive Signal Processing
User: matousc89
data-processing,Machine Learning notebooks for refreshing concepts.
User: maykulkarni
data-processing,π¦Ύ Main repository for the Mech programming language. Start here!
Organization: mech-lang
Home Page: http://mech-lang.org
data-processing,Large-scale pretraining for dialogue
Organization: microsoft
data-processing,Large-scale pretrained models for goal-directed dialog
Organization: microsoft
Home Page: http://aka.ms/GODEL
data-processing,Production-ready data processing made easy and shareable
Organization: ml6team
Home Page: https://fondant.ai/en/stable/
data-processing,Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
User: msamogh
data-processing,Kubernetes-native platform to run massively parallel data/streaming jobs
Organization: numaproj
Home Page: https://numaflow.numaproj.io
data-processing,A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Organization: nvidia
Home Page: https://docs.nvidia.com/deeplearning/dali/user-guide/docs/index.html
data-processing,A collection of handy Bash One-Liners and terminal tricks for data processing and Linux system maintenance.
User: onceupon
Home Page: http://onceupon.github.io/Bash-Oneliner/
data-processing,Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Organization: polyaxon
data-processing,Extract Transform Load for Python 3.5+
Organization: python-bonobo
Home Page: https://www.bonobo-project.org/
data-processing,Manipulating VASP files with Python.
User: pytlab
Home Page: https://pypi.python.org/pypi/vaspy/
data-processing,Remote Sensing and GIS Software Library; python module tools for processing spatial data.
Organization: remotesensinginfo
Home Page: https://www.rsgislib.org
data-processing,Public tracker for Scramjet Cloud Platform, a platform that bring data from many environments together.
Organization: scramjetorg
Home Page: https://www.scramjet.org
data-processing,Advanced and Fast Data Transformation in R
User: sebkrantz
Home Page: https://sebkrantz.github.io/collapse/
data-processing,ESA Earth Observation Toolbox and Java Development Platform
Organization: senbox-org
Home Page: http://step.esa.int
data-processing,Elastic data processing with Apache Pulsar and Apache Flink
Organization: streamnative
data-processing,A pure Python implementation of Apache Spark's RDD and DStream interfaces.
User: svenkreiss
Home Page: https://pysparkling.readthedocs.io
data-processing,Collection of Data Processing Agreement (DPA) and GDPR compliance resources
Organization: tollwerk
Home Page: https://tollwerk.github.io/data-processing-agreements/
data-processing,Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
User: tomwright
Home Page: https://daseldocs.tomwright.me
data-processing,A light-weight, flexible, and expressive statistical data testing library
Organization: unionai-oss
Home Page: https://www.union.ai/pandera
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.