wutao0914 Goto Github PK
Type: User
Type: User
webSpoon is a web-based graphical designer for Pentaho Data Integration with the same look & feel as Spoon
kettle插件机制进行血缘采集,采集执行时SQL等
RoaringBitmap extension for PostgreSQL
xxhash functions for PostgreSQL
基于插件架构的数据源服务,统一接口,可操作不同类型数据源
presto hbase connector 组件基于Presto Connector接口规范实现,用来给Presto增加查询HBase的功能。相比其他开源版本的HBase Connector,我们的性能要快10到100倍以上。
some useful User Defined Functions(UDF) for both PrestoSQL and TrinoDB
presto聚合插件Demo
The official home of the Presto distributed SQL query engine for big data
Qualitis is a one-stop data quality management platform that supports quality verification, notification, and management for various datasource. It is used to solve various data quality problems caused by data processing. https://github.com/WeBankFinTech/Qualitis
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
re_data - fix data issues before your users & CEO would discover them 😊
Redisson - Redis Java client with features of In-Memory Data Grid. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Publish / Subscribe, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, MyBatis, RPC, local cache ...
Open data platform based on Flink and Kubernetes, supports web-ui click-and-drop data integration with SeaTunnel on Flink, manage flink jar job both YARN and Kubernetes. Now Scaleph is working on Flink SQL online editor
此项目是对大学生的一卡通消费数据、图书借阅记录和图书馆门禁数据在spark集群的大数据框架环境之下进行聚类、关联分析,分析出学生的消费水平、生活规律、学习强度等聚类结果,以及将聚类结果进行FPGrowth关联分析得出学生聚类之间存在的关联性,此项目是使用scala语言,利用sparkSQL集合hive进行大数据分析
Distributed scheduled job framework
微服务任务调度框架
Public runnable examples of using John Snow Labs' OCR for Apache Spark.
Data Lineage Tracking And Visualization Solution
Document, sample code and other materials for SQLFlow
SQL Lineage Analysis Tool powered by Python
sqllineage前端
大数据平台-分布式任务调度系统
使用DAG图描述任务依赖,实现控制xxl-job client端任务执行顺序
在xxl-job 2.3.0版本的基础上 增加任务dag调度
TiDB is an open source distributed HTAP database compatible with the MySQL protocol
百度贴吧爬虫(基于scrapy和mysql)
Support agile DataOps Based on DataX and Flink-CDC with Web-UI
Trino UDFs Plugin to encrypt/decrypt values with a password
Forked from https://github.com/analysys/presto-hbase-connector
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.