Ron's Projects
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
SQL-based streaming analytics platform at scale
Mirror of Apache Bahir Flink
Apache Beam is a unified programming model for Batch and Streaming
Mirror of Apache Calcite
Define and run multi-container applications with Docker
An implementation of differential dataflow using timely dataflow on Rust.
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
DuckDB is an in-process SQL OLAP Database Management System
Mirror of Apache Flink
Change Data Capture (CDC) Connectors for Apache Flink
Docker packaging for Apache Flink
Apache Flink Kubernetes Operator
flink learning blog. http://www.54tianzhisheng.cn/tags/Flink/
Remote Shuffle Service for Flink
Apache Flink Website
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join
基于flink的分布式同步工具
手把手撕LeetCode题目,扒各种算法套路的裤子。English version supported! Crack LeetCode, not only how, but also why.
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Apache Hadoop
Upserts, Deletes And Incremental Processing on Big Data.
Apache Iceberg
Apache IoTDB
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
极客时间专栏《Java业务开发常见错误100例》源码