Cancai Cai's Projects
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache Arrow DataFusion SQL Query Engine
Official Rust implementation of Apache Arrow
Fast key-value DB in Go.
Apache Beam is a unified programming model for Batch and Streaming data processing.
Config files for my GitHub profile.
Apache Calcite
Apache Calcite Avatica
ClickHouse® is a free analytics DBMS for big data
CMU 15-445/645: Intro to Database Systems (Fall 2022). A course on the design and implementation of database management systems.
📊 Cube — The Semantic Layer for Building Data Applications
Implementation of Apache ORC file format use Apache Arrow in-memory format
阿里云数据库内核月报分类整理(定时更新)。
Readings in Databases
DuckDB is an in-process SQL OLAP Database Management System
egg is a flexible, high-performance e-graph library
Apache Flink
Apache Flink Kubernetes Operator
Hybrid in-memory and disk cache in Rust
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
Apache Hive
Scalable datastore for metrics, events, and real-time analytics
JVM readings
Mirror of Apache Kafka
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
A tutorial of building an LSM-Tree storage engine in a week!
CMU-DB's Cascades optimizer framework