codingcat Goto Github PK
Name: Nan Zhu
Type: User
Company: Pinterest
Location: Seattle
Blog: http://codingcat.me/
Name: Nan Zhu
Type: User
Company: Pinterest
Location: Seattle
Blog: http://codingcat.me/
Achieving Real-time Data Analytics with Spark and EventHubs
REST job server for Spark
Spark SQL listener to record lineage information
Dashboard to aid in Spark pull request reviews
Spark SQL Macros provides a mechanism similar to Spark User-Defined function registration; with the key enhancement being that custom code gets compiled to equivalent Catalyst Expressions at macro define time.
Examples showing how streaming events can be persisted to Azure blob, Hive table and Azure SQL Table through Spark.
spark_based_bbnp
Distributed Neural Networks for Spark
some test code for spark development
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
Statistical Workload Injector for MapReduce - Project at UC Berkeley AMP Lab
Tensorflow wrapper for DataFrames on Apache Spark
Terraform module to provision a fully managed AWS EKS Node Group
test_timestamp
本项目是July的《程序员编程艺术》的电子书版本
TLAPlusCourseProject
Port of TPC-DS dsdgen to Java
TPC-DS benchmark kit with some modifications/additions
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Spark with minimal hand tuning
fast tree inference
bring deep learning workloads to bare metal
start learning typescript
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
The repo to host all the web data including images for documents in dmlc projects.
Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.
a benchmark to test scalability of xgboost4j-spark and relevant projects
repo containing XGBoost-based ML project for various purposes
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.