Shuo Ouyang's Projects
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets
The Chinese version of numpy's official document
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
A lightweight parameter server interface
An Open Source Machine Learning Framework for Everyone
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
Tutorial code on how to build your own Deep Learning System in 2k Lines
TensorRT Hackathon 2022 Final Competition