chizhang118 Goto Github PK
Name: Chi Zhang
Type: User
Company: Bytedance
Bio: CMU MCDS graduate on distributed system. Ex-googler, work in Bytedance.
Location: San Jose
Name: Chi Zhang
Type: User
Company: Bytedance
Bio: CMU MCDS graduate on distributed system. Ex-googler, work in Bytedance.
Location: San Jose
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Web-based monitoring and management for Ceph
Ceph is a distributed object, block, and file storage platform
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Production-Grade Container Scheduling and Management
Tensors and Dynamic neural networks in Python with strong GPU acceleration
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
A high-throughput and memory-efficient inference and serving engine for LLMs
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.