bjmsong Goto Github PK
Name: bjmsong
Type: User
Bio: [email protected]
Name: bjmsong
Type: User
Bio: [email protected]
Caffe: a fast open framework for deep learning.
Code for the Manning book Concurrency in Python with Asyncio
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
CUDA Library Samples
CUDA Templates for Linear Algebra Subroutines
《深度学习入门:基于Python的理论与实现》电子版及配套代码。
Example models using DeepSpeed
An implementation of a deep learning recommendation model (DLRM)
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
Performance of the C++ interface of flash attention, flash attention v2 and self quantized decoding attention in large language model (LLM) inference scenarios.
Matrix Multiplication Optimization on GPU/CPU
This repository contains the results and code for the MLPerf™ Inference v3.1 benchmark.
Material for cuda-mode lectures
Standalone Flash Attention v2 kernel without libtorch dependency
Inference Llama 2 in one file of pure C
LLM training in simple, raw C/CUDA
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
The simplest, fastest repository for training/finetuning medium-sized GPTs.
C++ library based on tensorrt integration
Development repository for the Triton language and compiler
A high-throughput and memory-efficient inference and serving engine for LLMs
A lightweight llama2 inference framework. It can inference llama2-7b with 166+ tokens/s on signle 4090.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.