Code Monkey home page Code Monkey logo

bjmsong's Projects

caffe icon caffe

Caffe: a fast open framework for deep learning.

cutlass icon cutlass

CUDA Templates for Linear Algebra Subroutines

dlrm icon dlrm

An implementation of a deep learning recommendation model (DLRM)

fastllm icon fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

flash_attention_inference icon flash_attention_inference

Performance of the C++ interface of flash attention, flash attention v2 and self quantized decoding attention in large language model (LLM) inference scenarios.

libflash_attn icon libflash_attn

Standalone Flash Attention v2 kernel without libtorch dependency

llama2.c icon llama2.c

Inference Llama 2 in one file of pure C

llm.c icon llm.c

LLM training in simple, raw C/CUDA

micrograd icon micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

nanogpt icon nanogpt

The simplest, fastest repository for training/finetuning medium-sized GPTs.

triton icon triton

Development repository for the Triton language and compiler

vllm icon vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

xllm icon xllm

A lightweight llama2 inference framework. It can inference llama2-7b with 166+ tokens/s on signle 4090.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.