Code Monkey home page Code Monkey logo

llm_learning's Introduction

LLMs

Personal repository to enhance LLMs understanding.

Contents

MinGPT

A PyTorch re-implementation of GPT, both training and inference.
minGPT tries to be small, clean, interpretable and educational.

The minGPT library is three files:

  • mingpt/model.py contains the actual Transformer model definition
  • mingpt/bpe.py contains a mildly refactored Byte Pair Encoder that translates between text and sequences of integers exactly like OpenAI did in GPT
  • mingpt/trainer.py is (GPT-independent) PyTorch boilerplate code that trains the model

Then there are a number of demos and projects that use the library in the projects folder

  • projects/adder --> trains a GPT from scratch to add numbers
  • projects/chargpt --> trains a GPT to be a character-level language model on some input text file
  • demo.ipynb --> shows a minimal usage of the GPT and Trainer in a notebook format on a simple sorting example
  • generate.ipynb --> shows how one can load a pretrained GPT2 and generate text given some prompt

NanoGPT

Simple codebase for training/finetuning medium-sized GPTs.
It is a rewrite of minGPT that prioritizes teeth over education

GigaGPT

We present gigaGPT โ€“ the simplest implementation for training large language models with tens or hundreds of billions of parameters. This work was inspired by Andrej Karpathy's nanoGPT. However, while nanoGPT is designed to train medium sized models up to around the 1B parameter range, gigaGPT leverages Cerebras hardware to use a single simple model definition and training loop to scale to GPT-3 sized models run across exaflop scale clusters.

GPT-Fast

(PyTorch Official)
Simple and efficient pytorch-native transformer text generation.

A minimalistic, PyTorch-only decoding implementation loaded with best practices: int8/int4 quantization, speculative decoding, Tensor parallelism, etc. Boosts the "clock speed" of LLM OS by 10x with no model change!

LIT-GPT

(PyTorch Lightning Official)

LLM-Visualization

llm_learning's People

Contributors

pablorr100 avatar pablorrapple avatar

Watchers

 avatar

Forkers

meowboy326

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.