Code Monkey home page Code Monkey logo

quick-start-guide-to-llms's Introduction

Quick Start Guide to Large Language Models

Get your copy today!

Welcome to the GitHub repository for the "Quick Start Guide to Large Language Models" book. This repository contains the code snippets and notebooks used in the book, demonstrating various applications of Transformer models.

Repository Structure

Directories

  • notebooks: This directory contains Jupyter notebooks for each chapter in the book.
  • data: Contains the datasets used in the notebooks.
  • images: Contains images and graphs used in the notebooks.

Notebooks

Here are some of the notebooks included in the notebooks directory:

Part I - Introduction to Large Language Models

  • 2_semantic_search.ipynb: An introduction to semantic search using OpenAI and open source models.
  • 3_prompt_engineering.ipynb: A guide to effective prompt engineering for instruction aligned LLMs.

Part II - Getting the Most Out of LLMs

  • 4_fine_tuned_classification.ipynb: Learn how to perform text classification through fine-tuning OpenAI models
  • 5_adv_prompt_engineering.ipynb: Advanced techniques for prompt engineering including k-shot, semantic k-shot, chain of thought prompting, chaining, and building a retrieval augmented generating (RAG) enabled chatbot with GPT-4.
  • 5_VQA.ipynb: Introduction to prompt chaining and Visual Question Answering (VQA) with open source LLMs
  • 6_recommendation_engine.ipynb: Building a recommendation engine using custom fine-tuned LLMs

Part III - Advanced LLM Usage

  • 7_constructing_a_vqa_system.ipynb: Step-by-step guide to constructing a Visual Question Answering system using open-source GPT2 and the Vision Transformer.
  • 7_using_our_vqa.ipynb: A notebook to use the VQA system we built in the previous notebook.
  • 7_rl_flan_t5_summaries.ipynb: Using Reinforcement Learning (RL) to produce more neutral and grammatically correct summaries with the FLAN-T5 model.
  • 8_latex_gpt2.ipynb: Fine-tuning GPT-2 to generate LaTeX formulas
  • 8_anime_category_classification_model_freezing.ipynb: Fine-tuning a BERT model to classify anime categories with a comparison between freezing model layers and keeping the model unfrozen.
  • 8_optimizing_fine_tuning.ipynb: Best practices for optimizing fine-tuning of transformer models - dynamic padding, gradient accumulation, mixed precision, and more.
  • 8_sawyer_1_instruction_ft.ipynb: Fine-tuning the instruction model for the SAWYER bot.
  • 8_sawyer_2_train_reward_model.ipynb: Training a reward model for the SAWYER bot from human preferences.
  • 8_sawyer_3_rl.ipynb: Using Reinforcement Learning from Human Feedback (RLHF) to further align the SAWYER bot
  • 8_sawyer_4_use_sawyer.ipynb: Using our SAWYER bot
  • 9_distillation.ipynb: An exploration of knowledge distillation techniques for transformer models.

We will continue to add more notebooks exploring topics like fine-tuning, advanced prompt engineering, combining transformers, and various use-cases. Stay tuned!

How to Use

To use this repository, clone it to your local machine, navigate to the notebooks directory, and open the Jupyter notebook of your choice. Note that some notebooks may require specific datasets, which can be found in the data directory.

Please ensure that you have the necessary libraries installed and that they are up to date. This can usually be done by running pip install -r requirements.txt in the terminal.

Contributing

Contributions are welcome! Feel free to submit a pull request if you have any additions, corrections, or enhancements to submit.

Disclaimer

This repository is for educational purposes and is meant to accompany the "Quick Start Guide to Large Language Models" book. Please refer to the book for in-depth explanations and discussions of the topics covered in the notebooks.

quick-start-guide-to-llms's People

Contributors

sinanuozdemir avatar

Stargazers

A Tiwari avatar

Watchers

 avatar

Forkers

adnanhashmi

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.