Code Monkey home page Code Monkey logo

a-general-framework-to-prompt-tuning-llm-model's Introduction

Large Language Model Prompt Tuning Framework for classification

Welcome to our Large Language Model Prompt Tuning Framework, a robust solution designed for training large-scale language models efficiently, even on limited resources. This framework leverages the power of Hugging Face's peft model and accelerate tools, combined with efficient script management through srun, providing a streamlined approach for researchers and developers alike.

Features

  • Resource Efficiency: Utilize cutting-edge techniques to train large language models with significantly reduced computational requirements.
  • Hugging Face Integration: Built with Hugging Face's peft and accelerate, ensuring compatibility with state-of-the-art model training practices.
  • Scalable and Flexible: Adapt to various scales of resources, from small local setups to large distributed systems.
  • SLURM Support: Includes scripts for srun, making it easy to deploy on SLURM-managed clusters.

Getting Started

Follow these instructions to set up and run the framework on your system. We recommend you to use python 3.10 version and install the package via

pip install -r requirements.txt

Prerequisites

Ensure you have the following installed:

  • Hugging Face transformers
  • Hugging Face accelerator
  • SLURM (for cluster management)

Usage

  1. Run the training script with srun if you are using a SLURM cluster, or directly from your terminal:

    sbatch GPTmodel_huggingface_multigpu.sh
    # or
    accelerate launch -m train
  2. We also provide cpu version to eval the model.

  3. The code is just for reference, please feel free to modify if you have other needs.

a-general-framework-to-prompt-tuning-llm-model's People

Contributors

biuyeaf avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.