The llm-tuning from uvanlp

llm-tuning's Introduction

This package is to provide a codebase to easily fine-tune some open-source LLMs and reproduce the results.

So far, the codebase supports

Fine-tuning algorithms offered by the PEFT package
- LoRA
Open-source LLM families from Hugging Face
- Llama2
- Mistral
- Falcon
- Flan-T5
- Gemma
- Pythia
In-context learning

For detailed instruction, please refer to the Notion site: LLM Fine-tuning

A Simple Example

Here is a simple example of using the package for fine-tuning or prediction on the SST2 dataset. You can also find this demo code in demo.py.

import fire

from lora import Llama_Lora


def main(
    task: str = "eval",
	base_model: str = "meta-llama/Llama-2-7b-hf",
):
    m = Llama_Lora(
        base_model=base_model,
    )
    if task == "train":
        m.train(
            train_file = "data/sst2/train.json",
            val_file = "data/sst2/val.json",
            output_dir = "./ckp_sst_llama2_lora",
            train_batch_size = 32,
            num_epochs = 1,
        )
    elif task == "eval":
        m.predict(
            input_file = "data/sst2/val.json",
            max_new_tokens = 32,
            verbose = True,
        )
    else:
        raise ValueError(f"Unrecognized task: {task}")


if __name__ == "__main__":
    fire.Fire(main)

For prediction, please use

python demo.py

For fine-tuning, please use

python demo.py --task train

Contributors

In addition, this package has been tested by many members from the UVA ILP group via their research projects.

This project was inspired by the Alpaca-LoRA project, from which we learned a lot!

llm-tuning's People

Contributors

Stargazers

Watchers

llm-tuning's Issues

cannot import name 'prepare_model_for_int8_training' from 'peft'

prepare_model_for_int8_training is deprecated in the new version peft. I think we should change the code to be
self.model = prepare_model_for_kbit_training(self.model) on line 128, in llm_lora.py, and also change the import statement to be
from peft import ( LoraConfig, PeftModel, get_peft_model, get_peft_model_state_dict, prepare_model_for_kbit_training, set_peft_model_state_dict )

Recommend Projects

uvanlp / llm-tuning Goto Github PK

llm-tuning's Introduction

A Simple Example

Contributors

llm-tuning's People

Contributors

Stargazers

Watchers

Forkers

llm-tuning's Issues

cannot import name 'prepare_model_for_int8_training' from 'peft'

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent