Can you introduce the computing resources needed for the experiment

Can I fine tune GPT-Neo-XT-Chat-Base-20B with 8 A100? about openchatkit HOT 10 OPEN

newcolour1994 commented on June 30, 2024 10

Can I fine tune GPT-Neo-XT-Chat-Base-20B with 8 A100?

from openchatkit.

Comments (10)

encmps commented on June 30, 2024 1

Similar question, what is the minimum VRAM requirement to finetune the model? How about 4*4090?

from openchatkit.

zhongtao93 commented on June 30, 2024

Similar question, how to infer the model on 4x V100?

from openchatkit.

riatzukiza commented on June 30, 2024

I would really like to know this too, it should probably be in the readme. I have 1 3090 to stand this up with before I can ask for more resources. If its really big, I might try to scale the model down and submit a request for a mini model to do sanity checks on local systems and such.

from openchatkit.

Southpika commented on June 30, 2024

Similar question, what is the minimum requirement to finetune the model if I want to add my own docs?

from openchatkit.

csris commented on June 30, 2024

We train this model on 8x A100 80GB GPUs. I'll update the README.

I... submit a request for a mini model to do sanity checks on local systems and such

This is a great idea! Will keep this issue open to track adding such a model.

from openchatkit.

Southpika commented on June 30, 2024

Can I train it on a single or fewer A100 80GB GPUs? Maybe it takes more time or it cannot run?

from openchatkit.

puppet101 commented on June 30, 2024

Can I finetune the model on 8X V100 32GB GPUS with a smaller batch size?

from openchatkit.

raihan0824 commented on June 30, 2024

Can I train it on a single or fewer A100 80GB GPUs? Maybe it takes more time or it cannot run?

from openchatkit.

joydchh commented on June 30, 2024

We train this model on 8x A100 80GB GPUs. I'll update the README.

I... submit a request for a mini model to do sanity checks on local systems and such

This is a great idea! Will keep this issue open to track adding such a model.

how long it takes to train on 8*A100?

from openchatkit.

csris commented on June 30, 2024

About an hour per 100 steps. Usually, we fine-tune for a couple days.

from openchatkit.

Recommend Projects