Code Monkey home page Code Monkey logo

Comments (10)

encmps avatar encmps commented on June 30, 2024 1

Similar question, what is the minimum VRAM requirement to finetune the model? How about 4*4090?

from openchatkit.

zhongtao93 avatar zhongtao93 commented on June 30, 2024

Similar question, how to infer the model on 4x V100?

from openchatkit.

riatzukiza avatar riatzukiza commented on June 30, 2024

I would really like to know this too, it should probably be in the readme. I have 1 3090 to stand this up with before I can ask for more resources. If its really big, I might try to scale the model down and submit a request for a mini model to do sanity checks on local systems and such.

from openchatkit.

Southpika avatar Southpika commented on June 30, 2024

Similar question, what is the minimum requirement to finetune the model if I want to add my own docs?

from openchatkit.

csris avatar csris commented on June 30, 2024

We train this model on 8x A100 80GB GPUs. I'll update the README.

I... submit a request for a mini model to do sanity checks on local systems and such

This is a great idea! Will keep this issue open to track adding such a model.

from openchatkit.

Southpika avatar Southpika commented on June 30, 2024

Can I train it on a single or fewer A100 80GB GPUs? Maybe it takes more time or it cannot run?

from openchatkit.

puppet101 avatar puppet101 commented on June 30, 2024

Can I finetune the model on 8X V100 32GB GPUS with a smaller batch size?

from openchatkit.

raihan0824 avatar raihan0824 commented on June 30, 2024

Can I train it on a single or fewer A100 80GB GPUs? Maybe it takes more time or it cannot run?

up

from openchatkit.

joydchh avatar joydchh commented on June 30, 2024

We train this model on 8x A100 80GB GPUs. I'll update the README.

I... submit a request for a mini model to do sanity checks on local systems and such

This is a great idea! Will keep this issue open to track adding such a model.

how long it takes to train on 8*A100?

from openchatkit.

csris avatar csris commented on June 30, 2024

About an hour per 100 steps. Usually, we fine-tune for a couple days.

from openchatkit.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.