Code Monkey home page Code Monkey logo

Comments (17)

PhoebusSi avatar PhoebusSi commented on May 18, 2024 1

Try training on multiple GPUs, or increase the number of gradient_accumulation_steps.

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on May 18, 2024

这很奇怪,你是单卡训练的吗?

from alpaca-cot.

yuanluw avatar yuanluw commented on May 18, 2024

是的,单卡3090

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on May 18, 2024

已私聊

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

I’m encountering the same problem, can you help me?

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

will try. But how about tweaking the parameters? do you know what parameter I should adjust to make the response not repetitive and truthful? I tried changing the temperature to 0.8 and setting the greedy decoding to True

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

or should I just train it longer by adding the epoch?

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on May 18, 2024
  1. Raising the temperature has a slight effect on reducing repetition: higher temperatures have a smoother vocabulary prediction distribution, and a greater chance of picking up words that are not usually picked up.
  2. If you set greedy to true, there is no need to configure the temperature.
  3. The main reason is that the model is not well trained. More data can be used. If the amount of data is small (50k), it is recommended to train for 3 epoch. If the amount is large (0.5M+), you can only train for 1 epoch.

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

by 50k do you mean the number of instructions? my data size is currently 26MB

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on May 18, 2024

yes. 26MB (i mean the number of instructions) data is fairly large. Is the data quality not good?

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

well my data size is 26MB and has 50k instructions. The quality should be similar to the alpaca data, as I only translate it to other language using chatgpt.

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on May 18, 2024

I see. This may be due to Bloom's weak ability in your target language. You can collect more data, try larger bloom models, or try the LLM performing well on the target language.

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

do you think adding more epochs will solve the problem?

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on May 18, 2024

Yes, it should be better.

from alpaca-cot.

PhoebusSi avatar PhoebusSi commented on May 18, 2024

How many epoches do you set? 3 epochs is suitable for 50k instructions.

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

currently only one, since I have a limitation on my GPU usage. Three epochs would cost about 10 hours of training (in A100 80GB)

from alpaca-cot.

raihan0824 avatar raihan0824 commented on May 18, 2024

I'll switch the model to llama and see how it goes

from alpaca-cot.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.