Code Monkey home page Code Monkey logo

Comments (6)

fredzannarbor avatar fredzannarbor commented on July 17, 2024

I tried 3.5 today on two small runs, one of 10 and the other of 30 examples. They yielded 2 and 25 valids, respectively. I had to adjust to gpt-3.5-turbo-16k.

The reason I was doing this is because I was trying to isolate the out of memory problem that is preventing me from successfully using gpt-llm-trainer, and was tired of spending money on gpt-4 runs. ;-)

from gpt-llm-trainer.

Afo92 avatar Afo92 commented on July 17, 2024

I used model="gpt-3.5-turbo", and it ran correctly. I can't comment on the quality of the GTP output as I actually don't need it, I just tried as a test.

from gpt-llm-trainer.

TanmayDoesAI avatar TanmayDoesAI commented on July 17, 2024

@fredzannarbor @Afo92 Thank your for the confirmation.

from gpt-llm-trainer.

TanmayDoesAI avatar TanmayDoesAI commented on July 17, 2024

I am facing this issue, with the default model
OSError: NousResearch/llama-2-7b-chat-hf is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
If this is a private repository, make sure to pass a token having permission to this repo with use_auth_token or log in with huggingface-cli login and pass use_auth_token=True.

any idea what is to be done?
I tried my own sharded model but got me some other error

from gpt-llm-trainer.

Afo92 avatar Afo92 commented on July 17, 2024

I used model_name = "meta-llama/Llama-2-7b-chat-hf" and then

access_token = "hf_YOUR TOKEN HER!"
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    use_auth_token=access_token,
    quantization_config=bnb_config,
    device_map=device_map
)

from gpt-llm-trainer.

TanmayDoesAI avatar TanmayDoesAI commented on July 17, 2024

@Afo92 Gotcha thank you!

from gpt-llm-trainer.

Related Issues (18)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.