Code Monkey home page Code Monkey logo

Comments (6)

LiuAmber avatar LiuAmber commented on June 26, 2024

I also encountered similar problems
I ran the following command multiple times:

alpaca_eval --model_outputs "Mistral-7B+RAHF-DUAL+LoRA.json" --output_path "Mistral-7B+RAHF-DUAL+LoRA_gpt4" ----annotators_config "weighted_alpaca_eval_gpt4_turbo" --reference_outputs "reference_outputs.json" --is_reapply_parsing True

But the result is that only more than 200 examples are evaluated.

from alpaca_eval.

matenglearn avatar matenglearn commented on June 26, 2024

I also encountered similar problems I ran the following command multiple times:

alpaca_eval --model_outputs "Mistral-7B+RAHF-DUAL+LoRA.json" --output_path "Mistral-7B+RAHF-DUAL+LoRA_gpt4" ----annotators_config "weighted_alpaca_eval_gpt4_turbo" --reference_outputs "reference_outputs.json" --is_reapply_parsing True

But the result is that only more than 200 examples are evaluated.

Can you access OpenAI successfully?
Unfortunately, I only have intermittent access to it.
"INFO:root:Sleeping 2 before retrying to call openai API..."
I don't know if this is the reason for the incomplete evaluation.

from alpaca_eval.

YannDubs avatar YannDubs commented on June 26, 2024

Can either of you reproduce this result using OpenAI rather than through a proxy server? In my experience (from previous gh issues) the problem always comes from the proxy server and I can't help for that.

For example, many proxy servers don't return logprobs. Can you share the warnings/errors that you see during annotation?

from alpaca_eval.

YannDubs avatar YannDubs commented on June 26, 2024

E.g. @LiuAmber the fact that you could perform AlpacaEval 1 here: #307, strongly suggests to me that the error comes from the logprobs (necessary for AlapcaEval 2 but not 1) and that this comes from your proxy server.

from alpaca_eval.

YannDubs avatar YannDubs commented on June 26, 2024

I'm assuming that the issue is the fact that your proxy doesn't have access to logprobs. If so try:

alpaca_eval evaluate_from_model --model_configs '...' --annotators_config 'alpaca_eval_clf_gpt4_turbo'

Feel free to reopen if your issue is different

from alpaca_eval.

matenglearn avatar matenglearn commented on June 26, 2024

Can either of you reproduce this result using OpenAI rather than through a proxy server? In my experience (from previous gh issues) the problem always comes from the proxy server and I can't help for that.

For example, many proxy servers don't return logprobs. Can you share the warnings/errors that you see during annotation?

Thank you. I solved this problem. The problem is caused by proxy servers and the setting of annotators_config.

from alpaca_eval.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.