Code Monkey home page Code Monkey logo

Comments (15)

githubgtl avatar githubgtl commented on June 10, 2024

I change it returns 4 variables,the progress can run normally but loss = nan and f1=0

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

maybe the true label is compeletly different from predicted labels, loss is very big

from knowprompt.

njcx-ai avatar njcx-ai commented on June 10, 2024

Thanks for your attention. Perhaps you can try starting with Chinese BERT and simultaneously check for any discrepancies between the logits and labels.

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

Thanks for your attention. Perhaps you can try starting with Chinese BERT and simultaneously check for any discrepancies between the logits and labels.

I have change the model but the loss is still nan,should I change the relation in the dataset into english

from knowprompt.

njcx-ai avatar njcx-ai commented on June 10, 2024

Change the relation in the dataset into English would result in a semantic mismatch. You can utilize the debug mode of an IDE to troubleshoot and identify the issue.

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

Change the relation in the dataset into English would result in a semantic mismatch. You can utilize the debug mode of an IDE to troubleshoot and identify the issue.

thank you for your reply, I fix the question which loss is nan but the best is only 40%

from knowprompt.

zxlzr avatar zxlzr commented on June 10, 2024

Hi, if you are using only few-shot samples, you should try to run experiment multiple times to obtain the average results, and the hyperparameter and data selection is is also very sensitive to the performance.

You can also try the code here https://github.com/zjunlp/LREBench, in which we have already tried the Chinese datasets.

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

thanks, I just download the LRE projects and run it , I found almost every epoch is f1=0.00ckpt, does your data appear this performance?

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

by the way ,the loss gets smaller and smaller,but eval_f1 is 0.00

from knowprompt.

xxupiano avatar xxupiano commented on June 10, 2024

Hello, what's the dataset and the dataset size you used? Maybe the dataset size is such small that the model can not study them well.

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

I use the same dataset as your readme refers

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

but train dataset is 6000 picked by your dataset

from knowprompt.

xxupiano avatar xxupiano commented on June 10, 2024

Can you provide which dataset you used and which script you ran?

from knowprompt.

zxlzr avatar zxlzr commented on June 10, 2024

Have you solved your issues?

from knowprompt.

githubgtl avatar githubgtl commented on June 10, 2024

Have you solved your issues?

sorry , I just see this email, I solve it , thank you very much

from knowprompt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.