I want to change English roberta model to chinese roberta，the data processed module s

chinese fields about knowprompt HOT 15 CLOSED

zjunlp commented on June 10, 2024

chinese fields

from knowprompt.

Comments (15)

githubgtl commented on June 10, 2024

I change it returns 4 variables,the progress can run normally but loss = nan and f1=0

from knowprompt.

githubgtl commented on June 10, 2024

maybe the true label is compeletly different from predicted labels, loss is very big

from knowprompt.

njcx-ai commented on June 10, 2024

Thanks for your attention. Perhaps you can try starting with Chinese BERT and simultaneously check for any discrepancies between the logits and labels.

from knowprompt.

githubgtl commented on June 10, 2024

Thanks for your attention. Perhaps you can try starting with Chinese BERT and simultaneously check for any discrepancies between the logits and labels.

I have change the model but the loss is still nan,should I change the relation in the dataset into english

from knowprompt.

njcx-ai commented on June 10, 2024

Change the relation in the dataset into English would result in a semantic mismatch. You can utilize the debug mode of an IDE to troubleshoot and identify the issue.

from knowprompt.

githubgtl commented on June 10, 2024

Change the relation in the dataset into English would result in a semantic mismatch. You can utilize the debug mode of an IDE to troubleshoot and identify the issue.

thank you for your reply, I fix the question which loss is nan but the best is only 40%

from knowprompt.

zxlzr commented on June 10, 2024

Hi, if you are using only few-shot samples, you should try to run experiment multiple times to obtain the average results, and the hyperparameter and data selection is is also very sensitive to the performance.

You can also try the code here https://github.com/zjunlp/LREBench, in which we have already tried the Chinese datasets.

from knowprompt.

githubgtl commented on June 10, 2024

thanks, I just download the LRE projects and run it , I found almost every epoch is f1=0.00ckpt, does your data appear this performance?

from knowprompt.

githubgtl commented on June 10, 2024

by the way ,the loss gets smaller and smaller,but eval_f1 is 0.00

from knowprompt.

xxupiano commented on June 10, 2024

Hello, what's the dataset and the dataset size you used? Maybe the dataset size is such small that the model can not study them well.

from knowprompt.

githubgtl commented on June 10, 2024

I use the same dataset as your readme refers

from knowprompt.

githubgtl commented on June 10, 2024

but train dataset is 6000 picked by your dataset

from knowprompt.

xxupiano commented on June 10, 2024

Can you provide which dataset you used and which script you ran?

from knowprompt.

zxlzr commented on June 10, 2024

Have you solved your issues?

from knowprompt.

githubgtl commented on June 10, 2024

Have you solved your issues?

sorry , I just see this email, I solve it , thank you very much

from knowprompt.

Recommend Projects

chinese fields about knowprompt HOT 15 CLOSED

Comments (15)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent