Comments (2)
你好,
请问为什么robert.sh 用的是run_bert.py呢
是因为roberta的模型可以在bert上finetune吗?
另外我尝试用albert的预训练模型在run_bert.py上跑,结果显示torch
size 不匹配, 一个是[21128, 2048],一个是(21128, 128)
1.roberta和bert是同一模型,不同的初始化参数,所以都是用run_bert.py
2.albert和bert是不同的模型,所以没法用run_bert.py
from ccf-bdci-sentiment-analysis-baseline.
好的,我明白了,谢谢
from ccf-bdci-sentiment-analysis-baseline.
Related Issues (16)
- RuntimeError: set_storage is not allowed on Tensor created from .data or .detach() HOT 4
- bert输出层对截断部分怎么融合 HOT 2
- 我看分类的demo上,是在bert之后接了一个lstm层吗, HOT 6
- 咨询一个问题:这些模型无显卡的笔记本能跑嘛? HOT 1
- 可否共享一下数据? 谢谢. 现在网站无法下载了 HOT 1
- /home/ming/anaconda3/lib/python3.7/site-packages/sklearn/metrics/classification.py:1439: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no true samples. 'recall', 'true', average, warn_for) test 0.06457949662369551 HOT 5
- 如果不想切分文本,想整个都放进去,应该怎么改,是不是只改参数就好了?
- 您好,既然test集的全是假标签0为什么还要导进去dataloader里面? HOT 5
- 请教文章截成的k段,在哪块代码出可以看出是分别输入模型处理? HOT 3
- 运行robera-english报错的问题
- XLNet_zh_Large上的效果 HOT 1
- 可以用cpu么,怎么设置 HOT 2
- Roberta-large 与Roberta-mid的差别 HOT 3
- 我不理解这里的cls是什么,还有这里的return super(BertTokenizer, cls)._from_pretrained(pretrained_model_name_or_path, *inputs, **kwargs)
- run_bert.py eval_loss计算错误
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ccf-bdci-sentiment-analysis-baseline.