Comments (11)
I understand. I'll try to get the statistics soon.
from docee.
Thanks a lot!
from docee.
The number of gold arguments in PTPCG is the same as other baselines that use ChFinAnn.
You can download the original data from here and get the statistics.
from docee.
Hi there, does my response answer your questions? I'd like to close this issue if there's no further discussion.
from docee.
Hi @Spico197 after training the model on ChFinAnn, the test data arguments TP+FN = 28,545 but when I count the arguments from the original test data, it is 29,345.
I traced the missing arguments and found that they are dropped during the truncation of sentences and documents. Can you confirm?
Thanks.
from docee.
Yes. The default setting of the number of sentences in a document is 64, while the max sequence length is 128, so some documents are trucated. Doc2EDAG, GIT, PTPCG use the same setting. It may be potentially unfair if you use other settings.
from docee.
I didn't check the exact numbers yet, but do you mean arguments instead of mentions or entities?
from docee.
yes, I mean the arguments in event tables
from docee.
@Spico197 Hi! Would you be able to share the model predictions for ChFinAnn and DuEE-fin dev? I really appreciate your valuable time.
from docee.
Hi there, sorry for the late response. Things been busy these days.
The attachment below contains:
- PTPCG test evaluation results on ChFinAnn Epoch=57 (you can calculate the number of arguments from TP, FP and FNs in overall/overall) and middle prediction outputs.
- PTPCG dev evaluation results and middle prediction outputs on DuEE-Fin Epoch=99
from docee.
In case of any inconvenience for your analysis, I updated the PTPCG task dump trained on DuEE-Fin.
You can find it here: https://github.com/Spico197/DocEE/releases/tag/tasks-ptpcg-dueefin
from docee.
Related Issues (20)
- 实验结果 HOT 4
- 相似度的一些问题 HOT 8
- 分布式训练 HOT 3
- importance分数 HOT 15
- deppn模型F1只有33 HOT 2
- "pred_results"中的classification得分 HOT 25
- 分句 (uncommon sentence cutoff in DuEE-fin) HOT 14
- Duee_Fin预测结果 HOT 2
- 测试集结果 HOT 2
- 单事件&多事件 HOT 3
- Greedy-Dec模型如何运行? HOT 6
- Evaluation Metric HOT 11
- similarity calculation HOT 1
- pretrained model weight HOT 1
- 多事件 HOT 1
- 使用o2m格式的数据时,需要修改那些代码呢 HOT 1
- Potential performance issue: plotting slow in matplotlib == 3.3.0 HOT 1
- 请问老师怎么在自己的数据集上进行训练呢? HOT 14
- 关于ptpcg论文的一些问题 HOT 4
- 论文中的一个问题 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from docee.