nlpcl-lab / bert-event-extraction Goto Github PK

View Code? Open in Web Editor NEW

333.0 333.0 52.0 50 KB

Pytorch Solution of Event Extraction Task using BERT on ACE 2005 corpus

License: MIT License

Python 100.00%

ace2005 bert event-extraction pytorch

bert-event-extraction's People

Contributors

Stargazers

Watchers

Forkers

silencewinter shiqing1234 jsw-zorro yuanfang619 dylanzsz caitlin-hilverman maxthomas mrkolen esoftdiy surefirelin ganeshgs zyxnlp dearpolly jianfeidahai xiaomindog fighting41love zaypole19616 zoey-l itgirls deepframwork wangpeiyi9979 wanghia almondch mercurycoa scorpiokay lockinlucien7 masterkmp soiliml zengchongq fantasyoo666 hlee-top yangyuxino scarydemon2 knkarthick nlpersecjtu bit-engd yy2lyx leiwinnie albertbj akeshh elijahahianyo babajideowoyele xyliu-uir lu1kaifeng techthiyanes aqhali sdx9078 ryanccc114 radiateall777 vdsmitnov52 yongshengw

bert-event-extraction's Issues

预测的arguments是使用的groundtruth吗？

阅读代码发现对arguments的预测依赖于grountruth，打印预测的arguments发现start_idx和end_idx都与groundtruth相同。请问是我使用的方式不对吗？

input of a sentence

hi @bowbowbow, I am doing the end-of-the-year check up.
it processes the preprocessed data, but some people may utilize it for a sentence
that they want to process.

is it possible to input a sentence and check the output events within the input sentence?
if so, please add the description for it in README

thanks, yours sincerely, Yang

hi @bowbowbow, I am doing the end-of-the-year check up.
as i see it, i recognize that you tried some new approaches
to extract events while using bert.
I believe that the new approach can be captured by the NN architecture diagram

if it is possible, please add the diagram so that people can catch the new approach in a glance

thanks, yours sincerely, Yang

have you tried to use bert to improve the performance of JMEE?

Hi,
Thank you for sharing.
I'm interested if you tried to use bert to improve the performance of JMEE.
I try to reproduce JMEE,but I can't achieve the result of paper.

trigger跑到67分使用的是什么参数配置

我跑了50轮，最高trigger classification只有23的f1。不知道67分作者使用的是什么参数

mislabeled data

Hi, I'm try to reproduce your model. But my result is low. I have checked these labels that my model predicted and I found a lot of labels that was predicted to Event sub-type difference to tag "O" but was tagged to 'O' tag in the dataset. Therefore, my precision score is downgrade( I only get precison=62%) . Did you encountered with this issue. If so, how did your tackled with it. You fixed wrong label in test, dev sets or keep the original data to evaluate these score?
Hope to see your answer soon! Thank you so much!

Is this model a joint method or pipeline method?

I'm a freshman in Event Extraction. I have learned your code. In the train.py, I think this is a multitask because the loss is the sum of triggers loss and arguments loss. So i don't is this model a joint method or pipeline method?

我想知道大家用的环境是什么样的？

pip install pytorch==1.0 pytorch_pretrained_bert==0.6.1 numpy
下载一直失败，我人都麻了，能不能来个大佬帮帮我

after remove entities and pos ,looking forward to reply,thanks

TypeError: new() received an invalid combination of arguments - got (NoneType, int), but expected one of:

(*, torch.device device)
didn't match because some of the arguments have invalid types: (!NoneType!, !int!)
(torch.Storage storage)
(Tensor other)
(tuple of ints size, *, torch.device device)
(object data, *, torch.device device)

head_indexes_2d是干什么用的

x是[batch_size，SEQ_LEN，768]的bert表达
有一句代码：
for i in range(batch_size):
x[i] = torch.index_select(x[i], 0, head_indexes_2d[i])
请问这是在做什么？

怎么用训练好的模型进行一个简单的事件抽取任务？

关于结果的疑问

不知道您读过这篇文章没有：《Exploring Pre-trained Language Models for Event Extraction and Generation
Sen》

他直接吧trigger的识别准确率推到了80%

虽然但是，这个代码作为我在NLP领域的入门代码，给了我很大帮助，但是还是想指出代码中存在的两个问题

Originally posted by @mzh1996 in #15 (comment)

Two approaches to improve the performance

Hi,
I read your code and found there are two problems that hinder the performance improvement.
First, as I know, previous papers use head words of entity mentions as the candidate arguments, but you use the whole word sequence of entity mentions, which harms the argument-level performance a lot.
Second, while training, you train the argument-level classifier based on predicted triggers, instead, I believe the argument-level classifier should be trained on the golden triggers.

请问如何获取ACE2005数据集呢？

您好，请问如何获取ACE2005数据集呢？需要什么lincense吗？有了这个lincense之后可以去哪里下载或者申请呢？多谢

TypeError: new() received an invalid combination of arguments - got (NoneType, int), but expected one of: * (*, torch.device device)

Traceback (most recent call last):
File "D:/pythonProject/bert-event-extraction-master/train.py", line 80, in
model = Net(
File "D:\pythonProject\bert-event-extraction-master\model.py", line 18, in init
self.entity_embed = MultiLabelEmbeddingLayer(num_embeddings=entity_size, embedding_dim=entity_embedding_dim, device=device)
File "D:\pythonProject\bert-event-extraction-master\model.py", line 129, in init
self.matrix = nn.Embedding(num_embeddings=num_embeddings,
File "D:\anaconda2020\lib\site-packages\torch\nn\modules\sparse.py", line 109, in init
self.weight = Parameter(torch.Tensor(num_embeddings, embedding_dim))
TypeError: new() received an invalid combination of arguments - got (NoneType, int), but expected one of:

(*, torch.device device)
didn't match because some of the arguments have invalid types: (!NoneType!, !int!)
(torch.Storage storage)
(Tensor other)
(tuple of ints size, *, torch.device device)
(object data, *, torch.device device)

loss is 'nan'

Where is eval.py？

Thanks for your sharing！ But where is eval.py？