Comments (2)
I had never come into this situation before. Maybe there is something wrong with the offset mapping list, you could check out whether there is a problem with predicted token spans. If not, the problem might come from the mapping progress from token spans to char spans. You need to debug by yourself.
from tplinker-joint-extraction.
Hi mate, I haven't dived into it that you mentioned yet. I've found the issue is highly related to max_seq_len; the smaller max_seq_len is, the more entities with those outranged spans appear. The issue has been solved since I adjust the value of max_seq_len to 512 (max input len for BERT), which is the default value that you set.
However, there are still some entities with empty spans. I'll check it out few days later when I'm free.
Many thanks
from tplinker-joint-extraction.
Related Issues (20)
- 请问tplink plus的输入数据的格式和tplink输入数据的格式有什么区别? HOT 3
- 针对不连续的实体抽取是否有解决方案 HOT 1
- 代码可能有个地方写错了,decode_rel函数中循环增加offset,当序列长度比较大的时候,会出现超出token长度的token_span数值。 HOT 1
- 我的非
- 我的f1值一直是0怎么回事呢? HOT 2
- tplinker_plus.py 中的decode_rel有错误
- 您的github中readme部分分享的链接已失效, 希望您可以将 NYT* NYT WebNLG* WebNLG 四个数据集的参数再次分享一下 HOT 1
- 请问tplinker对于无标注数据怎么处理呢? HOT 1
- 关于TPLinker在SCIERC数据集的实验 HOT 2
- seq_len = seq_hiddens.size()[-2] HOT 1
- datasets HOT 2
- The process was killed!
- from tplinker import (HandshakingTaggingScheme,DataMaker4Bert, DataMaker4BiLSTM, TPLinkerBert, TPLinkerBiLSTM,MetricsCalculator) HOT 1
- eval问题 HOT 2
- 自己的数据集如何进行标注,训练呢?用什么工具进行标注,能详细讲讲吗 HOT 5
- 没有预测脚本吗,这个训练完之后,怎么进行推理呢? HOT 3
- 滑动窗口问题
- Prepare my own text dataset
- shaking_type中的cln和cln_plus
- tok_span_error
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tplinker-joint-extraction.