Comments (2)
BTW, if I use the provided code, I got F-1 91.5 on CORD dataset, which closely matches the numbers in the paper.
If I include the unmatched ground truth as false negatives, the F-1 is 83.6.
from donut.
Hi, thank you for reporting the issue. No, it is not intended. It seems that this issue has occurred during the recent refactoring to 1.0.6. donut-python>=1.0.7
will resolve the issue (pip install donut-python --upgrade
).
Regarding the different scores, we have tested various versions/types of F1, and the current implementation in this repository is the most rigid one. For example, it counts the case where the key is exact-matched but the value is not (e.g., "menu.nm":"Cake" & "menu.nm":"cake") as both FP and FN. Note that this might be too severe for all methods in an end-to-end setting (regardless of the OCR dependency. Also, this is a reason for using a TED-based metric). For instance, if a single character is missed in the OCR, there is no chance for the conventional tagging-based models. For the various versions, the overall trends among the methods were the same in our analyses. The current version will return a score of around 84.
python test.py --dataset_name_or_path naver-clova-ix/cord-v2 --pretrained_model_name_or_path naver-clova-ix/donut-base-finetuned-cord-v2
100%|█████████████| 100/100 [00:35<00:00, 2.80it/s]
Total number of samples: 100, Tree Edit Distance (TED) based accuracy score: 0.9374973163596032, F1 accuracy score: 0.8406020841373987
The current script (donut-python>=1.0.7
) is simple and easy to understand, and we further improved and fixed our script during the refactoring. With the latest evaluation script, we updated the scores at https://arxiv.org/abs/2111.15664. See https://github.com/clovaai/donut#test also. Hope this helps :)
from donut.
Related Issues (20)
- Does synthdog data has MiT or afl-3.0 license? HOT 1
- Error "A configuraton of type donut cannot be instantiated because not both `encoder` and `decoder` sub-configurations are passed" when run inference after finetuned docvqa without pushing to hugging face? HOT 1
- custom json schema - ASAP HOT 2
- Multi GPU support for fine tuning
- How to extract complete text from the document? HOT 3
- confidence 값의 공식적인 지원
- Classification inference
- Update donut-python Python Package to be compatible with latest versions of transformers
- donut inference시 sub task가 변경?
- Not getting prediction correctly using the model trained on the custom dataset (similar format as CORD-V2 dataset) HOT 6
- not work this app.py
- Can synthdog insert text for a specified bbox? HOT 1
- Where is the fine-tuned model?
- Why is the output of the intermediate verification empty after training?
- Donut generate ONLY <s><s>...<s></s> HOT 7
- Performance of the model HOT 1
- How to improve OCR accuracy for Japanese characters? HOT 2
- Early Stopping
- How many documents(invoices) are required for training model for document information extraction?
- What should be the configuration of the machine to train the model?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from donut.