Comments (5)
Use the generated trained data from the data directory data/digitsmodel.traineddata
from tesstrain.
@vijayrajasekaran Thank you very much for your hint! @kiransab Still an issue?
from tesstrain.
@wrznr @kiransab I have the same problem, did you manage to find what the solution is? At the end of the training, output was something like that:
2 Percent improvement time=975, best error was 3.143 @ 5376
At iteration 6351/10000/10000, Mean rms=0.492%, delta=0.284%, char train=1.013%, word train=3.54%, skip ratio=0%, New best char error = 1.013 wrote best model:data/checkpoints/pts1.013_6351.checkpoint wrote checkpoint.
Finished! Error rate = 1.013
lstmtraining \
--stop_training \
--continue_from data/checkpoints/pts_checkpoint \
--traineddata data/pts/pts.traineddata \
--model_output data/pts.traineddata
Loaded file data/checkpoints/pts_checkpoint, unpacking...
I have tried both data/pts.traineddata and data/pts/pts.traineddata, but got the same output:
TesseractError: (1, "Failed to load any lstm-specific dictionaries for lang pts!! Failed loading language 'pts' Tesseract couldn't load any languages! Could not initialize tesseract.")
I'm using it with pytesseract and other .traineddata files work well for the same code.
from tesstrain.
Mates, I found what the issue is for me. I was using tesseract in "--oem 2 (legacy+lstm)" mode, but realized that the code in this repo cannot produce ".box" files (it produces, but only emty files), so it doesn't work for legacy mode. It can work only in "--oem 1 (only lstm)" mode.
from tesstrain.
Many thanks for your helpful solution!
from tesstrain.
Related Issues (20)
- fine tuning arabic traineddata to solve extended words issue HOT 2
- Error while compiling tesseract within tesstrain HOT 2
- Maths OCR
- Can't open lstm.train despite (probably) having all training tools HOT 1
- Training a model from scratch with own imgs + txts? HOT 1
- Trying to train Tesseract for a different font, unable to get CER under 50%
- File not found - *.gt.txt HOT 3
- Error fine tuning new font for Thai Language
- What if my ground truth includes characters not found in a *.unicharset?
- Error generate text2image using khm.training_text HOT 1
- make training not building traineddata file HOT 1
- `make lists -j32` doesn't seem to be honoring the thread count. (Also happens when calling `make training -j32`) HOT 3
- deu_latf wordfile HOT 4
- unicharset_extractor stuck HOT 1
- How to train captcha? HOT 4
- winget install GnuWin32.Make error HOT 10
- make tesseract-langdata error HOT 7
- A question about missing dependency warnings when compiling and installing tesseract on centos using source code HOT 1
- How to train Chinese tradtional vertical in Tesseract 5? HOT 1
- "Compute CTC targets failed for xyz.lstmf!" for custom NET_SPECs HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tesstrain.