Comments (36)
@egorpugin Please delete the deprecated info regrading compiling on windows.
@HotDenim Latest Windows binaries with training tools can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki
Though, for training, I would recommend using Linux.
from tessdoc.
Done.
from tessdoc.
Thanks!
from tessdoc.
Done.
The new instructions do not work. I followed then, and while SW
was executing, it stopped with an error:
[2718/3758] generate: "C:/Users/User/.sw/storage/pkg/1f/e0/12aa/obj/bld/179681/bd/gobject/glib-enumtypes.c"
Exception in file D:/dev/cppan2/client2/src/sw/builder/command.cpp:764, function execute1: When executing: generate: "C:/Users/User/.sw/storage/pkg/1f/e0/12aa/obj/bld/179681/bd/gobject/glib-enumtypes.c"
command failed: exit code = -1073741515 (0xC0000135): The specified module could not be found.
Total errors: 1
This happened for the 4.1
Branch and the Master
branch. (I would like to compile the 4.1
branch).
from tessdoc.
Add -k 1000
options, run.
Then add --trace 2> log.txt
options, run and send log.
from tessdoc.
Add
-k 1000
options, run.
Then add--trace 2> log.txt
options, run and send log.
Here is the Log.txt
file.
from tessdoc.
Try to run sw build org.sw.demo.google.tesseract.training.text2image-master -sfc --trace 2> log.txt
If there are errors:
- Post log.
- In
.sw\rsp
run the latest.bat
file (sort by date desc) by double click and post screenshot of error message. There will be dll name that is missing.
from tessdoc.
No errors displayed (nothing output to screen), but there is in the log file (attached).
log.txt
from tessdoc.
Oh, the missing component is on MS side, not sw.
Can you show winver
command output? Win+R, type winver
, Enter.
from tessdoc.
Can you send me that .bat
file you've run also?
from tessdoc.
Try to install VS2015 redist. It is strange, because you have VS and WinSDK installed.
from tessdoc.
Winver:
BAT file (renamed to .TXT to enable attaching):
?
from tessdoc.
installed VS2015 redist. Then tried again (SW Build .
) , but still error.
from tessdoc.
I see. It is Win7. This complicates things a bit.
Possible solutions (except for building on win10):
- Try to install Win7 WinSDK (6.1 or 6.1A or smth like this).
- I can install win7 on VM to try to repro this, but this is time consuming, not sure that I have time for this.
- Find installed WinSDKs
C:\\Program Files (x86)\\Windows Kits
and sendtree
command output. I need detailed layout of installed SDKs to possibly tune sw to use them.
from tessdoc.
Also it would be nice if you send tree
output of C:\\Program Files (x86)\\Visual Studio 2015
and C:\\Program Files (x86)\\Microsoft Visual Studio 12.0
.
from tessdoc.
OK will do for next post. Also for sw build org.sw.demo.google.tesseract.training.text2image-master -sfc --trace 2> log.txt
:
1461694303201023788.bat.txt
8802815074161298208.bat.txt
from tessdoc.
- Win7 WinSDK (6.1 or 6.1A
This ?:
https://www.microsoft.com/en-gb/download/details.aspx?id=8442
It is version 7.1
also 32-bit distribution or 64-bit (I wish to build both).
from tessdoc.
This ?
Yes, try those installers.
from tessdoc.
Attached are the Tree output files (refer to the filenames). (I am using Visual Studio 2019)
Also, the VS 2015 redist does not include the api-ms-win-core-io-l1-1-1.dll file. I downloaded this file separately and installed it, after that, the .BAT file now refers to more .DLL errors (api-ms-win-core-synch-l1-2-1.dll). I think these .DLL files are Windows 10 only files ? (see https://docs.microsoft.com/en-us/uwp/win32-and-com/win32-apis )
Program Files (x86), Microsoft Visual Studio 10.0.txt
Program Files (x86), Microsoft Visual Studio.txt
Program Files (x86), Window Kits.txt
Program Files, Microsoft SDK's.txt
Program Files (x86), Microsoft SDK's.txt
Also, I installed Window 7 64-Bit, new installation, then installed :
- Cmake
- Git
- SW
- Visual Studio 2019
- VS 2015 Redist
- Windows 7 SDK 7.1 (as found in previous post) - 64-Bit
And the same error is present.
from tessdoc.
I think these .DLL files are Windows 10 only files ?
Yes, something like this.
The problem is that sw does not use your host config (host - win7 SDK) and uses win10 sdk. Therefore the binaries cannot be started.
I'll try to do quick fixes in sw, maybe we'll be able to get it working.
Can you send me set
command ouput?
from tessdoc.
Can you send me
set
command ouput?
Here:
ALLUSERSPROFILE=C:\ProgramData
APPDATA=C:\Users\Tester\AppData\Roaming
CommonProgramFiles=C:\Program Files\Common Files
CommonProgramFiles(x86)=C:\Program Files (x86)\Common Files
CommonProgramW6432=C:\Program Files\Common Files
COMPUTERNAME=X
ComSpec=C:\Windows\system32\cmd.exe
FP_NO_HOST_CHECK=NO
HOMEDRIVE=C:
HOMEPATH=\Users\Tester
LOCALAPPDATA=C:\Users\Tester\AppData\Local
LOGONSERVER=\\VIR
NUMBER_OF_PROCESSORS=4
OS=Windows_NT
Path=C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0\;C:\Program Files\Microsoft Windows Performance Toolkit\;C:\Program Files\Git\cmd;C:\Program Files\CMake\bin;C:\Program Files\Microsoft SQL Server\130\Tools\Binn\
PATHEXT=.COM;.EXE;.BAT;.CMD;.VBS;.VBE;.JS;.JSE;.WSF;.WSH;.MSC
PROCESSOR_ARCHITECTURE=AMD64
PROCESSOR_IDENTIFIER=Intel64 Family 6 Model 58 Stepping 9, GenuineIntel
PROCESSOR_LEVEL=6
PROCESSOR_REVISION=3a09
ProgramData=C:\ProgramData
ProgramFiles=C:\Program Files
ProgramFiles(x86)=C:\Program Files (x86)
ProgramW6432=C:\Program Files
PROMPT=$P$G
PSModulePath=C:\Windows\system32\WindowsPowerShell\v1.0\Modules\
PUBLIC=C:\Users\Public
SESSIONNAME=Console
SystemDrive=C:
SystemRoot=C:\Windows
TEMP=C:\Users\Tester\AppData\Local\Temp
TMP=C:\Users\Tester\AppData\Local\Temp
USERDOMAIN=X
USERNAME=Tester
USERPROFILE=C:\Users\Tester
VS100COMNTOOLS=C:\Program Files (x86)\Microsoft Visual Studio 10.0\Common7\Tools\
windir=C:\Windows
windows_tracing_flags=3
windows_tracing_logfile=C:\BVTBin\Tests\installpackage\csilogfile.log
from tessdoc.
Ok, I'm sending a new binary with some updates. Unpack and replace old sw.exe. Run the command and post logs.
First, we'll try to check if winsdk7 is detected by sw.
After that we need to provide correct host settings to sw.
from tessdoc.
Ok, I'm sending a new binary with some updates. Unpack and replace old sw.exe. Run the command and post logs.
First, we'll try to check if winsdk7 is detected by sw.
After that we need to provide correct host settings to sw.
Which Command, and which logs ?.
from tessdoc.
The command for building tess like you did before.
sw build org.sw.demo.google.tesseract.training.text2image-master -sfc --trace 2> log.txt
from tessdoc.
Log File:
from tessdoc.
Can you email me to solve this faster?
egor.pugin at gmail.com
from tessdoc.
Try this binary, send log.
from tessdoc.
Can you email me to solve this faster?
egor.pugin at gmail.com
Problems with email , at the moment.
from tessdoc.
Try this binary, send log.
Log File:
from tessdoc.
Try this binary, send log.
sw.client.sw-1.0.0.exe.zip
from tessdoc.
I've checked some things further, it seems we can't fix this issue on win7 right now. (Next sw version is in progress - it should fix the issue.)
So, you have to build it on win10.
from tessdoc.
When will next version be released?
from tessdoc.
How do I build it in Windows 10., the commands ?), for 32-Bit & 64-Bit, and Static & Dynamic Libraries ?. And version 4.1!
Also do I need to download the source via GIT first ?
Also; Can you update the Tessract Documention page with all info.
from tessdoc.
When will next version be released?
After some core algorithms is developed.
How do I build it in Windows 10
Same instructions.
- Download sources.
sw build -platform x86,x64 -static -shared
Combine flags as you wish.- Output files will be in
.sw\out\some hash
from tessdoc.
When will next version be released?
After some core algorithms is developed.
E.g. ?, 1 Month, 2 Months ?. 6 Months ?
How do I build it in Windows 10
Same instructions.
- Download sources.
sw build -platform x86,x64 -static -shared
Combine flags as you wish.- Output files will be in
.sw\out\some hash
So for Tesseract 4.1 I will need to download from GIT the source code. first ?, for 4.1 ?. Yes?, and does SW also work with 4.1 ?.
from tessdoc.
and does SW also work with 4.1 ?.
We'll see. In general, it should.
from tessdoc.
Related Issues (20)
- Unlisted GUI HOT 1
- OCR convertion Issue
- Broken links in documentation HOT 1
- tesseract ocr HOT 1
- Alto versions HOT 1
- PDF workflow issues HOT 2
- Debian repository notesalexp.org down HOT 1
- Getting really bad results?
- Assertion failure when using the legacy ara.traineddata in latest tesseract version. HOT 2
- Regenerate language data for tessdata_best
- Method missing HOT 2
- `image_to_data` result doesn't match `image_to_string` HOT 4
- document C++17 req HOT 1
- Reading library spine covers HOT 1
- Issue with TESSDATA_PREFIX and Symbolic Links on macOS Ventura Using Homebrew HOT 3
- Usage of unicharsets that don't seem to be attached to a trained model
- What dataset is base tesseract 5 trained on? HOT 1
- Wrong Issue
- How to check which language are covered by scripts? HOT 1
- What should be the norm_mode for different languages?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tessdoc.