Comments (5)
I've tested even bigger images, panorama images, and everything worked well.
Do you do any image preprocessing before recognition?
It seems like there is just a lot of noise in the bigger image, which is blurred during scaling.
You could simply check it, scaling down an image and scaling up it back before recognition.
from tesseract-ocr-ios.
Thanks for the reply Cyril - The only preprocessing I was doing was just a black and white conversion. But you’re right, the photo must have been too noisy, scaling down and back to the original size seems to do the trick, I get a better recognition rate as well on strings that have fractions (such as 1/4). anyway let me know if there is (or if you have) a special list of special image transforms I should apply to enhance the recognition. thanks again for the help!
On Sep 1, 2014, at 6:43 AM, Cyril [email protected] wrote:
I've tested even bigger images, panorama images, and everything worked well.
Do you do any image preprocessing before recognition?
It seems like there is just a lot of noise in the bigger image, which is blurred during scaling.
You could simply check it, scaling down an image and scaling up it back before recognition.—
Reply to this email directly or view it on GitHub.
from tesseract-ocr-ios.
Take a look here: #42.
In my case, I try to filter images to remove as much noise as possible. Simple combination of Adaptive Threshold Filter and Median filter removes the small noise and makes an images in binary format (this step is also named as binarization).
The second step, I would like to implement, is to find a text areas in the images to remove shadows and other quiete big noise around the text, which could produce some comma and dot output. It could be possible (perhaps, at least I've read some papers about it) using combinations of Opening and Closening morphological operations. But so far I didn't have a luck to implement the second phase, which could operation on any image.
from tesseract-ocr-ios.
ah! excellent! Doing an adaptive Threshold Filter did the trick. Lower res photos have issues (I have a couple that are 600px wide) but changing the contrast, some adaptive threasholding and most of the camera taken photos get great recognition rate now. Thanks for the help!
On Sep 3, 2014, at 5:14 AM, Cyril [email protected] wrote:
Take a look here: #42.
In my case, I try to filter images to remove as much noise as possible. Simple combination of Adaptive Threshold Filter and Median filter removes the small noise and makes an images in binary format (this step is also named as binarization).
The second step, I would like to implement, is to find a text areas in the images to remove shadows and other quiete big noise around the text, which could produce some comma and dot output. It could be possible (perhaps, at least I've read some papers about it) using combinations of Opening and Closening morphological operations. But so far I didn't have a luck to implement the second phase, which could operation on any image.—
Reply to this email directly or view it on GitHub.
from tesseract-ocr-ios.
I guess, it could be closed.
from tesseract-ocr-ios.
Related Issues (20)
- .
- User-Words file not loaded: Error: failed to load user-words HOT 5
- doesn't work with 64bpp images
- pod failed HOT 1
- Does not detects images 100%
- What's means with *.cube.* ? e.g. eng.cube.fold
- Why does the testsdata folder exist in the project? HOT 2
- Why don't we upgrade to the latest Tesseract?
- cocoapods 5.0.1 error
- Xcode 12.0.1 build is failing when trying to run it on the simulator. HOT 2
- RecognitionQuestion
- Apple Silicon (arm64; M1) support HOT 2
- Thread 1: EXC_BAD_ACCESS (code=1, address=0x163940000)
- Framework not found for Mac OS HOT 1
- 你好 这句一定闪退
- PDF creation not working
- self.tesseract!.recognize() is crashing in Acuant MRZ
- How to update Tesseract version
- Target 'TesseractOCRiOS' (project 'Pods') has copy command HOT 1
- cannot load language 'eng' HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tesseract-ocr-ios.