Code Monkey home page Code Monkey logo

Comments (4)

GoogleCodeExporter avatar GoogleCodeExporter commented on September 18, 2024
This is a known problem.

The FindLines code is assuming that each rectangle given to it is composed of a
single size of text, and, although the baseline may be curved, it does not shift
suddenly. While it will often succeed when these rules are broken, there is a 
much
higher probability that the text will be lost or just wrong.

The problem of the example you give is unique to forms processing (more or 
less), and
although it may be fixed in a future version, it will most likely be a distant 
future
version. In the meantime, you could try to cut out rectangles of similar-sized
characters...

Original comment by [email protected] on 19 Jul 2007 at 3:34

  • Changed state: Accepted

from tesseract-ocr.

GoogleCodeExporter avatar GoogleCodeExporter commented on September 18, 2024
i solved this the same way i solved the "digits" problem issue (#164).
Since you're parsing a form, you probably know where are each element to 
recognize on it.
My method is to 
- Rotate the form based on an angle automatically detected (i use the black 
areas around the scan to 
determine two corner's points, then i just do an atan on the coef, it gives me 
the angle).
- Crop the garbage generated by rotation, all around the picture (Easy if you 
know the angle, and at least 3 
corners of the document, i first shear it so the 3 points angle is 90° and 
then crop).
- Determine the "type" of your form, if you're processing many types. myself i 
do it with colorimetry, and 
placemarks annalysis.
- Then, you have to extract each elements, but not using absolute coordinates, 
i do use relative to size 
coordinates (Basically each set of x/y is a percentage of width/height of the 
document).
- Voila. You just extract things, and parse individually with tesseract.

Hope that helps,
Pierre.

Original comment by [email protected] on 4 Apr 2010 at 11:49

from tesseract-ocr.

GoogleCodeExporter avatar GoogleCodeExporter commented on September 18, 2024

Original comment by [email protected] on 20 May 2010 at 6:53

  • Changed state: Look-here-for-help

from tesseract-ocr.

GoogleCodeExporter avatar GoogleCodeExporter commented on September 18, 2024
Reference to this issue was posted in FAQ

Original comment by [email protected] on 2 Jan 2013 at 12:44

  • Changed state: No-longer-an-issue

from tesseract-ocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.