Comments (5)
---------_--------------
I am getting result like this
---------_--------------
Hey Helene
So whyls Telegram cool7
Looky unlike other messaging
systems Telegram uses
decentralized server infrastructire
all over the world to connect
people via servers close to them
ltls based on a customebuild
protocol which delivers our
message in minimum bytes
possible and is reliable even on
the weakest connections
Helenem
Who were you talking to7
Sorryi Telegram is cool because
it has kittens and dogs on the
background
---------_--------------
But I need to get it as
---------_--------------
Hey Helene
So whyls Telegram cool7
Looky unlike other messaging systems Telegram uses decentralized server infrastructire all over the world to connect people via servers close to them ltls based on a customebuild protocol which delivers our message in minimum bytes possible and is reliable even on the weakest connections.
Helenem
Who were you talking to7
Sorryi Telegram is cool because it has kittens and dogs on the background
from tesseract-ocr-ios.
Pls, refer to the Tesseract API. This issue is not related to the iOS port.
As an option for you, you are able to find a text regions in the image corresponding to one single buble. Cut that region from the whole image and send the only region to the tesseract. After getting the recognized text, just remove all the endlines since all that text is from only one rext bubble.
from tesseract-ocr-ios.
To add on to what @ws233 suggested, assuming your images are this consistent, you could use the white and green colors used for the backgrounds of the bubbles to isolate where each paragraph occurs in the image.
This example project might give you some ideas about how to use OpenCV to detect color regions in an image: https://github.com/lukagabric/iOS-OpenCV
The GPUImage library should allow you to do this as well: https://github.com/BradLarson/GPUImage
Here's a color tracking example project that uses GPUImage: https://github.com/BradLarson/GPUImage/tree/master/examples/iOS/ColorObjectTracking
from tesseract-ocr-ios.
This one also could be closed, I think, since there is a function
- (NSArray *)confidencesByIteratorLevel:(G8PageIteratorLevel)pageIteratorLevel;
which returns the blocks of text for the G8PageIteratorLevelParagraph.
from tesseract-ocr-ios.
I agree. @Swaroop1231, feel free to reopen this issue or start a new one if you have any more questions.
from tesseract-ocr-ios.
Related Issues (20)
- .
- User-Words file not loaded: Error: failed to load user-words HOT 5
- doesn't work with 64bpp images
- pod failed HOT 1
- Does not detects images 100%
- What's means with *.cube.* ? e.g. eng.cube.fold
- Why does the testsdata folder exist in the project? HOT 2
- Why don't we upgrade to the latest Tesseract?
- cocoapods 5.0.1 error
- Xcode 12.0.1 build is failing when trying to run it on the simulator. HOT 2
- RecognitionQuestion
- Apple Silicon (arm64; M1) support HOT 2
- Thread 1: EXC_BAD_ACCESS (code=1, address=0x163940000)
- Framework not found for Mac OS HOT 1
- 你好 这句一定闪退
- PDF creation not working
- self.tesseract!.recognize() is crashing in Acuant MRZ
- How to update Tesseract version
- Target 'TesseractOCRiOS' (project 'Pods') has copy command HOT 1
- cannot load language 'eng' HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tesseract-ocr-ios.