Code Monkey home page Code Monkey logo

saram's Introduction

Saram - Image/PDF OCR detection system

Get OCR in txt form from an image or pdf extension supporting multiple files from directory using pytesseract with support for rotation in case of wrong orientation along.

Currently in beta state

Follow: Demo run

Saram features

Note: Make sure you have a OCR tool like tesseract and certain data value for comparing OCR, eg tesseract-data-eng along with Pillow and Wand for image conversion and loading which will be fetched during pip install.

For using in python: Refer to the py-module branch

Installation

Install using PIP:

$ pip install saram
$ saram <dirname>

else

Clone the source locally:

$ git clone https://github.com/aryaminus/saram
$ cd saram
$ git checkout py-module
$ python main.py <dirname>

Todo

  • Add support for PDF by PDF -> Image -> Txt with converted image deletion after processing
  • Double check for orientation in case of image and PDF
  • Make a PIP package
  • Add NLP to process the most repeated frequent characters to filer content
  • Add Cloud Vision support for effective character recognization
  • Suppot for GUI using tkinter

Reference

  1. pdf-to-txt
  2. ocr-convert-image-to-text
  3. fix-image-rotation
  4. python-packaging

Contributing

  1. Fork it (https://github.com/aryaminus/saram/fork)
  2. Create your feature branch (git checkout -b feature/fooBar)
  3. Commit your changes (git commit -am 'Add some fooBar')
  4. Push to the branch (git push origin feature/fooBar)
  5. Create a new Pull Request

Enjoy!

saram's People

Contributors

aryaminus avatar mirmire avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

saram's Issues

Does it runs on WIndows?

I tried it using on Windows 10 and the installation was unsuccessful. In the end it throws an error:
image

Any tips to make it run on Windows ?

Use argparse instead of sys.argv

Using argparse would have at least the following benefits:

  • auto generated help and usage message
  • much better handling of positional and optional args (future-proof?)
  • elimination of the exception raising class

Request: write extracted text to .jpg or .pdf

Hey,

i'm not sure if you accept requests, but i guess it doesn't harm to ask :)

I was wondering if it is possible to rewrite image filename with extracted text data from pytesseract to .jpg

So for example i have an image with the filename 'abc.jpg' and text on the image 'this is an apple' , OCR the image and then the image and filename becomes 'this is an apple.jpg

Really hope you can implement this on saram :)

Thank you for reading my request

Getting this error while executing the code.

OCR tool: <module 'pyocr.tesseract' from '/home/btp/anaconda2/lib/python2.7/site-packages/pyocr/tesseract.pyc'>
OCR selected language: ENG (available: osd, equ, eng)
Traceback (most recent call last):
  File "main.py", line 183, in <module>
    s.main(path) # Def main to path
  File "main.py", line 129, in main
    degrees = self.get_rotation_info(image_file_name)
  File "main.py", line 90, in get_rotation_info
    stdoutdata = subprocess.getoutput('tesseract' + arguments % filename)
AttributeError: 'module' object has no attribute 'getoutput'

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.