Opticat Character Recognition

About

NOT COMPLETED PROJECT for OCR

Description

In this repo there is an attempt to create an OCR model for the first time

Project Structure

├── Interface
│   ├── google_app
│   ├── interface
├── data
├── notebooks
├── src
│   ├── dataset
│   ├── text encoding
│   ├── text processing
│   ├── training
│   └── inference
└── utils

The repository is organized as follows:

Interface/: This directory is the django project and contains google_app as the django app.
data/: This directory contains the dataset used for training and evaluation. It includes both the raw data and preprocessed versions, if applicable.
notebooks/: This directory contains Jupyter notebooks that provide step-by-step explanations of the data exploration, preprocessing, model training, and evaluation processes. Explore if you are feeling adventurous
src/: This directory contains the source code for the project, including data preprocessing scripts, model training scripts, and evaluation scripts.
utils/: This directory contains utility functions and helper scripts used throughout the project.

Getting Started

It is recommended to set up a virtual environment for this project using python 3.8.16

To get started with the project, follow these steps:

Clone the repository:

git clone https://github.com/Aylore/blnk-OCR.git

Change directory into the repository:
```
cd blnk-OCR
```
Install the required dependencies:
```
make install
```

Pipeline

Text Processing

I tired various methods for text encoding , none of them seemed to get good results so I just setteled for simple encoding
Image Processing

The images were resized to 50 * 200 I extracted the text only in new images with all black background but seemed to confuse the model for some reason
Training

A CRNN (CNN + RNN ) model for used for seuquence generation
Django Integration

After training I used django to create and easy to use not too fancy GUI.

Running The Project

 make run

Accuracy

Contributing

If you would like to contribute to this project, Feel Free to make a pull request

Future Work

Extract characters one by one using CV and start classifying them by a pretrained model
Use attention based model

aylore / blnk-ocr Goto Github PK

blnk-ocr's Introduction

Opticat Character Recognition

About

Description

Project Structure

Getting Started

Pipeline

Text Processing

Image Processing

Training

Django Integration

Running The Project

Accuracy

Contributing

Future Work

blnk-ocr's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org