Light

dun933 / ocr_cnn_-ctc Goto Github PK

View Code? Open in Web Editor NEW

This project forked from mengjiexu/ocr_cnn_-ctc

1.0 0.0 0.0 38.94 MB

使用CNN全卷积神经网络进行整行汉字识别，正确率90-94%,后来有朋友改成densenet做主网络，做到99%以上正确率

Python 1.06% Jupyter Notebook 98.94%

ocr_cnn_-ctc's Introduction

OCR_CNN_-CTC

使用CNN全卷积神经网络进行整行汉字识别，正确率90-94%，这项目是我毕设的一个子项目，主要用于识别建筑图纸中的文字。

train

requirements:

python > 3.x
tensorflow 1.13.1
opencv
matplotlib

to train the model:

add some ttf files to ttf/
use myutils/useTtfToGenImgs.ipynb to Generate images, you can use your own word files and ttf files;
run '''python3 train.py''', you can change the train.py for your model arch

效果

训练数据为200万的类似下面的图片：测试图片：测试结果：

训练数据

训练数据生成时使用的是“最新字典3_只含常用字.csv”,这里面包含的是1806个常用的汉字与建筑专业常用汉字; 使用的字体是收集的60多个字体文件，在项目中给出了。

代码

训练和测试的代码都在fcnn_ocr_v2.ipynb中使用的是jupyter notebook进行编写的，因为方便测试和输出记录这段时间比较忙，过一段时间进行代码重构，调用会方便很多

模型

训练模型使用的是GTX1080Ti，模型构建是使用的是CNN+CTC的架构，之前试过CRNN+CTC的架构，速度慢到让我抓狂，训练至少2天才能收敛，果断放弃。在测试过多种模型结构后选择了当前的模型结构，速度方面相比CRNN快很多很多，效果对于打印体来说比较好，但是使用CNN+CTC的缺点也是显而易见的，由于视野有限，太过宽的字符是识别不出来的，好在汉字是方块字。

本人email：[email protected] ,欢迎交流

ocr_cnn_-ctc's People

Contributors

Stargazers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.