Code Monkey home page Code Monkey logo

number_detection_recognition's Introduction

number detection and recognition based on AdvancedEast and CRNN


Detection and Crop:


Recognition:

  • 22046298859.jpg : 22046298859
  • 97785067838.jpg : 97785067838
  • 84999825604.jpg : 84999825604
  • 99851544924.jpg : 99851544924
  • 28510715459.jpg : 28510715459
  • 12233418739.jpg : 12233418739
  • 41679405336.jpg : 41679405336
  • 37774346979.jpg : 37774346979

limitations: When the two models are test on their respective validation sets , they can reach an acc of about 0.9. However, the number of the training data for recognizer I generated is horizontal, and the number in the crop image after the detection result introduces the rotation and other factors, resulting in poor results when used in combination.


Detection

training

  • prepare training data, data format refer to ICPR
  • modify params in cfg.py
  • run python preprocess.py to resize image and generator .npy training files
  • run python label.py
  • run python train.py, train the network

testing

  • modify your images' dir in predict.py, and run python predict.py, then we will get three outputs: bounding box on origin images, the cropped image, and coordinates(txt file).

more details please refer to AdvancedEast


Recognition

training

  • prepare training data, data format refer to MJSynth data
  • modify params in cfg.py
  • modify input_shape=(None, 50,7,512) in train.py line 55, the input_shape is refer to your bn_shape = bn4.get_shape() in network.py
  • run python train.py

testing

  • modify your images' dir in predict.py, then run python predict.py

number_detection_recognition's People

Contributors

bobo-y avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.