Code Monkey home page Code Monkey logo

qieti's Introduction

QieTi

基于图像处理,文字识别,自然语言处理的试卷图像题目切分系统

浙江省服务外包大赛获奖

环境准备

当前系统及库版本

anaconda: Anaconda navigator 1.9.2

opencv: Opencv3 3.1.0

AipOcr: Newest

re: Common

python:Python 3.7.0

环境安装

1.下载anaconda:https://www.anaconda.com 1.1anaconda安装spyder等基本开发环境

2.anaconda命令行创建虚拟环境(python建议3.7): conda create -n your_env_name python=X.X

3.激活虚拟环境 source activate your_env_name

4.安装opencv(opencv-python的版本号可能不同,不同版本号可能与一些系统不兼容) pip install opencv-python==3.4.2.16 -i https://pypi.mirrors.ustc.edu.cn/simple/ 或者 anaconda里面用包管理器安装

项目版本更新:

main3.29-00.00 :接通了百度api文本识别接口,实现了基本的自然语言处理(对基本题号和题目情感倾向的处理)

main3.31-18.46 :优化了自然语言处理过程(增加了对题目选项的语意识别),增加了新的工具函数

main4.2-00.00 :优化了自然语言处理效果(增加了对注意事项等无关内容的排除),实现了基本的切题功能

main4.14-00:00 :解决了自然语言处理的bug,实现了基本题型的高成功率识别,解决了图片切割范围判定逻辑的问题

main4.14-23:00 :实现了题号识别不出‘.’时,依然可以根据题号顺序规律将其划分出来。

main4.26-21:40 :实现了基本的GUI图形界面,构建了基本的批量识别图片的功能。

main4.28-13:25 :修改了题号识别不准确情况下题目的判定模块,修复了图形化界面删除图片后顺序错乱的bug

main4.28-16:00 :修复了图形化界面删除图片的bug(最后一张的情况),修改了一些交互

未来研究:

考虑使用随机森林法总结题干/题目内容的正则化匹配规则,或使用神经网络训练正则化匹配规则(用神经网络数据集去哪找啊) 考虑使用CNN系列算法对图像进行处理和预切分,以提高识别准确性 考虑多科目多套算法以应对不同科目试卷的特性 other

qieti's People

Contributors

popopoi avatar blue-sky-sea avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.