Code Monkey home page Code Monkey logo

Comments (3)

hiroi-sora avatar hiroi-sora commented on May 18, 2024

update:v1.3.2 版的多国语言扩展包已加入纯英文库,可避免此问题。有大量英文ocr需求的用户下载扩展包切换到英文即可。

(楼下指的是v1.3.1的多国语言包,当时还未加入英文。)


我测试了一下,空格识别率低似乎是简体中文模型库的专属毛病,换用繁中/日文/韩文/法文模型库均无此问题。

image

from umi-ocr.

lioArther avatar lioArther commented on May 18, 2024

我有下载你整理的多语言库。也是同样的结果。
image
必须在这里更改模型库噢

from umi-ocr.

hiroi-sora avatar hiroi-sora commented on May 18, 2024

空格丢失现象跟图像关系很大。

我测试了很多英文文章截图,对常规样式的文字,并不会频繁出现空格丢失。换用非简中模型库(如繁中)后会进一步减少该现象,几乎可以忽略。
如果画质模糊 / 字体空格间距较小 / 除文本外含有多余的图形 / 多种不同大小、粗细的文本混排(如你贴的例图),确实会增加空格丢失或者识别不准的概率。

目前我不认为这是一个严重的问题,PPOCR引擎在大部分场景中表现良好。如果你还有更多例图,证实在常见场景中,使用非简中库也会频繁出现空格丢失,请贴出来,我再调查一下。

如果确实要改,我还想到一些解决方案,比如使用wordninja等英文分词库进行后处理,补充缺失的空格。以后的版本可能会考虑安排上。

from umi-ocr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.