Comments (4)
Ps. A couple more things: the PDF looks like it’s all images, it sounds like images of words, which are always the most ‘expensive’. Also, you can always try running it directly against ocrmypdf yourself to see
from paperless-ngx.
I’m not sure there’s much for us to do here, as noted in the template (link is there too), and you can see in the logs the OCR process is handled by ocrmypdf. In general I have no idea how receptive they are to this kind of report (ie they might just say “OCR is resource-intensive”) but could still be worth opening an issue there, certainly seems unexpected. That said, I think you would need to supply a PDF.
Feel free to link to the issue back here if you do, in general we don’t use our issues to track upstream problems so we’ll close this.
If another team member notices something I didn’t here they can always re-open.
thanks
from paperless-ngx.
Thanks for the advice, that's probably what it is.
from paperless-ngx.
This issue has been automatically locked since there has not been any recent activity after it was closed. Please open a new discussion or issue for related concerns.
from paperless-ngx.
Related Issues (20)
- [BUG] App Config does not enforce permissions HOT 1
- [BUG] Crop gets not respected correctly when creating previews HOT 2
- [BUG] Documents no longer open HOT 25
- [BUG] Demo is offline HOT 1
- [BUG] Preview fails with Factur-X / EN 16931 HOT 5
- [BUG] consume fails since 2.4.0
- [BUG] Filtering using "Created" or "Added" specifying a date range with after/before not working HOT 5
- [BUG] Possible Crash on Database Migration when updating from 2.4.2 to 2.4.3 HOT 9
- [BUG] No matching manifest for linux/arm/v7, when installing on Rasperry Pi 4 HOT 4
- [BUG/ISSUE] Import procedure fails with "The manifest file refers to...which does not appear to be in the source directory", similar to #4485 HOT 7
- [BUG] Unknown tesseract error, returns non-zero HOT 2
- [BUG] [Errno 2] No such file or directory: '/tmp/paperless/tmp2qif925m' HOT 3
- [BUG] Multi-language documents produce unstable language metadata HOT 3
- [BUG] Paperless-ngx is freezing in incognito browser and memory overflow on PC
- [BUG] Files are renamed on disk but not in database HOT 3
- [BUG] Task Workers/Consuming bugged HOT 2
- [BUG] Adding a custom field with the same name of an existing field leads to internal server error HOT 6
- [BUG] Error loop after entering invalid formated value for a custom field HOT 6
- [BUG] External Auth for API is enabled by default even in 2.4.3 HOT 7
- [BUG] Version check running into Github rate limit
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paperless-ngx.