Comments (3)
The error indicates that:
- the file was damaged
- libqpdf tried to recover the damaged file, but gave up/exhausted its recovery tools
In my experience this usually happens when a file is truncated. Sometimes you can do manual forensic recovery and extract some content, but it all depends how the original was structured.
You should get an exception and that's expected behavior. If you got a crash, meaning the Python interpreter aborted with a segfault or some other error, I'd like to look at the files.
from pikepdf.
Yeh, this is why I closed the issue ...... the documents were extremely damaged..... However, i used a try\catch to skip those docs.
The attachments contain 4 samples.
documents.zip
from pikepdf.
All 4 of these files appear to be truncated. At a glance the first few pages of text/images might be recoverable from the first two, but that's definitely in the realm of forensic data recovery, not what we're trying to do here.
Thanks for your submission.
from pikepdf.
Related Issues (20)
- First tutorial example is unclear: new_pdf?
- The password param in method βopenβ HOT 1
- PackagesNotFoundError in Conda HOT 2
- image and its caption HOT 1
- Deterministic PDF generation HOT 1
- How to access OCR-data? HOT 1
- Import/Export annotations to FDF/JSON format HOT 1
- cannot install
- Accessibility Settings Issue HOT 1
- [ISSUE] Missing XMP Metadata HOT 1
- Traverse PDF layers and classes HOT 1
- Instalation errors
- How to replace the value of an Object mapped to a variable HOT 1
- pikepdf, libjpeg, l HOT 1
- 8.11.0: pytest requires `conftest` HOT 5
- Add 2 overlaid images to page with transparency mask HOT 2
- Add type checking for setting XMP metadata HOT 1
- RDF/XMP generated by pikepdf is incorrect?
- Question on file compression. HOT 1
- Re-enable 32-bit wheels on Windows HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pikepdf.