Code Monkey home page Code Monkey logo

Comments (1)

jbarlow83 avatar jbarlow83 commented on June 12, 2024

PDF is a very complex format. In this case, the PDFs you have are being used to store data custom application specific data. There is a sort of standard way to do layers. I don't know what "classes" would be in PDF internal data structures - there are many ways one could express that.

If you're familiar with HTML, it's sort of like looking at a block of text and wondering how it got its particular formatting - maybe there are some CSS rules that select it, maybe Javascript dynamically modified the rules, maybe there's inline CSS, maybe the text is rendered with SVG or Canvas. Maybe it's an iframe. Without getting into the details of how a specific HTML application works, you can't answer that question. And a different application that looks identical to the user may have an entirely different technical implementation.

You could use a tool like iText RUPS to inspect the structure of the PDF, and the PDF reference manual, and see if you can find where the data you want to access is located in the PDF. Then pikepdf gives you an efficient way to retrieve that information.

from pikepdf.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.