Comments (2)
@lusamino Do we really need a Page table? I'm wondering, could we not refer to a page using the following uid: [source_uid] + "#" + page_nbr. No strong feelings about this. Either works.
from pdf2data.
@c-jordi do you mean having the information about pages somewhere else? I mean, we need to store somewhere some information related to each individual page, such as: set it belongs to (train or test), if it is labelled or not, notes written down during annotation, date of annotation, mean entropy of predicted labels (for choosing the most adequate pages to label), and perhaps there is more entries to the table, but I cannot remember right now. For these reasons, I believe it makes sense to have such table.
Regarding the uid, the one you mention sounds great! [source_uid] + "#" + page_nbr
from pdf2data.
Related Issues (4)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pdf2data.