Comments (6)
Could you provide more informations, are there any logs of the error stack trace ?
from pdfalto.
The alto schema version didn't change, version 3.1 is used since the first pdfalto release : https://github.com/kermitt2/pdfalto/blob/master/schema/alto.xsd
from pdfalto.
Earlier the schemain the alto xml was:
xmlns="http://www.loc.gov/standards/alto/ns-v3#",
but now I get:
xmlns="http://www.loc.gov/standards/alto/v3/alto.xsd"
from pdfalto.
this was updated because the first link is wrong, it's not pointing to the schema.
from pdfalto.
@Aazhar Schema-location and Namespace URL don't have to be identical.
xmlns should be http://www.loc.gov/standards/alto/ns-v3# (see targetNamespace="http://www.loc.gov/standards/alto/ns-v3#" in http://www.loc.gov/standards/alto/v3/alto.xsd)
For schema location, you can use something like
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/standards/alto/v3/alto.xsd"
from pdfalto.
Added xsi:schemaLocation
with d49bf77
<alto xmlns="http://www.loc.gov/standards/alto/ns-v3#" xsi:schemaLocation="http://www.loc.gov/standards/alto/v3/alto.xsd">
from pdfalto.
Related Issues (20)
- export `ROTATION` attribute for TextBlock. HOT 1
- XML to PDF HOT 1
- Is there an option to output ALTO XML to STDOUT? HOT 3
- heap-buffer-overflow found?
- empty image / svg
- compile error on RHEL 8.6 (Ootpa): /usr/bin/ld: cannot find -lstdc++ HOT 1
- Error case with invalid characters mapping
- Segmentation fault with pdf with comments
- Soft hyphens omitted HOT 3
- PDF to XML conversion time out for some files in server mode but run the pdfalto_server cmd in shell is fast and returns ok. HOT 1
- xpdf version 4.04
- ARM binaries for the Apple M1 HOT 3
- Cannot run pdfalto HOT 5
- PDF cause a crash with annotation option
- Building on arm64 Ubuntu Server 22.04 fails HOT 1
- Building for Apple Silicon failed due to missing directories (with manual fix) HOT 1
- Wrong characters / difference between extraction and display HOT 1
- [Suggestion] Reporting the byte location of images HOT 2
- Compilation error on arch linux HOT 1
- Error case, missing digits HOT 10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pdfalto.