tboenig Goto Github PK
Name: Matthias Boenig
Type: User
Company: Berlin-Brandenburgischen Akademie der Wissenschaften (BBAW)
Location: Berlin
Name: Matthias Boenig
Type: User
Company: Berlin-Brandenburgischen Akademie der Wissenschaften (BBAW)
Location: Berlin
A Google book downloader with proxy support.
A repository containing scripts and output for Google OCR of sample files.
Ground truth files from Biodiversity Heritage Library (BHL)
Ground truth files from Polish Digital Libraries (PSNC)
Ground truth files from Spanish Digital Libraries (BNE/BVC)
Ground truth alignement tool
OCR-D guidelines for Ground Truth production
Metadata tool for Ground Truth datasets
OCR-D-Level-Rules can be created automatically with gt-MufiLevelRules from the encodings published by MUFI: The Medieval Unicode Font Initiative.
XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).
TEST GT
This repo provides a collection of ground truth data. The collection was compiled under different aspects (complexity of the layouts and use of the fonts). The individual data are also characterized by metadata. The metadata is based on the labeling scheme of OCR-D/PrimaLab.
Convert between Tesseract hOCR and ALTO XML 2.0/2.1 using XSL stylesheets
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
Ground Truth Resources for the HTR of patrimonial documents
A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODS
Offer of different keyboards for transcription software (Aletheia, Transkribus, LAREX, QURATOR-neat, eScriptorium)
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
With makeAletheia_mets you can create a METS file (collection file) for the Aletheia Ground Truth software. It is an alternative way without the Aletheia software.
An implementation of the IIIF Presentation API v2 based on XSLT
OCR Ground Truth Resources
Presentations, tutorials and data for the OCR workshop at LMU
Data for layout analysis and HTR.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.