The bookreader from aeschylus

aeschylus / bookreader Goto Github PK

View Code? Open in Web Editor NEW

The Internet Archive BookReader

License: GNU Affero General Public License v3.0

CSS 11.05% JavaScript 85.26% HTML 3.69%

bookreader's People

Contributors

Watchers

bookreader's Issues

Make hyperlinks clickable, opening the corresponding Wayback Machine archived resource.

Thanks @mekarpeles for getting the ball rolling on this. I've copied over relevant portions of your spec here.

Context
There are tens of millions of semantic entities in books stored on the internet archive, many of them with corresponding web resources. This spike aims to enable interaction with URL entities that have a wayback machine resource available.

Proposal & Constraints
The proposed solution (version 1) is to extend the Internet Archive BookReader with a new "Semantic" plugin which, on page-load:

Pulls a page of region-labeled, OCR’d + text using the https://api.archivelab.org/books/{identifier}/pages/{page}/ocr?mode=words API

Hits a new entities endpoint which identifies urls (and later other semantic entities) which returns a list of:

type: e.g. url

location: (x, y, w, h)

value: e.g. https://archive.org

Highlights the corresponding region on the book containing the link and makes the region clickable to a Save Page Now version of the link

I.e., once clicked, capture the webpage if we don’t already have it, or, in either case, bring the patron to a viewable version of this url

Recommend Projects

aeschylus / bookreader Goto Github PK

bookreader's People

Contributors

Watchers

Forkers

bookreader's Issues

Make hyperlinks clickable, opening the corresponding Wayback Machine archived resource.

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent