Material for improving OCR output
This repository contains various lists of OCR errors and their corrections.
The files are: English: English language words and their corrections Googles: Misreadings of the Google signature at the bottom of google-scanned books. No correction given. Latin: A small selection of Latin words, mainly derived from statutes. Names: Personal names, both fore- and sur-. Places: Place names of one word, mainly English and Welsh
Unless stated, each file is licensed as public domain, and may be used freely without any encumbrance.