Code Monkey home page Code Monkey logo

assyrian-dictionaries's Introduction

Assyrian Dictionaries

Collection of Assyrian dictionaries in digital format.

Oraham's Dictionary of the Stabilized and Enriched Assyrian Language and English
by Alexander Joseph Oraham
Bio: http://www.atour.com/people/20010702c.html
Published: 1943
Development: 25 years
Words: 21,000
Description: Text search available for English (using Adobe PDF reader). Syriac characters are not searchable at this time.

A Dictionary of the Dialects of Vernacular Syriac
by Arthur John Maclean
Bio: https://en.wikipedia.org/wiki/Arthur_Maclean
Published: 1901
Development: ? years
Words: ?
Description: Text search available for English (using Adobe PDF reader). Syriac characters are not searchable at this time.
Changelog (2018/09/11): Added English text recognition for searchability. Created content bookmarks.

Colloquial Syriac As Spoken In The Assyrian Levies
by Lieut. R. Hart, MBE
Bio: ?
Published: 1926
Development: ? years
Words: ~805+
Description: Text search available for English (using Adobe PDF reader). Syriac characters are not searchable at this time. The word count above is of the English base words in the vocab section (50 pages). The words in the other sections have not been counted, but an estimate might be about 350-450, cumulatively.

Plans

The ultimate goal of this repository is to provide a digital dataset of Assyrian words/definitions in a programmatically consumable format, such as JSON/CSV.

All the fields that one would expect from the common dictionary should be included, along with some additional data when available.

  • Syriac spelling (Eastern and Western)
  • Pronunciation in translated language
  • Type of word
  • Definition in translated language
  • Definition in Assyrian, itself (both Eastern and Western)
  • Example usage (multiple sentences)
  • Verb tenses
  • Source (the dictionary from which the data has been adapted)
  • Possible place of origin
  • Possible language of origin (Akkadian, Aramaic, borrowed from a known language, etc.)

The first step is to collect dictionaries (in digital format) that are available for free.

Approach

Use technology to extract data from digital formats and adapt it into JSON/CSV datasets. Manual entry of data should not begin until available sources have been completely parsed.

Outcome

Once the datasets are available, mobile and web developers will be able to develop Assyrian dictionary, thesaurus, and language-based apps. It's evident that there are many who have interest in developing an Assyrian dictionary app, but - apparently - the datasets haven't been available in a programmatically consumable way.

At the time of writing this, there are a couple of mobile app developers who are trying to produce Assyrian dictionary apps; however, the apps depend on an internet connection. This is due to the fact that they have to use the available web service APIs (most of which depend on the same web service).

Standards

This repository should not include dictionaries that are not free. No potential harm should be caused to any author and/or publisher, not just to respect copyrights and avoid affecting their financial outcome, but to also prevent discouraging authors from producing dictionaries.

Available dictionary web services should not be scraped to collect data. This would not only be harmful to the authors/publishers of the web-based dictionary, it would also be harmful to the language, as it could discourage further development of the affected dictionaries.

Notes

The most well-developed web-based Assyrian dictionary has been under continuous development for over a decade with almost 40,000 words. With that having been stated, this dataset is not going to be developed overnight.

assyrian-dictionaries's People

Contributors

ronrihoo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

agutkin

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.