Code Monkey home page Code Monkey logo

apertium-zab's Introduction

Dizhsa / Western Tlacolula Valley Zapotec: apertium-zab

This is an Apertium monolingual language package for Dizhsa, or Western Tlacolula Valley Zapotec, with a focus on the variety of San Lucas Quiaviní.

What you can use this language package for:

  • Morphological analysis of Western Tlacolula Valley Zapotec
  • Morphological generation of Western Tlacolula Valley Zapotec
  • Part-of-speech tagging of Western Tlacolula Valley Zapotec

Estaremos traduciendo esta página al Dizhsa y al español.

Responsibilities

Guelaguetza clause: While licensed under a free/open-source license that permits commercial uses, it is expected that anything created using this resource be made available to the community of San Lucas Quiaviní free of charge. This is consistent with the community's practice of guelaguetza, a complex system of reciprocity and exchange of goods and labor.

Requirements

To compile these resources, you will need the following software installed:

  • lttoolbox (>= 3.3.0)
  • apertium (>= 3.3.0)
  • vislcg3 (>= 0.9.9.10297)
  • hfst (>= 3.8.2)
  • lexd (>= 1.0.0)

If this does not make any sense, we recommend you look at: apertium.org. Especially have a look at https://wiki.apertium.org/wiki/Installation.

Compiling

Given the requirements being installed, you should be able to just run:

$ ./configure
$ make

You can use ./autogen.sh instead of ./configure if you're compiling from source.

If you're doing development, you don't have to install the data, you can use it directly from this directory.

If you are installing this language package as a prerequisite for an Apertium translation pair, then do (typically as root / with sudo):

$ make install

You can use a --prefix with ./configure to install as a non-root user, but make sure to use the same prefix when installing the translation pair and any other language packages.

Testing

If you are in the source directory after running make, the following commands should work for analysis:

$ echo "Yseidylayu naa Dizhsa." | apertium -d . zab-morph
^Yseidylayu/seidy<v><tv><pol><irre>+yu<prn><pers><p2><frm><sg>/seidy<v><dtv><pol><irre>+yu<prn><pers><p2><frm><sg>$ ^naa/naa<prn><pers><p1><sg>$ ^Dizhsa/Dizhsa<n>$^../..<sent>$

$ echo "Yseidylayu naa Dizhsa." | apertium -d . zab-tagger
^Yseidylayu/Seidy<v><dtv><pol><irre>+yu<prn><pers><p2><frm><sg>$ ^naa/naa<prn><pers><p1><sg>$ ^Dizhsa/Dizhsa<n>$^../..<sent>$

There are also orthography-specific generation modes:

$ echo "^becw<n><px>+u<prn><pers><p2><infm><sg>$" | apertium -d . -f none zab-gener
xyecwu

$ echo "^becw<n><px>+u<prn><pers><p2><infm><sg>$" | apertium -d . -f none zab_Phon-gener
x꞉yèeʼcwùuʼ

Furthermore, there are orthography conversion modes:

$ echo "Yseidylayu naa Dizhsa." | apertium -d . zab_Simp-zab_Phon
Ysèiʼdylàaʼyuuʼ nàaʼ Dìiʼzhsah.

$ echo "Ysèiʼdylàaʼyuuʼ nàaʼ Dìiʼzhsah." | apertium -d . zab_Phon-zab_Simp
Yseidylayu naa Dizhsa.

Files and data

For more information

Citing

When referencing this work in an academic publication, we ask that you cite the following source:

  • Washington, Jonathan N., Felipe H. Lopez, and Brook Danielle Lillehaugen (2021). “Towards a morphological transducer and orthography converter for Western Tlacolula Valley Zapotec”. In: Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP), pp. 185–193. URL: https://www.aclweb.org/anthology/2021.americasnlp-1.21.

Help and support

If you need help using this language pair or data, you can contact:

  • Mailing list: [email protected]
  • IRC: #apertium on irc.oftc.net (irc://irc.oftc.net/#apertium)

See also the file AUTHORS, included in this distribution.

apertium-zab's People

Contributors

jonorthwash avatar mr-martian avatar tinodidriksen avatar unhammer avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.