Code Monkey home page Code Monkey logo

apertium-sah's Introduction

Sakha: apertium-sah

This is an Apertium monolingual language package for Sakha. What you can use this language package for:

  • Morphological analysis of Sakha
  • Morphological generation of Sakha
  • Part-of-speech tagging of Sakha

Requirements

You will need the following software installed:

  • lttoolbox (>= 3.3.0)
  • apertium (>= 3.3.0)
  • vislcg3 (>= 0.9.9.10297)
  • hfst (>= 3.8.2)

If this does not make any sense, we recommend you look at: apertium.org

Compiling

Given the requirements being installed, you should be able to just run:

$ ./configure
$ make

You can use ./autogen.sh instead of ./configure if you're compiling from source.

If you're doing development, you don't have to install the data, you can use it directly from this directory.

If you are installing this language package as a prerequisite for an Apertium translation pair, then do (typically as root / with sudo):

$ make install

You can give a --prefix to ./configure to install as a non-root user, but make sure to use the same prefix when installing the translation pair and any other language packages.

Testing

If you are in the source directory after running make, the following commands should work:

  • Morphological analysis:

     $  echo "Бу сахалыы морфологическай ырытыы." | apertium -d . sah-morph
     ^Бу/бу<prn><dem><nom>/бу<mod>$ ^сахалыы/саха<n>+лыы<post>/сахалыы<adv><cop><aor><p3><sg>/сахалыы<adv>$ ^морфологическай/морфологическай<adj><cop><aor><p3><sg>/морфологическай<adj>$ ^ырытыы/ырытыы<n><nom><cop><aor><p3><sg>/ырытыы<n><nom>/ырытыы<n><attr>/ырыт<v><tv><ger><nom><cop><aor><p3><sg>/ырыт<v><tv><ger><nom>$^./.<sent>$
  • Tagging (analysis + disambiguation):

     $  echo "Бу сахалыы морфологическай ырытыы." | apertium -d . sah-tagger
     ^Бу/Бу<prn><dem><nom>$ ^сахалыы/саха<n>+лыы<post>$ ^морфологическай/морфологическай<adj>$ ^ырытыы/ырытыы<n><nom><cop><aor><p3><sg>$^./.<sent>$
  • Morphological generation:

     $  echo "^бу<prn><dem><nom>$ ^саха<n>+лыы<post>$ ^морфологическай<adj>$ ^ырытыы<n><nom>+э<cop><aor><p3><sg>$" | apertium -d . -f none sah-gener
     бу сахалыы морфологическай ырытыы

Files and data

For more information

Citing

When referencing this work in an academic publication, we ask that you cite the following source:

  • Ivanova, Sardana, Jonathan N. Washington, and Francis M. Tyers (2022). A Free/Open-Source Morphological Analyser and Generator for Sakha. Presented at LREC 2022. Poster, Paper.

The transducer also appeared as the following:

Help and support

If you need help using this language pair or data, you can contact:

  • Mailing list: [email protected]
  • IRC: #apertium on irc.oftc.net (irc://irc.oftc.net/#apertium)

See also the file AUTHORS, included in this distribution.

apertium-sah's People

Contributors

ftyers avatar ilnarselimcan avatar jonorthwash avatar mr-martian avatar sushain97 avatar tinodidriksen avatar unhammer avatar varie avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

apertium-sah's Issues

forms that aren't being analysed

@varie, could you provide some information on each of these forms? They're coming up as common unanalysed forms.

  • буолкалах
  • буолааччы
  • буоланнар
  • буоллун
  • буолуохтааҕын
  • баргыа
  • үөскүүр
  • сыллардаахха
  • суруллубут
  • оҥоһуллубут

Most of them look like inflected forms of stems we have in the transducer, but they could also be derived forms. Either way, let's look at each one carefully and think about how to add to the transducer.

verb forms with эбит

Sakha has "complex verb forms" with эбит.

Кэлбит эбиттэр "Они, оказывается, пришли"

Many other Turkic languages have this.

In Turkish, the etymologically identical and semantically similar (dub instead of evid) form is "gelmiş imişler"/"gelmişmişler":

"<gelmişmişler>"
	"gel" v iv past dub
		"i" cop aor p3 pl

In Kazakh, it's equivalent morphologically and semantically to "келген екен":

"<келген екен>"
	"кел" v iv past evid p3 pl

And Kyrgyz

"<келген экен>"
	"кел" v iv past evid p3 pl

There are a number of strategies for dealing with this sort of construction, and they were discussed to some extent on the apertium-turkic mailing list in 2014. A Turkic-wide solution should be found. In the meantime, I'm partial to doing it like Kazakh and Kyrgyz, and fixing everything all at once when a different strategy is decided on.

Missing verb form - aspect: Semelfactive (SEM)

көр + ө +түс, суруйа + -а +оҕус; оҥор + -а(-ыы)+тарт; куттан + -ан+ыл, тыс + гын, малай + с гын, ходьой + х гын; дьүккүй + к гын

some unanalysed passive forms

Which of the following are passive forms? / What is the analysis of these forms, and the lemma?

  • билиниллэр - билин ‘confess’ (?)
    • билиниллибит - (past)
    • билиниллэллэр - бил
  • сиэниллэр - сиэ
    • past сиэниллибит, plural сиэниллибиттэр
  • ыраастаныллар - ырастаа
  • өйдөнүллэр - өйдөө
  • өрдөнүллэр -
  • ирдэниллэр - ирдээ
  • хааччахтаныллар -хаччахтаа
  • тарҕаныллыбыт - тарҕаа
  • бааныллар - баай

number in 3rd person possessive forms of gerunds

We originally had just a <px3sp> in the lexicon, but it looks like there are distinct singular and plural forms, e.g.
барбытыгар/бар<v><iv><ger_past><px3sg><dat>
барбыттарыгар/бар<v><iv><ger_past><px3pl><dat>

However, some recent tests have forms like
барыахтарыгар/бар<v><iv><ger_fut><px3sp><dat>

Is барыаҕыгар a singular form to this plural form? Are there any possessives that don't have a distinction? Alternatively, are singular and plural readings ever ambiguous?

analysis of forms in {B}{I}т{I}н

There are a number of what appear to be verb forms in {B}{I}т{I}н in the corpus. What do these mean / how are these used?

     58 ^буолбутун/*буолбутун$
     32 ^оҥорбутун/*оҥорбутун$
     21 ^кэлбитин/*кэлбитин$
     16 ^киирбитин/*киирбитин$
     15 ^тахсыбытын/*тахсыбытын$
     15  ^суруллубутунан/*суруллубутунан$
     15 ^барбытын/*барбытын$
     14 ^үөскээбитин/*үөскээбитин$
     14  ^сылдьыбытын/*сылдьыбытын$
     11  ^ылбытын/*ылбытын$
     11 ^хаалбытын/*хаалбытын$
     11 ^суруйбутун/*суруйбутун$
      9 ^суруйбутунан/*суруйбутунан$
      9 ^олорбутун/*олорбутун$
      9 ^биэрбитин/*биэрбитин$
      8 ^эстибитин/*эстибитин$
      8  ^элбээбитин/*элбээбитин$
      8 ^тутуллубутун/*тутуллубутун$
      8 ^төрөөбүтүн/*төрөөбүтүн$
      8 ^талбытынан/*талбытынан$

Note the ones with an extra {A}н.

unanalysed forms

occurrences (news, NT, WP) form correct analysis reason for absence in transducer fixed?
3775, 60 эбитэ э<cop><past><p3><sg>, э<cop><ger_past><px3sg><nom> not in lexc fce02d2
3462, 75, 332 илдьэ <adv> (for now) not in lexc fce02d2
2878 ардыларынааҕы ардыларынааҕы<adj> not in lexc fce02d2
2760 уолаттар уол<n><pl><nom> irregular 37493d6, 32367e7 (analysis only!)
2690, 55, 543 этилэр э<cop><ifi><p3><pl> not in lexc fce02d2
2235 көрсөн
2144 иги no such form?
2079 сиринээҕи ардыларынааҕы<adj> not in lexc fce02d2
1981 министрэ phonology 90dd096
1943, , 287 оҥоһуллубут #11
1872 буолуой
1806 Оччолорго
1714 бүгүҥҥү morphology 90dd096
1646 управлениетын phonology? efe537a
1562, , 178 оҥоһуллар #11
1528 чемпиона phonology?
1500 спорка phonology 4a8fb6b
1467 диэни
1460 көрсө
1376 оччолорго
1350 иһитиннэрдэ
1349 начальнига phonology? efe537a
1310 хайдаҕый
1309 инникитин
1292, , 190 күнүнээҕи
1270 кимиэхэ
1266 буоланнар
1225 мунньаҕар
1184 суумалаах
1180 сорохтор
1179, , 277 ардыларыгар
1161 ААУо
1142 бюджет missing stem 90dd096
1087 коммунальнай missing stem 027788a
1083 Бүгүҥҥү morphology 90dd096
1059 Быһата
1046, , 165 спордун phonology efe537a, 4a8fb6b
1030 министрин phonology 90dd096
1015 ОДьКХ
1010 уолаттара irregular 37493d6, 32367e7 (analysis only!)
1006 киґи
и.н.д.
и.н.к.
о. д. а.
, 92 илгэлээх
, 75 пророк
, 60 аньыылаах
, 55 киэр
, 49 кимиэхэ
, , 403 сыллаахтан
, , 275 мэдиссиинэ
, , 245 эрэйиллэр
, , 240 диэни
, , 229 буоланнар
, , 214 сыллахха
, , 205 рейтинигэр
, , 195 диэҥҥэ

Confusion between Pluperfect and Resultative past tense Perfective + missing forms (tense)

  1. According to Gavril Torotoev's table forms like "барбыт этим" are called Pluperfect (PLPF), such form is missing in the transducer.

  2. According to Gavril Torotoev's table forms like "барбыппын" and "баран турабын" are called Resultative past tense Perfective 1st and 2nd respectively. Apertium-sah analyses form "барбыппын" as past and ger_past ⋅ px1sg ⋅ acc, "баран турабын" form is missing and those two words are analysed as separate tokens.

Conjugations of NPs

Conjugated NPs are not recognised by the transducer:
Иван is analysed,
but Ивана is no not.

analysis of forms in {I}лл{I}б{I}т

There are a lot of what appear to be {I}лл{I}б{I}т forms in the corpus:

    799  ^Ылыллыбыт/*Ылыллыбыт$
    158 ^оҥоһуллубут/*оҥоһуллубут$
    134 ^ыҥырыллыбыт/*ыҥырыллыбыт$
    133 ^суруллубут/*суруллубут$
    112 ^тэриллибит/*тэриллибит$
     91  ^ыытыллыбыт/*ыытыллыбыт$
     75  ^Туһаныллыбыт/*Туһаныллыбыт$
     75 ^иҥэриллибит/*иҥэриллибит$
     67 ^көмүллүбүт/*көмүллүбүт$
     48 ^туттарыллыбыт/*туттарыллыбыт$
     47 ^иитиллибит/*иитиллибит$
     44 ^тутуллубут/*тутуллубут$
     40 ^ылыллыбыт/*ылыллыбыт$
     38 ^бэриллибит/*бэриллибит$
     24 ^туруоруллубут/*туруоруллубут$
     24 ^бигэргэтиллибит/*бигэргэтиллибит$
     22 ^айыллыбыт/*айыллыбыт$
     20 ^ылыныллыбыт/*ылыныллыбыт$

There are also a bunch that have an extra {A} at the end:

    179  ^тэриллибитэ/*тэриллибитэ$
     73 ^иҥэриллибитэ/*иҥэриллибитэ$
     58  ^биллибитэ/*биллибитэ$
     55  ^ыытыллыбыта/*ыытыллыбыта$
     49 ^тутуллубута/*тутуллубута$
     37 ^оҥоһуллубута/*оҥоһуллубута$
     35  ^аһыллыбыта/*аһыллыбыта$
     29 ^арыллыбыта/*арыллыбыта$
     24 ^киллэриллибитэ/*киллэриллибитэ$
     21 ^уһуллубута/*уһуллубута$
     21 ^суруллубута/*суруллубута$
     21 ^бэриллибитэ/*бэриллибитэ$

What do these forms mean? They look almost like passive followed by the past {B}{I}т, but they seem to have an extra vowel. Is this all they are? And the ones with {A} are possessives of the past gerund of the passive?

Generator errors identified through the shared task

This is a list of generator errors that @varie identified while working on the error analysis for the shared task. These forms were produced by the version of the generator from ~March 2021.

Incorrect inflections:

Generated form Correct form Fixed?
относительностьбор:относительность<n><px1sg><dat> относительностьпар 4aa441f
уһулар:уһул<v><tv><aor><p3><sg> устар 35a13d8
вулканмыт:вулкан<n><px1pl><nom> вулкаммыт 02cdb41
вулканмытын:вулкан<n><px1pl><acc> вулкаммытын 02cdb41
вулканмытыныын:вулкан<n><px1pl><com> вулканмытыныын 02cdb41
вулканмытыттан:вулкан<n><px1pl><abl> вулкаммытыттан 02cdb41
гепатитиҥ:гепатит<n><px2sg><nom> гепатитыҥ efe537a
иҥммиккит:иҥин<v><iv><past><p2><pl> иҥнибиккит a9b0be8
иҥммитиҥ:иҥин<v><iv><past><p2><sg> иҥнибитиҥ a9b0be8
иҥммитэ:иҥин<v><iv><past><p3><sg> иҥнибитэ a9b0be8
киоскунан:киоск<n><ins> киосканан
коранмын:коран<n><px1sg><acc> кораммын 02cdb41
коранмыныын:коран<n><px1sg><com> кораммыныын 02cdb41
коранмытына:коран<n><px1pl><par> кораммытына 02cdb41
коранмытынааҕар:коран<n><px1pl><comp> кораммытынааҕар
корреспонденинэн:корреспондент<n><ins> корреспонденынан
корреспонденмыт:корреспонден<n><px1pl><nom> корреспондеммыт 02cdb41
корреспонденмытына:корреспонден<n><px1pl><par> корреспондеммытына 02cdb41
корреспонденмытыттан:корреспонден<n><px1pl><abl> корреспондеммытыттан 02cdb41
латынмытын:латын<n><px1pl><acc> латыммытын 02cdb41
латынмытыныын:латын<n><px1pl><com> латыммытыныын 02cdb41
латынмытыттан:латын<n><px1pl><abl> латыммытыттан 02cdb41
оонньоохтоох:оонньоо<v><tv><nec><p3><sg> оонньуохтаах
оонньоохтоохпут:оонньоо<v><tv><nec><p1><pl> оонньуохтаахпыт
оонньоохтоохтор:оонньоо<v><tv><nec><p3><pl> оонньуохтаахтар
ороскуоткына:ороскуот<n><px2sg><par> ороскуоккына
ороскуоткыныын:ороскуот<n><px2sg><com> ороскуоккынын
ороскуоткытын:ороскуот<n><px2pl><acc> ороскуоккытын
ороскуоткытынан:ороскуот<n><px2pl><ins> ороскуоккытынан
ороскуотпар:ороскуот<n><px1sg><dat> ороскуоппар a016378
ороскуотпыт:ороскуот<n><px1pl><nom> ороскуотум a016378
ороскуотпытыгар:ороскуот<n><px1pl><dat> ороскуоппытыгар
ороскуотпытыттан:ороскуот<n><px1pl><abl> ороскуоппытыттан
парольто:пароль<n><par> парольда
пиньиньнэн:пиньинь<n><ins> пиньининэн
румынмытына:румын<n><px1pl><par> румыммытына 02cdb41
румынмытыттан:румын<n><px1pl><abl> румыммытыттан 02cdb41
спортооҕор:спорт<n><comp> спортааҕар 4a8fb6b
уһунма:уһун<v><iv><neg><imp><p2><sg> устума 5d459da
уһунматыбыт:уһун<v><iv><neg><ifi><p1><pl> устубатыбыт 5d459da
уһунматыҥ:уһун<v><iv><neg><ifi><p2><sg> устубатыҥ 5d459da
шалфейинэн:шалфей<n><ins> шалфейынан efe537a
эргилиниэҕэ:эргилин<v><iv><fut><p3><sg> эргиллиэҕэ 124643d
тонмытыныын:тон<n><px1pl><com> @varie suggests томмутунуун, but her consultant suggests туоммутунуун
автомобильэ:автомобиль<n><px3sg><nom> @varie suggests автомобилэ but has doubts

Other suggestions from @varie's consultants

автомобиль<n> -- suggested to change the lemma to аптамабыыл
гол<n> -- suggested to change the lemma to гуол
наследиенан:наследие<n><ins> -- suggested наследиянан

verb forms: figure out which forms are possible in which roles

Which of these forms can be used in which ways? Complete the chart:

suffix form example forms finite? subst/ger? advl/gna? attr/gpr? with aux / prc?
-{А}р барар, саныыр <aor> yes no <gpr_aor> (with copula)
-{B}{А}т барбат, санаабат <neg><aor> yes no <neg><gpr_aor> (with copula)
-{B}{I}т барбыт, санаабыт yes yes no yes (with copula)
-{I}{A}х барыах, буолуох <fut> <dat>, <acc>, <abl>, <ins>, <comp>, remnant <gen> with <post> no <gpr_fut> (with copula)
-{I}{A} барыа, буолуо <nom> only no no (with copula)
-{I}{A}н барыан, буолуон <nom> only no no no
-{D}{A}х бардах, бүтэрдэх yes <dat>, <abl>, <acc>, remnant <gen> no no no
-{A}{A}чч{I} барааччы, үүнээччи yes yes (except <px><par> ?) no no no
-{A} илик бара илик, үөрэнэ илик <unac> yes no <gpr_unac> no
-{A}н баран, киирэн no yes no yes
-{A} бара, ыы no yes no yes
-{I}м{I}н{A} барымына, ылымына yes no yes no no
-{B}{A}кк{A} барбакка, тэһийбэккэ yes? no yes no no
-{а}{а}р{I} бараары, диэри no yes no no

-дээх

I'm finding a lot of forms with -дээх, mostly after №, like this:

27 №-дээх оскуолаҕа

What is this form?

Error: Invalid dictionary (hint: the left side of an entry is empty)

Dear Fran,
After (almost) completing the Uzbek tagging, I started to work on two other Turkic languages: Tatar and Yakut. A rather strange error message, however, appears:
www-data@unesco1:/tmp$ apertium -d /usr/local/share/apertium sah-tagger sah_ar1600y_b2_0.h2~ >sah_ar1600y_b2_0.i2~
Error: Invalid dictionary (hint: the left side of an entry is empty)
I firstly suspected a bug in the lexicon file, but in both lexicons at the same time? Any suggestions?
Have a healthy New Year,
Vlado B, 10:30

Add some more moods

from https://www.jehsmith.com/1/2021/03/verbal-mood-in-sakha-a-comprehensive-analysis.html

"Affirmative mood"

Foorms like барыыһы+copula endings, 3 negative forms: барымыыһы+copula endings, барыа суох+copula endings, and барыа+~endings суох.

"Debitive mood"

  • Aр>DAAx (should be realised at moment of speaking; must be realised after moment of speech)
  • IAх>TAAх (must be completed in the future)
  • Aр>DAAx этэ / IAх>TAAх этэ (should have been completed in the past but was not)

"Possible mood"

  • ((I)м)>AAйA / ((I)м)>AAрAй
  • should have similar tags to Kyrgyz бараармын/барбаасмын

<px3sg> and <px3pl> syncretism

Opening an issue per @ftyers' request.

I see a lot of syncretic forms in noun paradigms:

тириэрдиилэринээҕэр:тириэрдии<n><pl><px3pl><comp>
тириэрдиилэринээҕэр:тириэрдии<n><px3pl><comp>
тириэрдиилэринээҕэр:тириэрдии<n><pl><px3sg><comp>

Are <px3sg> and <px3pl> always expressed with the same morpheme? If yes, why are they annotated separately and not with <px3sp> like in Tuvan?

verbs like сырыт/сылдьар

  • ыарый/ыалдьар

  • оҕус/охсор

  • араҕыс/арахсар

  • сайын/сайдар

  • умун/умнар

  • үтүгүн/үтүктэр

  • эбилин/эбиллэр

  • бэрилин/бэриллэр

  • иҥэрилин/иҥэриллэр

  • иһилин/иһиллэр

  • талылын/талыллар

  • туһанылын/туһаныллар

  • төнүн/төннөр

  • кытын/кыттар

  • орт/ордор

  • тарт/тардар

  • тириэрт/тириэрдэр

  • сыһын/сыстар

<pii> forms

What do the <pii> forms correspond to? The generator gives me

барабыт:бар<v><iv><pii><p1><pl>
бараҕыт:бар<v><iv><pii><p2><pl>
бараллара:бар<v><iv><pii><p3><pl>
барарым:бар<v><iv><pii><p1><sg>
барарыҥ:бар<v><iv><pii><p2><sg>
барара:бар<v><iv><pii><p3><sg>

Most forms match the Ubriatova book paradigm for imperfect past (прошедшее незаконченное время, "unfinished past", pp. 228, 315):

Screen Shot 2021-03-01 at 8 21 25 PM

But the generator has барабыт:бар<v><iv><pii><p1><pl> instead of барарбыт and бараҕыт:бар<v><iv><pii><p2><pl> instead of бараргыт. Not sure if this is an error or it is actually a different inflection.

forms of -TAх

I'm trying to understand the range of things -TAх does. As we discussed on IRC, the information on pp. 236 to 237 of Убрятова suggests there are two main "moods" that use this suffix: второе условное and предположительное. There are also verbal adjective and verbal noun uses.

So my questions are:

  1. Are второе условное and предположительное moods both finite?
  2. Are there some good examples of each?
  3. Could we see a full paradigm of each?

буолуон

What's буолуон mean? Is this a form of буол? If so, how is the suffix {I}{A}н used?

Installed modes are missing files

modes.xml includes some modes with install="yes", but the required
files aren't installed.

Some generic suggestions:

  • -lexc and -twol modes probably aren't useful to users

  • -spell modes should depend on --enable-ospell

  • .deps files are never installed, so any modes using them shouldn't be
    installed.

  • Messages for package app-dicts/apertium-sah-9999:

  • Failed to find '/usr/share/apertium/apertium-sah/.deps/sah.twol.hfst' in install image.

  • QA: missing files required for mode sah-twol.

  • Failed to find '/usr/share/apertium/apertium-sah/.deps/sah.LR.lexc.hfst' in install image.

  • QA: missing files required for mode sah-lexc.

-AAт forms

There are verb forms in -{A}{A}т that are identified by Johanson (2021) as a type of converb. Examples: биэрээт, түһээт, ылаат.

(An English gloss of these forms appears to be "as soon as X Y-ed" or "upon having Y-ed"? In Kyrgyz these semantics are expressed with a VN+possessive ending followed by a postposition, e.g. берээрим менен - "as soon as I gave". So no help to the following questions.)

I guess we need a verbal adverb label (gna_???) for these. Aside from the question of what the label should be, I'm wondering if these might behave as infinitives too (that is, occur with auxiliaries, prc_???). @varie?

do BIт gerunds have bare <nom> forms?

Currently forms like барбыт are analysed as <ger_past><nom>, but it's not clear that they can be used as bare verbal nouns. Forms like барбыта (<ger_past><px3sp><nom>) are fine.

how to deal with илик

Meaning, distribution

The form "илик" only occurs in verb forms and adjusts the meaning to something like "not yet":

Оҥоро иликпин "Я ещё не сделал"
Оҥоро иликкин "Ты ещё не сделал"
Оҥоро иликпит "Мы ещё не сделали"

Aside from its use in finite verb complexes, it can also be attributive (verbal adjective):

бара илик киһи "A person who hasn't gone yet"

Similar things in other languages

Kyrgyz is the only language where we've implemented something similar:

"<бара элекмин>"
	"бар" v iv notyet p1 sg
"<бара элек>"
	"бар" v iv gpr_notyet
"<киши>"
	"киши" n nom

There's also a gerund reading, in constructions like:

"<бара электе>"
	"бар" v iv ger_notyet loc

and

"<бара элегимде>"
	"бар" v iv ger_notyet px1sg loc

бара?

The form бара appears in the transducer as a noun; investigate where this came from and see if it needs to be removed or left.

Multiple <neg><imp><p2><pl> forms produced by generator

I'm using .deps/sah.RL.hfst to generate inflected verbs. It generates several <neg><imp><p2><pl> forms that differ only by one vowel. Is this behavior intended?

булкуспааҥ:булкус<v><iv><neg><imp><p2><pl>
булкуспаеҥ:булкус<v><iv><neg><imp><p2><pl>
булкуспаоҥ:булкус<v><iv><neg><imp><p2><pl>
булкуспаөҥ:булкус<v><iv><neg><imp><p2><pl>
булкуспаэҥ:булкус<v><iv><neg><imp><p2><pl>
булкуспаяҥ:булкус<v><iv><neg><imp><p2><pl>

generator error for `prc_impf` after `ы`-final verb(s)

$ echo "^сылдьы<v><iv><prc_impf>$" | apertium -d . -f none sah-gener
сылдьыа/сылдьые/сылдьыи/сылдьыо/сылдьыу/сылдьыы/сылдьыэ/сылдьыя/сылдьыү/сылдьыө

rename numbered finite forms

  • <ger1> (-IAн) should be <ger_cond> or similar
    • көрүөн, кэлиэн
    • compare to көрүө (<ger_pot>) and көрүөх (<ger_fut>)
  • Figure out <ger2> (#14) - maybe <ger>? ("act of ...")
    • барааһын, көрөөһүн, кэлээһин

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.