apertium-ara's Introduction

Apertium

Requirements

This package needs the package lttoolbox-3.5 installed in the system, as well as libxml and libpcre.

See https://apertium.org and https://wiki.apertium.org for more information on installing.

Description

When building, this package generates, among others, the following modules:

apertium-deshtml, apertium-desrtf, apertium-destxt Deformatters for html, rtf and txt document formats.
apertium-rehtml, apertium-rertf, apertium-retxt Reformatters for html, rtf and txt document formats.
apertium Translator program. Execute without parameters to see the usage.

Quick Start

There are binaries available for Debian, Ubuntu, Fedora, CentOS, OpenSUSE, Windows, and macOS. We package both nightly builds and releases. See https://wiki.apertium.org/wiki/Installation for more information. Only build from source if you either want to change this tool's behavior, or are on a platform we don't yet package for.

Download the packages for lttoolbox-VERSION.tar.gz and apertium-VERSION.tar.gz and linguistic data

Note: If you are using the translator from GitHub, run ./autogen.sh before running ./configure in all cases.
Unpack lttoolbox and do ('#' means 'do that with root privileges'):

   $ cd lttoolbox-VERSION
   $ ./configure
   $ make
   # make install

Unpack apertium and do:

   $ cd apertium-VERSION
   $ ./configure
   $ make
   # make install

Unpack linguistic data (LING_DATA_DIR) and do:

   $ cd LING_DATA_DIR
   $ ./configure
   $ make
   and wait for a while (minutes).

Use the translator

   USAGE: apertium [-d datadir] [-f format] [-u] <direction> [in [out]]
    -d datadir       directory of linguistic data
    -f format        one of: txt (default), html, rtf, odt, docx, wxml, xlsx, pptx,
                     xpresstag, html-noent, latex, latex-raw
    -a               display ambiguity
    -u               don't display marks '*' for unknown words
    -n               don't insert period before possible sentence-ends
    -m memory.tmx    use a translation memory to recycle translations
    -o direction     translation direction using the translation memory,
                     by default 'direction' is used instead
    -l               lists the available translation directions and exits
    direction        typically, LANG1-LANG2, but see modes.xml in language data
    in               input file (stdin by default)
    out              output file (stdout by default)


   Sample:

   $ apertium -f txt es-ca <input >output

apertium-ara's People

Contributors

Stargazers

Watchers

apertium-ara's Issues

Issues with the README file

I am facing a couple of issues with using the repository.
My main aim was to use the POS tagger.

Firstly, I needed to use the following commands to build the bin files:

aclocal
autoconf
automake --add-missing 
./configure

Most of the repos have an autogen.sh script, however this repo doesn't.

Additionally, when I am using this line to test the tagger, I am getting errors:

$ echo "السلام" | apertium -d . ara-tagger

Error: Mode ara-tagger does not exist. Try one of:                                                                        
  ara-disam-debug                                                                                                         
  ara-disam                                                                                                               
  ara-gener                                                                                                               
  ara-morph

I tried using the morphological analyser and it's working:

$ echo "السلام" | lt-proc ara.automorf.bin 

^السلام/ال<det><def>+سلام<n><m><nn><sg><nom>/ال<det><def>+سلام<n><m><nn><sg><gen>/ال<det><def>+سلام<n><m><nn><sg><nom><cstr>/ال<det><def>+سلام<n><m><nn><sg><nom><artic>/ال<det><def>+سلام<n><m><nn><sg><gen><cstr>/ال<det><def>+سلام<n><m><nn><sg><gen><artic>/ال<det><def>+سلام<n><m><nn><sg><acc><cstr>/ال<det><def>+سلام<n><m><nn><sg><acc><artic>$

Recommend Projects