Skip to content

apertium/apertium-sme-deu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Northern Sami and German: apertium-sme-deu

This is an Apertium language pair for translating between Northern Sami and German. What you can use this language package for:

  • Translating between Northern Sami and German
  • Morphological analysis of Northern Sami and German
  • Part-of-speech tagging of Northern Sami and German

For information on the latter two points, see subheading "For more information" below.

Requirements

You will need the following software installed:

  • autoconf
  • automake
  • pkg-config
  • lttoolbox (>= 3.5.1)
  • apertium (>= 3.6.1)
  • vislcg3 (>= 1.3.1)
  • hfst (>= 3.15.1)
  • giella-sme
  • apertium-deu

If this does not make any sense, we recommend you look at: apertium.org.

Compiling

Given the requirements being installed, you should be able to just run:

$ autoreconf -fvi
$ ./configure
$ make
# make install

You can use ./autogen.sh instead of autoreconf and ./configure in case you're compiling from source. If you installed any prerequisite language packages using a --prefix with ./configure, make sure to use the same --prefix when running ./configure here.

If any of this doesn't make sense or doesn't work, see https://wiki.apertium.org/wiki/Install_language_data_by_compiling

Testing

If you are in the source directory after running make, the following commands should work:

$ echo "dállu" | apertium -d . sme-deu
der Haus

$ echo "Haus" | apertium -d . deu-sme
dállu

After installing somewhere in $PATH, you should be able to do e.g.

$ echo "dállu" | apertium sme-deu
der Haus

Files and data


For more information

Help and support

If you need help using this language pair or data, you can contact:

See also the file AUTHORS, included in this distribution.

Citing

If you use this in academic context please cite as:

Flammie Pirinen and Linda Wiechetek. 2022. Building an Extremely Low Resource Language to High Resource Language Machine Translation System from Scratch. In Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022), pages 150–155, Potsdam, Germany. KONVENS 2022 Organizers.

In TeX you can use the following bibtex:

@inproceedings{pirinen-wiechetek-2022-building,
    title = "Building an Extremely Low Resource Language to High Resource Language Machine Translation System from Scratch",
    author = "Pirinen, Flammie  and
      Wiechetek, Linda",
    booktitle = "Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022)",
    month = "12--15 " # sep,
    year = "2022",
    address = "Potsdam, Germany",
    publisher = "KONVENS 2022 Organizers",
    url = "https://aclanthology.org/2022.konvens-1.18",
    pages = "150--155",
}