Skip to content
(FST) pan-Ashaninka language morphological analyzer / Analizador morfológico para el idioma pan-Asháninka
TeX Shell Java Makefile C++ Perl Python
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
FreeLingModules
ServerMode
freeling_lexicon
reduplication
spellchecking
useful_files
.gitignore
AUTHORS
LICENSE
Makefile
README.md
adbl.py
advroot.prq.foma
aroot.prq.foma
ashaninkamorph.sh
asheninka.guesser.script
asheninka.script
bash-ini-parser
bibliography.bib
extractEntries.pl
ideo.adbl.foma
ideo.prq.foma
ideoredupsyl.prq.foma
interjection.prq.foma
interrogative.prq.foma
lookup.script
n-p-1.foma
n-s-abs.foma
n-s-adv.foma
n-s-aug.foma
n-s-cl.foma
n-s-cntr.foma
n-s-dem.foma
n-s-dim.foma
n-s-dub.foma
n-s-emph.foma
n-s-excl.foma
n-s-expect.foma
n-s-loc.foma
n-s-pl.foma
n-s-poss.foma
n-s-question.foma
n-s-superl.foma
n-s-temp.foma
n-s-tmp.foma
n-vroot.prq.group1.foma
neg.prq.foma
noun.cmp.prq.foma
noun.suffix.prq.script
noun.transitions.prq.script
nroot.en.foma
nroot.es.foma
nroot.es.freeling.00.foma
nroot.es.freeling.01.foma
nroot.es.freeling.02.foma
nroot.es.freeling.03.foma
nroot.prq.foma
nroot.pt.foma
nroot.qu.foma
numeral.prq.foma
oroot.prq.foma
orthography.foma
place.names.foma
prndem.prq.foma
prnpers.prq.foma
prnposs.prq.foma
punctuation.foma
tokenize.pl
typo.matrix
v-nroot.prq.group1.foma
v-p-1.foma
v-p-2.foma
v-s-adj.foma
v-s-ant1.foma
v-s-applben.foma
v-s-applcaussoc.foma
v-s-applgen.foma
v-s-applint.foma
v-s-applpres.foma
v-s-applreas.foma
v-s-applref.foma
v-s-applsep.foma
v-s-appr.foma
v-s-aug.foma
v-s-certainty.foma
v-s-cmpl.foma
v-s-cntf.foma
v-s-cond.foma
v-s-degree.foma
v-s-dext.foma
v-s-dir.foma
v-s-dist.foma
v-s-dlmt.foma
v-s-dub.foma
v-s-dur.foma
v-s-early.foma
v-s-emph.foma
v-s-excl.foma
v-s-frus.foma
v-s-hab.foma
v-s-icpl.foma
v-s-impp.foma
v-s-inch.foma
v-s-mir.foma
v-s-ncfoc.foma
v-s-neg.foma
v-s-nmz.foma
v-s-ofoc.foma
v-s-opt.foma
v-s-pl.foma
v-s-possind.foma
v-s-prf.foma
v-s-prog.foma
v-s-question.foma
v-s-rcp.foma
v-s-realitystatussystem.foma
v-s-regr.foma
v-s-rel.foma
v-s-rev.foma
v-s-semel.foma
v-s-sensory.foma
v-s-sim.foma
v-s-stat.foma
v-s-sub.foma
v-s-urgent.foma
verb.classifier.prq.foma
verb.composition.len.prq.foma
verb.composition.prq.foma
verb.suffix.prq.script
verb.transitions.exp1.script
verb.transitions.prq.script
vroot.es.foma
vroot.prq.foma
vroot.redup.foma

README.md

Ashaninka-Morph

This is a morphological analyzer for pan-Ashaninka that is written using Xerox finite state technology with a multilingual lexicon (Ashaninka, English, Spanish and Portuguese), some entries in Quechua and Italian are provided in a lesser quantity.

Pan-Ashaninka is the general term used to refer to a cluster of Arawak-dialects spoken in Peru and Acre-Brazil, the aforementioned cluster specifically comprises Ashéninka Pichis (Pi), Alto Perené (Pe), Ashéninka Pajonal (Paj), Ucayali-Yurua Ashéninka (U-Y), Ajyininka (or Ashéninka) Apucurayali (Apu), Ashaninka (Asha).

Pan-Ashaninka is a polysynthetic and head-marking language spoken in the central adjoining Amazonian regions between Peru and Brazil (Acre State). It is spoken by approximately 70,000 people (2002).

Polysynthetic

Because it is often possible to find a word that combines several word stems with a very specific semantic meaning (noun-incorporation and verbal classifiers).

Noun incorporation: tsapya 'river.bank'

apaani asheninka isaikatsapyaatziro inkaare
=apaani *** =asheninka *** i- =saik -a =tsapya -atz -i =ro *** =inkaare 
=one *** =man *** 3m.A- =to.live -EP =river.bank -PROG -IRR =3n.m.O *** =lake 
EN: 'a man who lived near a lake'; Lit.: 'one man who lived in the lake bank'
ES: 'un hombre que vivía cerca de una cocha' 
Verbal classifier: ha 'liquid'

katsinkajari / katsinkahari
=katsinka -ha -ri 
=to.be.cold -cl:liquid -rel 
EN: 'cold.water; lit.: liquid.that.is.cold'
ES: 'agua.fría; lit.: líquido.que.está.frio' 

Head-marking

Ashaninka possesses extensive agreement or cross-refencing. Heads such as verbs and nouns agree with the properties of their arguments, for instance, gender markers on the verb indicate properties, such as masculine (+m.) or not-masculine (+n.m.), of both the subject and the object.

Verbal reduplication

Verbal reduplication indicates urgency (1), repetition (3), or intensity (2).

1) ma 'to.do' -> ma~ma 'to.do.quickly' 

Our analysis of the collected text corpus, shows that Ashaninka presents both partial reduplication 'bounded copy' (2) and total reduplication 'unbounded copy' (1, 3) as productive morphological operations.

2) kov 'to.want' -> ko~kov 'to.prefer.strongly' 

3) koniha 'to.appear' -> koniha~koniha 'to.appear.again-and-again'

There are particular cases where verbal roots with prefixes are reduplicated in both partial and total modes.

Normalization process

In order to give this project a certain amount of robustness, we used a normalized version of the alphabet developed by Elena Mihas to write every lexicon entry and all the affixes. In addition to this, normalization rules have been implemented, this means that every letter in an input entry is mapped to its equivalent in the normalized alphabet before being fully analyzed.

Compiling with XFST or FOMA

# XFST 
$ xfst -f asheninka.script 
$ echo "ashaninka" | lookup asheninka.bin -flags cnKv29TT
0%>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>100%

  *****  LEXICON LOOK-UP  *****

ashaninka	[a-][NPers][1PL.poss+][--][=shani][VRoot][=to.be.of.the.same.group][--][-nka][NS][+NMZ.QLTY]nka
ashaninka	[a-][NPers][1PL.poss+][--][=shani+m.][NRoot][=anteater (ES: oso.hormiguero; sci.nm.: myrmecophaga.tridactyla)][--][-nka][NS][+NMZ.QLTY][=abstract.qlty.noun]
ashaninka	[=ashaninka][NRoot][=indigenous.person.that.lives.in.the.in.the.central.adjoining.Amazonian.regions.between.Peru-and-Brazil]


LOOKUP STATISTICS (success with different strategies):
strategy 0:	1 times 	(100.00 %)
not found:	0 times 	(0.00 %)

corpus size:	1 word
execution time:	0 sec
speed:		1 word/sec

  *****  END OF LEXICON LOOK-UP  *****

# FOMA
$ foma -f asheninka.script 

How to download the source code

  • Using wget
$ wget https://github.com/hinantin/AshaninkaMorph/archive/master.zip 
  • Cloning this repository
$ git clone https://github.com/hinantin/AshaninkaMorph

Software prerequisites

In order to run AshaninkaMorph (this finite state transducer), you will need either Foma or XFST, the download links for these are provided below:

Software on-line testing

Don't know how or want to install it?

Then test the morphological analyzer on-line, go to https://hinant.in/

Spell-checking

Test the spell-checker on-line: https://hinant.in/ckeditor/samples/api.html

You can’t perform that action at this time.