monolingual make rules and sample corpus for apertium-tagger .prob training #25

unhammer · 2017-04-12T15:13:15Z

We should have some make rules for the unigram tagger, along with a tiny sample corpus of the right format (e.g. in the subdir corpus). Something like

LANG.prob: corpus/LANG.corpus
	apertium -d . tagger-training-pipeline <$< >$@

(Someone with an idea what the tagger training pipeline looks like will have to fill that out.)

The text was updated successfully, but these errors were encountered:

ftyers · 2017-04-12T15:21:55Z

$(LANG1).prob: corpus/$(LANG1).tagged
        apertium-tagger -s 0 -u 2 $@ $<

ftyers · 2017-04-12T15:26:08Z

Ok, this is implemented in apertium-kaz

unhammer closed this as completed in f72b4ba Apr 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

monolingual make rules and sample corpus for apertium-tagger .prob training #25

monolingual make rules and sample corpus for apertium-tagger .prob training #25

unhammer commented Apr 12, 2017

ftyers commented Apr 12, 2017

ftyers commented Apr 12, 2017

monolingual make rules and sample corpus for apertium-tagger .prob training #25

monolingual make rules and sample corpus for apertium-tagger .prob training #25

Comments

unhammer commented Apr 12, 2017

ftyers commented Apr 12, 2017

ftyers commented Apr 12, 2017