Skip to content

Latest commit

 

History

History
34 lines (25 loc) · 1.8 KB

pimlico.modules.malt.conll_parser_input.rst

File metadata and controls

34 lines (25 loc) · 1.8 KB

Annotated text to CoNLL dep parse input converter

Path pimlico.modules.malt.conll_parser_input
Executable yes

Converts word-annotations to CoNLL format, ready for input into the Malt parser. Annotations must contain words and POS tags. If they contain lemmas, all the better; otherwise the word will be repeated as the lemma.

Inputs

Name Type(s)
annotations WordAnnotationCorpus <pimlico.datatypes.word_annotations.WordAnnotationCorpus> with 'word' and 'pos' fields

Outputs

Name Type(s)
conll_data ~pimlico.datatypes.parse.dependency.CoNLLDependencyParseInputCorpus