Skip to content

ValueError: [E177] Ill-formed IOB input detected #5753

Discussion options

You must be logged in to vote

I'm not sure what version of spacy you're using? You don't have to use it for training, but try the latest v2 (currently v2.3.1) for converting. I get the error:

File "/path/spacy/cli/converters/conllu2json.py", line 69, in read_conllx
    id_, word, lemma, pos, tag, morph, head, dep, _1, iob = parts
ValueError: not enough values to unpack (expected 10, got 9)

This line is malformed:

13	зафиксирована	_	VERB	_	Aspect=Perf|Gender=Fem|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Pass	7ccomp	_	O

I suspect your best option is to use tools from universal dependencies to validate the CoNLL-U format.

Replies: 6 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by ines
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / training Feature: Training utils, Example, Corpus and converters
2 participants
Converted from issue

This discussion was converted from issue #5753 on December 11, 2020 00:10.