Added python script for parsing xml -> tsv or json. #898
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Added python script for parsing xml -> tsv or json.
First argument: from (example - dict.xml)
Second argument: to (example - dict.json)
Third argument: mode (example - json).
Sample format of tsv (' ' means space):
id \t root \t data \t extra \n
#header \t dictionary \t version \t revision \n
OpenCorpora \t dictionary \t <version_from_xml> \t \n
#lemmas lemma variants empty
[ \t ' ' <';'.join(attributes)> \t ' ' <';'.join(attributes)> [, ' ' <';'.join(attributes)>]* \t \n]*
#gramemes \t parent \t alias \t description \n
[ \t \t \t \n]*
#links \t from \t to \t type \n
[ \t \t \t \n]*
It requires much less space.
This script is a sample, it requires a .sh wrapper.