Parallelize UDPipe processing with Python Multiprocessing.
Requires: UDPipe, xz, xzcat, UDPipe model
Example:
./mudpipe.py \
--input-xz \
--output-xz \
--arg='--tokenize' \
--arg='--tag' \
--arg='--parse' \
--model=[some_model].udpipe \
--path_dir=[data_dir].*.txt.xz
--workers=4UDPipe is developed by Institute of Formal and Applied Linguistics. For more information on UDPipe please see here.
Yulia Spektor & Kyle Gorman