Skip to content

Latest commit

 

History

History
27 lines (19 loc) · 1.16 KB

README.md

File metadata and controls

27 lines (19 loc) · 1.16 KB

Measurement Tagger

Build Status codecov

A dependency parse based measurement tagger.

Text to be tagged should be stored in the text/ directory. The file to tag is specific with -t.

Run in mode (-m):

Mode Measurement
d Distance
t Time
m Mass
e Energy
v Volume

By default, tags measurements then convertes them to their standard unit. Unconverted measurements can be returned if run with the --return_unconverted flag. The maximum n-gram to search for measurement units, i.e. nautical miles, can be set with the --max_gram flag.

Run pipenv install && python -m spacy download en && python -m nltk.downloader wordnet to setup, then test by running main.py -m d -t wiki.txt.

TO DO

  • Fix --parallel flag
  • Improve handling of n-gram measurement units