Skip to content

ggordonhall/measurement_tagger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Measurement Tagger

Build Status codecov

A dependency parse based measurement tagger.

Text to be tagged should be stored in the text/ directory. The file to tag is specific with -t.

Run in mode (-m):

Mode Measurement
d Distance
t Time
m Mass
e Energy
v Volume

By default, tags measurements then convertes them to their standard unit. Unconverted measurements can be returned if run with the --return_unconverted flag. The maximum n-gram to search for measurement units, i.e. nautical miles, can be set with the --max_gram flag.

Run pipenv install && python -m spacy download en && python -m nltk.downloader wordnet to setup, then test by running main.py -m d -t wiki.txt.

TO DO

  • Fix --parallel flag
  • Improve handling of n-gram measurement units