Inconsistency in data_utils.py #6

Harold-Lee-PKU · 2018-03-03T18:10:13Z

In data_utils.py, according to line 134, the tokes are expected to be "not annoying_number_word(...)", so the annoying_number_word() should return True when the toke should be ignored. Yet at line 116, the annoying_number_word() returns True when the toke should not be ignored. I think this bug may affect the generation model as well as the extraction model.

swiseman · 2018-03-03T22:11:42Z

Yes, thanks for pointing this out; looks like this results in numeric words (e.g., "thirty four") largely being ignored, though of course numbers are extracted. And I agree we should get results for including numeric words as well.

swiseman closed this as completed Mar 3, 2018

swiseman mentioned this issue Apr 16, 2018

A bug regarding text numbers in data_utils.py #13

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistency in data_utils.py #6

Inconsistency in data_utils.py #6

Harold-Lee-PKU commented Mar 3, 2018

swiseman commented Mar 3, 2018

Inconsistency in data_utils.py #6

Inconsistency in data_utils.py #6

Comments

Harold-Lee-PKU commented Mar 3, 2018

swiseman commented Mar 3, 2018