You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are a few NE sequences that start with I- instead of B- (as most do) in the files in data/. For example, the following appears in data/digitoday.2014.train.csv
520 I-PRO O
. O O
Lumia I-PRO O
520:n I-PRO O
osuus O O
käytössä O O
(I'd be happy to dig out the details for the rest if there's interest in addressing this issue.)
The text was updated successfully, but these errors were encountered:
Thank you for noticing this! I found altogether eight BIO annotation errors where the entity mistakenly starts with I-E instead of B-E. In addition, there was one missing B-ORG tag. Those have been fixed now. The changes are in the files digitoday.2014.csv, digitoday.2014.train.csv and digitoday.2015.test.csv.
There are a few NE sequences that start with
I-
instead ofB-
(as most do) in the files indata/
. For example, the following appears indata/digitoday.2014.train.csv
(I'd be happy to dig out the details for the rest if there's interest in addressing this issue.)
The text was updated successfully, but these errors were encountered: