why a lot of @@ in the data #12

yeliu918 · 2020-05-31T17:58:53Z

Hi,

I notice that there a lot of @@ in the data. For example, "Gut@@ ach : Incre@@ ased safety for pedestri@@ ans". It seems like that "Incre@@ ased" means "Increased". Should we revise the file such that deleting the @@ and combine two tokens to one token? I think for the preprocess.py ignore such a problem. And create the dictionary that contains a lot of words that have "@@".

Best,
Ye

yeliu918 closed this as completed Jun 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why a lot of @@ in the data #12

why a lot of @@ in the data #12

yeliu918 commented May 31, 2020

why a lot of @@ in the data #12

why a lot of @@ in the data #12

Comments

yeliu918 commented May 31, 2020