BPE code used in source and target data #12

xixiddd · 2018-11-06T02:59:49Z

Hi, Shamil Chollampatt.
In the Model and Training Details Section of your paper, you said that "Each of the source and target vocabularies consists of 30K most frequent BPE tokens from the source and target side of the parallel data, respectively.", but according to this line in the preprocessing scripts(i.e. training/preprocess.sh), it seems that you only use the target-end data to learn BPE codes and then, apply it to both source and target data.

shamilcm · 2018-11-09T10:52:49Z

The BPE model is trained using 30,000 operations using the target side of the training data according to the line that you pointed to. The source/target vocabularies for the encoder-decoder model consist of 30,000 most frequent subwords (or BPE segmented tokens) from the source/target sides of the parallel data (see line) .

shamilcm closed this as completed Nov 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BPE code used in source and target data #12

BPE code used in source and target data #12

xixiddd commented Nov 6, 2018

shamilcm commented Nov 9, 2018

BPE code used in source and target data #12

BPE code used in source and target data #12

Comments

xixiddd commented Nov 6, 2018

shamilcm commented Nov 9, 2018