Conversion from slow to fast for BPE spm vocabs contained an error. #10120

- There is only 1 test currently (tokenizers + slow) that used the modified path and it's reformer, which does not contain any ids modification so the bug was silent for now. - The real issue is that vocab variable was overloaded by SentencePieceExtractor, leading to Slow specific vocab oddities to be completely ignored - The bug was reported here huggingface#9518 - Ran the complete tokenization test suite with slow without error (`RUN_SLOW=1 pytest -sv tests/test_tokenization_*`)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conversion from slow to fast for BPE spm vocabs contained an error. #10120

Conversion from slow to fast for BPE spm vocabs contained an error. #10120

Commits on Feb 10, 2021