Skip to content

Conversation

@patil-suraj
Copy link
Contributor

What does this PR do?

Currently MBart50Tokenizer, MBart50TokenizerFast can't be loaded using AutoTokenizer because they use the MBartConfig which is associated with MBartTokenizer.

This PR enables loading MBart50Tokenizer(Fast) by adding them to the NO_CONFIG_TOKENIZER list. I've also added the tokenizer_type argument in the respective models' config file on the hub.

cc @Narsil

Copy link
Collaborator

@sgugger sgugger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot!

@Narsil
Copy link
Contributor

Narsil commented Mar 15, 2021

Very nice !

@patil-suraj patil-suraj merged commit fcf1021 into huggingface:master Mar 15, 2021
Iwontbecreative pushed a commit to Iwontbecreative/transformers that referenced this pull request Jul 15, 2021
* enable auto tokenizer for mbart50 tokenizers

* fix imports
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants