Can't load DeBERTa-v3 tokenizer #70

maiiabocharova · 2021-11-20T13:41:48Z

from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("microsoft/deberta-v3-base")

Gives me an error
ValueError: This tokenizer cannot be instantiated. Please make sure you have sentencepiece installed in order to use this tokenizer.
But sentencepiece is already installed

Also tried

!pip install deberta
from DeBERTa import deberta
vocab_path, vocab_type = deberta.load_vocab(pretrained_id='base-v3')
tokenizer = deberta.tokenizers[vocab_type](vocab_path)

this gives me
TypeError: stat: path should be string, bytes, os.PathLike or integer, not NoneType

Please help, how can I use the tokenizer for deberta-base-v3?

The text was updated successfully, but these errors were encountered:

chrischowfy · 2021-11-23T11:16:29Z

from transformers import DebertaV2Tokenizer, DebertaV2Model
tokenizer = DebertaV2Tokenizer.from_pretrained("microsoft/deberta-v3-base")
is work for me.

maiiabocharova · 2021-11-23T14:03:41Z

from transformers import DebertaV2Tokenizer, DebertaV2Model tokenizer = DebertaV2Tokenizer.from_pretrained("microsoft/deberta-v3-base") is work for me.

Thank you, I was able to initialize tokenizer, but later it gives me an error when providing text to tokenizer
tokenizer("Some text")
TypeError: 'NoneType' object is not callable

chrischowfy · 2021-11-28T07:31:43Z

It's weird. Maybe the text you tokenized wasn't processed properly

maiiabocharova · 2021-11-30T10:21:42Z

Hello, the issue was that I used colab and tokenizer needed sentencepiece to be installed. So the solution was to install sentencepiece and afterwards restart the runtime. (I didn't restart it at first)

Thank you sharing the model!

maiiabocharova closed this as completed Nov 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't load DeBERTa-v3 tokenizer #70

Can't load DeBERTa-v3 tokenizer #70

maiiabocharova commented Nov 20, 2021 •

edited

chrischowfy commented Nov 23, 2021

maiiabocharova commented Nov 23, 2021

chrischowfy commented Nov 28, 2021 •

edited

maiiabocharova commented Nov 30, 2021

Can't load DeBERTa-v3 tokenizer #70

Can't load DeBERTa-v3 tokenizer #70

Comments

maiiabocharova commented Nov 20, 2021 • edited

chrischowfy commented Nov 23, 2021

maiiabocharova commented Nov 23, 2021

chrischowfy commented Nov 28, 2021 • edited

maiiabocharova commented Nov 30, 2021

maiiabocharova commented Nov 20, 2021 •

edited

chrischowfy commented Nov 28, 2021 •

edited