Exception: You're trying to run a `Unigram` model but you're file was trained with a different algorithm #9871

jiyanbio · 2021-01-28T13:27:18Z

Environment info

transformers version: 4.2.2
Platform: Linux-3.10.107-1-tlinux2_kvm_guest-0049-x86_64-with-glibc2.10
Python version: 3.8.5
PyTorch version (GPU?): 1.7.1 (False)
Tensorflow version (GPU?): not installed (NA)
Using GPU in script?: no
Using distributed or parallel set-up in script?: no

Who can help

Information

Model I am using (Bert, XLNet ...):

The problem arises when using:

[1 ] the official example scripts: (give details below)
my own modified scripts: (give details below)

The tasks I am working on is:

an official GLUE/SQUaD task: (give the name)
my own task or dataset: (give details below)

To reproduce

Steps to reproduce the behavior:

open https://github.com/agemagician/ProtTrans/blob/master/Embedding/PyTorch/Basic/ProtAlbert.ipynb
when run the code 'tokenizer = AutoTokenizer.from_pretrained("Rostlab/prot_albert", do_lower_case=False )'
report errors as the follow:
Downloading: 100%|█████████████████████████████████████████████████████████████████| 505/505 [00:00<00:00, 516kB/s]
Downloading: 100%|██████████████████████████████████████████████████████████████| 238k/238k [00:03<00:00, 77.0kB/s]
Traceback (most recent call last):
File "", line 1, in
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/models/auto/tokenization_auto.py", line 385, in from_pretrained
return tokenizer_class_fast.from_pretrained(pretrained_model_name_or_path, *inputs, **kwargs)
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/tokenization_utils_base.py", line 1768, in from_pretrained
return cls._from_pretrained(
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/tokenization_utils_base.py", line 1841, in _from_pretrained
tokenizer = cls(*init_inputs, **init_kwargs)
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/models/albert/tokenization_albert_fast.py", line 136, in init
super().init(
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/tokenization_utils_fast.py", line 89, in init
fast_tokenizer = convert_slow_tokenizer(slow_tokenizer)
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/convert_slow_tokenizer.py", line 659, in convert_slow_tokenizer
return converter_class(transformer_tokenizer).converted()
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/convert_slow_tokenizer.py", line 349, in converted
tokenizer = self.tokenizer(self.proto)
File "/home/anaconda3/envs/prottrans/lib/python3.8/site-packages/transformers/convert_slow_tokenizer.py", line 335, in tokenizer
raise Exception(
Exception: You're trying to run a Unigram model but you're file was trained with a different algorithm

Expected behavior

The text was updated successfully, but these errors were encountered:

agemagician · 2021-01-28T15:43:10Z

Use "AlbertTokenizer" rather than "AutoTokenizer", this should solve your issue.
Please, check the updated notebook version.

github-actions · 2021-04-14T15:04:43Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

arkhan19 · 2022-01-11T12:39:20Z

Prot_albert tokenizer is returning none type, what changed?

github-actions bot closed this as completed Apr 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exception: You're trying to run a `Unigram` model but you're file was trained with a different algorithm #9871

Exception: You're trying to run a `Unigram` model but you're file was trained with a different algorithm #9871

jiyanbio commented Jan 28, 2021

agemagician commented Jan 28, 2021

github-actions bot commented Apr 14, 2021

arkhan19 commented Jan 11, 2022

Exception: You're trying to run a Unigram model but you're file was trained with a different algorithm #9871

Exception: You're trying to run a Unigram model but you're file was trained with a different algorithm #9871

Comments

jiyanbio commented Jan 28, 2021

Environment info

Who can help

Information

To reproduce

Expected behavior

agemagician commented Jan 28, 2021

github-actions bot commented Apr 14, 2021

arkhan19 commented Jan 11, 2022

Exception: You're trying to run a `Unigram` model but you're file was trained with a different algorithm #9871

Exception: You're trying to run a `Unigram` model but you're file was trained with a different algorithm #9871