-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error: IndexError: piece id is out of range. #1
Comments
Could you provide the complete error output, please? A screenshot would also be acceptable. |
I use transformers 4.32.0 |
It can only be seen that there is an out-of-range situation for the Have you made any modifications to the script? If so, please provide the complete script. If not, that's really confusing...🤯 |
I just modify the batchsize. or can you tell me the tokenizer version? |
tokenizer and model are all from model_path="OFA-Sys/gsm8k-rft-llama7b-u13b", which is https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b/tree/main , has only one version, it can't be a version issue. Additionally, I realized that you mentioned the issue only arises when |
Sorry for the late reponse. In fact, tokenizer.pad_token_id = 0 already in LlmaTokenizer, so the error still exits |
Update: I find this error will happen when I use OFA-Sys/gsm8k-rft-llama7b2-u13b. No error for OFA-Sys/gsm8k-rft-llama7b-u13b ckpt. |
Hi, when I set batchsize more than 1, the error will occur: piece id is out of range.
Could you help me fix it?
The text was updated successfully, but these errors were encountered: