Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When DPO Yi-34B Assertion srcIndex < srcSelectDimSize failed #240

Closed
victorShawFan opened this issue Mar 13, 2024 · 7 comments
Closed

When DPO Yi-34B Assertion srcIndex < srcSelectDimSize failed #240

victorShawFan opened this issue Mar 13, 2024 · 7 comments

Comments

@victorShawFan
Copy link

when i use train_dpo_llama_34b.sh to dpo Yi-34B-Chat, there will be an "Array out of bounds" kind of problem
企业微信截图_367fcfdf-f47b-464d-a470-62defe645e51
企业微信截图_8f07f54a-2332-4201-9b1a-d4b86101487a
企业微信截图_0d17d8f0-1969-44bb-8a13-8f50fc216840
i use huggingface Yi-34B-Chat ckpt and tokenizer
i didn't modify any code about tokenizers, please help

@hijkzzz
Copy link
Collaborator

hijkzzz commented Mar 13, 2024

Thank you for your feedback, we will reproduce and fix it as soon as possible.

@hijkzzz
Copy link
Collaborator

hijkzzz commented Mar 13, 2024

Please set use_fast=False in the openrlhf/utils/utils.py tokenizer = AutoTokenizer.from_pretrained(pretrain, trust_remote_code=True, use_fast=False, **sp_tokens)

@hijkzzz
Copy link
Collaborator

hijkzzz commented Mar 13, 2024

now you can use the option "--disable_fast_tokenizer" in train_dpo.py

@hijkzzz hijkzzz closed this as completed Mar 13, 2024
@victorShawFan
Copy link
Author

still don't work, same bug, what problem could that be?

@hijkzzz
Copy link
Collaborator

hijkzzz commented Mar 14, 2024

still don't work, same bug, what problem could that be?

use "--disable_fast_tokenizer"

@victorShawFan
Copy link
Author

problem solved, now it worked, appreciate
image
btw, is this loss normal ?

@hijkzzz
Copy link
Collaborator

hijkzzz commented Mar 14, 2024

It makes more sense to look at acc mean

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants