Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can support the new model about Llama3 #3263

Open
1737686924 opened this issue Apr 19, 2024 · 7 comments
Open

Can support the new model about Llama3 #3263

1737686924 opened this issue Apr 19, 2024 · 7 comments

Comments

@1737686924
Copy link

Can support the new model about Llama3

@namespace-Pt
Copy link

+1 here. Especially the chat template

@March-7
Copy link

March-7 commented Apr 19, 2024

Guys, is it the same as llama2?

@1737686924
Copy link
Author

伙计们,它和 llama2 一样吗?

I have not tested it myself, but I saw the evaluation results on meta, which are better and more advanced than llama2, and I heard that the open source Llama for 400B is being prepared, which should be better than GPT4

@sohelzerdoumi
Copy link

+1

They are 3 pending pull request
#3257 #3256 #3259

@March-7
Copy link

March-7 commented Apr 20, 2024

Guys, when will the conversation template for new models like llama3 be updated to main?

@hongyinjie
Copy link

hongyinjie commented Apr 22, 2024

Change the file tokenizer_config.json:
eos_token: end_of_text ==> eot_id

it work!

@Oscarjia
Copy link

@hongyinjie What change are you referring to? does this can fix llama3 can't stop problem? https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct/discussions/4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants