Art/llama3 support #1

nivibilla · 2024-04-29T18:01:48Z

Hey,

I was having a look at generalising this to llama 3 70b as well. I found that if we pre convert the safetensors version to a PyTorch pickle version. The model converts fine given the correct config.

And with your tokeniser changes generation should be good too. Pls have a look.

Ive converted and uploaded llama-3-8b-instruct and llama-3-70b-instruct to the native pytorch_model_n_of_n.bin for testing

nivibilla · 2024-04-29T18:05:30Z

This extends this PR to support both llama 3 8b and 70b.

Artyom17 · 2024-04-29T20:42:39Z

It is great to see that 70B model works as well! Thanks for that! A couple of notes:

In your converted llama-3-8b-instruct and llama-3-70b-instruct you misspelled the name of the sub-dir "original" (it is "orignal" in your case), therefore conversion script complains that it can't find the 'tokenizer.model'
I afraid, we can't make gpt-fast dependent on a third-party model.

Here is what I think should be done:

My original PR gets landed first (really hope it happens soon)
It would be nice if HuggingFace adopts your conversion and release those .bin files properly, or
We need to add .safetensor-to-.bin conversion script into gpt-fast
Finally, you create a proper PR in gpt-fast repo, which adds 70b model support.

What do you think?

* WIP: llama3 support, tiktoken tokenizer * Finalizing

nivibilla added 12 commits April 29, 2024 15:34

Update model.py

1794949

Update model.py

0f39729

Update model.py

f591d92

Update model.py

f9c9bf9

Update model.py

7e606ae

Update model.py

1904f2c

revert convert_hf_checkpoint.py

b8e0a3b

Update model.py

f73f835

Update model.py

be1af66

copy tokeniser

5b7e3c7

Update tokenizer.py

d0484b5

blobfile needed for tiktoken

059b2bf

nivibilla mentioned this pull request Apr 29, 2024

llama3 8B support, tiktoken tokenizer pytorch-labs/gpt-fast#158

Merged

Artyom17 and others added 3 commits April 29, 2024 14:02

llama3 8B support, tiktoken tokenizer (pytorch-labs#158)

30d69b3

* WIP: llama3 support, tiktoken tokenizer * Finalizing

Merge branch 'main' into art/llama3-support

1b5e41b

fix

80e3622

nivibilla closed this by deleting the head repository Apr 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Art/llama3 support #1

Art/llama3 support #1

nivibilla commented Apr 29, 2024 •

edited

Loading

nivibilla commented Apr 29, 2024

Artyom17 commented Apr 29, 2024

Art/llama3 support #1

Art/llama3 support #1

Conversation

nivibilla commented Apr 29, 2024 • edited Loading

nivibilla commented Apr 29, 2024

Artyom17 commented Apr 29, 2024

nivibilla commented Apr 29, 2024 •

edited

Loading