The saved embed_tokens is empty #21

merlinarer · 2023-11-20T02:39:49Z

Hello, I try to run this code with llama1-7B, while I find the saved embed_tokens is empty and fail to load after training. Have you met this problem?

(Pdb) param_name
'model.embed_tokens.weight'
(Pdb) param
tensor([], dtype=torch.bfloat16)

The text was updated successfully, but these errors were encountered:

AkariAsai · 2023-11-22T17:41:57Z

I haven't seen this issue on my side so I am not sure if I can help here... I had some issues when I was using Llama1 and adding special tokens a while ago, as back then (April or May) the HF transformers' llama1 support was somewhat unstable. Probably using Llama2 instead of upgrading HF transformers might help, although I am not sure...

merlinarer · 2023-11-29T12:47:07Z

Solved! It seems the given scripts will save the spilted safetensor (model-00001-of-00003.safetensors ...), and also a model.safetensors, which leads to the loading error. Deleting model.safetensors solves it.

merlinarer · 2023-11-29T12:49:26Z

BTW, I can not find the 13B training script. I try to modify the 7B to 13B and find a CUDA OOM even with bs 1 on each 80G device. Maybe I should reduce the input_length ?

AkariAsai · 2023-11-30T23:56:00Z

Yes we reduce the input_length for 13B due to the OOM issue, as mentioned in the paper. Let me upload the final 13B script and push it.

AkariAsai · 2023-12-01T00:20:27Z

I uploaded the 13B training script: script_finetune_13b.sh

AkariAsai closed this as completed Dec 1, 2023

HazekiahWon mentioned this issue Mar 16, 2024

Cannot approach the performance of the uploaded self-rag ckpt when finetuning meta/Llama-2 myself #57

Open

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The saved embed_tokens is empty #21

The saved embed_tokens is empty #21

merlinarer commented Nov 20, 2023 •

edited

Loading

AkariAsai commented Nov 22, 2023

merlinarer commented Nov 29, 2023

merlinarer commented Nov 29, 2023

AkariAsai commented Nov 30, 2023

AkariAsai commented Dec 1, 2023

The saved embed_tokens is empty #21

The saved embed_tokens is empty #21

Comments

merlinarer commented Nov 20, 2023 • edited Loading

AkariAsai commented Nov 22, 2023

merlinarer commented Nov 29, 2023

merlinarer commented Nov 29, 2023

AkariAsai commented Nov 30, 2023

AkariAsai commented Dec 1, 2023

merlinarer commented Nov 20, 2023 •

edited

Loading