Struggle with training LLaMA with a single GPU using both PT v1 and v2 #14

linhduongtuan · 2023-03-11T03:47:16Z

Hi,
I love your code base and want to try how to train the LLaMA with a single GPU. This code I use is here https://github.com/juncongmoo/pyllama/blob/main/llama/model_single.py.
However, I struggle with an error. This message's shown that:
"
self.tok_embeddings = nn.Embedding(params.vocab_size, params.dim)
File "/home/linh/anaconda3/envs/a/lib/python3.9/site-packages/torch/nn/modules/sparse.py", line 139, in init
self.weight = Parameter(torch.empty((num_embeddings, embedding_dim), **factory_kwargs))
RuntimeError: Trying to create tensor with negative dimension -1: [-1, 512]
"
Can you help me to fix/test this code again.

Thank in advance.
Linh

mldevorg · 2023-03-11T06:51:28Z

Guess your torch version is too old?

linhduongtuan · 2023-03-11T07:26:23Z

No. I test both PT V1 and V2 (updated very recent)

juncongmoo · 2023-03-12T08:35:25Z

@linhduongtuan Can you please post your environment info like OS, torch version, Model files checksum? (I cannot reproduce your issue.)

linhduongtuan · 2023-03-13T01:16:29Z

@juncongmoo,
I use PT v2 nightly (or PT 1.13) in Os Ubuntu 20.4, CUDA 11.7, LLaMA 7B. Instead of loading the model checkpoint, I want to train the model from scratch.

Repository owner deleted a comment from linhduongtuan Mar 12, 2023

juncongmoo self-assigned this Mar 12, 2023

linhduongtuan closed this as completed Mar 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Struggle with training LLaMA with a single GPU using both PT v1 and v2 #14

Struggle with training LLaMA with a single GPU using both PT v1 and v2 #14

linhduongtuan commented Mar 11, 2023

mldevorg commented Mar 11, 2023

linhduongtuan commented Mar 11, 2023

juncongmoo commented Mar 12, 2023

linhduongtuan commented Mar 13, 2023

Struggle with training LLaMA with a single GPU using both PT v1 and v2 #14

Struggle with training LLaMA with a single GPU using both PT v1 and v2 #14

Comments

linhduongtuan commented Mar 11, 2023

mldevorg commented Mar 11, 2023

linhduongtuan commented Mar 11, 2023

juncongmoo commented Mar 12, 2023

linhduongtuan commented Mar 13, 2023