llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv #7327

fairydreaming · 2024-05-16T12:37:17Z

When reshaping kqv at the end of llm_build_kqv() n_embd_head_k is incorrectly used instead of n_embd_head_v to calculate kqv dimensions.

…d n_embd_head_k when making a view of cached value vectors.

fairydreaming · 2024-05-17T07:31:46Z

I found another place when variables for key vectors were used for processing value vectors, so I added another commit to this PR.

ggerganov

Which models are affected by this?

fairydreaming · 2024-05-17T09:54:44Z

DeepSeek-V2 needs this since it has n_embd_head_k != n_embd_head_v, I'm not sure about other models:

llm_load_print_meta: n_embd_head_k    = 192
llm_load_print_meta: n_embd_head_v    = 128

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv

f15e933

mofosyne added bugfix fixes an issue or bug Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level labels May 16, 2024

llama : use n_embd_v_gqa and n_embd_head_v instead of n_embd_k_gqa an…

886f89d

…d n_embd_head_k when making a view of cached value vectors.

ggerganov approved these changes May 17, 2024

View reviewed changes

ggerganov merged commit 27b0406 into ggml-org:master May 17, 2024

fairydreaming deleted the llm_build_kqv_fix branch March 22, 2025 17:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv #7327

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv #7327

Uh oh!

fairydreaming commented May 16, 2024

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

ggerganov left a comment

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv #7327

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv #7327

Uh oh!

Conversation

fairydreaming commented May 16, 2024

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants