Fix for KeyError on Loading LLaMA #1978

imgaojun · 2023-12-08T03:49:09Z

This PR addresses the issue where loading LLaMA model parameters into vLLM results in a KeyError due to the presence of rotary_emb.cos_cached and rotary_emb.sin_cached. These cached values from LLaMA are not required or directly managed by vLLM, leading to compatibility issues.

Related issue #1977

Changes Made:

Added conditional checks to ignore rotary_emb.cos_cached and rotary_emb.sin_cached during the loading process.
Ensured that other necessary parameters are loaded correctly, without interfering with the rotary embeddings.

Yard1

Thanks!

Fix for KeyError on Loading LLaMA

b376d3a

allenhaozi added a commit to allenhaozi/vllm that referenced this pull request Dec 8, 2023

sync Fix for KeyError on Loading LLaMA vllm-project#1978

01735f2

Yard1 approved these changes Dec 9, 2023

View reviewed changes

Yard1 merged commit 3a8c238 into vllm-project:main Dec 9, 2023

Yard1 mentioned this pull request Dec 10, 2023

KeyError on Loading LLaMA Parameters in vLLM due to Unhandled Cached Rotary Embeddings #1977

Closed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Fix for KeyError on Loading LLaMA (vllm-project#1978)

d6cd08a

This was referenced Apr 2, 2024

[Bugfix] Fix KeyError on loading GPT-NeoX jsato8094/vllm#1

Draft

[Bugfix] Fix KeyError on loading GPT-NeoX #3925

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix for KeyError on Loading LLaMA #1978

Fix for KeyError on Loading LLaMA #1978

Uh oh!

imgaojun commented Dec 8, 2023

Uh oh!

Yard1 left a comment

Uh oh!

Uh oh!

Uh oh!

Fix for KeyError on Loading LLaMA #1978

Fix for KeyError on Loading LLaMA #1978

Uh oh!

Conversation

imgaojun commented Dec 8, 2023

Uh oh!

Yard1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!