[FIX] Fix the case when `input_is_parallel=False` for `ScaledActivation` #1737

zhuohan123 · 2023-11-21T07:50:47Z

Fix an error in #1731.

WoosukKwon

Oh my bad. Thanks for the fix!

…on` (vllm-project#1737)

…project#1737) ## Essential Elements of an Effective PR Description Checklist - [x] The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)". - [ ] The test plan, such as providing test command. - [ ] The test results, such as pasting the results comparison before and after, or e2e results ## Purpose Found current m_rope check is always call transformer API, which leads to a deeper python stack and longer CPU time Before: thinker_uses_mrope is called 33 times => leads to model_is_mrope spent 0.097ms. <img width="1682" height="573" alt="image" src="https://github.com/user-attachments/assets/f5de5586-8aa9-4028-b1ba-05b85dc6eaa1" /> With this PR: we removed thinker_uses_mrope call (only call once to set local property) => leads to model_is_mrope only spends 0.006ms <img width="1685" height="548" alt="image" src="https://github.com/user-attachments/assets/1b311199-e1a6-4dc5-b663-e2592fe18a57" /> ## Test Plan ## Test Result  Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

[FIX] Fix the case when input_is_parallel=False for ScaledActivation

095deb2

zhuohan123 requested a review from WoosukKwon November 21, 2023 07:50

WoosukKwon approved these changes Nov 21, 2023

View reviewed changes

zhuohan123 merged commit 7d761fe into main Nov 21, 2023

zhuohan123 deleted the fix_parallel_scaled_activation_weight_loading branch November 28, 2023 00:05

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

[FIX] Fix the case when input_is_parallel=False for `ScaledActivati…

751d3c3

…on` (vllm-project#1737)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[FIX] Fix the case when `input_is_parallel=False` for `ScaledActivation` #1737

[FIX] Fix the case when `input_is_parallel=False` for `ScaledActivation` #1737

zhuohan123 commented Nov 21, 2023

Uh oh!

WoosukKwon left a comment

Uh oh!

Uh oh!

Uh oh!

[FIX] Fix the case when input_is_parallel=False for ScaledActivation #1737

[FIX] Fix the case when input_is_parallel=False for ScaledActivation #1737

Conversation

zhuohan123 commented Nov 21, 2023

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

[FIX] Fix the case when `input_is_parallel=False` for `ScaledActivation` #1737

[FIX] Fix the case when `input_is_parallel=False` for `ScaledActivation` #1737