Add support GPT-NeoX Models without attention biases #2301

dalgarak · 2023-12-29T14:14:32Z

This PR enables support for GPT-NeoX models without attention bias.

All models with the currently released GPT-NeoX architecture use attention bias by default, but the next version of huggingface transformers will support models without attention bias (merged PR - huggingface/transformers#28126), and we want to modify the code to properly initialize these models in vLLM as well.

In previous versions of the transformers, there was no attention_bias argument in GPTNeoXConfig, so we used hasattr() to apply a default value to avoid compatibility issues with version differences.

WoosukKwon

Hi @dalgarak, thanks for submitting the PR. It seems we can simplify the change a bit. Please take a look at the review.

vllm/model_executor/models/gpt_neox.py

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon

LGTM! Thanks for submitting the PR!

dalgarak added 2 commits December 29, 2023 22:28

fix to supports GPT-NeoX models without attention biases

3cb9932

reformat code for guidelines

7d53798

WoosukKwon reviewed Dec 30, 2023

View reviewed changes

vllm/model_executor/models/gpt_neox.py Outdated Show resolved Hide resolved

simplify fetch attribute from config

1b8f069

Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

WoosukKwon approved these changes Dec 30, 2023

View reviewed changes

WoosukKwon merged commit 4934d49 into vllm-project:main Dec 30, 2023
2 checks passed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Support GPT-NeoX Models without attention biases (vllm-project#2301)

22ec576

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support GPT-NeoX Models without attention biases #2301

Add support GPT-NeoX Models without attention biases #2301

dalgarak commented Dec 29, 2023

WoosukKwon left a comment

WoosukKwon left a comment

Add support GPT-NeoX Models without attention biases #2301

Add support GPT-NeoX Models without attention biases #2301

Conversation

dalgarak commented Dec 29, 2023

WoosukKwon left a comment

Choose a reason for hiding this comment

WoosukKwon left a comment

Choose a reason for hiding this comment