Add support for Xverse models. #417

LaaZa · 2023-11-09T04:08:53Z

Adds support for xverse/XVERSE models.

Quantization/inference tested with xverse/XVERSE-7B

Ver. 2 of the models are just further training, so they should work fine.

Very close to Llama, so FusedLlamaAttentionForQuantizedModel and FusedLlamaMLPForQuantizedModel are enabled.

trust_remote_code=True required.

# Conflicts: # auto_gptq/modeling/__init__.py # auto_gptq/modeling/auto.py

fxmarty

LGTM

LaaZa added 2 commits November 9, 2023 06:00

Add support for Xverse models.

2d6f7ae

Merge branch 'main' into Xverse

98f8c77

# Conflicts: # auto_gptq/modeling/__init__.py # auto_gptq/modeling/auto.py

fxmarty approved these changes Nov 16, 2023

View reviewed changes

fxmarty merged commit 11fa862 into AutoGPTQ:main Nov 16, 2023

Provide feedback