Skip to content

Bug in Qwen2.5-Math-7B-Instruct's config file #61

@tristancinquin

Description

@tristancinquin

Describe the bug

There seems to be a bug in Qwen2.5-Math-7B-Instruct's configuration file. Launching Qwen2.5-Math-7B-Instruct fails with the following error:

ValueError: User-specified max_model_len (32768) is greater than the derived max_model_len
(max_position_embeddings=4096 or model_max_length=None in model's config.json). 
This may lead to incorrect model outputs or CUDA errors. 
To allow overriding this maximum, set the env var VLLM_ALLOW_LONG_MAX_MODEL_LEN=1

To Reproduce

vec-inf launch Qwen2.5-Math-7B-Instruct then vec-inf status {$JobID} where JobID is the Slurm Job ID.

Screenshots

Error:

Image

Expected:

Image
but with Qwen2.5-Math-7B-Instruct rather than Meta-Llama-3.1-8B-Instruct

Version

0.4.0.post1

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions