Skip to content

GPTBigCode models generate noise results #675

@anxietymonger

Description

@anxietymonger

if (getattr(self.hf_config, "multi_query", False) and

Adding conditions for falcon models seems break inference for GPTBigCode models. It worked well on commit df5dd3c, but broke on commit aa84c92.

I have tried WizardCoder 15B, if I change the condition

if (getattr(self.hf_config, "multi_query", False) and
            (self.hf_config.model_type == "falcon" and
             not getattr(self.hf_config, "new_decoder_architecture", False))):

into

if getattr(self.hf_config, "multi_query", False):

, it works well.

I'm not sure whether this is a bug, or I just need some extra configuration?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions