Skip to content

Something wrong with OpenAI's Large V2 model? #675

@ggerganov

Description

@ggerganov

So, I haven't looked in details, but I suspect there might be something wrong in the new large model released by OpenAI. Keep in mind this is very anecdotal evidence atm, so I might be completely wrong.

Running the main example with enabled color coding for the token probabilities -pc we normally get the following results:

  • base.en

image

  • small.en

image

  • small

image

  • medium

image

  • medium.en

image

However, this is what the color coding look like when using the new large model (i.e. v2):

  • large

image

As a comparison, this is the same run, but using the old large model - i.e. v1:

  • large-v1

image

So somehow the logits with v2 seem to be all over the place which is not observed for any of the other models.
Still, I need to double check all these observations, but I think there is something not quite right with large-v2

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions