Skip to content

Conversation

@changwangss
Copy link
Contributor

What does this PR do?

I want to use TSModelForCausalLM class loading starcode model, but the model_type "gpt_bigcode" missed. so I create GPTBigCodeNormalizedTextConfig to normalize the config.
here is the model config https://huggingface.co/bigcode/starcoder/blob/main/config.json

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

Signed-off-by: changwangss <chang1.wang@intel.com>
@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Jul 7, 2023

The documentation is not available anymore as the PR was closed or merged.

Copy link

@hshen14 hshen14 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@hshen14
Copy link

hshen14 commented Jul 7, 2023

@fxmarty It seems the test failure is not related with the code change. Please help trigger the retest if needed. Thanks.

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>
@changwangss changwangss requested a review from echarlaix July 10, 2023 13:16
@fxmarty fxmarty merged commit 53e09fe into huggingface:main Jul 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants