Skip to content

Starcoder2: Update config and change GQA to use repeat#520

Merged
awni merged 3 commits intoml-explore:mainfrom
Blaizzy:pc/starcoder2config
Mar 3, 2024
Merged

Starcoder2: Update config and change GQA to use repeat#520
awni merged 3 commits intoml-explore:mainfrom
Blaizzy:pc/starcoder2config

Conversation

@Blaizzy
Copy link
Copy Markdown
Contributor

@Blaizzy Blaizzy commented Mar 3, 2024

This PR updates the config to match the HF Starcoder config and changes GQA to use repeat instead of concatenate as suggested here #443.

@mzbac the PR is ready!
#502 (comment)

@Blaizzy
Copy link
Copy Markdown
Contributor Author

Blaizzy commented Mar 3, 2024

@mzbac let me know if there are any further changes needed.

@mzbac
Copy link
Copy Markdown
Contributor

mzbac commented Mar 3, 2024

Looks good to me. Let's see if @awni could help review and approve it :)

@Blaizzy
Copy link
Copy Markdown
Contributor Author

Blaizzy commented Mar 3, 2024

Awesome!

Copy link
Copy Markdown
Member

@awni awni left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!!

@awni awni merged commit 3fdf85e into ml-explore:main Mar 3, 2024
devonthomas35 pushed a commit to devonthomas35/mlx-examples that referenced this pull request Mar 11, 2024
* update config

* change gqa to use repeat instead of concante

* contribution
@Blaizzy Blaizzy deleted the pc/starcoder2config branch March 13, 2024 16:02
@Blaizzy Blaizzy restored the pc/starcoder2config branch March 13, 2024 16:03
@Blaizzy Blaizzy deleted the pc/starcoder2config branch March 13, 2024 16:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants