Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MPT: add optional bias parameters #5638

Merged
merged 1 commit into from
Feb 22, 2024
Merged

MPT: add optional bias parameters #5638

merged 1 commit into from
Feb 22, 2024

Conversation

datquocnguyen
Copy link
Contributor

@datquocnguyen datquocnguyen commented Feb 21, 2024

The pre-trained MPT models do not include 'bias' parameters. This PR is intended to make llama.cpp compatible with PhoGPT and SEA-LION models for Southeast Asia languages, that were pre-trained using the MPT architecture but with the addition of 'bias'.

This PR is based on @nviet 's commit with modifications to make 'bias' parameters optional.

Update for MPT with optional bias parameters: to work with PhoGPT and SEA-LION models that were pre-trained with 'bias'.
@ggerganov ggerganov merged commit 4ef245a into ggerganov:master Feb 22, 2024
ggerganov added a commit that referenced this pull request Feb 22, 2024
@hiepxanh
Copy link

@datquocnguyen nice, I will try this, thank you

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024
Update for MPT with optional bias parameters: to work with PhoGPT and SEA-LION models that were pre-trained with 'bias'.
jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Mar 13, 2024
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024
Update for MPT with optional bias parameters: to work with PhoGPT and SEA-LION models that were pre-trained with 'bias'.
hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024
@bryanSwk bryanSwk mentioned this pull request Apr 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants