PC: Add support for Starcoder2 by Blaizzy · Pull Request #518 · ml-explore/mlx-examples

Blaizzy · 2024-03-02T23:49:36Z

I was super excited about integrating starcoder 2 into mlx. However, while working on the mlx-langchain integration, I realized that @Muhtasham beat me to the punch and submitted a PR. Kudos to you!

I took a look at the original draft PR a few days ago and noticed a few areas that could use some improvement (mlp and others). When I tested the code with the quantized models uploaded to the hub, it didn't quite work as expected. But not to worry, I've fixed it! ✨ ✅

This implementation is working well and matches the original model using transformers in full precision and quantized configurations.

python -m mlx_lm.generate --model mlx-community/starcoder2-3b-4bit --prompt "Write a quick sort in C++"  --colorize --max-tokens 200

Output:

Prompt: Write a quick sort in C++

```cpp
//https://www.hackerrank.com/challenges/quicksort3/problem

#include <bits/stdc++.h>

using namespace std;

vector<int> quickSort(vector<int> ar) {
    vector<int> output;
    int pivot = ar[0];
    for(int i = 1; i < ar.size(); i++){
        if(ar[i] < pivot)
            output.push_back(ar[i]);
    }
    output.push_back(pivot);
    for(int i = 1; i < ar.size(); i++){
        if(ar[i] > pivot)
            output.push_back(ar[i]);
    }
    return output;
}

int main(void) {
    vector <int>  ar;
    int num_of_elements;
    cin >> num

mzbac · 2024-03-03T00:32:35Z

Nice work! Just curious, does this work with the FIM prompt?

mzbac · 2024-03-03T01:45:24Z

+        scores = (queries * self.scale) @ keys.transpose(0, 1, 3, 2)
+        if mask is not None:
+            scores += mask
+        scores = mx.softmax(scores, axis=-1).astype(values.dtype)


Maybe we need to keep scores.astype(mx.float32) in case softmax overflows on float16 values.

awni · 2024-03-03T03:41:03Z

@Blaizzy thanks for adding! I just merged #502 since it was basically done. I'm going to close this in favor of that. Appreciate the insights here which helped with #502!

Blaizzy · 2024-03-03T06:47:34Z

Nice work! Just curious, does this work with the FIM prompt?

Thanks! It should work without a problem.

If not I can investigate.

Blaizzy · 2024-03-03T06:49:00Z

@Blaizzy thanks for adding! I just merged #502 since it was basically done. I'm going to close this in favor of that. Appreciate the insights here which helped with #502!

Most welcome!

It's my pleasure 😊.

Blaizzy added 12 commits March 1, 2024 22:00

add starcoder 2

d396df9

add tie_word_embeddings

1979c45

add weight sharing comments

b099f95

format with black

079ac4e

add starcoder2 to mlx-lm tuner

3b31f09

Add to readme list of supported models

5f5bb85

fix typo

34a9245

fix transpose

2fe2d33

fix call method

0a630c8

fix gibberish output and formatting

86a7cf7

black formatting

d4383fb

Merge branch 'ml-explore:main' into pc/starcoder2

7c47a8f

Blaizzy changed the title ~~PC: Add support for starcoder2~~ PC: Add support for Starcoder2 Mar 3, 2024

mzbac mentioned this pull request Mar 3, 2024

Add Starcoder 2 #502

Merged

mzbac reviewed Mar 3, 2024

View reviewed changes

awni closed this Mar 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PC: Add support for Starcoder2#518

PC: Add support for Starcoder2#518
Blaizzy wants to merge 12 commits intoml-explore:mainfrom
Blaizzy:pc/starcoder2

Blaizzy commented Mar 2, 2024 •

edited

Loading

Uh oh!

mzbac commented Mar 3, 2024

Uh oh!

mzbac Mar 3, 2024

Uh oh!

awni commented Mar 3, 2024

Uh oh!

Blaizzy commented Mar 3, 2024

Uh oh!

Blaizzy commented Mar 3, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Blaizzy commented Mar 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mzbac commented Mar 3, 2024

Uh oh!

mzbac Mar 3, 2024

Choose a reason for hiding this comment

Uh oh!

awni commented Mar 3, 2024

Uh oh!

Blaizzy commented Mar 3, 2024

Uh oh!

Blaizzy commented Mar 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Blaizzy commented Mar 2, 2024 •

edited

Loading

Blaizzy commented Mar 3, 2024 •

edited

Loading