-
Notifications
You must be signed in to change notification settings - Fork 417
NNX Migration for Mistral models #2088
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
b2bfd1d
to
65c58f2
Compare
0591ca3
to
fa13350
Compare
3675744
to
5973fdb
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM just a couple small comments. Are the inference decode test results in the description from after the rebase with #2370? I see some very slight memory diff
Also discussed this offline already, but do you mind running some Maxengine/Jetstream test from a checkpoint?
c325f3e
to
5984d9c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see both of accuracy is 0 from JetStream. Did you kill the process or others?
Results
{'accuracy': 0.0, 'gen_num': 5000}
I did not kill the process. I am not sure what happened here, maybe using the wrong tokenizer? (I used the mistral tokenizer under assets instead of huggingface in llama2) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM after the testing you have been continuing. Just one comment
45af6de
to
bbb8ce7
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks for the great thorough testing so far. Please do follow up here when you have the results of the golden logits test you mentioned you were running
bbb8ce7
to
6bcfdc2
Compare
6bcfdc2
to
ad74a01
Compare
Description
Migrate Mistral model to NNX. Comparing f4f286c (before mistral migration) and 5984d9c (after mistral migration).
Tests
Training tests
Command
Webdiff
https://diff.googleplex.com/#key=qvnNP8tANEIK
Decode Inference tests
Command
Webdiff
https://diff.googleplex.com/#key=dbqiUSGbkqov
Jetstream Inference test
Command
Step 1: https://paste.googleplex.com/6590290169298944
Step 2: https://paste.googleplex.com/6674235103772672
Step 3: https://paste.googleplex.com/6448284356968448
Before Mistral Migration
xprof
After Mistral Migration
xprof
Web diff
https://diff.googleplex.com/#key=siw4m0hLYUAT
Checklist
Before submitting this PR, please make sure (put X in square brackets):