Support converting hf gemma weights #71

lsy323 · 2024-05-08T18:17:49Z

Update convert_checkpoint.py to convert Gemma weights from HuggingFace to safetensor format, and PyTorch state_dict.
Enable loading converted Gemma weights in both safetensor format and state_dict
Update README.md for running Gemma

Tested with running convert_checkpoint.py and run_interactive.py

run_interactive.py

jetstream_pt/engine.py

convert_checkpoints.py

qihqi · 2024-05-08T23:04:23Z

Can you also add the instructions in your PR to the README.md? thanks!

convert_checkpoints.py

FanhaiLu1

Can you share the real output result with the gemma weights?

convert_checkpoints.py

run_interactive.py

lsy323 · 2024-05-09T00:05:40Z

Can you share the real output result with the gemma weights?

There is some issue with the response, it's generating English but it will repeat the sentence. @qihqi will update the attention module for Gemma in the following PR, which should fix the accuracy issue.

Prompt

"I believe the meaning of life is"

The response is:

 to experience.

Here is my reasoning:

* **Life is a journey.** We are all on a journey through time. The experiences we gain through life's journey is to experience the world and all that we experience, both positive and negative experiences, both good and bad experiences, both positive and negative experiences, both joy and suffering. We all go through life to experience and learn from both good and bad and positive and negative and positive and negative experiences, both joy and suffering and pain and joy, both good and bad and good and bad and negative and positive and negative and positive and negative and good and bad and the good and bad and negative and positive and negative and positive and negative and the good and bad and negative and positive and negative experiences, both good and bad and the good and bad and negative and the good and bad. The is the good and bad and the good and bad and the the good and bad and the the are the experiences. Through the joy and negative. The the the journey is the good. The through the the.

FanhaiLu1 · 2024-05-09T00:36:26Z

Can you share the real output result with the gemma weights?

There is some issue with the response, it's generating English but it will repeat the sentence. @qihqi will update the attention module for Gemma in the following PR, which should fix the accuracy issue.

Prompt

"I believe the meaning of life is"

The response is:

 to experience.

Here is my reasoning:

* **Life is a journey.** We are all on a journey through time. The experiences we gain through life's journey is to experience the world and all that we experience, both positive and negative experiences, both good and bad experiences, both positive and negative experiences, both joy and suffering. We all go through life to experience and learn from both good and bad and positive and negative and positive and negative experiences, both joy and suffering and pain and joy, both good and bad and good and bad and negative and positive and negative and positive and negative and good and bad and the good and bad and negative and positive and negative and positive and negative and the good and bad and negative and positive and negative experiences, both good and bad and the good and bad and negative and the good and bad. The is the good and bad and the good and bad and the the good and bad and the the are the experiences. Through the joy and negative. The the the journey is the good. The through the the.

Great! The result is reasonable.

…etstream-pytorch into lsiyuan/convert-gemma-hf-weight

lsy323 · 2024-05-09T02:40:21Z

Can you also add the instructions in your PR to the README.md? thanks!

Added to the README.md for how to run Gemma.

lsy323 · 2024-05-09T02:45:54Z

Updated convert_checkpoint.py to generate weight in safetensor format by default. Loading weights in state_dict is still added in the weight loading logic.

FanhaiLu1 · 2024-05-09T03:28:06Z

Updated convert_checkpoint.py to generate weight in safetensor format by default. Loading weights in state_dict is still added in the weight loading logic.

Great! All looks good to me now.

add convert hf gemma weights

5d7b970

lsy323 requested review from FanhaiLu1, qihqi and wang2yn84 May 8, 2024 18:17

lsy323 marked this pull request as draft May 8, 2024 18:26

format

e7037d8

lsy323 marked this pull request as ready for review May 8, 2024 22:54

Merge branch 'main' into lsiyuan/convert-gemma-hf-weight

1cd590e

qihqi reviewed May 8, 2024

View reviewed changes

run_interactive.py Show resolved Hide resolved

jetstream_pt/engine.py Outdated Show resolved Hide resolved

jetstream_pt/engine.py Outdated Show resolved Hide resolved

convert_checkpoints.py Outdated Show resolved Hide resolved

convert_checkpoints.py Show resolved Hide resolved

qihqi reviewed May 8, 2024

View reviewed changes

convert_checkpoints.py Outdated Show resolved Hide resolved

FanhaiLu1 reviewed May 8, 2024

View reviewed changes

convert_checkpoints.py Show resolved Hide resolved

run_interactive.py Show resolved Hide resolved

lsy323 added 2 commits May 9, 2024 02:29

update convert script

7d4900d

Merge branch 'lsiyuan/convert-gemma-hf-weight' of github.com:lsy323/j…

9b2207c

…etstream-pytorch into lsiyuan/convert-gemma-hf-weight

lsy323 requested review from FanhaiLu1 and qihqi May 9, 2024 02:44

FanhaiLu1 approved these changes May 9, 2024

View reviewed changes

add sign agreement

0ba908d

qihqi approved these changes May 9, 2024

View reviewed changes

qihqi merged commit 811d718 into AI-Hypercomputer:main May 9, 2024

lsy323 deleted the lsiyuan/convert-gemma-hf-weight branch May 9, 2024 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support converting hf gemma weights #71

Support converting hf gemma weights #71

Uh oh!

lsy323 commented May 8, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qihqi commented May 8, 2024

Uh oh!

Uh oh!

FanhaiLu1 left a comment

Uh oh!

Uh oh!

Uh oh!

lsy323 commented May 9, 2024 •

edited

Loading

Uh oh!

FanhaiLu1 commented May 9, 2024

Uh oh!

lsy323 commented May 9, 2024

Uh oh!

lsy323 commented May 9, 2024

Uh oh!

FanhaiLu1 commented May 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Support converting hf gemma weights #71

Support converting hf gemma weights #71

Uh oh!

Conversation

lsy323 commented May 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qihqi commented May 8, 2024

Uh oh!

Uh oh!

FanhaiLu1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lsy323 commented May 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FanhaiLu1 commented May 9, 2024

Uh oh!

lsy323 commented May 9, 2024

Uh oh!

lsy323 commented May 9, 2024

Uh oh!

FanhaiLu1 commented May 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lsy323 commented May 8, 2024 •

edited

Loading

lsy323 commented May 9, 2024 •

edited

Loading