Fix conversion parity script by defaulting RMSNorm to 1e-6 #152

rohan-varma · 2024-01-04T22:09:04Z

Changelog

Defaults RMSNorm to 1e-6 which follows the llama default (https://github.com/facebookresearch/llama/blob/main/llama/model.py#L35).
Makes sure RMSNorm eps is propagated throughout the Transformer.

Test plan

Run conversion script: python -m scripts.llama2_checkpoint.convert_llama2_to_native --checkpoint_path /home/rvarm1/local/dev/assets/llama2-7b/consolidated.00.pth --device cuda:1 &> out
pytest tests/torchtune/models/llama2/test_transformer_decoder.py -k test_rms_norm_propagation

netlify · 2024-01-04T22:09:11Z

✅ Deploy Preview for torchtune-preview ready!

Name	Link
🔨 Latest commit	`6debf84`
🔍 Latest deploy log	https://app.netlify.com/sites/torchtune-preview/deploys/6597373b7831a8000878dc48
😎 Deploy Preview	https://deploy-preview-152--torchtune-preview.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

joecummings

Can you create a GI to add a test for script conversion?

joecummings · 2024-01-04T22:27:28Z

torchtune/models/llama2/rms_norm.py

@@ -23,7 +23,7 @@ class RMSNorm(nn.Module):
        eps (float): small value to avoid division by zero. Default: 1e-5


Change default in description.

rohan-varma · 2024-01-04T22:30:09Z

Can you create a GI to add a test for script conversion?

Yes, filed an issue: #153

tests/torchtune/models/llama2/test_transformer_decoder.py

gokulavasan

Just a question about test, rest seem okay. I will let Joe stamp it

joecummings

awesome - thanks for such a quick fix.

rohan-varma · 2024-01-05T01:12:05Z

@pytorch-labs/team-torchtune-repo-owner

patch

11078b7

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 4, 2024

rohan-varma changed the title ~~[Not for land] patch~~ Fix conversion parity script by defaulting RMSNorm to 1e-6 Jan 4, 2024

joecummings reviewed Jan 4, 2024

View reviewed changes

update

664352f

rohan-varma requested a review from joecummings January 4, 2024 22:28

rohan-varma mentioned this pull request Jan 4, 2024

Add conversion script as an integration test #153

Closed

gokulavasan reviewed Jan 4, 2024

View reviewed changes

tests/torchtune/models/llama2/test_transformer_decoder.py Show resolved Hide resolved

gokulavasan reviewed Jan 4, 2024

View reviewed changes

joecummings approved these changes Jan 4, 2024

View reviewed changes

Update

6debf84

rohan-varma merged commit 08844cb into main Jan 5, 2024
15 checks passed

joecummings pushed a commit that referenced this pull request Jan 11, 2024

Fix conversion parity script by defaulting RMSNorm to 1e-6 (#152)

1220dd1

ebsmothers deleted the patch branch April 11, 2024 15:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix conversion parity script by defaulting RMSNorm to 1e-6 #152

Fix conversion parity script by defaulting RMSNorm to 1e-6 #152

rohan-varma commented Jan 4, 2024 •

edited

Loading

netlify bot commented Jan 4, 2024 •

edited

Loading

joecummings left a comment

joecummings Jan 4, 2024

rohan-varma commented Jan 4, 2024

gokulavasan left a comment

joecummings left a comment

rohan-varma commented Jan 5, 2024

		@@ -23,7 +23,7 @@ class RMSNorm(nn.Module):
		eps (float): small value to avoid division by zero. Default: 1e-5

Fix conversion parity script by defaulting RMSNorm to 1e-6 #152

Fix conversion parity script by defaulting RMSNorm to 1e-6 #152

Conversation

rohan-varma commented Jan 4, 2024 • edited Loading

Changelog

Test plan

netlify bot commented Jan 4, 2024 • edited Loading

✅ Deploy Preview for torchtune-preview ready!

joecummings left a comment

Choose a reason for hiding this comment

joecummings Jan 4, 2024

Choose a reason for hiding this comment

rohan-varma commented Jan 4, 2024

gokulavasan left a comment

Choose a reason for hiding this comment

joecummings left a comment

Choose a reason for hiding this comment

rohan-varma commented Jan 5, 2024

rohan-varma commented Jan 4, 2024 •

edited

Loading

netlify bot commented Jan 4, 2024 •

edited

Loading