SC2 Conversion scripts #8530

suiyoubi · 2024-02-27T17:03:04Z

What does this PR do ?

Add SC2 conversion script between .nemo and HF format in both ways.
Add model config for SC2 with comments for all 3 variant.

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

for more information, see https://pre-commit.ci

scripts/nlp_language_modeling/convert_hf_starcoder2_to_nemo.py

scripts/nlp_language_modeling/convert_nemo_starcoder2_to_hf.py

scripts/nlp_language_modeling/convert_hf_starcoder2_to_nemo.py

for more information, see https://pre-commit.ci

scripts/nlp_language_modeling/convert_hf_starcoder2_to_nemo.py

github-actions · 2024-03-16T01:43:53Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

for more information, see https://pre-commit.ci

scripts/checkpoint_converters/convert_starcoder2_hf_to_nemo.py

+            input_ln_base_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_weight'
+            input_ln_bias_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_bias'
+        else:
+            input_ln_base_name = f'model.language_model.encoder.layers.{l}.input_layernorm.weight'


scripts/checkpoint_converters/convert_starcoder2_nemo_to_hf.py

+            input_ln_base_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_weight'
+            input_ln_base_name_bias = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_bias'
+        else:
+            input_ln_base_name = f'model.language_model.encoder.layers.{l}.input_layernorm.weight'


scripts/checkpoint_converters/convert_starcoder2_hf_to_nemo.py

+            post_attn_ln_base_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_weight'
+            post_attn_ln_bias_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_bias'
+        else:
+            post_attn_ln_base_name = f'model.language_model.encoder.layers.{l}.post_attention_layernorm.weight'


scripts/checkpoint_converters/convert_starcoder2_nemo_to_hf.py

+            post_attn_ln_base_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_weight'
+            post_attn_ln_base_name_bias = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_bias'
+        else:
+            post_attn_ln_base_name = f'model.language_model.encoder.layers.{l}.mlp.linear_fc1.layer_norm.weight'


yaoyu-33 · 2024-03-20T23:34:14Z

jenkins

add conversion scripts

5b5f8c6

github-actions bot added the NLP label Feb 27, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

b1d09bb

for more information, see https://pre-commit.ci

github-advanced-security bot found potential problems Feb 27, 2024

View reviewed changes

fix tp2 issue

b59b77b

suiyoubi force-pushed the aot/sc2-conversion branch from b1d09bb to 5b5f8c6 Compare March 1, 2024 15:39

[pre-commit.ci] auto fixes from pre-commit.com hooks

82d7e71

for more information, see https://pre-commit.ci

github-advanced-security bot found potential problems Mar 1, 2024

View reviewed changes

scripts/nlp_language_modeling/convert_hf_starcoder2_to_nemo.py Fixed Show fixed Hide fixed

scripts/nlp_language_modeling/convert_hf_starcoder2_to_nemo.py Fixed Show fixed Hide fixed

github-actions bot added the stale label Mar 16, 2024

suiyoubi added 3 commits March 19, 2024 10:59

Rename args based on refactor PR

e30e44f

rename the conversion script

5c4b9ad

remove to checkpoint_converters

ccec698

suiyoubi changed the base branch from main to ckpt_convert_script_refactor2 March 19, 2024 18:05

suiyoubi changed the base branch from ckpt_convert_script_refactor2 to main March 19, 2024 18:06

suiyoubi requested a review from yaoyu-33 March 19, 2024 21:05

yaoyu-33 previously approved these changes Mar 20, 2024

View reviewed changes

github-actions bot removed the stale label Mar 20, 2024

merge

5942353

suiyoubi dismissed yaoyu-33’s stale review via 5942353 March 20, 2024 16:42

[pre-commit.ci] auto fixes from pre-commit.com hooks

261cd61

for more information, see https://pre-commit.ci

github-advanced-security bot found potential problems Mar 20, 2024

View reviewed changes

remove setencepiece

8f47fba

yaoyu-33 approved these changes Mar 21, 2024

View reviewed changes

yaoyu-33 merged commit 3c670e2 into main Mar 21, 2024
17 of 127 checks passed

yaoyu-33 deleted the aot/sc2-conversion branch March 21, 2024 16:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SC2 Conversion scripts #8530

SC2 Conversion scripts #8530

suiyoubi commented Feb 27, 2024

github-actions bot commented Mar 16, 2024

yaoyu-33 commented Mar 20, 2024

SC2 Conversion scripts #8530

SC2 Conversion scripts #8530

Conversation

suiyoubi commented Feb 27, 2024

What does this PR do ?

Who can review?

Additional Information

github-actions bot commented Mar 16, 2024

yaoyu-33 commented Mar 20, 2024