Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SC2 Conversion scripts #8530

Merged
merged 10 commits into from Mar 21, 2024
Merged

SC2 Conversion scripts #8530

merged 10 commits into from Mar 21, 2024

Conversation

suiyoubi
Copy link
Collaborator

What does this PR do ?

Add SC2 conversion script between .nemo and HF format in both ways.
Add model config for SC2 with comments for all 3 variant.

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

@github-actions github-actions bot added the NLP label Feb 27, 2024
Copy link
Contributor

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

@github-actions github-actions bot added the stale label Mar 16, 2024
@suiyoubi suiyoubi changed the base branch from main to ckpt_convert_script_refactor2 March 19, 2024 18:05
@suiyoubi suiyoubi changed the base branch from ckpt_convert_script_refactor2 to main March 19, 2024 18:06
@suiyoubi suiyoubi requested a review from yaoyu-33 March 19, 2024 21:05
yaoyu-33
yaoyu-33 previously approved these changes Mar 20, 2024
@github-actions github-actions bot removed the stale label Mar 20, 2024
input_ln_base_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_weight'
input_ln_bias_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_bias'
else:
input_ln_base_name = f'model.language_model.encoder.layers.{l}.input_layernorm.weight'

Check warning

Code scanning / CodeQL

Unreachable code Warning

This statement is unreachable.
input_ln_base_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_weight'
input_ln_base_name_bias = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_bias'
else:
input_ln_base_name = f'model.language_model.encoder.layers.{l}.input_layernorm.weight'

Check warning

Code scanning / CodeQL

Unreachable code Warning

This statement is unreachable.
post_attn_ln_base_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_weight'
post_attn_ln_bias_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_bias'
else:
post_attn_ln_base_name = f'model.language_model.encoder.layers.{l}.post_attention_layernorm.weight'

Check warning

Code scanning / CodeQL

Unreachable code Warning

This statement is unreachable.
post_attn_ln_base_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_weight'
post_attn_ln_base_name_bias = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_bias'
else:
post_attn_ln_base_name = f'model.language_model.encoder.layers.{l}.mlp.linear_fc1.layer_norm.weight'

Check warning

Code scanning / CodeQL

Unreachable code Warning

This statement is unreachable.
@yaoyu-33
Copy link
Collaborator

jenkins

@yaoyu-33 yaoyu-33 merged commit 3c670e2 into main Mar 21, 2024
17 of 127 checks passed
@yaoyu-33 yaoyu-33 deleted the aot/sc2-conversion branch March 21, 2024 16:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants