-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SC2 Conversion scripts #8530
SC2 Conversion scripts #8530
Conversation
for more information, see https://pre-commit.ci
b1d09bb
to
5b5f8c6
Compare
for more information, see https://pre-commit.ci
This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days. |
for more information, see https://pre-commit.ci
input_ln_base_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_weight' | ||
input_ln_bias_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_bias' | ||
else: | ||
input_ln_base_name = f'model.language_model.encoder.layers.{l}.input_layernorm.weight' |
Check warning
Code scanning / CodeQL
Unreachable code Warning
input_ln_base_name = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_weight' | ||
input_ln_base_name_bias = f'model.decoder.layers.{l}.self_attention.linear_qkv.layer_norm_bias' | ||
else: | ||
input_ln_base_name = f'model.language_model.encoder.layers.{l}.input_layernorm.weight' |
Check warning
Code scanning / CodeQL
Unreachable code Warning
post_attn_ln_base_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_weight' | ||
post_attn_ln_bias_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_bias' | ||
else: | ||
post_attn_ln_base_name = f'model.language_model.encoder.layers.{l}.post_attention_layernorm.weight' |
Check warning
Code scanning / CodeQL
Unreachable code Warning
post_attn_ln_base_name = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_weight' | ||
post_attn_ln_base_name_bias = f'model.decoder.layers.{l}.mlp.linear_fc1.layer_norm_bias' | ||
else: | ||
post_attn_ln_base_name = f'model.language_model.encoder.layers.{l}.mlp.linear_fc1.layer_norm.weight' |
Check warning
Code scanning / CodeQL
Unreachable code Warning
jenkins |
What does this PR do ?
Add SC2 conversion script between .nemo and HF format in both ways.
Add model config for SC2 with comments for all 3 variant.
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information