Make SmoothQuant/LogEqualization FSDP Compatible #2025

Satrat · 2024-01-30T17:32:11Z

These modifiers were previously not FSDP compatible because they updated module weights directly. This PR wraps weight updates in an apply call to work with FSDP.

slack thread of issue: https://neuralmagic.slack.com/archives/C064P557R8B/p1706330735831899

Testing

recipe.yaml

test_stage:
  obcq_modifiers:
    LogarithmicEqualizationModifier:
      mappings: [
        [["re:.*q_proj", "re:.*k_proj", "re:.*v_proj"], "re:.*input_layernorm"],
        [["re:.*gate_proj", "re:.*up_proj"], "re:.*post_attention_layernorm"],
      ] 
    QuantizationModifier:
      ignore:
        # These operations don't make sense to quantize
        - LlamaRotaryEmbedding
        - LlamaRMSNorm
        - SiLUActivation
        - MatMulOutput_QK
        - MatMulOutput_PV
        # Skip quantizing the layers with the most sensitive activations
        - model.layers.1.mlp.down_proj 
        - model.layers.30.mlp.down_proj
        - model.layers.31.mlp.down_proj
        - model.layers.28.mlp.down_proj  
        - model.layers.29.mlp.down_proj   
      post_oneshot_calibration: false
      scheme_overrides:
        Linear:
          weights:
            num_bits: 8
            symmetric: true
            strategy: channel
        MatMulLeftInput_QK:
          input_activations:
            num_bits: 8
            symmetric: true
        MatMulLeftInput_PV:
          input_activations:
            num_bits: 8
            symmetric: true
        Embedding:
          input_activations: null
          weights:
            num_bits: 8
            symmetric: false

Run quantization:
with FSDP: accelerate launch --config_file integrations/huggingface-transformers/finetuning/example_fsdp_config.yaml test_quant.py
without FSDP: python test_quant.py

test_quant.py

from sparseml.transformers.finetune.text_generation import oneshot

model = "mgoin/llama2-7b-gsm8k-pt"
dataset_name = "open_platypus"
concatenate_data = False
output_dir = "./debug_smoothing"
recipe = "recipe.yaml"
overwrite_output_dir = True
splits = {
    "calibration": "train"
}
oneshot(
    model_name_or_path=model,
    dataset_name=dataset_name,
    output_dir=output_dir,
    recipe=recipe,
    overwrite_output_dir=overwrite_output_dir,
    concatenate_data = concatenate_data,
    splits = splits
)

Sara Adkins added 6 commits January 29, 2024 23:04

WIP

caddc04

fsdp parent helper

42f3b25

style

9f36c94

cleanup

a6a925c

Merge branch 'main' into fsdp_log_modifier

6028870

quality

72b4740

Satrat marked this pull request as ready for review January 30, 2024 18:20

Satrat requested review from bfineran, dbogunowicz, dsikka, horheynm, mwitiderrick and rahul-tuli January 30, 2024 18:27

bfineran approved these changes Feb 5, 2024

View reviewed changes

Sara Adkins added 2 commits February 8, 2024 14:32

Merge branch 'main' into fsdp_log_modifier

60d1ddb

Merge branch 'main' into fsdp_log_modifier

c9e0706

bfineran merged commit 7ede036 into main Feb 15, 2024

bfineran deleted the fsdp_log_modifier branch February 15, 2024 16:52

Satrat pushed a commit that referenced this pull request Mar 13, 2024

Make SmoothQuant/LogEqualization FSDP Compatible (#2025)

4201cd2

Satrat pushed a commit that referenced this pull request Mar 13, 2024

Make SmoothQuant/LogEqualization FSDP Compatible (#2025) (#2178)

2c3bdf7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make SmoothQuant/LogEqualization FSDP Compatible #2025

Make SmoothQuant/LogEqualization FSDP Compatible #2025

Uh oh!

Satrat commented Jan 30, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Make SmoothQuant/LogEqualization FSDP Compatible #2025

Make SmoothQuant/LogEqualization FSDP Compatible #2025

Uh oh!

Conversation

Satrat commented Jan 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Satrat commented Jan 30, 2024 •

edited

Loading