pytorch · macandro96 · Jul 20, 2022 · Jul 26, 2022 · Jul 26, 2022
diff --git a/torch/ao/sparsity/_experimental/data_sparsifier/lightning/callbacks/README.md b/torch/ao/sparsity/_experimental/data_sparsifier/lightning/callbacks/README.md
@@ -0,0 +1,77 @@
+# Lightning callbacks for data sparsifier and scheduler
+
+**These are callback scripts for lightning and does not introduce pytorch lightning dependency on PyTorch.**
+
+## Introduction
+Callbacks for PytorchLightning that specifies on when and how to to sparsify the data weights of the model.
+
+## Types of Data Sparsity Callbacks
+There are 2 types of data sparsity callbacks
+1. **Post Training data sparsifier callback**: Sparsification of the model parameters *post* training.
+
+2. **Training Aware data sparsifier callback**: Sparsification of the model parameters *during* training.
+
+## API Design
+1. `PostTrainingDataSparsity`: callback class that sparsifies the model parameters post training. Accepts
+    1.  `data_sparsifier_class`: class/type of data sparsifier that needs to be used. Only the class should be passed, the data sparsifier object
+    will be created internally and will be attached to the model by the callback whenever necessary.
+    2. `data_sparsifier_args`: the arguments/config for the data sparsifier constructor that will be used while creating the object.
+
+    Example:
+    ```
+    from data_sparsity import PostTrainingDataSparsity
+    sparsifier_args = {
+        'sparsity_level': 0.5,
+        'sparse_block_shape': (1, 4),
+        'zeros_per_block': 4
+    }
+    pt_callback = PostTrainingDataSparsity(data_sparsifier_class=DataNormSparsifier, data_sparsifier_args=sparsifier_args)
+    ```
+
+2. `TrainingAwareDataSparsity`: callback class to sparsify model during training. In addition to `data_sparsifier_class` and `data_sparsifier_args`,
+    also accepts
+    1. `data_scheduler_class`: class/type of data scheduler to schedule the sparsity levels during training. Only the class should be passed, the object
+    will be created internally whenever necessary.
+    2. `data_scheduler_args`: the arguments/config for the data scheduler constructor that will be used while creating the object.
+
+    Example:
+
+    ```
+    from data_sparsity import TrainingAwareDataSparsity
+    sparsifier_args = {
+        'sparsity_level': 0.5,
+        'sparse_block_shape': (1, 4),
+        'zeros_per_block': 4
+    }
+    scheduler_args = {
+        'gamma': 2,
+        'step_size': 1
+    }
+
+    ta_callback = TrainingAwareDataSparsity(
+        data_sparsifier_class=DataNormSparsifier,
+        data_sparsifier_args=sparsifier_args,
+        data_scheduler_class=StepSLScheduler,
+        data_scheduler_args=scheduler_args
+    )
+    ```
+
+**Note:**
+1. The model is copied and then sparsified, so the existing model is not modified.
+2. The sparsified model can be accessed using `sparsified` attribute and can be used for comparison with the original version.
+3. The data sparsifier/scheduler object will be created internally and will be attached to the model by the callback whenever necessary.
+
+## Usage
+```
+pl_module = SomePLModule()  # pl_module.model should specify the pytorch model
+
+ds_callback = SomeDataSparsifierCallback(data_sparsifier_class=..., data_sparsifier_args=..., ...)  # add scheduler if TrainingAwareDataSparsifier
+trainer = Trainer(callbacks=[ds_callback])
+
+trainer.fit(pl_module, train_data_loader, val_data_loader)
+
+# NOTE: pl_module.model is not sparsified
+
+# access sparsified model
+sparsified_model = ds_callback.sparsified
+```