[Pipe] Tied layers

## 🚀 Feature


Continuing the [requests](https://github.com/pytorch/pytorch/pull/50693) to support various needs of the models in the new Pipe pytorch feature, this one brings up 

### Tied layers

For models that have tied weights (e.g. encoder/decoder in transformer-type of models. It's important to:
- avoid memory duplication
- handle the gradients correctly (have to be reduced twice in the typical encoder+decoder transformer scenario)

You can see this feature provided in DeepSpeed [Tied Layers](https://www.deepspeed.ai/tutorials/pipeline/#tied-layers)  and its discussion.

Thank you.

@pritamdamania87

cc @pietern @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen @aazzolini @osalpekar @jiayisuse @agolynski @SciPioneer @H-Huang @mrzzd @cbalioglu @gcramer23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Pipe] Tied layers #51931

🚀 Feature

Tied layers

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Pipe] Tied layers #51931

Description

🚀 Feature

Tied layers

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions