Skip to content

[FEA] Support HSTU Context parallelism in training #7

@shijieliu

Description

@shijieliu

Is your feature request related to a problem? Please describe.
Context parallelism to scale HSTU training.

Describe the solution you'd like
TBD

Describe alternatives you've considered
TBD
Additional context
See Context Parallelism

Metadata

Metadata

Assignees

No one assigned

    Labels

    TBDTo be determined

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions