Skip to content

Grouped swizzle kernel #2451

@ptrendx

Description

@ptrendx

Develop a grouped swizzle kernel for the GroupedTensor type to support efficient memory layout transformations in MoE training. Ensure compatibility with device-supplied problem sizes.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions