How to support topk>2 (topk=6 is needed in our experiment)  in MoE model?

we find that

@staticmethod
    def supports_config(config: DSMoEConfig) -> bool:
        if config.input_dtype != config.output_dtype:
            return False

        if config.input_dtype != torch.float16 and config.input_dtype != torch.bfloat16:
            return False

        if config.top_k != 1 and config.top_k != 2:
            return False

        return True

 in "deepspeed.inference.v2.modules.implementations.moe.cutlass_multi_gemm.DSMultiGemmMoE"

It means DSMoE only supoort topk=1 or topk =2.

But our experiment need support for topk=6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to support topk>2 (topk=6 is needed in our experiment) in MoE model? #5155

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to support topk>2 (topk=6 is needed in our experiment) in MoE model? #5155

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions