Skip to content

[paddle-adapt] gemm/test_tgv_gemm + test_group_gemm: zero-diff, both PASS out-of-the-box#20

Merged
BingooYang merged 1 commit into
PFCCLab:0.6from
BingooYang:adapt/gemm_tgv_group
May 18, 2026
Merged

[paddle-adapt] gemm/test_tgv_gemm + test_group_gemm: zero-diff, both PASS out-of-the-box#20
BingooYang merged 1 commit into
PFCCLab:0.6from
BingooYang:adapt/gemm_tgv_group

Conversation

@BingooYang
Copy link
Copy Markdown

Description

Verify that tests/gemm/test_tgv_gemm.py and tests/gemm/test_group_gemm.py work under Paddle-compat mode without any source file modifications.

All APIs are either fully aligned or covered by paddle.enable_compat().

Test file Passed Skipped Reason for skips
test_tgv_gemm.py 90 0
test_group_gemm.py 288 360 sm90 hw not present (288) + batch too large (72)

Both skip reasons are upstream hardware guards, not paddle-adaptation issues.

Changed files: scripts/paddle_all_test_cases.sh: +7 lines

No paddle-adaptation changes: git diff upstream/0.6 -- *.py *.h *.cu *.cc = 0 lines

Regression:

  • norm/test_fused_rmsnorm_silu: PASS
  • norm/test_fused_dit_layernorm: PASS (35/35)
  • comm/test_trtllm_allreduce_fusion: PASS (1/1)
  • attention_sink / moe_smoke: pre-existing CCCL submodule issue (not caused by this PR)

Related Issues

N/A

Pull Request Checklist

  • pre-commit run --files scripts/paddle_all_test_cases.sh: all checks passed

Tests

  • test_tgv_gemm.py: 90/90 PASS
  • test_group_gemm.py: 288 PASS, 360 SKIP (hardware constraints)
  • Regression tests (norm, allreduce) PASS

Reviewer Notes

This PR only adds 7 lines to the shell test script. Zero changes to Python/C++ source.
The test files run out-of-the-box under paddle.enable_compat() -- no adaptation needed.

…PASS out-of-the-box

- test_tgv_gemm.py: 90/90 PASS; paddle.enable_compat() covers all APIs
  (F.linear, F.cosine_similarity, torch.randn, torch.float16/bfloat16)
  Note: all 90 cases SKIP on non-SM100 hardware (_match_sm_version guard)
- test_group_gemm.py: 288 PASS, 360 SKIP (sm90 hw / batch too large, upstream guards)
  SegmentGEMMWrapper, torch.testing.assert_close, torch.manual_seed all in ALIGNED_APIS
- No source file changes (git diff upstream/0.6 = 0 lines)
- Regression: norm/allreduce PASS; attention_sink/moe_smoke pre-existing CCCL
  submodule issue (not paddle-adaptation related)

Refs: adaptation-paddle/adaptation_exp.md (no new entries needed)
@BingooYang BingooYang merged commit 7f84b32 into PFCCLab:0.6 May 18, 2026
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant