Skip to content

chore(debug): remove FusedRMSNormGPU sub-step probes (E98 T98.3.2)#91

Merged
dndungu merged 2 commits intomainfrom
e98-t98.3.2-cleanup-debug-probes
Apr 15, 2026
Merged

chore(debug): remove FusedRMSNormGPU sub-step probes (E98 T98.3.2)#91
dndungu merged 2 commits intomainfrom
e98-t98.3.2-cleanup-debug-probes

Conversation

@dndungu
Copy link
Copy Markdown
Contributor

@dndungu dndungu commented Apr 15, 2026

Strips the ZERFOO_GQA_DEBUG-gated probes from gpu_fused_rmsnorm.go added in T98.2.2. The bug those probes localized was fixed in T98.2.3 / PR #89.

dndungu added 2 commits April 14, 2026 19:07
Probes were added in T98.2.2 to pin the corrupting sub-step inside
FusedRMSNormGPU. Bug was found (devIn=0x0, pass-through aliasing
issue) and fixed in T98.2.3. Probes no longer needed.

Refs E98 T98.3.2.
@dndungu dndungu merged commit fd646fb into main Apr 15, 2026
1 check passed
@dndungu dndungu deleted the e98-t98.3.2-cleanup-debug-probes branch April 15, 2026 02:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant