fix exclusive cumsum calculation by bkal01 · Pull Request #109 · ScalingIntelligence/KernelBench

bkal01 · 2025-12-22T16:05:08Z

as @AKatydid pointed out on #72, the current exclusive cumsum computation is incorrect.

now we apply torch.cumsum on x (without the last elements along the dimension) and shifts them right by one by prepending 0s.

@AKatydid

as @AKatydid pointed out on #72, the current exclusive cumsum computation is incorrect. now we apply torch.cumsum on x (without the last elements along the dimension) and shifts them right by one by prepending 0s.

simonguozirui · 2025-12-24T17:01:37Z

Double-checked the math and wrote a short script to verify it against the PyTorch reference implementation. using roll (not efficient, but easy to check). Equivalent on both cpu and gpu.

def exclusive_cumsum_ref(x: torch.Tensor, dim: int) -> torch.Tensor:
    c = torch.cumsum(x, dim=dim)
    y = torch.roll(c, shifts=1, dims=dim)

    idx = [slice(None)] * x.ndim
    idx[dim] = 0
    y[tuple(idx)] = 0
    return y

Thanks for the fix @bkal01 and @AKatydid for pointing out on #72. Also started change log to document ongoing problem updates!

@AKatydid

* fix exclusive cumsum calculation as @AKatydid pointed out on #72, the current exclusive cumsum computation is incorrect. now we apply torch.cumsum on x (without the last elements along the dimension) and shifts them right by one by prepending 0s. * check and add changelog --------- Co-authored-by: Simon Guo <simonguo@stanford.edu>

@AKatydid

* fix exclusive cumsum calculation as @AKatydid pointed out on #72, the current exclusive cumsum computation is incorrect. now we apply torch.cumsum on x (without the last elements along the dimension) and shifts them right by one by prepending 0s. * check and add changelog --------- Co-authored-by: Simon Guo <simonguo@stanford.edu>

fix exclusive cumsum calculation

4907b86

as @AKatydid pointed out on #72, the current exclusive cumsum computation is incorrect. now we apply torch.cumsum on x (without the last elements along the dimension) and shifts them right by one by prepending 0s.

bkal01 requested a review from simonguozirui December 22, 2025 16:05

simonguozirui mentioned this pull request Dec 24, 2025

[Puzzle] Level1 cumsum_exclusive #72

Closed

check and add changelog

d1e18fa

simonguozirui approved these changes Dec 24, 2025

View reviewed changes

simonguozirui merged commit 6bab08b into main Dec 24, 2025

simonguozirui mentioned this pull request Dec 24, 2025

Fix shape for exclusive cumulative sum #63

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix exclusive cumsum calculation#109

fix exclusive cumsum calculation#109
simonguozirui merged 2 commits intomainfrom
fix-exclusive-cumsum

bkal01 commented Dec 22, 2025

Uh oh!

simonguozirui commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bkal01 commented Dec 22, 2025

Uh oh!

simonguozirui commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants