Add DIV_EXACT prim #2626

beverlylytle · 2025-10-10T11:06:02Z

In the model HF ibm-granite/granite-3.1-3b-a800m-instruct a certain set of indices is computed by dividing two tensors of ints. In general, div is differentiable, but not when both operands are of exact dtype and the rounding mode is "trunc". Because there is no exception for this case, autodiff fetches the augmented forward part coming from div's grad transform, and this returns an inexact result. This PR introduces a new prim DIV_EXACT which is the same as prims.DIV except without having a grad transform registered to it. A check on dtypes forwards the call to DIV_EXACT instead of DIV.

kshitij12345

LGTM, thanks @beverlylytle

thunder/core/prims.py

thunder/tests/test_ops.py

t-vi · 2025-10-10T14:27:54Z

Is div_exact a good name? I thought floor_div was great to represent //

beverlylytle · 2025-10-10T14:33:37Z

Is div_exact a good name? I thought floor_div was great to represent //

I am open to changing the name, but since this prim is used for both floor_div and trunc_div I don't think it should be named floor_div. Do you have another suggestion?

kiya00

LGTM, thank you @beverlylytle

t-vi

Thank you @beverlylytle @kiya00 @kshitij12345

IvanYashchuk · 2025-10-16T11:03:44Z

thunder/tests/test_ops.py

    assert_close(fn(a), jfn(a))
+
+
+def test_div_exact():


The test function works even without the change introduced in this PR:

git checkout 80461ba^ ```py In [1]: import torch, thunder In [2]: def fn(a, b, c): ...: indices = torch.div(a, b, rounding_mode="trunc") ...: # this would throw an error if indices are not ints ...: return c[indices] ...: In [3]: jfn = thunder.jit(fn) ...: a = torch.randint(1, 5, (5,)) ...: b = torch.ones(5, dtype=torch.int32) ...: c = torch.randn(5, 5) ...: fn(a, b, c), jfn(a, b, c) Out[3]: (tensor([[ 0.5290, -2.2824, 1.0693, -1.6769, -1.9725], [-1.0802, -0.4437, -0.7387, -1.1378, 0.6227], [-0.1425, 0.3009, 1.3933, -0.8111, 0.2195], [-0.1425, 0.3009, 1.3933, -0.8111, 0.2195], [-0.1425, 0.3009, 1.3933, -0.8111, 0.2195]]), tensor([[ 0.5290, -2.2824, 1.0693, -1.6769, -1.9725], [-1.0802, -0.4437, -0.7387, -1.1378, 0.6227], [-0.1425, 0.3009, 1.3933, -0.8111, 0.2195], [-0.1425, 0.3009, 1.3933, -0.8111, 0.2195], [-0.1425, 0.3009, 1.3933, -0.8111, 0.2195]]))

The c tensor must have requires_grad=True to trigger buggy code path.

Oh, shoot. c lost its require_grad in the various iterations. Good catch.

beverlylytle added 3 commits October 10, 2025 14:00

Add DIV_EXACT prim

edcb1f7

add test

4b01872

math is hard

51b9f54

beverlylytle requested review from kiya00 and kshitij12345 October 10, 2025 12:27

beverlylytle marked this pull request as ready for review October 10, 2025 12:28

beverlylytle requested review from KaelanDt, lantiga, mruberry and t-vi as code owners October 10, 2025 12:28

kshitij12345 approved these changes Oct 10, 2025

View reviewed changes

thunder/core/prims.py Outdated Show resolved Hide resolved

thunder/tests/test_ops.py Show resolved Hide resolved

update supported dtypes

e61a648

kiya00 approved these changes Oct 13, 2025

View reviewed changes

t-vi approved these changes Oct 13, 2025

View reviewed changes

t-vi merged commit 80461ba into main Oct 13, 2025
48 of 51 checks passed

t-vi deleted the add_div_exact branch October 13, 2025 08:21

IvanYashchuk reviewed Oct 16, 2025

View reviewed changes

IvanYashchuk mentioned this pull request Oct 16, 2025

Revert new primitive for grad bug fix; Apply localized solution for division output type consistency in _div_prim_grad #2664

Closed

Copilot AI mentioned this pull request Oct 16, 2025

Revert new primitive for grad bug fix; Apply localized solution for division output type consistency in _div_prim_grad #2665

Merged

13 tasks

IvanYashchuk mentioned this pull request Oct 16, 2025

HF ibm-granite/granite-3.1-3b-a800m-instruct: index_add_(): Expected dtype int32/int64 for index but got: Float #2599

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add DIV_EXACT prim #2626

Add DIV_EXACT prim #2626

Uh oh!

beverlylytle commented Oct 10, 2025 •

edited

Loading

Uh oh!

kshitij12345 left a comment

Uh oh!

Uh oh!

Uh oh!

t-vi commented Oct 10, 2025

Uh oh!

beverlylytle commented Oct 10, 2025

Uh oh!

kiya00 left a comment

Uh oh!

t-vi left a comment

Uh oh!

Uh oh!

IvanYashchuk Oct 16, 2025

Uh oh!

beverlylytle Oct 16, 2025

Uh oh!

beverlylytle Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add DIV_EXACT prim #2626

Add DIV_EXACT prim #2626

Uh oh!

Conversation

beverlylytle commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kshitij12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

t-vi commented Oct 10, 2025

Uh oh!

beverlylytle commented Oct 10, 2025

Uh oh!

kiya00 left a comment

Choose a reason for hiding this comment

Uh oh!

t-vi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

IvanYashchuk Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

beverlylytle Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

beverlylytle Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

beverlylytle commented Oct 10, 2025 •

edited

Loading