Fix `Dirichlet.log_prob()` when x=0 and alpha=1 #103605

kalekundert · 2023-06-14T17:39:16Z

Dirichlet.log_prob() incorrectly returns NaN in the case where $x_i=0$ and $\alpha_i=1$. The Dirichlet PDF is given by:
$$\frac{1}{B(\alpha)} \prod_{i=1}^{K} x_i^{\alpha_i - 1}$$
So this corresponds to the case where one of the terms has the form $0^0=1$. The logarithm of such a term should be 0, but you get NaN if you try to calculate it as 0 * log(0).

This PR implements the same algorithm that scipy.stats.dirichlet uses to avoid this behavior, namely xlogy(alpha - 1, x) instead of (alpha - 1) * log(x). It also adds a test case comparing the pytorch and scipy implementations for this specific case.

pytorch-bot · 2023-06-14T17:39:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/103605

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e001407:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2023-06-14T17:39:22Z

The committers listed above are authorized under a signed CLA.

✅ login: kalekundert / name: Kale Kundert (e001407)

albanD

Change sounds good!
Just a small question on the test.

albanD · 2023-06-15T12:26:04Z

test/distributions/test_distributions.py

+        x = torch.tensor([0, 1])
+        actual_log_prob = dist.log_prob(x)
+        expected_log_prob = scipy.stats.dirichlet.logpdf(x.numpy(), alpha.numpy())
+        self.assertEqual(actual_log_prob, expected_log_prob, atol=1e-3, rtol=0)


Do you really need to override the tolerances here? I would expect the default one to work fine here?

I just copied the whole assertion line from the existing Dirichlet.log_prob() test. I don't know if there's a good reason why these tolerances were chosen in the first place, though. Maybe there's a test environment that uses low-precision floats or something? In any case, my goal was to be consistent. Also, this test only really needs to distinguish between NaN and not NaN, so for that the tolerance doesn't matter.

albanD

Ok let's keep it as is in the name of consistency then. I agree that it does check what we want here anyways.

albanD · 2023-06-15T14:02:26Z

@pytorchbot merge

pytorchmergebot · 2023-06-15T14:04:40Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Fix Dirichlet.log_prob() when x=0 and alpha=1

e001407

pytorchbot added the open source label Jun 14, 2023

albanD reviewed Jun 15, 2023

View reviewed changes

albanD added release notes: python_frontend release notes category topic: bug fixes topic category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Jun 15, 2023

albanD approved these changes Jun 15, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 15, 2023

pytorchmergebot added the merging label Jun 15, 2023

pytorchmergebot added Merged and removed merging labels Jun 15, 2023

pytorchmergebot closed this in e75f799 Jun 15, 2023

kalekundert deleted the fix-dirichlet-xlogy branch June 15, 2023 19:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `Dirichlet.log_prob()` when x=0 and alpha=1 #103605

Fix `Dirichlet.log_prob()` when x=0 and alpha=1 #103605

kalekundert commented Jun 14, 2023

pytorch-bot bot commented Jun 14, 2023 •

edited

linux-foundation-easycla bot commented Jun 14, 2023 •

edited

albanD left a comment

albanD Jun 15, 2023

kalekundert Jun 15, 2023

albanD left a comment

albanD commented Jun 15, 2023

pytorchmergebot commented Jun 15, 2023

Fix Dirichlet.log_prob() when x=0 and alpha=1 #103605

Fix Dirichlet.log_prob() when x=0 and alpha=1 #103605

Conversation

kalekundert commented Jun 14, 2023

pytorch-bot bot commented Jun 14, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/103605

✅ No Failures

linux-foundation-easycla bot commented Jun 14, 2023 • edited

albanD left a comment

Choose a reason for hiding this comment

albanD Jun 15, 2023

Choose a reason for hiding this comment

kalekundert Jun 15, 2023

Choose a reason for hiding this comment

albanD left a comment

Choose a reason for hiding this comment

albanD commented Jun 15, 2023

pytorchmergebot commented Jun 15, 2023

Merge started

Fix `Dirichlet.log_prob()` when x=0 and alpha=1 #103605

Fix `Dirichlet.log_prob()` when x=0 and alpha=1 #103605

pytorch-bot bot commented Jun 14, 2023 •

edited

linux-foundation-easycla bot commented Jun 14, 2023 •

edited