fix: eliminate RuntimeWarnings in von Mises-Fisher loss backward pass by carlosm-silva · Pull Request #824 · graphnet-team/graphnet

carlosm-silva · 2025-08-26T23:49:46Z

🎯 Summary

Fixes RuntimeWarnings in LogCMK.backward when processing arrays containing zero values, while maintaining mathematical accuracy and numerical stability.

🔬 Mathematical Background

This PR implements a numerically stable solution for computing gradients in the von Mises-Fisher loss function. The core issue was that while the mathematical limit:

$$\lim_{\kappa \to 0} \left(\frac{1}{\kappa} - \frac{1}{\tanh(\kappa)}\right) = 0$$

is well-defined, naive floating-point evaluation triggers division by zero warnings.

Detailed mathematical derivations and proofs are provided in:

Enhanced docstrings in src/graphnet/training/loss_functions.py
New documentation: docs/source/models/von_mises_fisher_mathematical_background.md

🔧 Changes

Code Changes

✅ Fixed zero handling: Replaced np.where() with boolean masking to avoid double evaluation
✅ Enhanced docstrings: Added comprehensive mathematical background with Google-style formatting
✅ Comprehensive tests: Added test_logcmk_backward_zero_handling() with warning capture

Implementation Details

Small κ: Uses Taylor approximation -κ/3 for |κ| < 1e-6
Large κ: Uses exact formula 1/κ - 1/tanh(κ)
Error bound: Truncation error ≤ |κ|³/45 ≲ O(10⁻²¹) for threshold values

Documentation

✅ Mathematical background: Complete derivations, Taylor series analysis, and error bounds
✅ Implementation details: Explains boolean masking approach and threshold selection
✅ Integration: Added to docs/source/models/models.rst

🧪 Testing

# Run the specific test
python -m pytest tests/training/test_loss_functions.py::test_logcmk_backward_zero_handling -v

# Expected: PASSED with NO RuntimeWarnings

Test Coverage

✅ Zero values don't raise exceptions
✅ No RuntimeWarnings generated
✅ Gradients remain finite for all inputs
✅ Mathematical accuracy: f(0) = 0 exactly
✅ Correct Taylor approximation for small κ
✅ Arrays with multiple zeros handled correctly

📊 Before/After

Before:

RuntimeWarning: divide by zero encountered in divide
RuntimeWarning: invalid value encountered in subtract

After:

tests/training/test_loss_functions.py::test_logcmk_backward_zero_handling PASSED [100%]

🔗 References

von Mises-Fisher distribution: Wikipedia
arXiv:1812.04616, Section 8.2
Original MIT License implementation by Max Ryabinin (2019)

✅ Checklist

Follows PEP8 conventions
Google-style docstrings with type hints
Comprehensive unit tests added
Mathematical background documented
No RuntimeWarnings generated
All existing tests pass
Clean implementation without code duplication

Mathematical accuracy preserved • Numerical stability achieved • Warnings eliminated

- Replace np.where with boolean masking to avoid double evaluation - Add comprehensive unit tests for zero handling in LogCMK.backward - Enhanced docstrings with mathematical background and implementation details - Added mathematical background documentation in docs/source/models/ Fixes division by zero warnings while maintaining numerical accuracy. Error bound analysis shows <1e-21 accuracy for |κ| < 1e-6.

shubhamos-ai

📚 Well-structured code! Consider adding more detailed documentation, implementing CI/CD pipelines, and adding performance monitoring for production readiness.

RasmusOrsoe

Hey @carlosm-silva thank you very much for this clean contribution!

I did a few checks on the compatibility of the gradients from this change in backward w.r.t. to the current implementation and was happy to find that they agree except for kappa=0.

When I took a closer look at the documentation introduced in the PR, I found that the math appears to be rendered incorrectly (see here). Could you double-check that it's OK?

Tagging @Aske-Rosted for completeness. @Aske-Rosted Do you have comments?

RasmusOrsoe · 2025-09-02T06:32:23Z

docs/source/models/von_mises_fisher_mathematical_background.md

@@ -0,0 +1,200 @@
+# Mathematical Background: von Mises-Fisher Loss Implementation


Most of the math here appears not to render correctly in the markdown file.

Interestingly, it renders correctly when I copy and paste it into my local Markdown environment (Obsidian). The minus sign is being interpreted as the start of a list. I'm unfamiliar with GitHub's Markdown notation, so I'm unsure how to proceed.

Never mind, I fixed it in 510f4ba

Corrected mathematical expressions in the von Mises-Fisher documentation so they are properly rendered in Github markdown.

RasmusOrsoe · 2025-09-03T08:22:13Z

@carlosm-silva thank you for the quick iteration! The docs now look much better. I have no further comments at this stage.

Lets give @Aske-Rosted a chance to catch up before we proceed

Aske-Rosted · 2025-09-04T01:55:27Z

Hi @carlosm-silva,

Thanks for this very well documented contribution!
I too think that this looks ready to go and do not have any further comments or suggestions.

RasmusOrsoe

🚀

shubhamos-ai approved these changes Aug 29, 2025

View reviewed changes

RasmusOrsoe reviewed Sep 2, 2025

View reviewed changes

Fix mathematical expressions in documentation

510f4ba

Corrected mathematical expressions in the von Mises-Fisher documentation so they are properly rendered in Github markdown.

RasmusOrsoe self-requested a review September 5, 2025 07:21

RasmusOrsoe approved these changes Sep 5, 2025

View reviewed changes

carlosm-silva merged commit 1bb42ad into graphnet-team:main Sep 8, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

fix: eliminate RuntimeWarnings in von Mises-Fisher loss backward pass#824

fix: eliminate RuntimeWarnings in von Mises-Fisher loss backward pass#824
carlosm-silva merged 2 commits intographnet-team:mainfrom
The-Blenderers:main

carlosm-silva commented Aug 26, 2025

Uh oh!

shubhamos-ai left a comment

Uh oh!

RasmusOrsoe left a comment

Uh oh!

RasmusOrsoe Sep 2, 2025

Uh oh!

carlosm-silva Sep 2, 2025

Uh oh!

carlosm-silva Sep 2, 2025

Uh oh!

RasmusOrsoe commented Sep 3, 2025

Uh oh!

Aske-Rosted commented Sep 4, 2025

Uh oh!

RasmusOrsoe left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -0,0 +1,200 @@
		# Mathematical Background: von Mises-Fisher Loss Implementation

Comments

Conversation

carlosm-silva commented Aug 26, 2025

🎯 Summary

🔬 Mathematical Background

🔧 Changes

Code Changes

Implementation Details

Documentation

🧪 Testing

Test Coverage

📊 Before/After

🔗 References

✅ Checklist

Uh oh!

shubhamos-ai left a comment

Choose a reason for hiding this comment

Uh oh!

RasmusOrsoe left a comment

Choose a reason for hiding this comment

Uh oh!

RasmusOrsoe Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

carlosm-silva Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

carlosm-silva Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

RasmusOrsoe commented Sep 3, 2025

Uh oh!

Aske-Rosted commented Sep 4, 2025

Uh oh!

RasmusOrsoe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants