Adds fix for sporadic CI bug in Barnes-Hut test#1580
Adds fix for sporadic CI bug in Barnes-Hut test#1580peterdsharpe merged 7 commits intoNVIDIA:mainfrom
Conversation
… consistent TensorDict structure. Update test tolerances for output comparisons to enhance robustness against numerical discrepancies.
|
Root cause: |
Greptile SummaryThis PR fixes a sporadic CI failure in the Barnes-Hut kernel caused by TensorDict returning leaves in a non-deterministic iteration order across library versions. Important Files Changed
Reviews (1): Last reviewed commit: "Revises fix" | Re-trigger Greptile |
|
/blossom-ci |
PhysicsNeMo Pull Request
Description
In CI, seeing a sporadic bug reported by @ktangsali (example failing run):
CI trace report
This is possibly introduced by #1494, which recently modified these files. However, CI passed on this PR, and this bug seems to occur only sporadically.
On local machines, these test seem to consistently pass.
For now, adding diagnostic code here to try reproducing this on CI and get more debugging details.
Checklist
Dependencies
Review Process
All PRs are reviewed by the PhysicsNeMo team before merging.
Depending on which files are changed, GitHub may automatically assign a maintainer for review.
We are also testing AI-based code review tools (e.g., Greptile), which may add automated comments with a confidence score.
This score reflects the AI’s assessment of merge readiness and is not a qualitative judgment of your work, nor is
it an indication that the PR will be accepted / rejected.
AI-generated feedback should be reviewed critically for usefulness.
You are not required to respond to every AI comment, but they are intended to help both authors and reviewers.
Please react to Greptile comments with 👍 or 👎 to provide feedback on their accuracy.