-
Notifications
You must be signed in to change notification settings - Fork 35
Patch non-repeative sampling "inv_pop_f" #300
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Patch non-repeative sampling "inv_pop_f" #300
Conversation
optimized for non-repeative sampling
for more information, see https://pre-commit.ci
📝 WalkthroughWalkthroughThe candidate selection logic in the adaptive lower report module was updated to use Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~8 minutes Suggested reviewers
Note ⚡️ Unit Test Generation is now available in beta!Learn more here, or try it out under "Finishing Touches" below. 📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (2)
🔇 Additional comments (3)
✨ Finishing Touches
🧪 Generate unit tests
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Create a ticket on our support page for assistance with any issues or questions. Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #300 +/- ##
=======================================
Coverage 84.22% 84.22%
=======================================
Files 104 104
Lines 6111 6112 +1
=======================================
+ Hits 5147 5148 +1
Misses 964 964 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
using non-repeative sampling instead of the original repeative sampling when using
"candi_sel_prob": "inv_pop_f".random.choices->numpy.random.choice(replace=False)The original behavior could take a large portion of repeated long-tail low-frequency smaples (the longer the tail, the worse the case), causing tens of percents of repeated downstream fp calculations, moreover amplifying the noise in labels from these high-force configurations.
The non-repeated sampling re-nomalizes the prob after screening out each picked sample
Summary by CodeRabbit
Bug Fixes
Tests