-
-
Notifications
You must be signed in to change notification settings - Fork 8.6k
[grid] Improve Node handling concurrent sessions creation #16913
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Viet Nguyen Duc <nguyenducviet4496@gmail.com>
PR Compliance Guide 🔍Below is a summary of compliance checks for this PR:
Compliance status legend🟢 - Fully Compliant🟡 - Partial Compliant 🔴 - Not Compliant ⚪ - Requires Further Human Verification 🏷️ - Compliance label |
||||||||||||||||||||||||||||||
PR Code Suggestions ✨Explore these optional code suggestions:
|
|||||||||||||||||
Signed-off-by: Viet Nguyen Duc <nguyenducviet4496@gmail.com>
Signed-off-by: Viet Nguyen Duc <nguyenducviet4496@gmail.com>
Signed-off-by: Viet Nguyen Duc <nguyenducviet4496@gmail.com>
Signed-off-by: Viet Nguyen Duc <nguyenducviet4496@gmail.com>
🔗 Related Issues
💥 What does this PR do?
Adds a configurable threshold to automatically mark a Node as DOWN after consecutive session creation failures. This helps detect and isolate unhealthy Nodes that consistently fail to create sessions, improving Grid reliability
--node-down-failure-thresholdBehavior
resetConsecutiveSessionFailures()methodAdditional Improvements Included
maxSessionCounttryReserve()instead of synchronized block for better concurrencyrestoreSessionCount()prevents premature draining on failed session attemptsConsecutiveSessionFailuresattribute exposed for monitoringresetConsecutiveSessionFailures()allows programmatic recovery🔧 Implementation Notes
💡 Additional Considerations
🔄 Types of changes