Skip to content

avoid thundering herd #9150

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed

Conversation

polvorin
Copy link
Contributor

if there are multiple agents trying to call the same model, and the arn hasn't been created/cached yet, let only 1 of them create it

snandam and others added 2 commits June 19, 2025 10:22
if there are multiple agents trying to call the same model, and the arn
hasn't been created/cached yet,  let only 1 of them create it
@polvorin polvorin requested a review from a team as a code owner June 19, 2025 18:20
@polvorin polvorin marked this pull request as draft June 19, 2025 18:20
mrinalwadhwa
mrinalwadhwa previously approved these changes Jun 19, 2025
@snandam snandam force-pushed the snandam/cache-inference-profiles branch 3 times, most recently from d01d157 to edf6b47 Compare June 19, 2025 20:04
Base automatically changed from snandam/cache-inference-profiles to develop June 19, 2025 21:49
@snandam snandam dismissed mrinalwadhwa’s stale review June 19, 2025 21:49

The base branch was changed.

@polvorin polvorin closed this Jun 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants