Skip to content

Conversation

@flesher0813
Copy link
Contributor

@flesher0813 flesher0813 commented Sep 6, 2025

Purpose

What this PR does / why we need it?

Recover from load failure, requests can still running even load failed.

Modifications

Does this PR introduce any user-facing change?

Support continuing inference after a load failure.

Test

How was this patch tested?

Test with offline test
ESA tp=2
image
tp=1
image
nfs tp = 2
image
nfs, tp = 2, load_async = True
image

@flesher0813 flesher0813 force-pushed the develop branch 12 times, most recently from 2c43eef to cad784d Compare September 6, 2025 14:45
Signed-off-by: flesher0813 <1208954694@qq.com>
@flesher0813 flesher0813 changed the title [WIP][Feat]Support load async and load failure [Feat]Support load async and load failure Sep 6, 2025
@ygwpz ygwpz merged commit 3d6c86e into ModelEngine-Group:develop Sep 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants