Sending two "load" requests to server makes it load twice #7018

ShuaiShao93 · 2024-03-21T18:29:05Z

Description
When I use two clients to send /v2/repository/models/MODEL/load requests to the same server at the same time, the model is loaded twice

Triton Information
What version of Triton are you using?
23.11

Are you using the Triton container or did you build it yourself?
Container nvcr.io/nvidia/tritonserver:23.11-py3

To Reproduce
Start a server in explicit mode, and load no model.

Open two terminals, run curl -X POST "http://localhost:8000/v2/repository/models/MODEL/load" -d "{}" at the same time. You can see logs like

 successfully loaded MODEL
loading: MODEL
successfully loaded MODEL
successfully unloaded MODEL

Expected behavior
The model should be only loaded once. And the log successfully unloaded MODEL should be before successfully loaded MODEL

The text was updated successfully, but these errors were encountered:

indrajit96 · 2024-03-25T22:03:05Z

Hi @ShuaiShao93 , thanks a lot for reaching out.
Can you provide with the following details

What type of model/backend?
Can you reproduce this behavior with other types of models/backends? Or is it specific to this one?
Not sure how are you getting the unloaded log? Are you making a unload request?

I am unable to reproduce this

When I try to load a model simultaneously it just gets loaded once.

ShuaiShao93 · 2024-04-08T03:08:05Z

Hi @ShuaiShao93 , thanks a lot for reaching out. Can you provide with the following details

What type of model/backend?

Ensemble pipeline with Python & ONNX backends

Can you reproduce this behavior with other types of models/backends? Or is it specific to this one?

Sorry didn't get a chance to test more

Not sure how are you getting the unloaded log? Are you making a unload request?

No, I just made load requests simultaneously from two clients, and I saw the unloaded logs

I am unable to reproduce this

When I try to load a model simultaneously it just gets loaded once.

indrajit96 added the question Further information is requested label Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sending two "load" requests to server makes it load twice #7018

Sending two "load" requests to server makes it load twice #7018

ShuaiShao93 commented Mar 21, 2024

indrajit96 commented Mar 25, 2024

ShuaiShao93 commented Apr 8, 2024

Sending two "load" requests to server makes it load twice #7018

Sending two "load" requests to server makes it load twice #7018

Comments

ShuaiShao93 commented Mar 21, 2024

indrajit96 commented Mar 25, 2024

ShuaiShao93 commented Apr 8, 2024