Skip to content

[Bug]: pullImage error when the image is actually pulled successfully #2503

@jvstme

Description

@jvstme

Steps to reproduce

Run a job on a VM-based backend that will require pulling a Docker image. For example, use a custom Docker image that is not yet pulled on the instance.

The issue happens randomly, you may need to try several times to reproduce.

Actual behaviour

The job fails:

 NAME     BACKEND              INSTANCE    RESOURCES                          RESERVATION  PRICE  STATUS  SUBMITTED  ERROR                      
 httpbin  datacrunch (FIN-01)  CPU.4V.16G  4xCPU, 16GB, 100.0GB (disk), SPOT               $0.01  failed  10:15      JOB_FAILED                 
                                                                                                                     (CREATING_CONTAINER_ERROR) 

httpbin provisioning completed (failed)
Run failed with error code CREATING_CONTAINER_ERROR.
Error: pullImage error: failed pulling kennethreitz/httpbin:latest: downloaded 211188154/214167220 bytes (21.26MB/s), errors: []
Check CLI, server, and run logs for more details.

shim.log:

time=2025-04-11T08:15:30.303305Z level=debug msg=Pulling image
time=2025-04-11T08:15:39.773158Z level=debug msg=Status: Downloaded newer image for kennethreitz/httpbin:latest
time=2025-04-11T08:15:39.779208Z level=error msg=pullImage error: failed pulling kennethreitz/httpbin:latest: downloaded 211188154/214167220 bytes (21.26MB/s), errors: []
time=2025-04-11T08:15:39.780045Z level=debug msg=released GPU(s) gpus=[] task=f14ca241-2bfa-4c0d-af09-10b2af0d3912
time=2025-04-11T08:15:39.780085Z level=error msg=failed to run task=f14ca241-2bfa-4c0d-af09-10b2af0d3912 err=failed pulling kennethreitz/httpbin:latest: downloaded 211188154/214167220 bytes (21.26MB/s), errors: []

The image is actually pulled successfully to the runner instance

# docker image ls
REPOSITORY             TAG       IMAGE ID       CREATED       SIZE
kennethreitz/httpbin   latest    b138b9264903   6 years ago   534MB

Expected behaviour

If the image is pulled successfully, shim does not fail with pullImage error and proceeds to start the runner.

dstack version

0.19.2

Server logs

Additional information

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingmajor

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions