Skip to content

[CLI] BlockingIOError: [Errno 11] Resource temporarily unavailable on specific Linux distros #2182

Closed
@konstantinjdobler

Description

@konstantinjdobler

Describe the bug
I am logging a wandb.Image at the end of every epoch. The docs state that you should log less than 50 at every step, so that should be fine. However I get the following behavior when trying to look at the Images in the wandb web app:

Bildschirmaufnahme.2021-05-08.um.12.34.53.mov

Sometimes the web app also crashes like this after some time
Bildschirmfoto 2021-05-08 um 12 38 23

Indeed, I cannot find the images that are said to be "missing" in the video in the logged files. I have the following error in my training run, but the training continues:

Bildschirmfoto 2021-05-08 um 12 38 11

So it seems like some error occurs when transferring the files and after that point, no more images are uploaded. However, metrics continue to be logged. Do you have any idea what is going on here and how I could fix this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    a:appArea: Frontend/Backend

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions