Skip to content

Some hosting tests failing on Azdo with docker+network issues #10085

Closed
@radical

Description

@radical

Failing build on an unrelated PR #10053.

A bunch of test runs failed on helix.

Image

Looking at Hosting.MongoDb, it has lot of entries like the following before the whole job times out:

fail: Aspire.Hosting.Dcp.dcpctrl.NetworkReconciler[0] could not create the network as all available subnet ranges from the default pool are allocated, retrying... {"NetworkName": {"name":"default-aspire-network"}, "Reconciliation": 52, "Network": "default-aspire-network-vskli6mtis", "error": "docker command 'CreateNetwork' returned with non-zero exit code 1\nobject could not allocate required resources\nnetwork subnet pool full\nError response from daemon: could not find an available, non-overlapping IPv4 address pool among the defaults to assign to the network"} [+44/x3/?0] Aspire.Hosting.MongoDB.Tests.dll (net8.0|x64) - Aspire.Hosting.MongoDB.Tests.MongoDbFunctionalTests.VerifyWaitForOnMongoBlocksDependentResources (19m 36s)

could not create the network as all available subnet ranges from the default pool are allocated, retrying...

docker command 'CreateNetwork' returned with non-zero exit code 1
object could not allocate required resources
network subnet pool full
Error response from daemon: could not find an available, non-overlapping IPv4 address pool among the defaults to assign to the network

Looking at Hosting.Milvus, I see lot of

fail: Aspire.Hosting.Dcp.dcpctrl.NetworkReconciler[0] could not create a network {"NetworkName": {"name":"default-aspire-network"}, "Reconciliation": 2, "Network": "default-aspire-network-fcj9i4r2t0", "error": "docker command 'InspectNetworks' returned with non-zero exit code 1\nobject not found\nnetwork not found\nError response from daemon: network default-aspire-network-fcj9i4r2t0 not found\nnot all requested objects were returned\nonly 0 out of 1 networks were successfully inspected\ndocker command 'CreateNetwork' returned with non-zero exit code 1\nerror\nError response from daemon: failed to update bridge store for object type *bridge.networkConfiguration: open /datadisks/disk1/docker/network/files/local-kv.db: no such file or directory"}:

docker command 'InspectNetworks' returned with non-zero exit code 1
object not found
network not found
Error response from daemon: network default-aspire-network-fcj9i4r2t0 not found
not all requested objects were returned
only 0 out of 1 networks were successfully inspected
docker command 'CreateNetwork' returned with non-zero exit code 1
error
Error response from daemon: failed to update bridge store for object type *bridge.networkConfiguration: open /datadisks/disk1/docker/network/files/local-kv.db: no such file or directory"

Looking at some of the other failures they seem to be of the same form.

This happened only on Azdo. And the changes in the PR are based on 223ff6a from main.

cc @karolz-ms @davidfowl

Report

Summary

24-Hour Hit Count 7-Day Hit Count 1-Month Count
0 0 0

Metadata

Metadata

Assignees

Type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions