New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to download container image from private repository #904
Comments
Server core host should be fine and work. Is this host deployed on Azure or local onebox? My suspicion is that image download is timing out, but I could be wrong and would need traces to diagnose the issue further. |
Local one box. If you can point me to docs on enabling tracing I'll gladly do that and provide it. |
Just for fun I deployed another Windows Server 2016 core server, had docker running on it, installed service fabric configuration same as my full GUI version and the dockerd process isn't getting started.
|
I get the same "Failed to download container" error but on an Azure cluster (I tried Windows preview and Ubuntu).
I have the latest preview installed though. |
Are you using Server2016DataCenter-WithContainers image? The error you are seeing indicates we are not able to find docker on the node. |
@RajeetN No, I only have the preview SF SDK & Docker for Windows installed on my local machine. |
Can you please retry with the 5.7 bits (non-preview) and tell us what you find? With Server2016DataCenter-WithContainers, it should work. |
I've retried this with Azure Service Fabric 5.7 with Linux. |
No you cannot run Linux containers on windows Service Fabric clusters today. |
Ok, in that case I'm still blocked by this issue. |
Can you RDP on your machine and check if Docker is running? |
Here are my compose files: docker-compose.yml
docker-compose.override.yml
Well, it's a Linux cluster, so I guess I cannot RDP into it, but I'll try if I can access it via SSH somehow. |
The directives look ok, but SF only supports docker compose version 3, so I don't know if it is rejecting checking the version - https://docs.microsoft.com/en-us/azure/service-fabric/service-fabric-docker-compose#supported-compose-directives Yes please SSH into the node using the cluster FQDN and port 3389 for node 0, 3390 for node 1 and so on. |
When I change the versions to 3 I get the same error. @prasadker Thanks for the quick info on how to access the nodes via SSH. felix_schroeter@LinuxNode000000:~$ docker ps
Got permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Get http://%2Fvar%2Frun%2Fdocker.sock/v1.30/containers/json: dial unix /var/run/docker.sock: connect: permission denied
felix_schroeter@LinuxNode000000:~$ sudo su
root@LinuxNode000000:/home/felix_schroeter# docker ps
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
root@LinuxNode000000:/home/felix_schroeter# And here is the result of root@LinuxNode000000:/home/felix_schroeter# ps -aux | grep "[d]ocker"
root 6811 0.0 0.3 275856 7068 ? Ssl Sep19 0:42 docker-containerd -l unix:///var/run/docker/libcontainerd/docker-containerd.sock --metrics-interval=0 --start-timeout 2m --state-dir /var/run/docker/libcontainerd/containerd --shim docker-containerd-shim --runtime docker-runc
root 105012 0.1 1.7 699400 34704 pts/7 Ssl+ Sep18 3:02 /usr/bin/dockerd -H localhost:2375 -H unix:///var/run/docker.sock --pidfile /mnt/sfroot/sfdocker.pid |
Is docker-compose supposed to be isntalled?: root@LinuxNode000000:/home/felix_schroeter# docker-compose ps
The program 'docker-compose' is currently not installed. You can install it by typing:
apt install docker-compose |
If you are operating against a SF cluster, you shouldn't need that. You will be using instructions @ https://docs.microsoft.com/en-us/azure/service-fabric/service-fabric-docker-compose. Since many issues have been discussed, to clarify exactly where you are blocked - you are trying to run (Linux) containers on a (Linux) SF cluster in Azure, correct? |
@mani-ramaswamy Yes, Linux containers on a Linux Service Fabric cluster. It's an ASP.NET Core project created with Docker support via Visual Studio. |
Can you try connecting to the cluster via CLI, and then running the install script below? sfctl cluster select --endpoint http://:19080 And if you don't have a default service in the app manifest, then create the service as well. If you share your manifest files, I can try it on my cluster and see what's going on. Are you able to get it to work with a public docker hub image? |
And to install sfctl, use the instructions @ https://docs.microsoft.com/en-us/azure/service-fabric/service-fabric-cli. |
I have an issue that may be related. I am getting the "failed to download" error when I try to deploy a SF Container project to my local cluster. I have noticed that when the local cluster starts up, the Docker engine dies. If Service Fabric is trying to use Docker to pull and install the image, then it would fail. I have already posted an issue in Docker For Windows (docker/for-win#1175) but would it be helpful for me to add more details here, or open a new issue for Service Fabric? Edit: never mind, I just got a comment on that issue saying that SF does not support containers on Windows 10. |
So, is it possible to run Linux container in Service Fabric on Windows? I'm getting:
|
No, to run Linux containers, you need a SF Linux cluster at this time. |
@mani-ramaswamy ok, thanks. After switch Docker to the Windows container I'm also getting
https://hub.docker.com/r/micdenny/rabbitmq-windows/ What is the problem? |
It could be timing out. Can you try the instructions @ https://docs.microsoft.com/en-us/azure/service-fabric/service-fabric-get-started-containers#configure-container-image-download-time and increase the time out and report back? |
Please also ensure that you are using docker EE (and not the CE variant) |
@mani-ramaswamy Thanks for response
I tried very small containers and got error immediately after start.
I'm using Docker CE on my dev machine.
Docker EE for Windows requires Windows Server 2016 or later. I'm not developing on Windows Server |
Service Fabric cannot run Windows containers on Windows 10 locally at present. This will be fixed in an upcoming release. |
@mani-ramaswamy Any idea on how soon it will be fixed? |
We're presently testing internal builds with this - the next minor version update of SF (6.2) will have this fixed. |
Getting the same error SF running on local cluster (6.1.467.9494). |
To re-confirm, the VM is running Windows Server 2016 Datacenter with containers, and the guest container is Windows Server 2016 Datacenter Server Core. The reason I ask is that Windows containers aren't compatible across releases - thus, Windows Server 2016 container aren't compatible with Windows Server 2016 version 1709 hosts and vice versa. |
BTW, Docker for Windows (CE) isn't supported, if you were using that. You need EE on the Server. |
Yes, the guest & the host are both - Microsoft Windows [Version 10.0.14393].
Sent from Mail<https://go.microsoft.com/fwlink/?LinkId=550986> for Windows 10
…________________________________
From: Mani Ramaswamy <notifications@github.com>
Sent: Friday, March 30, 2018 9:24:14 AM
To: Azure/service-fabric-issues
Cc: rn-3; Comment
Subject: Re: [Azure/service-fabric-issues] Unable to download container image from private repository (#315)
To re-confirm, the VM is running Windows Server 2016 Datacenter with containers, and the guest container is Windows Server 2016 Datacenter Server Core. The reason I ask is that Windows containers aren't compatible across releases - thus, Windows Server 2016 container aren't compatible with Windows Server 2016 version 1709 hosts and vice versa.
@harahma<https://github.com/harahma> @RajeetN<https://github.com/RajeetN>
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub<https://github.com/Azure/service-fabric-issues/issues/315#issuecomment-377562257>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ARnaj5bmuO8rKveuWyDFDNMPzotji65Xks5tjlwugaJpZM4N7UTD>.
|
Using Docker version 17.06.2-ee-6, build e75fdb8.
Sent from Mail<https://go.microsoft.com/fwlink/?LinkId=550986> for Windows 10
…________________________________
From: Mani Ramaswamy <notifications@github.com>
Sent: Friday, March 30, 2018 9:36:04 AM
To: Azure/service-fabric-issues
Cc: rn-3; Comment
Subject: Re: [Azure/service-fabric-issues] Unable to download container image from private repository (#315)
BTW, Docker for Windows (CE) isn't supported, if you were using that. You need EE on the Server.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub<https://github.com/Azure/service-fabric-issues/issues/315#issuecomment-377564939>, or mute the thread<https://github.com/notifications/unsubscribe-auth/ARnajzMuRH5V-CcOe1BBXecJMcccVIS0ks5tjl70gaJpZM4N7UTD>.
|
@mani-ramaswamy @RajeetN |
@rn-3, do you have version tag specified for your image? |
microsoft/windowsservercore:latest |
alright, this is not about download from private repository that the original issue reported was about then? Could you please share the traces from your machine? If this is a cluster deployed on azure, please send us the cluster resource name and region. |
I tried deploying again using the docker-compose file and it seems like the Container's have created & started... |
I'm also getting same error when using locally (dev cluster) the local registry.
|
This shall be SF issue. I deploy this to 3 different SF servers and only one of them showing this issue. All are Windows 2016 LTSC and image is exactly the same, so it can not be mismatch etc. |
@aryamsft Can you take look at it? |
@harahma This is relevant to the issue in other thread with bunch of files created in temporary folder on that server. microsoft/service-fabric-issues#1122 |
Now can we deploy containers in windows 10 1909 machine? |
I have a SF (Microsoft.Azure.ServiceFabric.WindowsServer.5.6.220.9494) running the ClusterConfig.Unsecure.DevCluster configuration.
Docker was installed prior and was working and running these same images. Before installed SF I removed all running containers and cleaned up images and private repository logins.
I am trying to deploy an application pointing to my private repo, e.g. myrepo.azurecr.io/sf/myapp
I am able to use the repository credentials specified in the manifest to login to the repo from docker cli.
When SF tries to deploy the container it states:
In the admin log I see this sequence of events:
ServiceManifest
ApplicationManifest
Also, it appears that dockerd is not running though I saw in a previous debug log that the docker process manager started dockerd successfully, but then exited with error code 1 which the log said was ok. I haven't seen this happen again in the debug log as of yet.
One other thing to note, the image is rather large at about 9gb
Update: I installed docker on another host, logged into the private repo, pulled the image and then installed SF and was able to deploy the same manifests and run the container successfully. If the image does not exist in docker prior to SF trying to pull it then it fails. with the errors above.
dockerd process is not running successfully if the image doesn't exist
Update: I wiped the images
docker rmi $(docker images -q)
from my local development workstation running Windows Server 2016, using the local dev SF deployment and deploying the application causes docker to download the image from my private repo. I did the same process on the broken 2016 container host but the main difference is that server is running the Core OS so there is no GUI, the docker host that is working is my 2016 development workstation so it's local to Visual Studio.Is my core container host broke, unsupported, or what? I am mostly at this point interested in how do I find out what's wrong with it so if it's something I did I don't do it again.
The text was updated successfully, but these errors were encountered: