Flannel (NetworkPlugin cni) error: /run/flannel/subnet.env: no such file or directory #70202

ghost · 2018-10-24T20:36:15Z

/kind bug

@kubernetes/sig-contributor-experience-bugs

What happened:
Installed a single-node kubernetes cluster on centos 7 (VM running on virtual box); my application pod (created via k8s deployment) won’t go into Ready state

Pod Event: Warning FailedCreatePodSandBox . . . Kubelet . . . Failed create pod sandbox: rpc error: code = Unknown desc = failed to set up sandbox . . . network for pod "companyemployees-deployment-766c7c7767-t7mc5": NetworkPlugin cni failed to set up pod "companyemployees-deployment-766c7c7767-t7mc5_default" network: open /run/flannel/subnet.env: no such file or directory

In addition, it looks like the kubernetes coredns docker container keeps exiting – e.g. docker ps -a | grep -i coredns: 6341ce0be652 k8s.gcr.io/pause:3.1 "/pause" . . . Exited (0) 1 second ago k8s_POD_coredns-576cbf47c7-9bxxg_kube-system_e84afb7a-d7b7-11e8-bafa-08002745c4bc_581

What you expected to happen:
Flannel not to have the error & Pod to go into ready state

How to reproduce it (as minimally and precisely as possible):
Create a simple deployment after creating docker image and pushing the image to a private docker registry
kubectl create -f companyemployees-deployment.yaml
deployment yaml:

apiVersion: apps/v1beta1
kind: Deployment
metadata:
  name: companyemployees-deployment
  labels:
    app: companyemployees
spec:
  replicas: 1
  selector:
    matchLabels:
      app: companyemployees
  template:
    metadata:
      labels:
        app: companyemployees
    spec:
      containers:
      - name: companyemployees
        image: localhost:5000/companyemployees:1.0
        ports:
        - containerPort: 9092

Anything else we need to know?:
ip link
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT qlen 1
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: enp0s3: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP mode DEFAULT qlen 1000
link/ether 08:00:27:45:c4:bc brd ff:ff:ff:ff:ff:ff
3: enp0s8: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP mode DEFAULT qlen 1000
link/ether 08:00:27:21:0f:92 brd ff:ff:ff:ff:ff:ff
4: docker0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP mode DEFAULT
link/ether 02:42:1b:04:1f:7c brd ff:ff:ff:ff:ff:ff
6: veth3f5bcb4@if5: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue master docker0 state UP mode DEFAULT
link/ether b2:1f:d4:fb:84:2e brd ff:ff:ff:ff:ff:ff link-netnsid 0
7: flannel.1: <BROADCAST,MULTICAST> mtu 1450 qdisc noop state DOWN mode DEFAULT
link/ether e6:44:ed:15:dd:97 brd ff:ff:ff:ff:ff:ff

Environment:

Kubernetes version (use kubectl version):
Client Version: version.Info{Major:"1", Minor:"12", GitVersion:"v1.12.1", GitCommit:"4ed3216f3ec431b140b1d899130a69fc671678f4", GitTreeState:"clean", BuildDate:"2018-10-05T16:46:06Z", GoVersion:"go1.10.4", Compiler:"gc", Platform:"linux/amd64"}
Server Version: version.Info{Major:"1", Minor:"12", GitVersion:"v1.12.1", GitCommit:"4ed3216f3ec431b140b1d899130a69fc671678f4", GitTreeState:"clean", BuildDate:"2018-10-05T16:36:14Z", GoVersion:"go1.10.4", Compiler:"gc", Platform:"linux/amd64"}
Cloud provider or hardware configuration:
Single-node kubernetes cluster on CentOS 7 VM running on virtual box (virtual box is running on windows 7 pro)
OS (e.g. from /etc/os-release):
cat /etc/os-release:
NAME="CentOS Linux"
VERSION="7 (Core)"
ID="centos"
ID_LIKE="rhel fedora"
VERSION_ID="7"
PRETTY_NAME="CentOS Linux 7 (Core)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:centos:centos:7"
HOME_URL="https://www.centos.org/"
BUG_REPORT_URL="https://bugs.centos.org/"

CENTOS_MANTISBT_PROJECT="CentOS-7"
CENTOS_MANTISBT_PROJECT_VERSION="7"
REDHAT_SUPPORT_PRODUCT="centos"
REDHAT_SUPPORT_PRODUCT_VERSION="7"

rpm -q centos-release
centos-release-7-4.1708.el7.centos.x86_64

Kernel (e.g. uname -a):
uname -a
Linux ibm-ms 3.10.0-693.5.2.el7.x86_64 Unit test coverage in Kubelet is lousy. (~30%) #1 SMP Fri Oct 20 20:32:50 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
Install tools:

My team's centos image had docker, kubernetes, flannel and docker private registry already on the image; it was working and then recently I had issues w/ it that resulted in my uninstalling kubernetes, docker and flannel and reinstalling.

Install steps:

Switch to root: su - root

install docker

yum install -y yum-utils device-mapper-persistent-data lvm2
yum-config-manager --add-repo https://download.docker.com/linux/centos/docker-ce.repo
yum install docker-ce
systemctl daemon-reload
systemctl enable docker
systemctl start docker
docker run hello-world

install private docker registry

docker pull registry
docker run -d -p 5000:5000 --restart=always --name registry registry
Note: firewalld is not running

install k8s:

setenforce 0
sed -i --follow-symlinks 's/SELINUX=enforcing/SELINUX=disabled/g' /etc/sysconfig/selinux
swapoff -a
Edit /etc/fstab and comment-out /dev/mapper/centos-swap swap
Add kubernetes repo for yum - edit /etc/yum.repos.d/kubernetes.repo and add

[kubernetes]
	name=Kubernetes
	baseurl=https://packages.cloud.google.com/yum/repos/kubernetes-el7-x86_64
	enabled=1
	gpgcheck=1
	repo_gpgcheck=1
	gpgkey=https://packages.cloud.google.com/yum/doc/yum-key.gpg
		https://packages.cloud.google.com/yum/doc/rpm-package-key.gpg

yum install -y kubelet kubeadm kubectl
systemctl enable kubelet
systemctl start kubelet
kubeadm init --pod-network-cidr= 10.244.0.0/16
k8s config for user – running as root: export KUBECONFIG=/etc/kubernetes/admin.conf

install flannel:

sysctl net.bridge.bridge-nf-call-iptables=1
kubectl apply -f https://raw.githubusercontent.com/coreos/flannel/bc79dd1505b0c8681ece4de4c0d86c5cd2643275/Documentation/kube-flannel.yml

Remove master node taint (to allow scheduling pods on master): kubectl taint nodes --all node-role.kubernetes.io/master-

Others:
Prior to installing, uninstalled using following steps:

Switch to root: su - root

Uninstall k8s
(Although on master node, I did this a few times and included draining the node the last time)

kubectl drain mynodename --delete-local-data --force --ignore-daemonsets
kubectl delete node mynodename
kubeadm reset
systemctl stop kubelet
yum remove kubeadm kubectl kubelet kubernetes-cni kube*
yum autoremove
rm -rf ~/.kube
rm -rf /var/lib/kubelet/*

Uninstall docker:

docker rm docker ps -a -q``
docker stop (as needed)
docker rmi -f docker images -q``
Check that all containers and images were deleted: docker ps -a; docker images
systemctl stop docker
yum remove yum-utils device-mapper-persistent-data lvm2
yum remove docker docker-client docker-client-latest docker-common docker-latest docker-latest-logrotate docker-logrotate docker-selinux docker-engine-selinux docker-engine
yum remove docker-ce
rm -rf /var/lib/docker
12. rm -rf /etc/docker

Uninstall flannel

rm -rf /var/lib/cni/
rm -rf /run/flannel
rm -rf /etc/cni/
Remove interfaces related to docker and flannel:
ip link
For each interface for docker or flannel, do the following
ifconfig <name of interface from ip link> down
ip link delete <name of interface from ip link>

The text was updated successfully, but these errors were encountered:

dims · 2018-10-24T20:58:56Z

/sig network

prodanlabs · 2018-10-26T09:00:15Z

I am using a binary running flannel.
After starting flanneld with the systemctl command, mk-docker-opts.sh -i is automatically executed to generate the following two file environment variable files.

run/flannel/subnet.env
/run/docker_opts.env

flanneld.service add ExecStartPost=/usr/libexec/flannel/mk-docker-opts.sh -k DOCKER_NETWORK_OPTIONS -d /run/flannel/docker

katanyoussef · 2018-12-31T12:41:33Z

Hi,
i got the same error, if someone have an answer can you plz help, i re-did it 3 time the same result...maybe i m missing something also the coredns dns is showing conatinercreating all the time.....

jwatte · 2019-01-16T00:16:47Z

I have the same problem on Ubuntu 18.04.1 with kubelet 1.13 and docker-ce 18.09.
The same setup worked with kubelet 1.12 and docker-ce 18.06.
(Note that kubelet and docker were updated in place and the machine rebooted; downgrading versions goes back to working.)
One question I have: Do I need to run flanneld on the node hosting my kubernetes, even though it's single-node (master==slave)? ubuntu doesn't have a modern flanneld to install, and no installation instructions talk about this -- apparently just applying .yml should be enough?

wborgo · 2019-01-29T12:18:23Z

I have the same problem on CentOS Linux release 7.6.1810 (Core)
kubelet 1.13.2
Docker 18.09.1
Image k8s.gcr.io/coredns:1.2.6

Everytime I reboot the server, the file /run/flannel/subnet.env is created after a minute or two.
I tried to change the owner and group to the non root user:

sudo chown $(id -u):$(id -g) /run/flannel/subnet.env

jansmets · 2019-02-25T06:27:41Z

Anyone has a possible solution for this chicken-egg problem? It's a problem when you want to do jumbo frames.
Thanks

discostur · 2019-04-09T09:22:22Z

Just got the same problem - fixed it by manually adding the file:

/run/flannel/subnet.env

FLANNEL_NETWORK=10.244.0.0/16
FLANNEL_SUBNET=10.244.0.1/24
FLANNEL_MTU=1450
FLANNEL_IPMASQ=true

unmurphy · 2019-04-12T04:00:35Z

@discostur thx, resolved this issues with your answer!

suryastef · 2019-04-20T17:30:43Z

in my case, using centos in DO , the file /run/flannel/subnet.env exist, but same issue: /run/flannel/subnet.env: no such file or directory

at first I tried different subnet while running kubeadm init --pod-network-cidr=192.168.255.0/24

I tried @discostur solution, with changing the file manually, but the subnet.env restored to its original state when I restarted the master

this only solved by kubeadm reset and use flannel default network-cidr kubeadm init --pod-network-cidr=10.244.0.0/16

discostur · 2019-04-20T18:58:52Z

Try checking kube-proxy logs... it could be there is something wrong with it and the flannel error is only related to kube-proxy error... Am 20.04.2019 um 19:32 schrieb Surya Stefanus <notifications@github.com<mailto:notifications@github.com>>: in my case, using centos in DO , the file /run/flannel/subnet.env exist, but same issue: /run/flannel/subnet.env: no such file or directory at first I tried different subnet while running kubeadm init --pod-network-cidr=192.168.255.0/24 I tried @discostur<https://github.com/discostur> solution, with changing the file manually, but the subnet.env restored to its original state when I restarted the master this only solved by kubeadm reset and use flannel default network-cidr kubeadm init --pod-network-cidr=10.244.0.0/16 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#70202 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AAT6Q7HQN7F7BTFEES7H4MLPRNHTVANCNFSM4F7DV7KA>.

manukasa · 2019-04-22T09:53:24Z

Just got the same problem - fixed it by manually adding the file:

/run/flannel/subnet.env
FLANNEL_NETWORK=10.244.0.0/16
FLANNEL_SUBNET=10.244.0.1/24
FLANNEL_MTU=1450
FLANNEL_IPMASQ=true

Thanks this worked for us

RamanPndy · 2019-04-22T09:54:04Z

Just got the same problem - fixed it by manually adding the file:

/run/flannel/subnet.env
FLANNEL_NETWORK=10.244.0.0/16
FLANNEL_SUBNET=10.244.0.1/24
FLANNEL_MTU=1450
FLANNEL_IPMASQ=true

this solution worked for me. but i've one doubt. What are the means of these values and how flannel is using these values?

ryanjfrizzell · 2019-04-23T00:39:33Z

Just got the same problem - fixed it by manually adding the file:
/run/flannel/subnet.env
FLANNEL_NETWORK=10.244.0.0/16
FLANNEL_SUBNET=10.244.0.1/24
FLANNEL_MTU=1450
FLANNEL_IPMASQ=true
this solution worked for me. but i've one doubt. What are the means of these values and how flannel is using these values?

This will get it started, but it won't survive a reboot...still struggling with this myself

caseydavenport · 2019-05-02T20:43:40Z

The subnet.env file is written out by the flannel daemonset pods and probably shouldn't be modified by hand.

If that file isn't getting written, it suggests another problem preventing the flannel pod from starting up. Are there other logs in the flannel pod? You can check with something like kubectl logs -n kube-system <flannel-pod-name>

Happy to continue discussing, but I'm going to close this since it appears to be a flannel issue rather than a Kubernetes one. Might also be worth raising as a support issue against the flannel repo too: https://github.com/coreos/flannel

/remove-triage unresolved
/remove-kind bug
/close

k8s-ci-robot · 2019-05-02T20:43:41Z

@caseydavenport: Closing this issue.

In response to this:

The subnet.env file is written out by the flannel daemonset pods and probably shouldn't be modified by hand.

If that file isn't getting written, it suggests another problem preventing the flannel pod from starting up. Are there other logs in the flannel pod? You can check with something like kubectl logs -n kube-system <flannel-pod-name>

Happy to continue discussing, but I'm going to close this since it appears to be a flannel issue rather than a Kubernetes one. Might also be worth raising as a support issue against the flannel repo too: https://github.com/coreos/flannel

/remove-triage unresolved
/remove-kind bug
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

HankTheCrank · 2019-11-13T21:53:18Z

I know this is old but I wanted to comment here as I too had this issue, but in my case it was a symptom to a different issue. In my case, there was no subnet.env file but it was not getting created because my flannel daemonset was failing. The error from the pod (kubectl --namespace=kube-system logs <POD_NAME>) showed "Error registering network: failed to acquire lease: node "<NODE_NAME>" pod cidr not assigned". The node was missing a spec for podCIDR, so I ran "kubectl patch node <NODE_NAME> -p '{"spec":{"podCIDR":"10.244.0.0/16"}}'" for each node and the issue went away.

sudo-undefined · 2020-01-02T21:25:47Z

Thanks @HankTheCrank.
I just deployed kubeadm, kubelet, kubeclt 1.15.7 together with whatever CIDR version that are in kubernetes 1.17, I've read there were some changes in CIDR but I don't know really what it is and now I don't have to spend hours trying to figure it out (at the moment),
The node patch command made flannel come alive on my worker nodes.

rprasad17088 · 2020-05-09T11:13:56Z

I also encountered exactly same problem while creating rook-ceph-operator pod, enforcing SELinux to 0 on worker nodes resolved the issue.

rjshk013 · 2021-09-10T10:15:01Z

in my case myAmazonEKSCNIRole was the culprit.I didn't give proper oidc id in that role.I rechecked trust relationship section & corrected the values .After that my pods shows running
kube-system aws-node-29rwz 1/1 Running 20 74m
kube-system aws-node-ffvc4 1/1 Running 20 74m
kube-system coredns-65ccb76b7c-f96q4 1/1 Running 0 92m
kube-system coredns-65ccb76b7c-x7gdg 1/1 Running 0 92m

ilmal · 2022-01-18T19:05:55Z

Hope I can be of help!

The solution for me:

My problem was that flannel-ds pods weren't running on all of my nodes (check that the amount of flannel pods in kube-system match the amount of nodes that are in the cluster).

In my case, two of my nodes had the taint NoExecute, which also blocks flannel pods. If this is the case for you, edit the daemonset and add toleration for NoExecute. Problem solved!

pint1022 · 2022-01-19T19:39:36Z

the solution works for me.

audioscavenger · 2022-02-17T02:29:53Z

creating the /run/flannel/subnet.env fixes the coredns issue not starting but it's only temporary.
My solution for the master/control-plane :

kubeadm init --control-plane-endpoint=whatever --node-name whatever --pod-network-cidr=10.244.0.0/16
kubectl apply -f https://raw.githubusercontent.com/flannel-io/flannel/master/Documentation/kube-flannel.yml
restart all

systemctl stop kubelet
systemctl stop docker
iptables --flush
iptables -tnat --flush
systemctl start kubelet
systemctl start docker

attila123 · 2022-10-17T17:11:59Z

Thanks, I just needed a quick solution for a test system running some old k8s. I scripted the workaround which recreates the missing /run/flannel/subnet.env:

#! /bin/bash

set -x


# See https://github.com/kubernetes/kubernetes/issues/70202
# Run as root (e.g. with sudo)

mkdir -p /run/flannel

cat << EOF > /run/flannel/subnet.env
FLANNEL_NETWORK=10.244.0.0/16
FLANNEL_SUBNET=10.244.0.1/24
FLANNEL_MTU=1450
FLANNEL_IPMASQ=true
EOF

laurijssen · 2023-09-01T06:00:24Z

This issue started on 6 nodes after I removed apparmor daemon from the system. (after a while). As soon as I reinstalled apparmor it worked again.
One node required a kubeadm reset/join and a restart before working again, The flannel pod itself had the error permission denied on obtaining network interface so the subnet.env did not appear.
I dont know why as there are no flannel profiles in apparmor conf but it worked!

abdelghanimeliani · 2024-07-01T17:28:34Z

/run/flannel/subnet.env

worked for me, thanks

k8s-ci-robot added kind/bug Categorizes issue or PR as related to a bug. needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Oct 24, 2018

k8s-ci-robot added sig/network Categorizes an issue or PR as relevant to SIG Network. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Oct 24, 2018

fntlnz mentioned this issue Nov 29, 2018

Can't integrate flannel and docker when flannel runs in kubernetes flannel-io/flannel#926

Closed

thockin added the triage/unresolved Indicates an issue that can not or will not be resolved. label Mar 8, 2019

freehan assigned caseydavenport Apr 18, 2019

k8s-ci-robot closed this as completed May 2, 2019

k8s-ci-robot removed the triage/unresolved Indicates an issue that can not or will not be resolved. label May 2, 2019

k8s-ci-robot removed the kind/bug Categorizes issue or PR as related to a bug. label May 2, 2019

caseydavenport mentioned this issue May 2, 2019

Stuck at kube-flannel-ds-amd64 Init:0/1 #72462

Closed

xabolcs mentioned this issue Aug 13, 2019

calico/node is not ready: BIRD is not ready: BGP not established (Calico 3.6 / k8s 1.14.1) projectcalico/calico#2561

Closed

cmoulliard mentioned this issue Feb 26, 2020

Investigate and resolve why coreDNS pod fails to start with k8s 1.16.7 snowdrop/k8s-infra#146

Closed

Keith-Hon mentioned this issue Apr 8, 2022

coredns stuck at ContainerCreating coredns/deployment#87

Closed

anuj-upadhyay-2001 mentioned this issue Apr 19, 2022

Genesis chart doesn't create config maps Consensys/quorum-kubernetes#127

Closed

kate-goldenring mentioned this issue Jun 10, 2022

Fix Kubernetes setup in end-to-end tests project-akri/akri#481

Merged

cloud-66 mentioned this issue Oct 3, 2022

the same ip on different pod in multinode cluster rootless-containers/usernetes#267

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flannel (NetworkPlugin cni) error: /run/flannel/subnet.env: no such file or directory #70202

Flannel (NetworkPlugin cni) error: /run/flannel/subnet.env: no such file or directory #70202

ghost commented Oct 24, 2018 •

edited by ghost

Loading

dims commented Oct 24, 2018

prodanlabs commented Oct 26, 2018

katanyoussef commented Dec 31, 2018

jwatte commented Jan 16, 2019 •

edited

Loading

wborgo commented Jan 29, 2019

jansmets commented Feb 25, 2019

discostur commented Apr 9, 2019

unmurphy commented Apr 12, 2019

suryastef commented Apr 20, 2019

discostur commented Apr 20, 2019 via email

manukasa commented Apr 22, 2019

RamanPndy commented Apr 22, 2019

ryanjfrizzell commented Apr 23, 2019

caseydavenport commented May 2, 2019

k8s-ci-robot commented May 2, 2019

HankTheCrank commented Nov 13, 2019

sudo-undefined commented Jan 2, 2020

rprasad17088 commented May 9, 2020

rjshk013 commented Sep 10, 2021

ilmal commented Jan 18, 2022

pint1022 commented Jan 19, 2022

audioscavenger commented Feb 17, 2022 •

edited

Loading

attila123 commented Oct 17, 2022

laurijssen commented Sep 1, 2023 •

edited

Loading

abdelghanimeliani commented Jul 1, 2024

Flannel (NetworkPlugin cni) error: /run/flannel/subnet.env: no such file or directory #70202

Flannel (NetworkPlugin cni) error: /run/flannel/subnet.env: no such file or directory #70202

Comments

ghost commented Oct 24, 2018 • edited by ghost Loading

dims commented Oct 24, 2018

prodanlabs commented Oct 26, 2018

katanyoussef commented Dec 31, 2018

jwatte commented Jan 16, 2019 • edited Loading

wborgo commented Jan 29, 2019

jansmets commented Feb 25, 2019

discostur commented Apr 9, 2019

unmurphy commented Apr 12, 2019

suryastef commented Apr 20, 2019

discostur commented Apr 20, 2019 via email

manukasa commented Apr 22, 2019

RamanPndy commented Apr 22, 2019

ryanjfrizzell commented Apr 23, 2019

caseydavenport commented May 2, 2019

k8s-ci-robot commented May 2, 2019

HankTheCrank commented Nov 13, 2019

sudo-undefined commented Jan 2, 2020

rprasad17088 commented May 9, 2020

rjshk013 commented Sep 10, 2021

ilmal commented Jan 18, 2022

pint1022 commented Jan 19, 2022

audioscavenger commented Feb 17, 2022 • edited Loading

attila123 commented Oct 17, 2022

laurijssen commented Sep 1, 2023 • edited Loading

abdelghanimeliani commented Jul 1, 2024

ghost commented Oct 24, 2018 •

edited by ghost

Loading

jwatte commented Jan 16, 2019 •

edited

Loading

audioscavenger commented Feb 17, 2022 •

edited

Loading

laurijssen commented Sep 1, 2023 •

edited

Loading