Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NFD Operator on OperatorHub not installing #54

Closed
madorn opened this issue Jan 28, 2020 · 15 comments
Closed

NFD Operator on OperatorHub not installing #54

madorn opened this issue Jan 28, 2020 · 15 comments

Comments

@madorn
Copy link

madorn commented Jan 28, 2020

Description of problem:

After installing NFD Operator, pods keep crashing.

{"level":"info","ts":1579886681.9943285,"logger":"cmd","msg":"Go Version: go1.12.12"} {"level":"info","ts":1579886681.9943712,"logger":"cmd","msg":"Go OS/Arch: linux/amd64"} {"level":"info","ts":1579886681.9943788,"logger":"cmd","msg":"Version of operator-sdk: v0.4.0+git"} {"level":"info","ts":1579886681.9950516,"logger":"leader","msg":"Trying to become the leader."} {"level":"info","ts":1579886682.1206093,"logger":"leader","msg":"Found existing lock with my name. I was likely restarted."} {"level":"info","ts":1579886682.120643,"logger":"leader","msg":"Continuing as the leader."} {"level":"info","ts":1579886682.2016144,"logger":"cmd","msg":"Registering Components."} {"level":"info","ts":1579886682.2017787,"logger":"kubebuilder.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="} {"level":"info","ts":1579886682.2019343,"logger":"kubebuilder.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="} {"level":"info","ts":1579886682.202073,"logger":"kubebuilder.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="} {"level":"info","ts":1579886682.2021816,"logger":"kubebuilder.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="} {"level":"info","ts":1579886682.20229,"logger":"kubebuilder.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="} {"level":"error","ts":1579886682.202312,"logger":"cmd","msg":"","error":"no kind is registered for the type v1.SecurityContextConstraints in scheme \"k8s.io/client-go/kubernetes/scheme/register.go:60\"","stacktrace":"github.com/openshift/cluster-nfd-operator/vendor/github.com/go-logr/zapr.(*zapLogger).Error\n\t/go/src/github.com/openshift/cluster-nfd-operator/vendor/github.com/go-logr/zapr/zapr.go:128\nmain.main\n\t/go/src/github.com/openshift/cluster-nfd-operator/cmd/manager/main.go:92\nruntime.main\n\t/opt/rh/go-toolset-1.12/root/usr/lib/go-toolset-1.12-golang/src/runtime/proc.go:200"}

Upon first glance, controller appears to be attempting to watch /api/v1/securitycontextconstraints which is not available in OpenShift 4.x.

The OperatorHub controller image should be updated to watch /apis/security.openshift.io/v1/securitycontextconstraints.

@ArangoGutierrez
Copy link
Contributor

Hi @madorn , could you try the namescpace-scoped branch?, although it has been merged now into master, it had the fix for the SCC API issue

@madorn
Copy link
Author

madorn commented Feb 3, 2020

@ArangoGutierrez Hi Carlos, thank you!!

can we can get this change committed to the image that currently resides in OpenShift 4x? It is located at registry.redhat.io/openshift4/ose-cluster-nfd-operator@sha256:ff782d4c2a3c9436ea9d9713cd6e360337aebdf8c9d6f02d1c4d07ba305de847

If you need assistance pushing the commit there, happy to help!

@ArangoGutierrez
Copy link
Contributor

is now on master, we can close this issue

@madorn
Copy link
Author

madorn commented Feb 6, 2020

@ArangoGutierrez
Copy link
Contributor

Heya @madorn , is it working for you? or do we need some extra work here :)

@jaypal17
Copy link

I'm experiencing the a similar issue when installing through the web console on openshift.
My environment:
crc version: 1.6.0+8ef676f
OpenShift version: 4.3.0
Error msg I get:
Failed: install failed: Timeout: deployment nfd-operator not ready before timeout: deployment "nfd-operator" exceeded its progress deadline
I also see this:
installing: Waiting: waiting for deployment nfd-operator to become ready: Waiting for rollout to finish: 0 of 1 updated replicas are available...

@madorn
Copy link
Author

madorn commented Mar 24, 2020

Heya @madorn , is it working for you? or do we need some extra work here :)

Yes it is working now!

@madorn
Copy link
Author

madorn commented Mar 24, 2020

I'm experiencing the a similar issue when installing through the web console on openshift.
My environment:
crc version: 1.6.0+8ef676f
OpenShift version: 4.3.0
Error msg I get:
Failed: install failed: Timeout: deployment nfd-operator not ready before timeout: deployment "nfd-operator" exceeded its progress deadline
I also see this:
installing: Waiting: waiting for deployment nfd-operator to become ready: Waiting for rollout to finish: 0 of 1 updated replicas are available...

Justin, I don't think Node Discovery will run on CRC.....needs to be multi-node environment, i.e. master and work nodes properly labeled.

@mythi
Copy link

mythi commented Mar 25, 2020

Is this expected to work on vanilla kubernetes? I get the same error as originally reported.

@madorn
Copy link
Author

madorn commented Mar 26, 2020

@mythi how did you install the Operator on the vanilla kubernetes cluster? Is OLM installed in the cluster?

@mythi
Copy link

mythi commented Mar 26, 2020

@madorn I used make deploy based on instructions in the README. What is OLM?

@madorn
Copy link
Author

madorn commented Mar 27, 2020

@mythi
Copy link

mythi commented Mar 30, 2020

@madorn ah, thanks I had that missing but even with that the same error occurs:

$ kubectl logs nfd-operator-5ffb978c7c-pv64k -n openshift-nfd
{"level":"info","ts":1585594183.2338152,"logger":"cmd","msg":"Go Version: go1.13.8"}
{"level":"info","ts":1585594183.2338634,"logger":"cmd","msg":"Go OS/Arch: linux/amd64"}
{"level":"info","ts":1585594183.2338767,"logger":"cmd","msg":"Version of operator-sdk: v0.4.0+git"}
{"level":"info","ts":1585594183.2342427,"logger":"leader","msg":"Trying to become the leader."}
{"level":"info","ts":1585594183.2773724,"logger":"leader","msg":"Found existing lock with my name. I was likely restarted."}
{"level":"info","ts":1585594183.2774174,"logger":"leader","msg":"Continuing as the leader."}
{"level":"info","ts":1585594183.7321594,"logger":"controller-runtime.metrics","msg":"metrics server is starting to listen","addr":":8080"}
{"level":"info","ts":1585594183.73271,"logger":"cmd","msg":"Registering Components."}
{"level":"info","ts":1585594183.7331934,"logger":"cmd","msg":"Starting the Cmd."}
{"level":"info","ts":1585594183.7336092,"logger":"controller-runtime.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="}
{"level":"info","ts":1585594183.733983,"logger":"controller-runtime.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="}
{"level":"info","ts":1585594183.7342155,"logger":"controller-runtime.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="}
{"level":"info","ts":1585594183.7343142,"logger":"controller-runtime.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="}
{"level":"info","ts":1585594183.7344038,"logger":"controller-runtime.controller","msg":"Starting EventSource","controller":"nodefeaturediscovery-controller","source":"kind source: /, Kind="}
{"level":"info","ts":1585594183.7346737,"logger":"controller-runtime.manager","msg":"starting metrics server","path":"/metrics"}
{"level":"error","ts":1585594185.6808777,"logger":"controller-runtime.source","msg":"if kind is a CRD, it should be installed before calling Start","kind":"SecurityContextConstraints.security.openshift.io","error":"no matches for kind \"SecurityContextConstraints\" in version \"security.openshift.io/v1\"","stacktrace":"github.com/go-logr/zapr.(*zapLogger).Error\n\t/go/src/github.com/openshift/cluster-nfd-operator/vendor/github.com/go-logr/zapr/zapr.go:128\nsigs.k8s.io/controller-runtime/pkg/source.(*Kind).Start\n\t/go/src/github.com/openshift/cluster-nfd-operator/vendor/sigs.k8s.io/controller-runtime/pkg/source/source.go:88\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func1\n\t/go/src/github.com/openshift/cluster-nfd-operator/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:165\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start\n\t/go/src/github.com/openshift/cluster-nfd-operator/vendor/sigs.k8s.io/controller-runtime/pkg/internal/controller/controller.go:198\nsigs.k8s.io/controller-runtime/pkg/manager.(*controllerManager).startLeaderElectionRunnables.func1\n\t/go/src/github.com/openshift/cluster-nfd-operator/vendor/sigs.k8s.io/controller-runtime/pkg/manager/internal.go:477"}
{"level":"error","ts":1585594185.6810625,"logger":"cmd","msg":"Manager exited non-zero","error":"no matches for kind \"SecurityContextConstraints\" in version \"security.openshift.io/v1\"","stacktrace":"github.com/go-logr/zapr.(*zapLogger).Error\n\t/go/src/github.com/openshift/cluster-nfd-operator/vendor/github.com/go-logr/zapr/zapr.go:128\nmain.main\n\t/go/src/github.com/openshift/cluster-nfd-operator/cmd/manager/main.go:106\nruntime.main\n\t/usr/local/go/src/runtime/proc.go:203"}

Image ID: quay.io/openshift-psap/cluster-nfd-operator@sha256:c0924c5a8297864a396d1bfa69a1f42f8bb2e481e65547099ec5c224c13cb201

@madorn
Copy link
Author

madorn commented Mar 31, 2020

@mythi The Operator you are attempting to install only works on an OpenShift environment - you can see the Operator is looking for "kind":"SecurityContextConstraints.security.openshift.io".

This endpoint is only found in Openshift environments.

Because this specific issue deals with installing NFD via OperatorHub, I am going to close this issue but please open new issue and specify the type of Kubernetes distribution you are using and Kubernetes version and error.

@madorn madorn closed this as completed Mar 31, 2020
@mythi
Copy link

mythi commented Apr 1, 2020

please open new issue and specify the type of Kubernetes distribution you are using and
Kubernetes version and error.

I can work without the operator and deploy NFD manually for now but it'd be really nice if this had the option to deploy on vanilla kubernetes too (my setup is just a barametal kubeadm install).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants