Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error on reconciliation of odh-model-controller #86

Closed
vaibhavjainwiz opened this issue Sep 20, 2023 · 8 comments · Fixed by #100 or #111
Closed

Error on reconciliation of odh-model-controller #86

vaibhavjainwiz opened this issue Sep 20, 2023 · 8 comments · Fixed by #100 or #111
Assignees
Labels
kind/bug Something isn't working rhods-2.5

Comments

@vaibhavjainwiz
Copy link
Contributor

Below issue is coming on reconciliation of odh-model-controller.

2023-09-20T00:50:14Z	ERROR	Reconciler error	{"controller": "inferenceservice", "controllerGroup": "serving.kserve.io", "controllerKind": "InferenceService", "InferenceService": {"name":"caikit-example-isvc","namespace":"kserve-demo"}, "namespace": "kserve-demo", "name": "caikit-example-isvc", "reconcileID": "f399dd1f-1ee1-45c2-badc-c37507bc51fe", "error": "servicemonitors.monitoring.coreos.com \"caikit-example-isvc-metrics\" is invalid: metadata.resourceVersion: Invalid value: 0x0: must be specified for an update"}
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.14.6/pkg/internal/controller/controller.go:329
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.14.6/pkg/internal/controller/controller.go:274
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).Start.func2.2
	/opt/app-root/src/go/pkg/mod/sigs.k8s.io/controller-runtime@v0.14.6/pkg/internal/controller/controller.go:235

Currently this error is not visible in logs because error object is not send back. We capture the error and then just ignore it. In below line we are capturing err but never return it to calling function.
https://github.com/opendatahub-io/odh-model-controller/blob/main/controllers/inferenceservice_controller.go#L99

@vaibhavjainwiz vaibhavjainwiz self-assigned this Sep 20, 2023
@vaibhavjainwiz
Copy link
Contributor Author

depends on changes of #84

@heyselbi heyselbi added the kind/bug Something isn't working label Oct 11, 2023
@vaibhavjainwiz vaibhavjainwiz linked a pull request Oct 12, 2023 that will close this issue
3 tasks
@heyselbi
Copy link
Contributor

@rpancham will retest it. Once test passes, @vaibhavjainwiz please make sure it is in RHODS 2.4.

@heyselbi heyselbi reopened this Oct 25, 2023
@vaibhavjainwiz vaibhavjainwiz linked a pull request Oct 26, 2023 that will close this issue
3 tasks
@bdattoma
Copy link

bdattoma commented Nov 8, 2023

@vaibhavjainwiz how can I try to reproduce the issue?

@vaibhavjainwiz
Copy link
Contributor Author

These error will not be visible in logs but you could see the impacts it on ServiceMeshMemberRole.
If you delete the Namespace from ServiceMeshMemberRole in running application then ideally it would add back by operator but beacause of this error it would not added back.

@bdattoma
Copy link

These error will not be visible in logs but you could see the impacts it on ServiceMeshMemberRole. If you delete the Namespace from ServiceMeshMemberRole in running application then ideally it would add back by operator but beacause of this error it would not added back.

I tried, and don't see any error in odh-model-controller pod.
However, the ServiceMeshMemberRoll doesn't get reconcilied automatically.

@heyselbi heyselbi reopened this Nov 16, 2023
@israel-hdez israel-hdez self-assigned this Nov 16, 2023
@israel-hdez
Copy link
Contributor

I updated the label to rhods-2.5, because the fix is missing on 2.4.
Once there is a build for 2.5 I'll re-check and close if I the fix is correctly in place.

@bdattoma
Copy link

I updated the label to rhods-2.5, because the fix is missing on 2.4. Once there is a build for 2.5 I'll re-check and close if I the fix is correctly in place.

sounds good, thank you!

@israel-hdez
Copy link
Contributor

The issue seems to be gone in 2.5-RC
Closing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working rhods-2.5
Projects
Status: Done
Status: Model Serving
Status: Done
4 participants