-
Notifications
You must be signed in to change notification settings - Fork 824
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grafana Dashboard not updating the deployments #1854
Comments
What version of seldon-core-operator and seldon-core-analytics are you running? Master currently was updated and requires #1809 to match. Otherwise you need to make sure you are aligned with the compatibility table https://docs.seldon.io/projects/seldon-core/en/latest/reference/upgrading.html#wrapper-compatibility-table. Once you share the versions of both libraries we'll have a better insight on what the issue is. |
Currently, I am using Seldon-core 1.1.0 and Seldon core analytics also 1.1.0. Seldon-core-analytics is configured under |
Could you try port-forwarding to prometheus instead of grafana and in prometheus run each of:
That should help us determine the problem. I'm guessing you have left the executor enabled as that's the default now instead of the Java engine. The metrics did change when that change was made. But with the new analytics I'd expect the new models to be showing metrics and not the old ones. |
I ran the following command to port forward to prom: Since my namespace is in default, i changed all the Test Commands ran in Prometheus:
Results: Show all the old models but not the new models
Results: Shows nothing
Result: Show nothing Is there something that needs to be set up first or am I doing something wrong.. |
Oh in that first query kubernetes_namespace= What version of seldon core were the old models created with? Is it an option for you to uninstall and reinstall seldon core analytics? Is that how you upgraded seldon core analytics? |
It is a fresh installation of seldon-core-analytics, all the models created are using Just to clarify, the old and new models I meant by Grafana will not show any new models being deployed and will not show any models that were re-deployed |
I guess all the pods are up and responding to requests. Would you be able to copy a working and non-working pod's manifest so we can compare? |
FWIW, it has been working for me so just thinking about possible differences - 1) I've been running latest master and 2) I've been installing by cloning the repo and doing a helm install from the local path rather than from the published/hosted version of the chart |
I end up having this same problem, when using a router; all pods are functional in the graph, when I bash into a bod and run the following, the data is there:
When running non-router models, nothing comes in at all. This is consistent across all 1.1.0 deployments, using the Helm charts ( |
This is my deployment.yaml Non-working:
Working:
I did download the seldon-core repo locally in my cloud before using helm install. |
@Deunitato-sentient Interesting. So the working and non-working ones are quite different. Are you sure the failure is related to the model being newly-deployed and not differences in the manifest? Have you been able to test that (e.g. by removing a working one and adding it in again)? |
So I manage to see some new deployments in my grafana when I made a different deployment and used different container name. May I ask if it's related to issue #618 and if there is any fixes for this bug? |
If I delete the deployment yaml file and reapply it, it does not show in the Grafana anymore |
@Deunitato-sentient could you test with the latest version of seldon core and with an updated wrapper? We have done some testing of grafana metrics and it all seems to work |
Please reopen if can be replicated on latest |
I have encountered a problem whereby newly deployed deployments are not being captured by grafana dashboard for Seldon-core-analytics. I am able to view any old deployments I have made before creation of the grafana dashboard following the metrics helm example in the documentation.
Is this a bug or did I not do something correctly.
The text was updated successfully, but these errors were encountered: