Prometheus operator helm chart fun and games #824

gitfool · 2019-10-01T21:26:52Z

I've been struggling to install the prometheus-operator to an EKS cluster via the helm chart.

I wanted everything to be tucked into a "monitoring" namespace, and I eventually worked out what I needed to disable, which makes sense in hindsight given the EKS control plane, and now have the following:

const monitoringNamespace = new k8s.core.v1.Namespace("monitoring", { metadata: { name: "monitoring" } }, { provider: provider });

function fixGrafanaTest(obj: any) {
    if (obj.metadata.name === "po-grafana-test") {
        if (obj.kind === "Pod") {
            obj.metadata.annotations = {
                "pulumi.com/skipAwait": "true"
            };
        }
    }
}

function setMonitoringNamespace(obj: any) {
    if (obj.metadata.namespace === undefined) {
        obj.metadata.namespace = "monitoring";
    }
}

const prometheusOperatorChart = new k8s.helm.v2.Chart("po", {
    repo: "stable",
    chart: "prometheus-operator",
    version: "6.11.0",
    namespace: "monitoring",
    transformations: [ fixGrafanaTest, setMonitoringNamespace ],
    values: {
        kubeControllerManager: { enabled: false },
        kubeEtcd: { enabled: false },
        kubeScheduler: { enabled: false },
        kubeTargetVersionOverride: k8sVersion,
        prometheusOperator: { createCustomResource: false }
    }
}, {
    dependsOn: monitoringNamespace,
    provider: provider
});

I'd like highlight the following issues:

If the prometheusOperatorChart referenced monitoringNamespace using namespace: monitoringNamespace.metadata.name, then I'd get a warning about [Can't preview] all chart values must be known ahead of time to generate an accurate preview, so I'm using the same constant value instead. Maybe this could be improved? (Propagate inputs to outputs during preview. pulumi#3245).
The Grafana test is using a helm hook on test-success and is always failing its grafana health check, which I initially thought was due to the timing of helm hooks not being supported when using helm template, like Pulumi does behind the scenes, but then I'd expect subsequent runs to succeed since Grafana is running then, but it always fails so probably something more subtle with the test itself.
- Subsequently there is the problem that Pulumi waits until it times out; handily I could add an annotation to "skipAwait", which works around my impatience, but then it seems to me that Pulumi could be smarter here since the pod specifies to never restart, so it should stop waiting after the first failure.
- So now it fails fast, but I still need a workaround to avoid the failing test, like using another transformation to neuter or preferably remove it altogether, along with all of its supporting resource detritus. I can see how to neuter the pod by modifying its image etc, but not so much the supporting resources.

The text was updated successfully, but these errors were encountered:

gitfool · 2019-10-01T21:33:18Z

@hausdorff mentioned in Slack a possible hack to remove resources in a transformation:

if you set apiVersion: "v1" and kind: "List" I think it should work

I'll give that a go in the meantime, but it would obviously be better to have a first-class way to remove resources via a transformation!

pgavlin · 2019-10-01T21:36:33Z

I'll give that a go in the meantime, but it would obviously be better to have a first-class way to remove resources via a transformation!

FWIW, that capability is tracked via #486

gitfool · 2019-10-01T22:33:28Z

I used kubectl to run the equivalent Grafana test and it worked:

kubectl exec -it -n monitoring -c grafana po-grafana-56fd4bc598-lb94k bash
curl -s -o /dev/null -I -w '%{http_code}' http://po-grafana/api/health
200

... so I'm still not sure why it's always failing.

Meanwhile, I'm very pleased to say the hack to remove resources in a transformation works for me:

function removeGrafanaTest(obj: any) {
    if (obj.metadata.name === "po-grafana-test") {
        obj.apiVersion = "v1";
        obj.kind = "List";
        obj.items = [];
    }
}

lblackstone · 2022-12-21T18:18:26Z

Looks like this was fixed in #486

lblackstone mentioned this issue Nov 6, 2019

Previews with computed values are not useful (show output<string>) pulumi/pulumi#3455

Closed

lblackstone added kind/bug Some behavior is incorrect or out of spec resolution/fixed This issue was fixed labels Dec 21, 2022

lblackstone closed this as completed Dec 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prometheus operator helm chart fun and games #824

Prometheus operator helm chart fun and games #824

gitfool commented Oct 1, 2019

gitfool commented Oct 1, 2019

pgavlin commented Oct 1, 2019

gitfool commented Oct 1, 2019

lblackstone commented Dec 21, 2022

Prometheus operator helm chart fun and games #824

Prometheus operator helm chart fun and games #824

Comments

gitfool commented Oct 1, 2019

gitfool commented Oct 1, 2019

pgavlin commented Oct 1, 2019

gitfool commented Oct 1, 2019

lblackstone commented Dec 21, 2022