Facing error err="parsing YAML file /etc/prometheus/config_out/prometheus.env.yaml: empty duration string" #5197

pankdhnd · 2022-12-01T11:43:01Z

What happened?
We upgraded to prometheus operator version 0.61.1 and after upgrade we found that prometheus pods are failing with below error:

level=error msg="Error loading config (--config.file=/etc/prometheus/config_out/prometheus.env.yaml)" file=/etc/prometheus/config_out/prometheus.env.yaml err="parsing YAML file /etc/prometheus/config_out/prometheus.env.yaml: empty duration string"

We load the configurations from a secret, and config-reloader parses them and puts them into /etc/prometheus/config_out/prometheus.env.yaml file.

We found in the output file that global.scrape_interval is not being parsed and it is put as empty value in the prometheus.env.yaml file, due to which prometheus keeps crashing. Below is the snippet of the prometheus.env.yaml file

global:
  evaluation_interval: 40s
  scrape_interval: ""
  external_labels:
    prometheus: monitoring-prometheus
    prometheus_replica: monitoring-prometheus-0

If we downgrade operator to previous version i.e. 0.60.1, then everything works fine.

How to reproduce it (as minimally and precisely as possible):

Deploy prometheus stack with 0.60.1 version of prometheus operator
Edit operator deployment and change the operator image tag to 0.61.1
Let the operator and other pods restart.

Environment
Any. We checked on OCP, AWS and Azure

Prometheus Operator version:v 0.61.1
Kubernetes version information: 1.23.5
Prometheus Operator Logs:

level=warn ts=2022-12-01T11:20:58.497128297Z caller=operator.go:2018 component=prometheusoperator msg="skipping servicemonitor" error="invalid scrapeInterval \"\": empty duration string" servicemonitor=monitoring-grafana namespace=namespace prometheus=monitoring-prometheus

The text was updated successfully, but these errors were encountered:

simonpasquier · 2022-12-01T13:45:57Z

Have you upgraded the prometheus-operator CRDs to the same v0.61.1 version?

slashpai · 2022-12-02T13:22:56Z

We had removed some of the default values and validations in operator code in v0.60.1 since they are already covered in OpenAPI. So like Simon mentioned you would need to update CRD's to 0.61.0 to make it work.

bdbrink · 2022-12-13T19:25:37Z

seeing the same issue after updating CRDs + prometheus-operator to v0.61.1

EDIT: Fixed after updating CRDs and Prometheus-operator, had to restart the operator after applying the CRDs, then restart prometheus to get it working.

billiford · 2023-01-13T18:09:58Z

I was still having this problem with the kube-prometheus-stack, so I wanted to share how I debugged and fixed it.

I had gotten the stack running in one cluster, but not another, so I compared the config.file contents which according to kubectl -n monitoring describe po prometheus-kube-prometheus-stack-prometheus-0 was held in the volume config with secret name prometheus-kube-prometheus-stack-prometheus.

I output each secrets data context of prometheus.yaml.gz to a file.

$ echo '<SECRET_CONTENT>' | base64 -d | gunzip > /tmp/config-<CLUSTER_NAME>.yaml

Then performed a diff.

$ diff /tmp/config-<WORKING_CLUSTER>.yaml /tmp/config-<BROKEN_CLUSTER>.yaml
2,3c2,3
<   evaluation_interval: 30s
<   scrape_interval: 30s
---
>   evaluation_interval: ""
>   scrape_interval: ""

Sure enough, the evaluation interval and scrape intervals were not being set on lines 2 and 3!

To fix, I set them explicitly, redeployed, and bounced the prometheus pod.

prometheus:
  prometheusSpec:
    scrapeInterval: 30s
    evaluationInterval: 30s

simonpasquier · 2023-01-16T08:28:31Z

@billiford it's very likely that you have a difference between the version of the CRDs and the operator version (e.g. operator version > CRD version).

billiford · 2023-01-16T19:12:12Z

I deployed all the CRDs to both clusters that I found here.

It would be nice to know which CRD specifically is the root of this problem and why it is causing these intervals to not be set.

simonpasquier · 2023-01-17T08:59:46Z

It is the Prometheus CRD. You need to check that spec.scrapeInterval and spec.evaluationInterval return with default values if unset.

darkpwny · 2023-02-09T13:26:46Z

The bug I think is that the scrapeInterval and evaluationInterval are defined in the values.yaml as empty strings.

line 2624 : scrapeInterval: ""
line 2632 : evaluationInterval: ""

These either need to be commented out so the defaults get inserted or set to the default value "30s".

I edited my kube-prometheus-stack\values.yaml so values where

line 2624 : scrapeInterval: "30s"
line 2632 : evaluationInterval: "30s"

and then installed via helm:

helm install promstack --namespace monitoring -f --create-namespace kube-prometheus-stack/values.yaml ./kube-prometheus-stack

bvanelli · 2023-02-27T11:17:33Z

As others stated, the CRDs were incompatible and I was also getting this error message. Following the CRD upgrade solved the problem for me. I was installing chart version 45.X.X, so the following CRDs were applicable:

kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_alertmanagerconfigs.yaml
kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_alertmanagers.yaml
kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_podmonitors.yaml
kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_probes.yaml
kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_prometheuses.yaml
kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_prometheusrules.yaml
kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_servicemonitors.yaml
kubectl apply --server-side -f https://raw.githubusercontent.com/prometheus-operator/prometheus-operator/v0.63.0/example/prometheus-operator-crd/monitoring.coreos.com_thanosrulers.yaml

After installing them, the problem went away. See more on the documentation: https://github.com/prometheus-community/helm-charts/tree/main/charts/kube-prometheus-stack#upgrading-an-existing-release-to-a-new-major-version

pankdhnd · 2023-02-27T15:21:49Z

The issue was resolved after CRD update. Thanks everyone for the help :-)

sfxworks · 2023-06-20T09:20:33Z

https://github.com/prometheus-community/helm-charts/blob/main/charts/kube-prometheus-stack/values.yaml#L2815 it's still in the default values yaml

dauberson · 2023-10-19T17:41:35Z

I had the same issue and the problem was the spec.scrapeInterval and spec.evaluationInterval values, it was empty.
I am using Terraform to provide the resources, eks, charts, etc... And I change manually the specs values, and somehow it rolls back to the old values, to fix it, I copy this file and set values in terraform helm_release:

resource "helm_release" "chart_prometheus" {
  name       = "kube-prometheus-stack"
  chart      = "kube-prometheus-stack"
  version    = "51.9.4"
  repository = "https://prometheus-community.github.io/helm-charts"
  namespace  = "monitoring"


  values = compact(distinct(concat([
    file("${path.module}/configs/prometheus-values.yaml"),
  ])))
}

ref prometheus-operator/prometheus-operator#5197

pankdhnd added the kind/bug label Dec 1, 2022

slashpai added kind/support and removed kind/bug labels Dec 12, 2022

pankdhnd closed this as completed Feb 27, 2023

eplightning mentioned this issue Jun 7, 2023

TargetAllocator : error during loading configuration open-telemetry/opentelemetry-operator#1811

Closed

eumel8 added a commit to caas-team/caas-project-monitoring that referenced this issue Oct 25, 2023

set evaluationInterval/scrapeInterval to prevent empty prometheus conf

0151f3e

ref prometheus-operator/prometheus-operator#5197

MarkhamLee mentioned this issue Nov 25, 2023

Prometheus yaml parsing error: "parsing YAML file /etc/prometheus/config_out/prometheus.env.yaml: empty duration string" techno-tim/launchpad#46

Open

moadz mentioned this issue Feb 15, 2024

[ACM-9653] Updating prometheus-k8s CRD's to 0.68 stolostron/multicluster-observability-operator#1330

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Facing error err="parsing YAML file /etc/prometheus/config_out/prometheus.env.yaml: empty duration string" #5197

Facing error err="parsing YAML file /etc/prometheus/config_out/prometheus.env.yaml: empty duration string" #5197

pankdhnd commented Dec 1, 2022

simonpasquier commented Dec 1, 2022

slashpai commented Dec 2, 2022

bdbrink commented Dec 13, 2022 •

edited

billiford commented Jan 13, 2023

simonpasquier commented Jan 16, 2023

billiford commented Jan 16, 2023

simonpasquier commented Jan 17, 2023

darkpwny commented Feb 9, 2023 •

edited

bvanelli commented Feb 27, 2023

pankdhnd commented Feb 27, 2023

sfxworks commented Jun 20, 2023

dauberson commented Oct 19, 2023 •

edited

Facing error err="parsing YAML file /etc/prometheus/config_out/prometheus.env.yaml: empty duration string" #5197

Facing error err="parsing YAML file /etc/prometheus/config_out/prometheus.env.yaml: empty duration string" #5197

Comments

pankdhnd commented Dec 1, 2022

simonpasquier commented Dec 1, 2022

slashpai commented Dec 2, 2022

bdbrink commented Dec 13, 2022 • edited

billiford commented Jan 13, 2023

simonpasquier commented Jan 16, 2023

billiford commented Jan 16, 2023

simonpasquier commented Jan 17, 2023

darkpwny commented Feb 9, 2023 • edited

bvanelli commented Feb 27, 2023

pankdhnd commented Feb 27, 2023

sfxworks commented Jun 20, 2023

dauberson commented Oct 19, 2023 • edited

bdbrink commented Dec 13, 2022 •

edited

darkpwny commented Feb 9, 2023 •

edited

dauberson commented Oct 19, 2023 •

edited