Cloud Run service fails to apply update #832

knowhoper · 2023-06-23T09:03:06Z

Checklist

I did not find a related open issue.
I did not find a solution in the troubleshooting guide: (https://cloud.google.com/config-connector/docs/troubleshooting)
If this issue is time-sensitive, I have submitted a corresponding issue with GCP support.

Bug Description

I am seeing the error

{"name":"xxx-api","namespace":"xxx-api"},"namespace":"xxx-api","name":"xxx-api","reconcileID":"feb918c5-8c38-4e40-b0c6-e4f080b2660b","error":"Update call failed: error generating the diffs from desired state: \"Location\" must be set"}

When updating the CloudRun service. Below is the resource YAML. Noting deleting and re-creating works.

apiVersion: run.cnrm.cloud.google.com/v1beta1
kind: RunService
metadata:
  name: xxx-api
  namespace: xxx-api
  annotations:
    argocd.argoproj.io/sync-wave: "20"
    cnrm.cloud.google.com/project-id: acme-uat
spec:
  ingress: "INGRESS_TRAFFIC_ALL"
  launchStage: "GA"
  location: australia-southeast1
  projectRef:
    external: projects/acme-uat
  template:
    containerConcurrency:  80
    scaling:
      minInstanceCount: 1
      maxInstanceCount: 2
    revision: xxx-api-v1-4-44-uatj
    serviceAccountRef:
      external: "serviceAccount:svc-xxx-api@acme-uat.iam.gserviceaccount.com"
    containers:
      - env:
          - name: default_badge_limit
            value: "6"
          - name: bucket_id
            value: "acme-public-images-uat"
        image: "australia-southeast1-docker.pkg.dev/acme-dev-tooling/acme-docker/xxx-api:v1.4.44-uat"
        ports:
          - name: http1
            containerPort: 5000
        resources:
          limits:
            cpu: 1000m
            memory: 1Gi
    serviceAccountRef:
      external: svc-xxx-api@acme-uat.iam.gserviceaccount.com
    vpcAccess:
      connectorRef:
        external: projects/acme-uat/locations/australia-southeast1/connectors/acme-svpc
      egress: PRIVATE_RANGES_ONLY
  traffic:
    - percent: 100
      type: "TRAFFIC_TARGET_ALLOCATION_TYPE_REVISION"
      revision: xxx-api-v1-4-44-uatj

Additional Diagnostic Information

None

Kubernetes Cluster Version

1.25.8-gke.1000

Config Connector Version

1.105.0

Config Connector Mode

cluster mode

Log Output

{"severity":"info","timestamp":"2023-06-23T08:59:13.837Z","logger":"runservice-controller","msg":"starting reconcile","resource":{"namespace":"xxx-api","name":"xxx-api"}}
{"severity":"error","timestamp":"2023-06-23T08:59:13.917Z","msg":"Reconciler error","controller":"runservice-controller","controllerGroup":"run.cnrm.cloud.google.com","controllerKind":"RunService","RunService":{"name":"xxx-api","namespace":"xxx-api"},"namespace":"xxx-api","name":"xxx-api","reconcileID":"f405ed57-31a1-4aab-a1f0-f1e4977dec34","error":"Update call failed: error generating the diffs from desired state: "Location" must be set"}

Steps to reproduce the issue

Create CloudRun service. Add a revision.

YAML snippets

apiVersion: run.cnrm.cloud.google.com/v1beta1
kind: RunService
metadata:
  name: xxx-api
  namespace: xxx-api
  annotations:
    argocd.argoproj.io/sync-wave: "20"
    cnrm.cloud.google.com/project-id: acme-uat
spec:
  ingress: "INGRESS_TRAFFIC_ALL"
  launchStage: "GA"
  location: australia-southeast1
  projectRef:
    external: projects/acme-uat
  template:
    containerConcurrency:  80
    scaling:
      minInstanceCount: 1
      maxInstanceCount: 2
    revision: xxx-api-v1-4-44-uatj
    serviceAccountRef:
      external: "serviceAccount:svc-xxx-api@acme-uat.iam.gserviceaccount.com"
    containers:
      - env:
          - name: default_badge_limit
            value: "6"
          - name: bucket_id
            value: "acme-public-images-uat"
        image: "australia-southeast1-docker.pkg.dev/acme-dev-tooling/acme-docker/xxx-api:v1.4.44-uat"
        ports:
          - name: http1
            containerPort: 5000
        resources:
          limits:
            cpu: 1000m
            memory: 1Gi
    serviceAccountRef:
      external: svc-xxx-api@acme-uat.iam.gserviceaccount.com
  traffic:
    - percent: 100
      type: "TRAFFIC_TARGET_ALLOCATION_TYPE_REVISION"
      revision: xxx-api-v1-4-44-uatj

The text was updated successfully, but these errors were encountered:

diviner524 · 2023-06-27T18:34:48Z

@knowhoper thanks for reporting this issue! We saw other users report similar errors as well and we are looking into it.

knowhoper · 2023-07-03T11:00:50Z

Hi @diviner524, thanks for the response. Any ETA on a fix? This is preventing us deploying some workloads.

We have found a workaround by adding the cnrm.cloud.google.com/deletion-policy: abandon annotation to the CR service, then forcing a recreate of the service in GKE. This appears to clear the issue but it's less than ideal.

tsallou · 2023-07-06T11:32:06Z

I have the same issue using the CloudSchedulerJob ressource : Update call failed: error generating the diffs from desired state: "Location" must be set

knowhoper · 2023-07-07T00:17:32Z

@tsallou the way we work around this - we use ArgoCD to manage GKE resources BTW, was to set the annotation cnrm.cloud.google.com/deletion-policy: abandon in our Cloud Run services, so they are never removed from GCP. Then delete the CR crd from the relevant namespace in GKE, removing finalisers if necessary.

jpeterson-bestbuy · 2023-08-25T18:46:04Z

I have the same issue using the CloudSchedulerJob ressource : Update call failed: error generating the diffs from desired state: "Location" must be set

We also observed this issue with CloudSchedulerJob. Abandon/recreate worked to get the resource back into a healthy state. @diviner524 -- any idea on when we can expect a resolution?

davireis · 2023-09-30T11:03:02Z

I am also observing this with a skaffold + config-connector setup. The deletion-policy workaround works as long as I delete the CR by hand before running skaffold. If I run skaffold without the manual delete, the Location problems shows up and never goes away. Then I am left with another workaround: creating new clusters. But when I try to delete the old clusters, I get

ERROR: (gcloud.anthos.config.controller.delete) Operation https://krmapihosting.googleapis.com/v1/projects/trash-362115/locations/us-central1/operations/operation-1696069559461-60690f79b2821-90f34dd1-a106ef60 has not finished in 1800 seconds. The operations may still be underway remotely and may still succeed; use gcloud list and describe commands or https://console.developers.google.com/ to check resource state.

And now my bill is going through the roof. Would love to see things more robust here as skaffold + config connector is a great experience when it works. Let me know if I can help somehow.

diviner524 · 2023-09-30T17:46:21Z

We just released CloudRun as a stable CRD in the latest 1.110.0 release, with a few more bug fixes on this resource. Please give it a try and see if this location issue persists.

jpeterson-bestbuy · 2023-11-14T20:15:02Z

Thank you @diviner524 -- I can confirm that we're no longer affected by this issue.

diviner524 · 2023-11-14T20:18:02Z

@jpeterson-bestbuy Thank you for letting us know!

knowhoper added the bug Something isn't working label Jun 23, 2023

fmichaelobrien mentioned this issue Jul 6, 2023

Use Case: POC Serverless Canary Application (frontend/backend/persistence) as a Profile 3 LZ workload with PSC, PSA and VPC-SC GoogleCloudPlatform/pubsec-declarative-toolkit#418

Open

jpeterson-bestbuy mentioned this issue Oct 20, 2023

CloudSchedulerJob fails to update with '"Location" must be set' error #956

Open

3 tasks

diviner524 closed this as completed Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cloud Run service fails to apply update #832

Cloud Run service fails to apply update #832

knowhoper commented Jun 23, 2023

diviner524 commented Jun 27, 2023

knowhoper commented Jul 3, 2023 •

edited

Loading

tsallou commented Jul 6, 2023

knowhoper commented Jul 7, 2023

jpeterson-bestbuy commented Aug 25, 2023

davireis commented Sep 30, 2023

diviner524 commented Sep 30, 2023

jpeterson-bestbuy commented Nov 14, 2023

diviner524 commented Nov 14, 2023

Cloud Run service fails to apply update #832

Cloud Run service fails to apply update #832

Comments

knowhoper commented Jun 23, 2023

Checklist

Bug Description

Additional Diagnostic Information

Kubernetes Cluster Version

Config Connector Version

Config Connector Mode

Log Output

Steps to reproduce the issue

YAML snippets

diviner524 commented Jun 27, 2023

knowhoper commented Jul 3, 2023 • edited Loading

tsallou commented Jul 6, 2023

knowhoper commented Jul 7, 2023

jpeterson-bestbuy commented Aug 25, 2023

davireis commented Sep 30, 2023

diviner524 commented Sep 30, 2023

jpeterson-bestbuy commented Nov 14, 2023

diviner524 commented Nov 14, 2023

knowhoper commented Jul 3, 2023 •

edited

Loading