Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pufanyi/MIMICIT is now data-only #2805

Merged
merged 1 commit into from
May 15, 2024

Conversation

severo
Copy link
Collaborator

@severo severo commented May 15, 2024

Part of #2804 (the dataset had already been moved to data-only by the maintainers).

https://huggingface.co/datasets/pufanyi/MIMICIT/tree/main

Copy link

ArgoCD Diff for commit e203fe3

Updated at 5/15/2024, 10:19:27 AM CEST

App: datasets-server-prod
YAML generation: Success 🟢
App sync status: Out of Sync ⚠️

===== apps/Deployment datasets-server/prod-datasets-server-admin ======
--- /tmp/argocd-diff1830201432/prod-datasets-server-admin-live.yaml	2024-05-15 08:19:26.091461602 +0000
+++ /tmp/argocd-diff1830201432/prod-datasets-server-admin	2024-05-15 08:19:26.091461602 +0000
@@ -409,7 +409,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -456,7 +456,7 @@
           value: "9"
         - name: ADMIN_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-admin:sha-93252dd
+        image: huggingface/datasets-server-services-admin:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-api ======
--- /tmp/argocd-diff3065412814/prod-datasets-server-api-live.yaml	2024-05-15 08:19:26.131461422 +0000
+++ /tmp/argocd-diff3065412814/prod-datasets-server-api	2024-05-15 08:19:26.127461441 +0000
@@ -409,7 +409,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -468,7 +468,7 @@
           value: "9"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-api:sha-93252dd
+        image: huggingface/datasets-server-services-api:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-rows ======
--- /tmp/argocd-diff1376145005/prod-datasets-server-rows-live.yaml	2024-05-15 08:19:26.155461315 +0000
+++ /tmp/argocd-diff1376145005/prod-datasets-server-rows	2024-05-15 08:19:26.155461315 +0000
@@ -452,7 +452,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -501,7 +501,7 @@
           value: "8080"
         - name: ROWS_INDEX_MAX_ARROW_DATA_IN_MEMORY
           value: "300_000_000"
-        image: huggingface/datasets-server-services-rows:sha-93252dd
+        image: huggingface/datasets-server-services-rows:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-search ======
--- /tmp/argocd-diff2988552074/prod-datasets-server-search-live.yaml	2024-05-15 08:19:26.179461207 +0000
+++ /tmp/argocd-diff2988552074/prod-datasets-server-search	2024-05-15 08:19:26.175461225 +0000
@@ -420,7 +420,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -477,7 +477,7 @@
           value: /tmp/duckdb-extensions
         - name: HF_HUB_ENABLE_HF_TRANSFER
           value: "1"
-        image: huggingface/datasets-server-services-search:sha-93252dd
+        image: huggingface/datasets-server-services-search:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-sse-api ======
--- /tmp/argocd-diff1714039001/prod-datasets-server-sse-api-live.yaml	2024-05-15 08:19:26.191461154 +0000
+++ /tmp/argocd-diff1714039001/prod-datasets-server-sse-api	2024-05-15 08:19:26.187461172 +0000
@@ -274,7 +274,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -315,7 +315,7 @@
           value: "1"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-sse-api:sha-93252dd
+        image: huggingface/datasets-server-services-sse-api:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-storage-admin ======
--- /tmp/argocd-diff1879187260/prod-datasets-server-storage-admin-live.yaml	2024-05-15 08:19:26.199461119 +0000
+++ /tmp/argocd-diff1879187260/prod-datasets-server-storage-admin	2024-05-15 08:19:26.195461136 +0000
@@ -206,7 +206,7 @@
         helm.sh/chart: datasets-server
     spec:
       containers:
-      - image: huggingface/datasets-server-services-storage-admin:sha-93252dd
+      - image: huggingface/datasets-server-services-storage-admin:sha-af80a24
         imagePullPolicy: IfNotPresent
         name: prod-datasets-server-storage-admin
         resources:

===== apps/Deployment datasets-server/prod-datasets-server-worker-heavy ======
--- /tmp/argocd-diff3907243261/prod-datasets-server-worker-heavy-live.yaml	2024-05-15 08:19:26.223461011 +0000
+++ /tmp/argocd-diff3907243261/prod-datasets-server-worker-heavy	2024-05-15 08:19:26.219461029 +0000
@@ -543,7 +543,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -682,7 +682,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-93252dd
+        image: huggingface/datasets-server-services-worker:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-worker-light ======
--- /tmp/argocd-diff3025479138/prod-datasets-server-worker-light-live.yaml	2024-05-15 08:19:26.251460885 +0000
+++ /tmp/argocd-diff3025479138/prod-datasets-server-worker-light	2024-05-15 08:19:26.247460904 +0000
@@ -542,7 +542,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -681,7 +681,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-93252dd
+        image: huggingface/datasets-server-services-worker:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/prod-datasets-server-worker-medium ======
--- /tmp/argocd-diff2085610616/prod-datasets-server-worker-medium-live.yaml	2024-05-15 08:19:26.279460760 +0000
+++ /tmp/argocd-diff2085610616/prod-datasets-server-worker-medium	2024-05-15 08:19:26.275460778 +0000
@@ -542,7 +542,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -681,7 +681,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-93252dd
+        image: huggingface/datasets-server-services-worker:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== batch/CronJob datasets-server/prod-datasets-server-job-backfill ======
--- /tmp/argocd-diff2705324414/prod-datasets-server-job-backfill-live.yaml	2024-05-15 08:19:26.299460671 +0000
+++ /tmp/argocd-diff2705324414/prod-datasets-server-job-backfill	2024-05-15 08:19:26.295460689 +0000
@@ -214,7 +214,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -255,7 +255,7 @@
               value: backfill
             - name: LOG_LEVEL
               value: debug
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-backfill
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-backfill-retryable-errors ======
--- /tmp/argocd-diff2790997768/prod-datasets-server-job-backfill-retryable-errors-live.yaml	2024-05-15 08:19:26.311460617 +0000
+++ /tmp/argocd-diff2790997768/prod-datasets-server-job-backfill-retryable-errors	2024-05-15 08:19:26.311460617 +0000
@@ -215,7 +215,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -256,7 +256,7 @@
               value: backfill-retryable-errors
             - name: LOG_LEVEL
               value: debug
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-backfill-retryable-errors
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-cache-metrics-collector ======
--- /tmp/argocd-diff2744718419/prod-datasets-server-job-cache-metrics-collector-live.yaml	2024-05-15 08:19:26.319460582 +0000
+++ /tmp/argocd-diff2744718419/prod-datasets-server-job-cache-metrics-collector	2024-05-15 08:19:26.319460582 +0000
@@ -175,7 +175,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -188,7 +188,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-cache-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-cache-metrics-collector
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-post-messages ======
--- /tmp/argocd-diff3490388547/prod-datasets-server-job-post-messages-live.yaml	2024-05-15 08:19:26.327460546 +0000
+++ /tmp/argocd-diff3490388547/prod-datasets-server-job-post-messages	2024-05-15 08:19:26.327460546 +0000
@@ -186,7 +186,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -211,7 +211,7 @@
               value: post-messages
             - name: LOG_LEVEL
               value: info
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-post-messages
             resources:

===== batch/CronJob datasets-server/prod-datasets-server-job-queue-metrics-collector ======
--- /tmp/argocd-diff2544686290/prod-datasets-server-job-queue-metrics-collector-live.yaml	2024-05-15 08:19:26.335460510 +0000
+++ /tmp/argocd-diff2544686290/prod-datasets-server-job-queue-metrics-collector	2024-05-15 08:19:26.335460510 +0000
@@ -175,7 +175,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -188,7 +188,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-queue-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: prod-datasets-server-queue-metrics-collector
             resources:

App: datasets-server-staging
YAML generation: Success 🟢
App sync status: Out of Sync ⚠️

===== apps/Deployment datasets-server/staging-datasets-server-admin ======
--- /tmp/argocd-diff2287032566/staging-datasets-server-admin-live.yaml	2024-05-15 08:19:26.819458344 +0000
+++ /tmp/argocd-diff2287032566/staging-datasets-server-admin	2024-05-15 08:19:26.815458362 +0000
@@ -402,7 +402,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -449,7 +449,7 @@
           value: "1"
         - name: ADMIN_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-admin:sha-93252dd
+        image: huggingface/datasets-server-services-admin:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-api ======
--- /tmp/argocd-diff1434976750/staging-datasets-server-api-live.yaml	2024-05-15 08:19:26.835458273 +0000
+++ /tmp/argocd-diff1434976750/staging-datasets-server-api	2024-05-15 08:19:26.835458273 +0000
@@ -399,7 +399,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -453,7 +453,7 @@
           value: "1"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-api:sha-93252dd
+        image: huggingface/datasets-server-services-api:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-rows ======
--- /tmp/argocd-diff3305026773/staging-datasets-server-rows-live.yaml	2024-05-15 08:19:26.863458148 +0000
+++ /tmp/argocd-diff3305026773/staging-datasets-server-rows	2024-05-15 08:19:26.859458166 +0000
@@ -463,7 +463,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -507,7 +507,7 @@
           value: "8080"
         - name: ROWS_INDEX_MAX_ARROW_DATA_IN_MEMORY
           value: "300_000_000"
-        image: huggingface/datasets-server-services-rows:sha-93252dd
+        image: huggingface/datasets-server-services-rows:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-search ======
--- /tmp/argocd-diff2852572665/staging-datasets-server-search-live.yaml	2024-05-15 08:19:26.883458058 +0000
+++ /tmp/argocd-diff2852572665/staging-datasets-server-search	2024-05-15 08:19:26.879458076 +0000
@@ -430,7 +430,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -482,7 +482,7 @@
           value: /tmp/duckdb-extensions
         - name: HF_HUB_ENABLE_HF_TRANSFER
           value: "1"
-        image: huggingface/datasets-server-services-search:sha-93252dd
+        image: huggingface/datasets-server-services-search:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-sse-api ======
--- /tmp/argocd-diff420472095/staging-datasets-server-sse-api-live.yaml	2024-05-15 08:19:26.895458005 +0000
+++ /tmp/argocd-diff420472095/staging-datasets-server-sse-api	2024-05-15 08:19:26.891458022 +0000
@@ -283,7 +283,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -319,7 +319,7 @@
           value: "1"
         - name: API_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-sse-api:sha-93252dd
+        image: huggingface/datasets-server-services-sse-api:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-storage-admin ======
--- /tmp/argocd-diff2017741926/staging-datasets-server-storage-admin-live.yaml	2024-05-15 08:19:26.903457969 +0000
+++ /tmp/argocd-diff2017741926/staging-datasets-server-storage-admin	2024-05-15 08:19:26.903457969 +0000
@@ -206,7 +206,7 @@
         helm.sh/chart: datasets-server
     spec:
       containers:
-      - image: huggingface/datasets-server-services-storage-admin:sha-93252dd
+      - image: huggingface/datasets-server-services-storage-admin:sha-af80a24
         imagePullPolicy: IfNotPresent
         name: staging-datasets-server-storage-admin
         resources:

===== apps/Deployment datasets-server/staging-datasets-server-worker-all ======
--- /tmp/argocd-diff3675266281/staging-datasets-server-worker-all-live.yaml	2024-05-15 08:19:26.931457844 +0000
+++ /tmp/argocd-diff3675266281/staging-datasets-server-worker-all	2024-05-15 08:19:26.927457861 +0000
@@ -541,7 +541,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -680,7 +680,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-93252dd
+        image: huggingface/datasets-server-services-worker:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== apps/Deployment datasets-server/staging-datasets-server-worker-light ======
--- /tmp/argocd-diff2249460558/staging-datasets-server-worker-light-live.yaml	2024-05-15 08:19:26.955457736 +0000
+++ /tmp/argocd-diff2249460558/staging-datasets-server-worker-light	2024-05-15 08:19:26.951457754 +0000
@@ -541,7 +541,7 @@
         - name: COMMON_BLOCKED_DATASETS
           value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
         - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+          value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
         - name: COMMON_HF_ENDPOINT
           value: https://huggingface.co
         - name: HF_ENDPOINT
@@ -680,7 +680,7 @@
           value: "1"
         - name: WORKER_UVICORN_PORT
           value: "8080"
-        image: huggingface/datasets-server-services-worker:sha-93252dd
+        image: huggingface/datasets-server-services-worker:sha-af80a24
         imagePullPolicy: IfNotPresent
         livenessProbe:
           failureThreshold: 30

===== batch/CronJob datasets-server/staging-datasets-server-job-cache-metrics-collector ======
--- /tmp/argocd-diff3177727301/staging-datasets-server-job-cache-metrics-collector-live.yaml	2024-05-15 08:19:26.975457647 +0000
+++ /tmp/argocd-diff3177727301/staging-datasets-server-job-cache-metrics-collector	2024-05-15 08:19:26.971457665 +0000
@@ -173,7 +173,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -186,7 +186,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-cache-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-cache-metrics-collector
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-post-messages ======
--- /tmp/argocd-diff1010011708/staging-datasets-server-job-post-messages-live.yaml	2024-05-15 08:19:26.983457612 +0000
+++ /tmp/argocd-diff1010011708/staging-datasets-server-job-post-messages	2024-05-15 08:19:26.979457629 +0000
@@ -185,7 +185,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -210,7 +210,7 @@
               value: post-messages
             - name: LOG_LEVEL
               value: info
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-post-messages
             resources:

===== batch/CronJob datasets-server/staging-datasets-server-job-queue-metrics-collector ======
--- /tmp/argocd-diff3851455485/staging-datasets-server-job-queue-metrics-collector-live.yaml	2024-05-15 08:19:26.991457575 +0000
+++ /tmp/argocd-diff3851455485/staging-datasets-server-job-queue-metrics-collector	2024-05-15 08:19:26.987457593 +0000
@@ -174,7 +174,7 @@
             - name: COMMON_BLOCKED_DATASETS
               value: open-llm-leaderboard/*,lunaluan/*,atom-in-the-universe/*,cot-leaderboard/cot-eval-traces,mitermix/yt-links,mcding-org/*
             - name: COMMON_DATASET_SCRIPTS_ALLOW_LIST
-              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,pufanyi/MIMICIT,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
+              value: '{{ALL_DATASETS_WITH_NO_NAMESPACE}},hf-internal-testing/dataset_with_script,togethercomputer/RedPajama-Data-1T,togethercomputer/RedPajama-Data-V2,gaia-benchmark/GAIA,poloclub/diffusiondb,mozilla-foundation/common_voice_*,google/fleurs,speechcolab/gigaspeech,espnet/yodas'
             - name: COMMON_HF_ENDPOINT
               value: https://huggingface.co
             - name: HF_ENDPOINT
@@ -187,7 +187,7 @@
                   optional: false
             - name: CACHE_MAINTENANCE_ACTION
               value: collect-queue-metrics
-            image: huggingface/datasets-server-jobs-cache_maintenance:sha-93252dd
+            image: huggingface/datasets-server-jobs-cache_maintenance:sha-af80a24
             imagePullPolicy: IfNotPresent
             name: staging-datasets-server-queue-metrics-collector
             resources:

Legend Status
The app is synced in ArgoCD, and diffs you see are solely from this PR.
⚠️ The app is out-of-sync in ArgoCD, and the diffs you see include those changes plus any from this PR.
🛑 There was an error generating the ArgoCD diffs due to changes in this PR.

@severo severo requested a review from lhoestq May 15, 2024 08:19
@severo severo merged commit 4740508 into main May 15, 2024
2 checks passed
@severo severo deleted the 2804-remove-one-exception-to-script-datasets branch May 15, 2024 12:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants