Skip to content

fix(oss-health): correct April 2026 telemetry snapshot and pause fetch cron#508

Merged
tym83 merged 1 commit intomainfrom
fix/oss-health-april-counters
Apr 22, 2026
Merged

fix(oss-health): correct April 2026 telemetry snapshot and pause fetch cron#508
tym83 merged 1 commit intomainfrom
fix/oss-health-april-counters

Conversation

@tym83
Copy link
Copy Markdown
Contributor

@tym83 tym83 commented Apr 22, 2026

Summary

Manually correct the April 2026 telemetry snapshot on cozystack.io/oss-health/telemetry/ with live VictoriaMetrics values (112 clusters / 450 nodes / 444 tenants as of 2026-04-22) and pause the daily fetch cron until the upstream server fix ships.

Root cause: the telemetry-server /api/overview endpoint freezes the current-month snapshot at the first-of-month request. See cozystack/cozystack-telemetry-server#5 for the server fix. Until that is deployed, every fetch-telemetry.yml run would overwrite this correction with stale 43 / 164 / 83.

What

  • static/oss-health-data/telemetry.json:
    • month (April 2026) → 112 / 450 / 444. Apps scaled linearly by the cluster ratio 112 / 43 ≈ 2.6047.
    • quarter and year averages recomputed: March derived from (2 × old_avg − stale_april), new average computed with the corrected April. Apps given the same treatment.
  • .github/workflows/fetch-telemetry.yml: schedule: block commented out with a TODO pointing at the upstream fix. workflow_dispatch: kept for manual refresh.

Why

Without pausing the cron the next 08:00 UTC run would revert this commit. Keeping workflow_dispatch: means we can trigger the fetch on-demand once the server fix is live, to verify it returns live data before re-enabling the schedule.

Test plan

  • Hugo build + Netlify preview show April 112 / 450 / 444 and scaled apps on /oss-health/telemetry/
  • GitHub Actions page shows Fetch Telemetry Data with only manual trigger
  • Once the server fix is deployed: trigger the workflow manually, confirm it produces a diff of 0 (or within rounding) against this committed JSON, then re-enable the daily schedule in a follow-up PR

Summary by CodeRabbit

  • Chores
    • Updated system health and telemetry metrics with latest data

…h cron

The telemetry-server /api/overview endpoint freezes its current-month
snapshot at the first request of the month, so April 2026 has been
serving 43 clusters / 164 nodes / 83 tenants since April 1 while live
VictoriaMetrics shows 112 / 450 / 444 as of 2026-04-22. Update the
cached month values on-site with the live numbers, scale apps linearly
by the cluster ratio (112 / 43), and recompute quarter / year averages
so the rolled-up periods reflect the correction too.

Pause the daily fetch cron until the server-side fix ships
(cozystack/cozystack-telemetry-server: current-month snapshots must be
regenerated on every request and never persisted). Without the pause,
tomorrow's 08:00 UTC run would pull the stale API response and revert
this commit. workflow_dispatch is kept for manual refresh.

Related server-side fix: cozystack/cozystack-telemetry-server branch
fix/current-month-live-and-tenant-label also corrects the total_tenants
query (kind="tenant" -> kind="Tenant") which was silently returning 0.

Signed-off-by: tym83 <6355522@gmail.com>
@netlify
Copy link
Copy Markdown

netlify Bot commented Apr 22, 2026

Deploy Preview for cozystack ready!

Name Link
🔨 Latest commit ab06e60
🔍 Latest deploy log https://app.netlify.com/projects/cozystack/deploys/69e9125c0c2df300089262a9
😎 Deploy Preview https://deploy-preview-508--cozystack.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Apr 22, 2026

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 4159a05c-c537-45d8-b465-d2c845091ab8

📥 Commits

Reviewing files that changed from the base of the PR and between 78f644b and ab06e60.

📒 Files selected for processing (2)
  • .github/workflows/fetch-telemetry.yml
  • static/oss-health-data/telemetry.json

📝 Walkthrough

Walkthrough

The GitHub Actions workflow disables automatic daily telemetry data fetching by commenting out the cron schedule, while preserving manual trigger capability. Simultaneously, telemetry metrics are updated with refreshed timestamps and revised cluster, node, tenant, and application deployment counts.

Changes

Cohort / File(s) Summary
GitHub Actions Workflow
.github/workflows/fetch-telemetry.yml
Disabled scheduled cron trigger (previously 0 8 * * *) by commenting out the schedule block; workflow_dispatch remains active for manual execution.
Telemetry Data
static/oss-health-data/telemetry.json
Updated updated_at timestamp and refreshed all period summaries (month, quarter, year) with new cluster, node, and tenant counts. Per-app deployment values revised across multiple applications; app roster reordered with entries like Redis (5→13) and modified placements for Qdrant, ClickHouse, and NATS.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Poem

🐰 We've paused the daily fetch, no more the morning call,
Yet telemetry data flows—refreshed through it all!
Redis hops to thirteen, clusters shine anew,
Manual triggers ready when you need them to—who knew?
📊✨

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch fix/oss-health-april-counters

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the telemetry data in telemetry.json, including timestamps and various usage metrics for clusters, nodes, tenants, and applications. A critical issue was identified: the workflow modification mentioned in the PR description to pause the daily fetch cron is missing from the changes, which would result in these manual updates being overwritten by the next scheduled run.

@@ -1,5 +1,5 @@
{
"updated_at": "2026-04-17T19:38:08Z",
"updated_at": "2026-04-22T18:11:05Z",
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

The pull request description states that the .github/workflows/fetch-telemetry.yml file was modified to comment out the schedule block and pause the daily fetch cron. However, this file is missing from the current pull request. Without this change, the manual corrections in telemetry.json will be overwritten by the next scheduled run of the telemetry fetch workflow. Please include the workflow changes in this PR to ensure the fix persists.

@tym83 tym83 marked this pull request as ready for review April 22, 2026 18:26
@tym83 tym83 requested review from kvaps and lllamnyp as code owners April 22, 2026 18:26
@tym83 tym83 merged commit 5415111 into main Apr 22, 2026
5 of 6 checks passed
@tym83 tym83 deleted the fix/oss-health-april-counters branch April 22, 2026 18:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant