Skip to content

v0.6.0

Latest

Choose a tag to compare

@ish-hcc ish-hcc released this 30 Jun 10:20
93635bd

Release Notes v0.6.0

What's Changed

Major Features & Enhancements

☸️ Kubernetes Monitoring

  • feat: K8s node host agents — Telegraf (metrics) and Fluent Bit (logs) installed per node via privileged Jobs, with per-node metric selection by @ish-hcc in #327, #332
  • feat: Extend Insight, Alert, and Log to Kubernetes clusters and nodes by @ish-hcc in #329
  • fix: Resolve exec-plugin kubeconfigs (EKS/GKE/NKS) via a cb-tumblebug token broker, schedule Jobs onto the real node (instance-id → node name), and install the log agent as a Fluent Bit host binary so it works on EKS Amazon Linux nodes by @ish-hcc in #334, #340, #342
  • feat: Persist K8s node agent task state with a per-node lock so install/uninstall is consistent and shown across browser tabs by @ish-hcc in #348
  • fix: K8s cluster-list cache with a background warmer so the NS/Infra overview never pays the cold cb-spider discovery cost by @ish-hcc in #332, #345

🔭 Distributed Tracing

  • feat: Add distributed tracing (Grafana Tempo) and rename MCI/VM to Infra/Node throughout by @ish-hcc in #302
  • feat: Windows OTel Java auto-instrumentation and Beyla-based tracing by @basinibi in #294, #274, #275, #276
  • feat: Add paging to the Tracing trace list by @ish-hcc in #308

🖥️ Web Frontend

  • feat: New React-based observability frontend by @ish-hcc in #278
  • feat: Namespace-scoped iframe UI for embedding consoles — cross-origin parent-namespace detection via postMessage, land on the Monitoring view, and remember the last selected namespace/section by @ish-hcc in #326, #335, #336, #337, #338, #339
  • feat: Make the Insight menu usable — anomaly detection, prediction, and server error analysis by @ish-hcc in #307, #300
  • fix: Display timestamps in the browser's local timezone and show point markers for sparse metric samples by @ish-hcc in #319, #314

🚨 Alert & Trigger

  • feat: Editable trigger policies — thresholds and settings after creation, multi-channel notifications by @ish-hcc in #318, #320
  • feat: Discord and Teams notification channels by @tryoo0607 in #298
  • feat: Kakao AlimTalk notifications via NCP-registered templates by @tryoo0607 in #312, #313, #324

🖥️ GPU & Insight

  • feat: GPU monitoring with DCGM Exporter by @ish-hcc in #299
  • feat: OpenAI-compatible LLM provider and base-URL support for Insight by @50thk in #323

⚡ Performance & Caching

  • feat: Caffeine-based monitoring metric + CSP cache with periodic, portal-range-aligned warming by @ish-hcc in #270, #271, #281, #287, #273
  • feat: Downsampling DB support across multiple InfluxDB instances for long-range queries by @ish-hcc in #264, #273

🏗️ Platform & CB Integration

  • refactor: Route all infrastructure calls through cb-tumblebug only (no direct cb-spider) and align with the v0.12.x Infra/Node model by @basinibi @ish-hcc in #297, #291, #302
  • feat: Update cb-tumblebug / cb-spider across the v0.12.x line by @ish-hcc in #265, #288, #292, #295

Bug Fixes

  • fix: Harden VM/K8s agent install and status — node-id registration, cross-tab task guard, status caching, Azure AKS node-name mapping, and a 63-char Job-name limit; drop GROUP BY time() when the projection has no aggregate (empty graphs); bypass the apt sandbox so the K8s log agent installs on hardened nodes by @ish-hcc in #346, #347
  • fix: Surface backend-unreachable errors in the overview with a Retry instead of a silent empty state by @ish-hcc in #333
  • fix: Precise CSP "unsupported" labelling, persistent infra selector, and no agent buttons on non-running VMs by @ish-hcc in #341, #343
  • fix: Auto-create InfluxDB databases and reset stuck agent task status on manager startup by @ish-hcc in #303, #305
  • fix: Auto-derive VM-reachable host for Fluent Bit (Loki) and Beyla (Tempo) agents by @ish-hcc in #310

Full Changelog: v0.5.0...v0.6.0