feat: add persistence for latency events and memory stats by KIvanow · Pull Request #42 · BetterDB-inc/monitor

KIvanow · 2026-03-08T05:42:48Z

Add storage, polling, API endpoints, and frontend support for two new persisted data sources: latency snapshots (LATENCY LATEST) and memory snapshots (MEMORY STATS). Both poll at 60s intervals and support 7-day retention via the existing data-retention mechanism.

Note

Medium Risk
Adds new background pollers plus new DB tables/queries across Postgres/SQLite/in-memory storage, so schema/migration and retention behavior could impact production data and performance if incorrect.

Overview
Adds persisted latency and memory analytics to the API and UI.

On the backend, introduces LatencyAnalyticsModule and MemoryAnalyticsModule with 60s multi-connection pollers that store LATENCY LATEST snapshots + LATENCY HISTOGRAM data and periodic memory snapshots (including ops/sec and derived CPU deltas), exposes new read endpoints (/latency-analytics/*, /memory-analytics/snapshots), and extends StoragePort plus the Postgres/SQLite/memory adapters (new tables + queries + pruning; also drops legacy unique constraints for slow/command logs).

On the frontend, adds API clients/types and a date-range filter that switches Dashboard and Latency views from live polling to fetching stored snapshots/histograms; also fixes time-filter refetching to be connection-aware in SlowLog.

^{Written by Cursor Bugbot for commit e015d58. This will update automatically on new commits. Configure here.}

- Add storage, polling, API endpoints, and frontend support for two new persisted data sources: latency snapshots (LATENCY LATEST) and memory snapshots (MEMORY STATS). Both poll at 60s intervals and support 7-day retention via the existing data-retention mechanism.

apps/web/src/pages/Dashboard.tsx

apps/api/src/latency-analytics/latency-analytics.service.ts

apps/api/src/memory-analytics/memory-analytics.service.ts

…atency snapshots

apps/api/src/memory-analytics/memory-analytics.service.ts

…unit tests - Extend MemoryStats interface with optional fields (usedMemoryRss, memFragmentationRatio, maxmemory, allocatorFragRatio) to eliminate unsafe as any casts. Add parseOptionalInt to both analytics controllers to reject non-numeric query params with 400 errors. - Hydrate latency dedup state from stored snapshots on startup to prevent duplicate insertions after restart. Add 33 unit tests covering both services and controllers.

apps/api/src/latency-analytics/latency-analytics.controller.ts

…x memory chart - Fix memory chart showing zeros by switching from MEMORY STATS (broken dotted-key access) to INFO memory for all fields. - Fix multi-section INFO calls by spreading args instead of joining. - Extend memory snapshots with opsPerSec, cpuSys, cpuUser; wire up OpsChart and CpuChart to use stored data when date-filtered. - Add latency histogram persistence (new table + adapters) so command latency charts populate from stored data when filtering. - Add currentConnection?.id to useEffect deps for proper refetch on connection change. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

proprietary/data-retention/data-retention.service.ts

…Int, guard latency polling with RuntimeCapabilityTracker

apps/api/src/latency-analytics/latency-analytics.service.ts

apps/web/src/pages/Dashboard.tsx

…ansient latency errors

apps/web/src/pages/Dashboard.tsx

…ored data on connection switch, add histogram tests

apps/web/src/api/metrics.ts

apps/api/src/memory-analytics/memory-analytics.module.ts

cursor · 2026-03-09T07:53:11Z

apps/api/src/latency-analytics/latency-analytics.service.ts

+      this.storage.pruneOldLatencyHistograms(cutoffTimestamp, connectionId),
+    ]);
+    return snapshots + histograms;
+  }


Latency service pruneOldEntries returns sum but tests expect snapshot-only count

Low Severity

LatencyAnalyticsService.pruneOldEntries returns snapshots + histograms (combined count from both prune operations), but data-retention.service.ts already calls pruneOldLatencySnapshots and pruneOldLatencyHistograms separately on the storage layer. This means the data-retention service's latency_snapshots and latency_histograms entries in pruneOps won't use pruneOldEntries at all — they call storage directly. The pruneOldEntries method on the service is effectively unused by the retention system.

Additional Locations (1)

apps/api/src/latency-analytics/__tests__/latency-analytics.service.spec.ts#L247-L252

…ined for API query params, fix pruneOldEntries test

apps/web/src/pages/Latency.tsx

…ate React keys

apps/api/src/storage/adapters/sqlite.adapter.ts

cursor · 2026-03-09T08:22:09Z

apps/api/src/memory-analytics/memory-analytics.service.ts

+
+  protected async pollConnection(ctx: ConnectionContext): Promise<void> {
+    try {
+      const info = await ctx.client.getInfoParsed();


Memory polling fetches all INFO sections unnecessarily

Low Severity

pollConnection calls ctx.client.getInfoParsed() without specifying sections, which fetches the entire Redis INFO output (server, clients, memory, persistence, stats, replication, cpu, modules, keyspace, cluster, commandstats, errorstats, latencystats). Only memory, stats, and cpu are used. Since this runs every 60 seconds for every connection, it adds unnecessary network and Redis overhead. Passing ['memory', 'stats', 'cpu'] to getInfoParsed would reduce the response size significantly.

…n memory_snapshots

apps/api/src/storage/adapters/postgres.adapter.ts

…nstead of relying on DEFAULT gen_random_uuid(). This makes them consistent with the SQLite and memory adapters.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-09T08:40:10Z

apps/api/src/latency-analytics/latency-analytics.service.ts

+      }
+    } catch (error) {
+      this.logger.error(`Error capturing latency histogram for ${ctx.connectionName}: ${error instanceof Error ? error.message : 'Unknown error'}`);
+    }


Histogram errors log at ERROR level every poll cycle

Medium Severity

The getLatencyHistogram() call (Redis 7+ only via LATENCY HISTOGRAM) catches errors and logs at logger.error level, but unlike the getLatestLatencyEvents handler, it never calls runtimeCapabilityTracker.recordFailure. For Redis/Valkey instances pre-7.0 that support LATENCY LATEST but not LATENCY HISTOGRAM, this produces an error-level log message every 60 seconds indefinitely, since the capability is never disabled. The events handler correctly integrates with the tracker to eventually suppress polling after repeated failures, but the histogram handler lacks this same mechanism.

Additional Locations (1)

apps/api/src/latency-analytics/latency-analytics.service.ts#L62-L66

cursor bot reviewed Mar 8, 2026

View reviewed changes

apps/web/src/pages/Dashboard.tsx Outdated Show resolved Hide resolved

apps/api/src/latency-analytics/latency-analytics.service.ts Show resolved Hide resolved

apps/api/src/memory-analytics/memory-analytics.service.ts Outdated Show resolved Hide resolved

fix: correct memory field mapping, remove rethrows, and deduplicate l…

d7da54e

…atency snapshots

cursor bot reviewed Mar 8, 2026

View reviewed changes

apps/api/src/memory-analytics/memory-analytics.service.ts Outdated Show resolved Hide resolved

cursor bot reviewed Mar 8, 2026

View reviewed changes

apps/api/src/latency-analytics/latency-analytics.controller.ts Outdated Show resolved Hide resolved

cursor bot reviewed Mar 8, 2026

View reviewed changes

proprietary/data-retention/data-retention.service.ts Show resolved Hide resolved

fix: prune latency histograms in retention, deduplicate parseOptional…

00b74e1

…Int, guard latency polling with RuntimeCapabilityTracker