Metric description mapping by Demonstrandum · Pull Request #66 · Demonstrandum/tensorbored

Demonstrandum · 2026-03-06T14:21:00Z

Motivation for features / changes

Previously, hovering over a run name in the dashboard often displayed "Multiple descriptions per run" for metrics. This was confusing because descriptions are intended to be a global 1:1 mapping from metric name to description, local to a profile. The issue arose because the system collected descriptions from individual run event metadata, which could vary, leading to a composite "Multiple descriptions" text. Additionally, profile-defined metric descriptions were only used as fallbacks and did not override event metadata.

This change ensures that metric descriptions are consistently 1:1 and that profile definitions take precedence, aligning with the intended behavior.

Technical description of changes

Removed "Multiple descriptions" logic: The functions _get_tag_description_info and _build_combined_description were removed from metrics_plugin.py, eliminating the generation of composite descriptions.
Enforced 1:1 description mapping: _get_tag_to_description in metrics_plugin.py was simplified to always produce a single description per tag. If multiple runs provide different descriptions for the same tag, the description from the alphabetically first run is deterministically chosen.
Profile descriptions override event metadata: _merge_profile_tag_descriptions in metrics_plugin.py was modified to ensure that profile-defined metricDescriptions always take precedence over any descriptions found in event metadata.
Updated tests:
- test_tags_conflicting_description and test_tags_unsafe_conflicting_description in metrics_plugin_test.py were updated to expect a single, deterministic description instead of the composite.
- A new test, test_tags_profile_descriptions_override_event_descriptions, was added to verify that profile descriptions correctly override event metadata descriptions.

Screenshots of UI changes (or N/A)

N/A (Backend logic change affecting tooltip text)

Detailed steps to verify changes work correctly (as executed by you)

Conflicting descriptions: Create multiple runs where the same metric has different summary_description values in their event metadata. Observe that the dashboard tooltip for this metric now shows a single, deterministic description (from the alphabetically first run that provided one), not the "Multiple descriptions" composite.
Profile override: Create runs where a metric has a summary_description in its event metadata, but also define a different description for the same metric in the user profile. Observe that the dashboard tooltip for this metric displays the description from the profile, overriding the event metadata description.
Run unit tests: Execute the updated unit tests in metrics_plugin_test.py to ensure all new and modified test cases pass. (This was done via a standalone test script during development due to environment constraints).

Alternate designs / implementations considered (or N/A)

Considered attempting full integration tests, but opted for targeted unit tests and a standalone test script due to complexities with Bazel, TensorFlow, and proto compilation in the development environment. The core design choice to prioritize profile descriptions and enforce a 1:1 mapping was maintained.

The old behavior collected descriptions per-tag from each run's event metadata. When the same tag had different summary_description strings across runs, _build_combined_description() generated a confusing '# Multiple descriptions / ## For run: ...' composite text. This contradicts TensorBored's model where metric descriptions are a global 1:1 mapping from metric name to description (set via the profile's metricDescriptions). Two fixes: 1. Profile descriptions now override event-metadata descriptions (previously they only filled gaps for tags with no event description). 2. When no profile description exists and event metadata has per-run variation, pick the description from the alphabetically first run instead of compositing. Remove _get_tag_description_info and _build_combined_description (dead code after this change). Co-authored-by: Samuel <samuel@knutsen.co>

cursor · 2026-03-06T14:21:01Z

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
_{Learn more about Cursor Agents}

github-actions · 2026-03-06T14:31:25Z

Preview Deployment


Status	✅ Running
Live Preview	https://Demonstrandum-tensorbored-pr-66.hf.space
Space	https://huggingface.co/spaces/Demonstrandum/tensorbored-pr-66

Details

Wheel: tensorbored_nightly-2.21.0a20260312-py3-none-any.whl
Commit: 4664037
Build status: success

When different runs provide different summary_description text for the same metric tag, the backend now returns them as tagRunDescriptions alongside the global tagDescriptions. The global description (from profile or first-run fallback) is always shown at the top. Backend: - Add _get_per_run_tag_descriptions() which collects per-run descriptions only for tags where descriptions actually differ. - Include tagRunDescriptions in the /tags response (omitted when empty). Frontend data flow: - Add TagToRunDescriptions type - Thread tagRunDescriptions through backend types, data source, store types, reducers, and test helpers. UI (scalar, histogram, image cards): - When per-run descriptions exist, replace the plain text matTooltip with a rich popover showing the global description at top and a tab strip below where each tab corresponds to a run with its own description text. - When no per-run descriptions exist, behavior is unchanged (simple matTooltip with tag + description). - Shared popover styles live in _common.scss. Co-authored-by: Samuel <samuel@knutsen.co>

- Don't assign undefined to optional properties (TS2375/TS2412); conditionally spread tagRunDescriptions only when defined. - Run Prettier on changed .ts/.ng.html files. - Run Black on metrics_plugin_test.py. Co-authored-by: Samuel <samuel@knutsen.co>

…tier 2.4.1 - Change tagRunDescriptions observable/input from nullable to empty-object default, avoiding pipe() overload resolution issues with null unions. - Reformat with project-pinned Prettier 2.4.1 (not latest 3.x). Co-authored-by: Samuel <samuel@knutsen.co>

Co-authored-by: Samuel <samuel@knutsen.co>

- Tag name shown in gray/secondary-text weight, description in primary text color with medium weight -- description is the useful info, tag name is just context. - Always use the rich popover when a description exists (not just for per-run descriptions). matTooltip only fires as a plain tag fallback when there is no description at all. - Remove now-unused getTagTooltip/buildTagTooltip from card components. Co-authored-by: Samuel <samuel@knutsen.co>

cursor · 2026-03-11T15:44:17Z

      <tb-truncated-path
        class="tag"
-        [matTooltip]="getTagTooltip(tag, tagDescription)"
+        [matTooltip]="tagDescription ? '' : tag"


Dual tooltip appears when only run descriptions exist

Medium Severity

The matTooltip binding tagDescription ? '' : tag only suppresses the native tooltip when tagDescription is truthy, but the custom popover's *ngIf also activates when hasRunDescriptions is true. When a tag has per-run descriptions but no global tagDescription, both the matTooltip (showing the tag name) and the custom popover (also showing the tag name plus run descriptions) render simultaneously on hover. The condition needs to account for hasRunDescriptions as well, e.g. (tagDescription || hasRunDescriptions) ? '' : tag.

Additional Locations (2)

tensorbored/webapp/metrics/views/card_renderer/histogram_card_component.ng.html#L24-L25

tensorbored/webapp/metrics/views/card_renderer/image_card_component.ng.html#L26-L27

Write a diagnostics/convergence_rate scalar via the native writer with a description that varies per run (includes optimizer, lr, batch size, expected convergence step). This makes the per-run description tab view exercisable in the PR preview deployment. Co-authored-by: Samuel <samuel@knutsen.co>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-12T10:55:33Z

+  }
+
+  selectRunTab(run: string) {
+    this.selectedRunTab = run;


Identical popover logic duplicated across three card components

Low Severity

The properties descriptionTooltipVisible, selectedRunTab, and the methods hasRunDescriptions, runDescriptionEntries, selectedRunDescription, showDescriptionTooltip, hideDescriptionTooltip, and selectRunTab are copy-pasted identically across ScalarCardComponent, HistogramCardComponent, and ImageCardComponent. The same HTML popover template is also duplicated in all three .ng.html files. This triples the maintenance surface — any future fix (like the matTooltip condition bug) needs updating in three places.

Additional Locations (2)

tensorbored/webapp/metrics/views/card_renderer/histogram_card_component.ts#L63-L95

tensorbored/webapp/metrics/views/card_renderer/image_card_component.ts#L69-L101

cursoragent and others added 3 commits March 6, 2026 14:51

Demonstrandum marked this pull request as ready for review March 11, 2026 13:50

cursoragent and others added 2 commits March 11, 2026 13:51

Fix CI: add tagRunDescriptions to storeForm test fixture

a4a6563

Co-authored-by: Samuel <samuel@knutsen.co>

cursor Bot reviewed Mar 11, 2026

View reviewed changes

cursor Bot reviewed Mar 12, 2026

View reviewed changes

Demonstrandum merged commit 6547c78 into master Mar 16, 2026
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metric description mapping#66

Metric description mapping#66
Demonstrandum merged 7 commits into
masterfrom
cursor/metric-description-mapping-f8b3

Demonstrandum commented Mar 6, 2026

Uh oh!

cursor Bot commented Mar 6, 2026

Uh oh!

github-actions Bot commented Mar 6, 2026 •

edited

Loading

Uh oh!

cursor Bot Mar 11, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Mar 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Demonstrandum commented Mar 6, 2026

Motivation for features / changes

Technical description of changes

Screenshots of UI changes (or N/A)

Detailed steps to verify changes work correctly (as executed by you)

Alternate designs / implementations considered (or N/A)

Uh oh!

cursor Bot commented Mar 6, 2026

Uh oh!

github-actions Bot commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Preview Deployment

Uh oh!

cursor Bot Mar 11, 2026

Choose a reason for hiding this comment

Dual tooltip appears when only run descriptions exist

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Mar 12, 2026

Choose a reason for hiding this comment

Identical popover logic duplicated across three card components

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions Bot commented Mar 6, 2026 •

edited

Loading