Skip to content

Normalize machine names: drop redundant cloud prefix, normalize T-shirt sizes#881

Merged
alexey-milovidov merged 2 commits into
mainfrom
normalize-machine-names
May 8, 2026
Merged

Normalize machine names: drop redundant cloud prefix, normalize T-shirt sizes#881
alexey-milovidov merged 2 commits into
mainfrom
normalize-machine-names

Conversation

@alexey-milovidov
Copy link
Copy Markdown
Member

Summary

For cloud-service result files whose .machine value contains a
memory size (GB / GiB) or a T-shirt size (XS / S / M / L / XL / NXL),
drop the redundant cloud-name prefix. The system field already
carries the cloud name, so repeating it inside .machine bloats the
dashboard's machine axis. Examples:

  • AlloyDB: 8 vCPU 64 GB8 vCPU 64 GB
  • ByteHouse: XSXS
  • ClickHouse ☁️: 120GiB120GiB
  • Databricks: 2X-Large2XL (also normalized; see below)
  • Hydra: XLXL
  • Snowflake: 2XL2XL
  • Supabase: 4XL4XL
  • Tablespace: L1 - 16CPU 32GBL1 - 16CPU 32GB
  • Timescale ☁️: 16 vCPU 64GB16 vCPU 64GB
  • pgpro_tam: 16 vCPU 32GB16 vCPU 32GB

Also normalize T-shirt sizing variants to their letter form:

  • SmallS, MediumM, LargeL
  • X-SmallXS, X-LargeXL
  • 2X-Small2XS, 2X-Large2XL, 3X-Large3XL,
    4X-Large4XL, 5X-Large5XL

What's left alone

  • Bare-metal hardware descriptions (2x Intel Xeon ...,
    AMD EPYC ..., AWS c5.metal 100GB, Steam Deck 512 GB, etc.)
    — those are CPU/instance descriptions, not managed-cloud labels.
  • Aurora: 16acu (Aurora Capacity Unit, no GB)
  • Hologres: 16 CU (Compute Unit, no GB)
  • Redshift: ra3.4xlarge and similar (xlarge is part of an
    instance type name, not a T-shirt size)
  • SingleStore: S2 / SingleStore: S24 (Sn isn't the same as the
    XS/S/M/L/XL/NXL convention)
  • Motherduck: jumbo / mega / pulse / standard (tier names but
    not the T-shirt convention either)

Test plan

  • Spot-check the dashboard locally: rows for AlloyDB, ByteHouse,
    ClickHouse Cloud, Databricks, Snowflake group runs by their
    tier/size only, with no redundant cloud-name repetition on the
    machine axis.

🤖 Generated with Claude Code

alexey-milovidov and others added 2 commits May 8, 2026 19:56
…rt sizes

For cloud-service results whose .machine value contains a memory size
(GB / GiB) or a T-shirt size (XS / S / M / L / XL / NXL etc), drop
the redundant cloud-name prefix so the dashboard groups runs by the
actual size rather than the (system, machine) tuple. The system field
on each entry already carries the cloud name; repeating it inside
.machine just bloats the X axis.

Also normalize T-shirt sizing variants to their letter form:
  Small → S, Medium → M, Large → L,
  X-Small → XS, X-Large → XL,
  2X-Small → 2XS, 2X-Large → 2XL, 3X-Large → 3XL, 4X-Large → 4XL,
  5X-Large → 5XL.

Affected systems: AlloyDB, ByteHouse, CHYT, ClickHouse Cloud
(every aws/azure/gcp tier), CrunchyBridge, Databricks, Hydra,
Snowflake, Supabase, Tablespace, Timescale Cloud, pgpro_tam.

Bare-metal hardware descriptions (CPU model + RAM, "AWS c5.metal
100GB", etc) are left unchanged — the rule applies to managed-cloud
machine labels only.

Aurora's "16acu", Hologres' "16 CU", Redshift's "ra3.4xlarge", and
SingleStore's "S2"/"S24" don't match the GB or T-shirt-size pattern
and are also left alone.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Convert "<digits><space?>GB" → "<digits>GiB" in cloud-service machine
names. Where the value also carries an "<N> vCPU " prefix in front
of the GB amount, drop that prefix — the GiB tier already conveys
the size, so "8 vCPU 64 GB" simplifies to "64GiB".

Examples:
- "8 vCPU 64 GB" (AlloyDB) → "64GiB"
- "10 vCPU 40GB" (CHYT) → "40GiB"
- "720GB" (CHYT) → "720GiB"
- "Analytics-256GB" (Crunchy Bridge) → "Analytics-256GiB"
- "L1 - 16CPU 32GB" (Tablespace) → "L1 - 16CPU 32GiB"
  (16CPU is not "vCPU" so it stays)
- "8 vCPU 32GB" (Timescale ☁️) → "32GiB"
- "16 vCPU 32GB" / "30 vCPU 480GB" (pgpro_tam) → "32GiB" / "480GiB"
- "64 vCPU 256GB" (YDB) → "256GiB"

Bare-metal hardware descriptions in hardware/, versions/, gravitons/
(e.g. "AWS c5.metal 100GB", "Linode 16GB", "Steam Deck 512 GB",
"AMD EPYC 3.2 GHz, Micron 5100 MAX 960 GB") are left alone.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@alexey-milovidov alexey-milovidov self-assigned this May 8, 2026
@alexey-milovidov alexey-milovidov merged commit 806441e into main May 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant