Skip to content

coverage: footnote/decimal artifacts pollute auto-generated report (#3) #16

Description

@Seungpyo1007

Problem

The auto-generated coverage report (#3) listed non-SKU garbage entries, e.g.:

  • 1200-4 — from a Wikipedia reference marker [4]
  • 1200-af-16-c — from 1200 (AF) [ 16 ] [ c ]
  • 1-25, 1-50 — from decimal clock cells (1.25, 1.50)

Cause

wikitable_first_cells grabbed each row's first cell via get_text, keeping <sup> reference markers and bracketed footnotes; slugify then turned them into -4 / -16-c suffixes. Decimal cells slugified to <digits>-<digits>.

Fix

  • Strip <sup> reference nodes and leftover [...] text in wikitable_first_cells (benefits all categories).
  • Reject decimal-derived <digits>-<digits> slugs via a shared is_probable_model_slug filter used by the CPU and GPU sources.
  • Offline parser tests cover both cases.

The report itself (#3) is regenerated by re-running the coverage workflow.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No fields configured for Bug.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions