Skip to content

Fix cpWER#77

Merged
EduardoPach merged 4 commits intomainfrom
eduardo/fix-cpwer
Nov 21, 2025
Merged

Fix cpWER#77
EduardoPach merged 4 commits intomainfrom
eduardo/fix-cpwer

Conversation

@EduardoPach
Copy link
Copy Markdown
Collaborator

What does this PR do?

This PR fixes the ConcatenatedMinimumPermutationWER metric, which would incorrectly compute cpWER hypothesis_speakers < reference_speakers in the Greedy matching path, and for the Hungarian Matcher path, cpWER would be incorrectly computed for hypothesis_speakers > reference_speakers.

We spotted this while running evaluations on callhome-english, therefore, I took the references and hypotheses and added them as tests. Since the callhome-english license doesn't allow redistributing the transcriptions, these files exist on a private HF repository.

Although I was able to fix the implementation using MeetEval cpWER implementation as baseline (same amount of edits per word, although some edits were classified differently compared to jiwer), I decided to add meeteval as a new dependency, as they provide a bunch of other metrics implementations. Their implementation is battle-tested, and as a bonus, they have a pretty neat visualization tool that we could consider using in the future.

@EduardoPach EduardoPach requested review from atiorh and dbrkn November 21, 2025 15:28
Copy link
Copy Markdown
Contributor

@dbrkn dbrkn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@EduardoPach EduardoPach merged commit 8800d6b into main Nov 21, 2025
2 checks passed
@EduardoPach EduardoPach deleted the eduardo/fix-cpwer branch November 21, 2025 19:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants