Use CutSet for whisper annotation workflow #834

desh2608 · 2022-10-03T14:23:32Z

This PR allows transcribing a CutSet with the whisper workflow. It is currently WIP since it relies on #832 and #822. But I have verified that it works.

…instead of touching the audio file

into feature/whisper_cut

lhotse/workflows/whisper.py

… feature/whisper_cut

into feature/whisper_cut

desh2608 · 2022-10-05T16:29:34Z

@pzelasko I have separated it into two helper functions. Let me know if it looks reasonable now.

pzelasko · 2022-10-05T18:19:01Z

lhotse/workflows/whisper.py

+                id=f"{cut.id}-{segment['id']:06d}",
+                recording_id=cut.recording_id,
+                start=round(segment["start"], ndigits=8),
+                duration=round(segment["end"], ndigits=8),


I think with this single change we can prevent timestamps going out of cut's duration; can you also include _postprocess_timestamps in the cut-based workflow?

Suggested change

duration=round(segment["end"], ndigits=8),

duration=max(cut.duration, round(segment["end"], ndigits=8)),

BTW, why are we using "end" as the duration? Shouldn't it be "end-start"?

Because in my own testing I discovered that their "end" is actually a "duration" 😂 but if you could triple-check that I got it right, maybe using some longer recording, that'd be great.

So I ran the model on one of the AMI headset recordings (~5000s) and it seems like the "end" actually shows the end of the segment, not the duration. Here is the JSON containing the results["segments"]: https://drive.google.com/file/d/169igkcDY2SmMs5k3hOhHip89T4MQDnKs/view?usp=sharing

pzelasko · 2022-10-05T19:54:58Z

Thanks, LGTM

jtrmal and others added 12 commits September 30, 2022 15:34

kaldi: add an switch/option to read the durations from kaldi utt2dur …

340ffbb

…instead of touching the audio file

declare the option properly in @click

9a8fd3b

add more tests

6ed8b5f

add the file to fixtures

7e75f90

Document the change and change the defaults

9b8ebb5

fix test failure mode due to change of defaults

0677aa9

use cutset for whisper workflow

0d7fa2c

Merge branch 'master' into feature/whisper_cut

857e250

resolve conflicts

f114887

remove kaldi changes

6b1db93

Merge branch 'feature/whisper_cut' of https://github.com/desh2608/lhotse

dedc7dc

into feature/whisper_cut

Merge branch 'master' into feature/whisper_cut

fc40902

pzelasko reviewed Oct 5, 2022

View reviewed changes

lhotse/workflows/whisper.py Show resolved Hide resolved

desh2608 changed the title ~~WIP: Use CutSet for whisper annotation workflow~~ Use CutSet for whisper annotation workflow Oct 5, 2022

desh2608 added 4 commits October 4, 2022 21:24

Merge branch 'master' of https://github.com/lhotse-speech/lhotse into…

77b61e2

… feature/whisper_cut

different methods for annotating recordings and cuts

ae464af

remove merge supervisions

9c389d9

Merge branch 'feature/whisper_cut' of https://github.com/desh2608/lhotse

4528ffb

into feature/whisper_cut

pzelasko reviewed Oct 5, 2022

View reviewed changes

ensure supervisions do not exceed cut boundary

38095b1

pzelasko added this to the v1.9 milestone Oct 5, 2022

pzelasko merged commit 41c44e8 into lhotse-speech:master Oct 5, 2022

desh2608 mentioned this pull request Oct 6, 2022

Whisper workflow supervision end may be incorrect #839

Closed

desh2608 deleted the feature/whisper_cut branch November 2, 2023 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use CutSet for whisper annotation workflow #834

Use CutSet for whisper annotation workflow #834

desh2608 commented Oct 3, 2022

desh2608 commented Oct 5, 2022

pzelasko Oct 5, 2022

desh2608 Oct 5, 2022

desh2608 Oct 5, 2022

pzelasko Oct 5, 2022

desh2608 Oct 5, 2022

pzelasko commented Oct 5, 2022

	duration=round(segment["end"], ndigits=8),
	duration=max(cut.duration, round(segment["end"], ndigits=8)),

Use CutSet for whisper annotation workflow #834

Use CutSet for whisper annotation workflow #834

Conversation

desh2608 commented Oct 3, 2022

desh2608 commented Oct 5, 2022

pzelasko Oct 5, 2022

Choose a reason for hiding this comment

desh2608 Oct 5, 2022

Choose a reason for hiding this comment

desh2608 Oct 5, 2022

Choose a reason for hiding this comment

pzelasko Oct 5, 2022

Choose a reason for hiding this comment

desh2608 Oct 5, 2022

Choose a reason for hiding this comment

pzelasko commented Oct 5, 2022