Instructions for running the cli version? #140
Labels
bug
Something isn't working
documentation
Improvements or additions to documentation
enhancement
Improves existing code
good first issue
Good for newcomers
Is the word
whisperkit-cli
missing from the README?If I don't include it, I get
error: no executable product named 'transcribe'
.Transcription seems to be pretty slow, with no use of the GPU.
The output is a wall of text, with some capitalisation anomalies.
Using the mlx whisper, you can add timestamps to the output, so that if two people are speaking, the transcript starts each change of speaker on a new line. Is the same capability available here?
I'm not sure what MP3 formats are supported? I got a
Error when transcribing /Users/xxx.mp3: loadAudioFailed("Unable to resample audio")
from a stereo 44.1 kHz .mp3 file.I'm not sure whether I'm using the large-v3 for 30s clips, or the one for full length transcripts.
The text was updated successfully, but these errors were encountered: