v1.1.0
What's Changed
- Updated Dockerfile by @gaspardpetit in #61
- (c) 2025 by @gaspardpetit in #62
- Refer to HUGGINGFACE_TOKEN not TOKEN_HUGGINGFACE by @gaspardpetit in #64
- Refer to HUGGINGFACE_TOKEN not TOKEN_HUGGINGFACE by @gaspardpetit in #65
- Use
uvfor development and packaging by @linozen in #60 - Updated gitignore by @gaspardpetit in #66
- Cleanup uv merge by @gaspardpetit in #67
- WIP: Add support for voice separation using pyannote/AMI by @gaspardpetit in #69
- Add support for voice separation using pyannote/AMI by @gaspardpetit in #70
- Configure ruff format (allow up to 150 characters per line) by @gaspardpetit in #71
- Rename utterances by @gaspardpetit in #72
- Update README.md - docker run by @mayeulk in #74
- Fix/dockerfile by @gaspardpetit in #77
- WIP: Integrate
pywhispercppby @linozen in #76 - Fix/improve pywhispercpp by @gaspardpetit in #87
- add mlx-whisper by @linozen in #88
- Fix/improve pywhispercpp by @gaspardpetit in #89
- pylint fixes for mlx-whisper by @gaspardpetit in #90
- Fix/fix configure languages by @gaspardpetit in #91
- Feat/support m4a by @gaspardpetit in #92
- Fix/add dev dependencies by @gaspardpetit in #97
- run workflows on windows and macos by @gaspardpetit in #98
- Feat/use mps on apple silicon by @gaspardpetit in #99
- ruff check; add check in workflow by @gaspardpetit in #100
- fix - diarization fails when the filename contains a space character by @gaspardpetit in #101
- Fixed type assignment based on pyright linter by @gaspardpetit in #102
- Add ids to utterances by @gaspardpetit in #103
- Improve json format by @gaspardpetit in #104
- Add stereo diarization strategy by @linozen in #106
- Add new output format writer for
diarizationlmand some tests based upon it by @linozen in #95 - adding code coverage by @gaspardpetit in #105
- Pylint ruff style fixes by @gaspardpetit in #107
- Minor fixes by @linozen in #108
- Fix "could not broadcast" exception when window is full by @gaspardpetit in #112
- Use ABC as abstract base class by @gaspardpetit in #113
- linter fixes by @gaspardpetit in #114
- Expose the number of channels of audio sources by @gaspardpetit in #115
- Feat/support more than two channels by @gaspardpetit in #116
- Feat/support multiple separation strategies by @gaspardpetit in #117
- Allow each source to have its own diarization by @gaspardpetit in #120
- Improve txt formatting (support speaker, language, probability, and t… by @gaspardpetit in #119
- Feat/Add support for separation by channel by @gaspardpetit in #121
- refactor: reuse convert_to_wav, remove convert_mp3_to_wav by @gaspardpetit in #122
- Add diarization postprocessing via LLM served by Ollama by @linozen in #126
- Fix number of speakers when args.diarize is set without a value by @gaspardpetit in #123
- Double check language when transcription is outside of detection window by @gaspardpetit in #124
- Log start of audio source transcript by @gaspardpetit in #131
- Addinig sample_to_timestr and unittests by @gaspardpetit in #130
- Adding unittests from diarizationml by @gaspardpetit in #132
- log audio offset in addition to duration by @gaspardpetit in #133
- Read json back to utterances; adding unit tests for json transcript by @gaspardpetit in #134
- Cherry pick from #125 by @gaspardpetit in #135
- Use github caches to accelerate workflows by @gaspardpetit in #136
- Simplify transcribe function (no functional change) by @gaspardpetit in #137
- Fix/fix repeated words on zero duration timestamps by @gaspardpetit in #138
- Adding silence based sentence tokenizer and unittests by @gaspardpetit in #140
- Added create_joint_speaker_sources function by @gaspardpetit in #141
- Feat/skip silence until next word by @gaspardpetit in #142
- Feat/do not advance beyond next word by @gaspardpetit in #143
- Feat/force smart acknowledge of long utterances by @gaspardpetit in #144
- Use dotenv to parse .env file by @gaspardpetit in #145
- simplify main by @gaspardpetit in #146
- Doc/verbatim cli by @gaspardpetit in #147
- Updated README and add architecture document by @gaspardpetit in #149
- Release/update dependencies by @gaspardpetit in #148
- Updated version to 1.1.0 by @gaspardpetit in #150
New Contributors
Full Changelog: v1.0.2...v1.1.0