Releases · aTrainTranscription/aTrain

Release list

v1.4.1 Latest

Latest

JuergenFleiss released this 28 Jan 17:10

v1.4.1

a618b4b

Bugfix Release!

Fixes...

speaker recognition running on CPU when it should run on GPU. Thanks @Biga73 for reporting #109
increase in repetitions in transcripts (restores default temperature behavior). Thanks @KlausFarmer for reporting #111
transcription folder button not working in some linux distros. Thanks @Dorianux for reporting #113

Contributors

Dorianux, Biga73, and KlausFarmer

Assets 2

v1.4.0

JuergenFleiss released this 22 Jan 09:59

v1.4.0

c569a71

This new major release of aTrain features a complete rewrite of the user interface in NiceGUI, which greatly simplified the codebase.

This also fixed issues with GPU-transcription not producing output #84

In addition, this new version includes:

Improved speaker diarization using the newest pyannote.audio 4.0.3
Support for Nvidia RTX 50xx series

Assets 2

v1.3.1

ArminHaberl released this 15 May 08:43

v1.3.1

c969214

Bugfixes only in this release.

This release fixes some bugs regarding error handling in child processes and updates the torch dependency to address recent security concerns.

Assets 2

v1.3.0

JuergenFleiss released this 31 Mar 14:29

v1.3.0

19688e5

Major Bugfixes and Development Guidelines

Those were some rough months. Updating the backend to the newest upstream software and faster-whisper models as well as our rewrite of the codebase, seperating frontend GUI and backend in atrain_core, together intrudeced similar looking bugs that each only appeared with some of the files.

Thanks to the community feedback and provision of problematic files, we found our own backend bug and then, later, also the upstream bug in faster whisper.

This should now run smoothly.

We also introduced guidelines on how we will do branching in the future and how to contribute.

Thanks for providing feedback, logs, and problematic files for testing:
@Beefthong @Biga73 @OhsawaSenpai @allan-simon @architeck @aereimer @happypig @adlihm @OhsawaSenpai @wenyuan-wu @MonikaBarget @fstolba

Contributors

allan-simon, fstolba, and 9 other contributors

Assets 2

v1.2.1

andreafooo released this 24 Jan 10:29

v1.2.1

6454a91

Release v.1.2.1

We are happy to announce the release of our newest improvements. This version includes some additional features and bug fixes, for example:

Replacement of multiprocessing with stoppable threads
Possibility to add an initial prompt in advanced settings thanks to the contribution of @Bytestorm5
Add float32 to possible inputs
Add note for macOS users that file names with white spaces can lead to errors
Fix model hash error
Update FAQs

Contributors

Bytestorm5

Assets 2

v1.2.0

JuergenFleiss released this 26 Nov 15:27

v1.2.0

7c09f2e

Release v1.2.0: Engine Upgrade

With this release, we also welcome @andreafooo to our team who did a lot of the heavy lifting for it; glad to have you! What also makes us really happy is the increased involvement of all the contributors for the growing aTrain user community.

Also, as a shameless plug, it is really cool that aTrain is now recommended at Harvard University (and many other Universities worldwide) for the local transcription of sensitive audio material

But on to the release, and it is a big one, as it includes a completely rewritten backend, lots of new features and improvements and the use of faster-large-v3-turbo as the default model.

Major Improvements

Completely new backend aTrain_core
Support for faster-whisper large-v3
New default model for a great balance between speed and accuracy: faster-whisper large-v3-turbo
Support for distil-whisper models single language models for large speed improvement. Currently only for English and marked as beta as there is an illusive bug that leads to sometimes not showing the completion in the GUI, while the transcript is already finished and available in the folder.
Introduction of our Model Manager: Download only models you really use, reducing the installer size greatly (we include speaker diarization model and faster-large-v3-turbo as the defaults)
Halfed installer size thanks to model manager
Rewrote transcription time estimate function, now estimated live and it should be very accurate

Minor improvements

Updated faster-whisper to 1.0.2
Updated pyannote-audio to 3.2.0

Major bugfixes

aTrain no longer crashes when having special characters in the filename. Thanks to @wenyuan-wu and @hirowa for figuring this out and to @SjDayg for fixing it
Numpy error when installing fixed, thanks @samfisherirl

Contributors

wenyuan-wu, hirowa, and 2 other contributors

Assets 2

v1.1.0

ArminHaberl released this 09 Nov 12:16

v1.1.0

3ce7fb6

This version of aTrain introduces two new features:

Advanced settings for specifying the compute device (CPU vs. GPU) and the compute type (int8 vs. float16) used for transcription.
Additional output format: SRT files for subtitles

The code base of this release is also restructured to a proper Python project installable with pip.

Assets 2

v1.0.0

ArminHaberl released this 06 Oct 11:15

v1.0.0

8bb7c53

Initial release

Assets 2

Uh oh!

Releases: aTrainTranscription/aTrain

Release list

v1.4.1

Contributors

Uh oh!

v1.4.0

Uh oh!

v1.3.1

Bugfixes only in this release.

Uh oh!

v1.3.0

Major Bugfixes and Development Guidelines

Contributors

Uh oh!

v1.2.1

Release v.1.2.1

Contributors

Uh oh!

v1.2.0

Release v1.2.0: Engine Upgrade

Contributors

Uh oh!

v1.1.0

Uh oh!

v1.0.0

Uh oh!