Skip to content

AllTalk v1.9c

Latest
Compare
Choose a tag to compare
@erew123 erew123 released this 28 Mar 21:57
· 110 commits to main since this release
6c20f45

Quite a large update, in preparedness for a more structured application & future possibilities.

  • TTS Generator - Various interface bugs & filtering options cleaned up.
  • TTS Generator - TTSDiff now scans generated text and TTS for errors.
  • TTS Generator - TTSSRT now creates subtitle files for video production e.g. a Youtube video.
  • Finetune - Now uses a customised tokenizer to deal with Japanese.
  • Finetune - Pre flight check and warning messages.
  • Finetune - Extra documentation and warnings.
  • Entire file structure has been re-organised to simplify management and future changes.
  • Documentation (built in and Github) has been rewritten/tidied up.
  • Requirements files have been cleaned up and simplified.
  • ATsetup has been re-written as necessary with additional options.
  • Diagnostics now performs some other checks.
  • DeepSpeed moved up to version 14.
  • Standalone Application moved to PyTorch 2.2.1.
  • Nvidia CUDA Toolkit installation is NO LONGER needed (other than to compile DeepSpeed on Linux)

Tested on Linux and Windows.

65 changed files with 10,298 additions and 300 deletions.

If you download and use the ZIP file from here, it will NOT be linked to this Github repository and so CANNOT be automatically updated with a git pull in future.