Skip to content

Releases: lukaszliniewicz/Pandrator

0.0.7

23 Apr 02:54
78b9693
Compare
Choose a tag to compare

This release addresses primarily VoiceCraft and its recent updates. It adds VoiceCraft model selection to the GUI as well as advanced generation settings. The selected model will be downloaded automatically.

The .exe one-click installer files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator if you want them to install git, ffmpeg and/or calibre.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

0.0.6

15 Apr 02:46
1ba6886
Compare
Choose a tag to compare

New features:

  • Support for EPUB files using ebook-convert from Calibre.

Improvements:

  • Enhanced sentence splitting logic.

The .exe one-click installer files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator if you want them to install git, ffmpeg and/or calibre.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

0.0.5

07 Apr 01:21
42419c5
Compare
Choose a tag to compare

New features:

  • The ability to use PDF files as input (they are converted to .txt, and before the final conversion happens you can see a preview, enable or disable paragraph retention and edit the text). It is not perfect and uses a relatively simple conversion method, so results may vary depending on the layout complexity of the input PDF and other factors. You may use the LLM workflow to try and remove OCR artifacts/misspelled words etc. I'm looking for a better conversion method, and if you have any suggestions, please let me know.
  • The option to select an external (remote) XTTS server, for example hosted on a service like RunPod or a Google Colab like this one created by the author of XTTS Api Server.

Fixes:

  • Corrected dependencies (ffmpeg-python).
  • Improved lowering of the original track's volume during subtitle speech segments when mixing the synchronized audio output with a video track.
  • Minor UI improvements.

The .exe files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

0.0.4

03 Apr 22:53
5709376
Compare
Choose a tag to compare
  • Added the VoiceCraft model through the VoiceCraft API Server I made for this purpose. You can install Pandrator with VoiceCraft using the pandrator_start_minimal_voicecraft.exe.
  • It is now possible to generate speech from an .srt file and automatically mix it with a video's sound track.

The .exe files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

0.0.3

30 Mar 02:56
463c80e
Compare
Choose a tag to compare

I added processing of .srt subtitle files. If a subtitle file is chosen, the output will be synchronized to its timestamps, effectively creating dubbing that you can, for example, mix with a video's sound track.
Additionally, you may now choose the TTS engine - XTTS or Silero. Silero produces worse results (though still decent), but can run on a cpu, so it is recommended for lower-end systems.
There are two executables now - one installs (and later runs) Pandrator with the XTTS Api Server, the other with the Silero API server. You may of course install another TTS server manually later.
The .exe files were created using pyinstaller from pandrator-start-minimal_xtts.py and pandrator_start_minimal_silero.py in the repository. Please remember to run them as an administrator. It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

0.0.2

26 Mar 02:42
53d78bf
Compare
Choose a tag to compare

I added the option to edit sentences from the GUI, addressed some playback issues and added a few tooltips explaining several options in the GUI.
The .exe file was created using pyinstaller from pandrator-start-minimal.py in the repository. Please remember to run it as an administrator. It's possible that your antivirus software flags it as a threat. In that case, add it as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

Initial release

22 Mar 17:22
848791b
Compare
Choose a tag to compare

This is the initial release. The .exe was created using pyinstaller from pandrator-start-minimal.py in the repository. Please remember to run it as an administrator. It's possible that your antivirus software flags it as a threat. In that case, add it as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.