Releases: ThioJoe/Auto-Synced-Translated-Dubs
Releases · ThioJoe/Auto-Synced-Translated-Dubs
0.14.1
🎉 [0.14.0] Major New Features:
-
Synthesizing with Azure no longer requires two-passes
- Azure now supports a new SSML tag that lets you specify the exact desired duration of an audio clip before it is processed. This eliminates the need to do a second pass to correct the speed.
- This should make the overall process much faster, because it requires half the API calls, and no stretching of audio, all while keeping the highest voice quality.
- NOTE: It seems that at the moment some of the tags the scripts use (namely
Leading-exact
andTailing-exact
), which determines the amount of silence at the beginning and end of a clip, do not function perfectly. So there may be a slight amount of extra silence between some sentences (probably around 60ms). This shouldn't be a dramatic amount, but if it bothers you, in the mean time you can go back to using the 0.13.1 release which uses the two-pass technique. I've submitted a question about this and it has been forwarded to a team at Microsoft, so hopefully they will fix it soon.
🟢 0.14.0 → 0.14.1:
- Fixed sys import error for several tool scripts.
- Slightly improved message displayed while waiting for Azure batch process
0.14.0
🎉 [0.14.0] Major New Features:
-
Synthesizing with Azure no longer requires two-passes
- Azure now supports a new SSML tag that lets you specify the exact desired duration of an audio clip before it is processed. This eliminates the need to do a second pass to correct the speed.
- This should make the overall process much faster, because it requires half the API calls, and no stretching of audio, all while keeping the highest voice quality.
- NOTE: It seems that at the moment some of the tags the scripts use (namely
Leading-exact
andTailing-exact
), which determines the amount of silence at the beginning and end of a clip, do not function perfectly. So there may be a slight amount of extra silence between some sentences (probably around 60ms). This shouldn't be a dramatic amount, but if it bothers you, in the mean time you can go back to using the 0.13.1 release which uses the two-pass technique. I've submitted a question about this and it has been forwarded to a team at Microsoft, so hopefully they will fix it soon.
0.13.1
🎉 [0.13.0] New Features:
-
Support for Azure Exact Comma Pause Time
- New config setting
azure_comma_pause
. Similar toazure_sentence_pause
, this allows you to set the exact pause time of the voice after a comma.
- New config setting
🟢 0.13.0 → 0.13.1:
- Moved Google Authentication to auth.py. Fixed logic so won't try to authorize if neither Google Translate nor Google TTS will be used.
- Fixed case where target and original language are the same, so it doesn't send a translate request to cloud
0.13.0
0.12.0
🎉 [0.12.0] New Features:
-
New Tool: Subtitle Track Remover
- Found in Tools folder as
SubtitleTrackRemover.py
- Fetches the subtitle tracks of a video, and allows the user to select which ones to delete. Good if needing to replace a subtitles track with an updated one.
- Found in Tools folder as
📈 [0.12.0] Other Improvements
-
Created new folders called
Scripts
andTools
for better organization. Tools are standalone while Scripts are used as part of the main program. The configs stay in the main root directory along withmain.py
. -
Script now shows progress for how many out of the total languages have been processed so far
0.11.2
[0.11.0] 📈 Improvements:
-
Importing of Pre-Translated SRT Files When Translation is Skipped
- Now when
skip_translation
is enabled in the config, the script will instead search for pre-existing translated srt files in the relevant output folder for each language and load it in. - Therefore you can use
stop_after_translation
, then inspect the translated subtitle files before they are synthesized. Then when you are ready, run the script withskip_translation
enabled.
- Now when
0.11.0 → 0.11.1:
- Now if skip_translation is enabled, the
translate_service
setting doesn't matter. So no translation service is needed if only synthesizing voice. - Cleaned up and consolidated many variables for easier future changes
🟢 0.11.1 → 0.11.2:
- Fixed the text replacement functionality (like dont translate) for languages using unicode characters like Hindi ( #52 )
0.11.1
[0.11.0] 📈 Improvements:
-
Importing of Pre-Translated SRT Files When Translation is Skipped
- Now when
skip_translation
is enabled in the config, the script will instead search for pre-existing translated srt files in the relevant output folder for each language and load it in. - Therefore you can use
stop_after_translation
, then inspect the translated subtitle files before they are synthesized. Then when you are ready, run the script withskip_translation
enabled.
- Now when
🟢 0.11.0 → 0.11.1:
- Now if skip_translation is enabled, the
translate_service
setting doesn't matter. So no translation service is needed if only synthesizing voice. - Cleaned up and consolidated many variables for easier future changes
0.11.0
📈 [0.11.0] Improvements:
-
Importing of Pre-Translated SRT Files When Translation is Skipped
- Now when
skip_translation
is enabled in the config, the script will instead search for pre-existing translated srt files in the relevant output folder for each language and load it in. - Therefore you can use
stop_after_translation
, then inspect the translated subtitle files before they are synthesized. Then when you are ready, run the script withskip_translation
enabled.
- Now when
0.10.0
🎉 [0.10.0] New Features:
-
Manual Word Translations
- Allows you to set manual translations of chosen words or phrases. See
READ_THIS.txt
file inSSML_Customization
folder.
- Allows you to set manual translations of chosen words or phrases. See
-
Phonetic Pronunciation
- Added ability to specify exact phonetic pronunciation of chosen words or phrases. See
READ_THIS.txt
file inSSML_Customization
folder.
- Added ability to specify exact phonetic pronunciation of chosen words or phrases. See
-
URL Handling
- Added ability to list URLs that might be included in the text, so that it will be better pronounced in the TTS stage. See
READ_THIS.txt
file inSSML_Customization
folder.
- Added ability to list URLs that might be included in the text, so that it will be better pronounced in the TTS stage. See
📈 [0.10.0] Other Improvements
-
Fixed bug that resulted in Google Translate and DeepL translating words that were set to not be translated.
0.9.2
🎉 [0.9.0] New Features:
-
Dynamically Named Output Folders
- There will be a new
Outputs
directory, and in that, every output will get its own folder based on the original video file name
- There will be a new
-
Added ability to stop certain words/phrases from being translated
- In the new
SSML_Customization
folder, seedont_translate_phrases.txt
- In the new
-
Added support for aliases with TTS (Azure Only)
- You can now use
aliases.csv
in theSSML_Customization
folder to change what should be spoken instead of certain words or phrases - For example, you can make "btw" be spoken as "by the way"
- See here and
SSML_Customization/READ_THIS.txt
for more details
- You can now use
-
Added support for 'interpret-as' SSML parameter (Azure Only)
- You can now use
interpret-as.csv
in theSSML_Customization
folder to specify how a word or phrase should be spoken - For example, if there is a word TEST, you can have it sounded out as 'T E S T'
- This will also avoid misinterpretations of abbreviations, such as where the Portuguese TTS voice pronounced "RJ45" as "Rio de Janeiro 45"
- See here and
SSML_Customization/READ_THIS.txt
for more details - There is also an example csv file
- You can now use
📈 [0.9.0] Other Improvements
-
Moved many translation related functions from
main.py
into their own file calledtranslate.py
. Eventually I will better organize all these scripts.
0.9.0 → 0.9.1:
- Fixed subtitle combining feature which was not behaving as intended
🟢 0.9.1 → 0.9.2:
- Fixed
<span>
notranslate tags not being removed after translation