ForcedAlignment is a graphical tool to automatically synchronize audio and text.
Contact: fandrei2000@yahoo.com
ForcedAlignment generates a html file with an audio player and a list, from a text file and an audio file. The text and audio file must match. The text is highlighted as the speech progresses. The text is split by delimiters like .(full stop) !(exclamation mark) ?(question mark) ,(comma) ;(semicolon) in units for easy following. By clicking on such an unit the reading begins at that moment. A text file and an audio file exists as samples. The generated temp_csv_file.csv and temp_txt_file.txt can be deleted.
- Python 2.x or 3.x or later
- Aeneas https://github.com/readbeyond/aeneas/
- A html5 compatible browser (for the audio tag)
- Depending on the browser the audio file can have many audio formats.
- Python must be in the PATH
- Tkinter and other python modules
Linux, Mac OS X, Windows
Requires the dependencies.
Select the text file, the audio file, and the language for both files and a html file and run the script.
ForcedAlignment is released under the terms of the GNU General Public License Version 3. See the LICENSE file for details.
ForcedAlignment_python2_1.py for python2
ForcedAlignment_python3_1.py for python3