Audiobook Simplifier is a tool that creates audiobooks from text documents or eBooks using TTS (Text-to-Speech) technology.
This project converts text documents or eBooks into audio files using TTSv2, making written content accessible in audio format. Ideal for accessibility, visually impaired users, or anyone who prefers listening to a book instead of reading.
Make sure the following components are installed:
-
Python 3.10
👉 Download Python 3.10.11 (Windows) OR Download Python 3.10.11 (Windows-Store)During installation:
- Check "Install for all users".
- Check "Add Python to PATH".
-
Download Microsoft C++ Authoring Tools -👉 https://visualstudio.microsoft.com/fr/visual-cpp-build-tools/
-
FFmpeg
👉 Download FFmpegAdd FFmpeg to your system
PATH
variable.Command terminal
winget install "FFmpeg (Essentials Build)"
-
CUDA Toolkit 11.8
👉 Download CUDA Toolkit 11.8 -
cuDNN v8.9.7 for CUDA 11
👉 Download cuDNN v8.9.7Unzip and copy the files into :
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.8
- Clone this repository:
git clone https://github.com/EasyAI-France/audiobook-simplifier.git cd audiobook-simplifier
- (Recommended) Install dependencies in a virtual environment:
python -m venv .venv .venv\Scripts\activate pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu118 pip install -r requirements.txt
- (Recommended) Install dependencies in a virtual environment:
python -m venv .venv .venv\Scripts\activate pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cpu pip install -r requirements.txt
- Double-click
install.bat
. - Select your graphics
- Once the project is installed, the terminal closes by itself
Clone the GitHub directory, then navigate to the root of the Audiobook-Simplifier directory, where the Dockerfile and docker-compose.yml files are located. To create your Docker image, type in the terminal:
docker build -t audiobook_simplifier .
👉 To launch it in English, type:
docker run --name EasyAI -e LANGUE=eng -p 7860:7860 audiobook_simplifier
👉 Or to launch it in French:
docker run --name EasyAI -e LANGUE=fr -p 7860:7860 audiobook_simplifier
- Double-click
start.bat
. - Select your language.
- Open the displayed URL in your browser.
.venv\Scripts\activate
python scripts\main_eng.py
- Select a TTS voice in
.wav
format. - Choose the language from the dropdown menu.
- Enable or disable subtitles.
- Upload your text file or eBook (optimized for one paragraph at a time).
- Process the text to make it more understandable for the TTS engine.
- Convert text files into audio files (multiple segments will be created in the
cache
folder). - In the terminal: You must accept the terms of the CPML non-commercial license: https://coqui.ai/cpml
- Compile segments into a single audio file using the "Compile audio files" button.
- Listen to and download your audiobook.
- Clear the cache using the "Delete files" button.
- Click on “Transform an audio”.
- Enter the name of the
.wav
file. - Rewrite the text to be fixed.
- Click “Recompile audio file”.
This project is maintained by EasyAI-France.
It uses the open-source TTS engine from Coqui TTS.
Contributions are welcome!
Feel free to submit improvements, bug fixes, or issues.
For questions, feedback, or suggestions:
📧 easyaivideo@gmail.com