Book2Vision is an automated system designed to transform digital books (PDF, EPUB, TXT) into complete multimedia packages, including audiobooks, video summaries, image packs, and knowledge tools.
- Ingestion: Supports PDF, EPUB, and TXT formats.
- Analysis: Extracts text, identifies chapters, and performs semantic analysis.
- Audiobook: Generates audiobooks using TTS.
- Install Tesseract OCR (required for scanned PDFs).
- Download Spacy model:
python -m spacy download en_core_web_sm
- Clone the repository.
- Create a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
- Install Spacy model:
python -m spacy download en_core_web_sm
- Create a
.envfile in the root directory and add your API keys (see.env.example):GEMINI_API_KEY=your_key_here ELEVENLABS_API_KEY=your_key_here # Add other keys as needed
- Start the server:
python src/server.py ```
- Open your browser and navigate to
http://localhost:8000. - Upload a book to start the transformation.T