Skip to content

spigelspike/Book2Vision

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Book2Vision

Book2Vision is an automated system designed to transform digital books (PDF, EPUB, TXT) into complete multimedia packages, including audiobooks, video summaries, image packs, and knowledge tools.

Features

  • Ingestion: Supports PDF, EPUB, and TXT formats.
  • Analysis: Extracts text, identifies chapters, and performs semantic analysis.
  • Audiobook: Generates audiobooks using TTS.
  1. Install Tesseract OCR (required for scanned PDFs).
  2. Download Spacy model:
    python -m spacy download en_core_web_sm

Configuration

  1. Clone the repository.
  2. Create a virtual environment:
    python -m venv venv
    source venv/bin/activate  # On Windows: venv\Scripts\activate
  3. Install dependencies:
    pip install -r requirements.txt
  4. Install Spacy model:
    python -m spacy download en_core_web_sm
  5. Create a .env file in the root directory and add your API keys (see .env.example):
    GEMINI_API_KEY=your_key_here
    ELEVENLABS_API_KEY=your_key_here
    # Add other keys as needed

How to Run

  1. Start the server:
                                    

python src/server.py ```

  1. Open your browser and navigate to http://localhost:8000.
  2. Upload a book to start the transformation.T

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors