🧠 Voice-to-Vector AI System

A Flask-based AI system that transforms voice notes into structured insights and stores them in a vector database for semantic retrieval.
This project demonstrates an end-to-end intelligence pipeline — from voice recording to actionable knowledge — integrating transcription, LLM reasoning, and vector storage.

🚀 Overview

The Voice-to-Vector system automates the process of turning spoken thoughts into searchable, structured data.

Workflow Summary:

Voice Note Input → uploaded or recorded audio
Transcription → processed via Whisper API
AI Enrichment → Gemini for intent recognition and tagging
Vector Embedding → stored in Qdrant using Sentence Transformers
Semantic Search → query system retrieves related insights by meaning

🧩 Architecture

Voice-to-Vector Architecture Diagram

🎙️Voice Input

 ⬇️️

💻[Flask API Server] — handles upload, processing, and response

 ⬇️

🧩[Whisper API] — transcription

 ⬇️

🤖[Gemini ] — enrichment + intent recognition

 ⬇️

🗄️[SentenceTransformer + Qdrant] — embeddings + vector storage

 ⬇️

🔍[Search Endpoint] — query for semantically similar notes

Key Components

Flask: API backend handling requests and orchestration
Qdrant: Vector database storing embeddings and metadata
SentenceTransformer: Generates text embeddings
LLM API (Gemini 2.5 Pro): AI reasoning layer for structuring and tagging insights
Whisper API: Speech-to-text transcription

🧠 Features

🎙️ Transcribe voice notes automatically
🪄 AI-powered enrichment and sentiment tagging
🔎 Semantic search for finding related ideas and tasks
🧱 Modular components: replace or extend with your own APIs
📦 Local vector storage using Qdrant
⚙️ Easily deployable with Docker

🧰 Tech Stack

Layer	Tools Used
Frontend/Input	Local or browser-based audio recorder
Backend/API	Flask, Python
AI/ML	Whisper API, (Gemini 2.5 Pro)
Vector DB	Qdrant
Embeddings	SentenceTransformers (`all-MiniLM-L6-v2`)

🖥️ Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/98Devops/Voice-to-vector.git
cd voice-to-vector
Set Up Environment Variables

Create a .env file:

QDRANT_HOST=localhost
QDRANT_PORT=6333
GEMINI_API_KEY=your_api_key_here
WHISPER_API_KEY=your_whisper_api_key_here

3️⃣ Install Dependencies
pip install -r requirements.txt

4️⃣ Run Qdrant (Docker)
docker run -p 6333:6333 qdrant/qdrant

5️⃣ Start the Flask Server
python app.py


Flask will start on http://localhost:5000

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md
WhatsApp Image 2025-09-30 at 7.24.58 PM.jpeg		WhatsApp Image 2025-09-30 at 7.24.58 PM.jpeg
WhatsApp Image 2025-09-30 at 7.26.11 PM.jpeg		WhatsApp Image 2025-09-30 at 7.26.11 PM.jpeg
WhatsApp Image 2025-09-30 at 7.26.14 PM.jpeg		WhatsApp Image 2025-09-30 at 7.26.14 PM.jpeg
WhatsApp Image 2025-09-30 at 7.26.15 PM (1).jpeg		WhatsApp Image 2025-09-30 at 7.26.15 PM (1).jpeg
WhatsApp Image 2025-09-30 at 7.26.15 PM.jpeg		WhatsApp Image 2025-09-30 at 7.26.15 PM.jpeg
WhatsApp Image 2025-09-30 at 7.26.16 PM.jpeg		WhatsApp Image 2025-09-30 at 7.26.16 PM.jpeg
app.py		app.py
architecture-diagram.png.png		architecture-diagram.png.png
index.html		index.html
memory_manager.py		memory_manager.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Voice-to-Vector AI System

🚀 Overview

🧩 Architecture

🧠 Features

🧰 Tech Stack

🖥️ Installation & Setup

1️⃣ Clone the Repository

About

Uh oh!

Releases

Packages

Languages

98Devops/Voice-to-vector

Folders and files

Latest commit

History

Repository files navigation

🧠 Voice-to-Vector AI System

🚀 Overview

🧩 Architecture

🧠 Features

🧰 Tech Stack

🖥️ Installation & Setup

1️⃣ Clone the Repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages