🎙️ Voice-Assistant (Memo AI)

A privacy-first, full-stack local pipeline that transforms voice commands into technical actions. This assistant leverages Llama 3 for orchestration and Whisper for transcription to execute code and manage files within a secure, local sandbox, also save sessions on pine cone to fetch them back in future.

🏗️ System Architecture

The project follows a modular pipeline designed for low latency and high reliability:

Input Layer: Captures real-time audio via streamlit-mic-recorder, uploaded audio files (.wav, .mp3), or direct text commands.
STT Layer (OpenAI Whisper): Local transcription of audio into text. We use the base model with fp16=False to ensure stability across CPU/GPU configurations.
Task Router (Ollama/Llama 3): A high-precision intent classifier that breaks down user requests into actionable JSON tasks.
Execution Layer:
- Logic Engine: Generates code or summaries based on detected intents.
- File System: Automatically writes outputs to the local /output directory.
Memory Layer (Pinecone RAG):
- Uses mxbai-embed-large to vectorize interactions.
- Stores data in the voice-assist index for long-term session persistence and retrieval.

🛠️ Prerequisites

Before running the application, ensure you have the following installed:

Ollama: The engine for running LLMs locally. Download here
Python 3.10+: Ensure Python is added to your system PATH.
Pinecone API Key: Required for the vector database. Get it from the Pinecone Console.

🚀 Setup & Installation

1. Pull Local Models

Open your terminal and pull the models required for logic and embeddings:

ollama pull llama3:latest
ollama pull mxbai-embed-large

2. Install Python Dependencies

Install all necessary libraries using the requirements.txt file:

pip install -r requirements.txt

3. Environment Repair (Optional)

If you encounter library conflicts or version mismatches (common with PyTorch/Whisper), run the recovery script:

python repair_env.py

4. Configure API Key

Set your Pinecone API key in your environment variables or directly in agent.py:

# In agent.py
api_key = "YOUR_PINECONE_API_KEY"

🔌 Hardware Workarounds

Cross-Platform STT: fp16=False is set in the Whisper configuration to prevent CUDA-related crashes on non-NVIDIA hardware.
Local Vectorization: Using Ollama for embeddings ensures that no data leaves the local network, maximizing privacy and reducing latency.
Self-Healing Index: The system automatically detects if the voice-assist index exists in Pinecone with the correct 1024 dimensions; if not, it creates it automatically.

📁 File Structure

File	Function
`app.py`	Streamlit frontend and terminal-themed UI logic.
`agent.py`	Backend logic, Task Router, and Pinecone integration.
`requirements.txt`	Comprehensive list of project dependencies.
`repair_env.py`	Environment diagnostic and fix script.
`/output`	Secure directory for AI-generated files and scripts.

📝 Usage

Launch: streamlit run app.py
Initialize: Click "Start New Session" in the sidebar.
Command: Type or speak: "Write a python script to add two numbers and save it."
Recall: Use the sidebar dropdown to reload any past session stored in Pinecone.

⚖️ License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
output		output
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Requirements.txt		Requirements.txt
agent.py		agent.py
app.py		app.py
repair_env.py		repair_env.py
temp_mic.wav		temp_mic.wav
temp_v.wav		temp_v.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Voice-Assistant (Memo AI)

🏗️ System Architecture

🛠️ Prerequisites

🚀 Setup & Installation

1. Pull Local Models

2. Install Python Dependencies

3. Environment Repair (Optional)

4. Configure API Key

🔌 Hardware Workarounds

📁 File Structure

📝 Usage

⚖️ License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ Voice-Assistant (Memo AI)

🏗️ System Architecture

🛠️ Prerequisites

🚀 Setup & Installation

1. Pull Local Models

2. Install Python Dependencies

3. Environment Repair (Optional)

4. Configure API Key

🔌 Hardware Workarounds

📁 File Structure

📝 Usage

⚖️ License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages