MEMO - Vision-First AI Desktop Companion

MEMO (Memory & Environmental Monitoring Observer) is an intelligent, vision-based desktop companion designed to enhance productivity, health, and spatial awareness. It transforms your standard webcam or mobile phone camera into a smart observer that tracks objects, monitors your posture, guards your focus, and recognizes you.

Built with YOLOv8, FaceNet, and OpenCV, MEMO runs entirely locally on your machine, ensuring privacy and speed.

🚀 Features

🧠 Spatial Memory & Querying

MEMO "remembers" where it saw objects on your desk.

"Where is my bottle?" → "The bottle is currently on the left, seen just now."
"Do you see my wallet?" → "Yes, I see the wallet."
Supports fuzzy matching and synonyms (e.g., "phone" = "cell phone").

👤 Face Recognition & Greeting

MEMO knows who you are.

Personalized Greetings: "Hello Jayadeep. Welcome back."
Privacy-First: Face embeddings are stored locally (user_embedding.npy) and never uploaded.
Smart Re-Greeting: Only greets you again if you've been away for a while.

📵 Smart Focus Mode

Need to get work done? Tell MEMO to guard your focus.

Command: focus on
Behavior: If MEMO sees a cell phone in the frame, it verbally scolds you: "Put the phone away and focus on your work!"
Visuals: Bounding boxes turn RED when threats (phones) are detected.

🧘 Health & Posture Coach

MEMO watches out for your physical well-being.

Proximity Alert: If you lean too close to the screen (>55% frame width), it warns: "You are too close to the screen. Please move back."
Sedentary Alert: Tracks how long you've been sitting. If > 60 minutes, it reminds you to stretch.

🛠️ Technology Stack

Vision: YOLOv8 (Object Detection & Pose Estimation), FaceNet (Face Recognition).
Core: OpenCV (Video Processing), NumPy.
Audio: pyttsx3 (Text-to-Speech).
Architecture: Modular Python design (Perception → State → Reasoning → Interface).

📦 Installation

Clone the Repository

git clone https://github.com/jay7-tech/memo.git
cd memo

Install Dependencies MEMO requires Python 3.8+ and PyTorch.
```
pip install -r requirements.txt
```
Note: Visual C++ Build Tools may be required for some dependencies on Windows.

🎮 Usage

Run the System:

# Default Webcam (Source 0)
python main.py

# IP Camera (Example)
python main.py http://192.168.1.5:8080/video 90

Web Dashboard 🌐: Visit http://localhost:5000 to view the live feed and control the system remotely.
Voice Commands 🗣️:
- Toggle Voice: Press v or type voice on.
- Commands:
  - "Focus on" / "Focus off"
  - "Register me"
  - "Where is my [object]?"
  - "Selfie" (Takes a photo)
Keyboard Shortcuts:
- q: Quit
- v: Toggle Voice
- f: Toggle Focus Mode
- s: Take Selfie

🚀 Future Roadmap

See the complete feature roadmap for Raspberry Pi 4B deployment:

📄 docs/FEATURE_ROADMAP.md

This includes:

📸 Enhanced Vision: Emotion detection, hand gestures, eye gaze tracking
🤖 Robot Behaviors: Head tracking gimbal, idle animations, personality modes
🧠 AI Integration: Gemini chat, vision-language models, conversation memory
🎛️ Hardware: LED eyes, environment sensors, better audio
🌐 Dashboard: Live stats, object map, settings panel

Author: [Jayadeep / Jay7-Tech]

Name		Name	Last commit message	Last commit date
Latest commit History 218 Commits
camera_input		camera_input
core		core
data		data
design		design
docs		docs
interface		interface
models		models
perception		perception
reasoning		reasoning
scripts		scripts
state		state
utils		utils
.gitignore		.gitignore
INSTALL_RPI.md		INSTALL_RPI.md
LICENSE		LICENSE
PI_DEPLOY_INSTRUCTIONS.md		PI_DEPLOY_INSTRUCTIONS.md
PI_SETUP.md		PI_SETUP.md
RASPBERRY_PI_SETUP.md		RASPBERRY_PI_SETUP.md
README.md		README.md
REPLICATION_GUIDE.md		REPLICATION_GUIDE.md
config.json		config.json
config.py		config.py
config_rpi.json		config_rpi.json
debug_buzz.py		debug_buzz.py
debug_camera.py		debug_camera.py
debug_cmd_logic.py		debug_cmd_logic.py
debug_dashboard_standalone.py		debug_dashboard_standalone.py
debug_pi_env.py		debug_pi_env.py
debug_quotes.py		debug_quotes.py
demo_emotions.py		demo_emotions.py
demo_features.py		demo_features.py
demo_features_lite.py		demo_features_lite.py
demo_gestures.py		demo_gestures.py
demo_unified.py		demo_unified.py
diag_camera.py		diag_camera.py
diag_ollama.py		diag_ollama.py
diag_pi_full.py		diag_pi_full.py
enable_dual_touch.sh		enable_dual_touch.sh
fix_pi_build.sh		fix_pi_build.sh
install_log.txt		install_log.txt
install_pi.sh		install_pi.sh
install_rpi.sh		install_rpi.sh
main.py		main.py
main_legacy.py		main_legacy.py
model.zip		model.zip
reproduce_issue.py		reproduce_issue.py
requirements.txt		requirements.txt
requirements_pi.txt		requirements_pi.txt
requirements_verified.txt		requirements_verified.txt
run.sh		run.sh
run_memo.bat		run_memo.bat
run_memo.sh		run_memo.sh
setup_pi.sh		setup_pi.sh
start_memo.sh		start_memo.sh
test_tinyllama.py		test_tinyllama.py
verify_face_onnx.py		verify_face_onnx.py
verify_gemini.py		verify_gemini.py
verify_lcd.py		verify_lcd.py
verify_lcd_hardware.py		verify_lcd_hardware.py
verify_system.py		verify_system.py
verify_touch_hardware.py		verify_touch_hardware.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MEMO - Vision-First AI Desktop Companion

🚀 Features

🧠 Spatial Memory & Querying

👤 Face Recognition & Greeting

📵 Smart Focus Mode

🧘 Health & Posture Coach

🛠️ Technology Stack

📦 Installation

🎮 Usage

🚀 Future Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MEMO - Vision-First AI Desktop Companion

🚀 Features

🧠 Spatial Memory & Querying

👤 Face Recognition & Greeting

📵 Smart Focus Mode

🧘 Health & Posture Coach

🛠️ Technology Stack

📦 Installation

🎮 Usage

🚀 Future Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages