MeemawAssist 🤖💙

Autonomous AI tech-support assistant for elderly users.

The user describes a problem by voice or text — the app figures out what needs to be done and acts on it automatically.

Operating Modes

The AI picks a mode automatically (priority top to bottom):

Mode	What it does	Example
🛡 Anti-Scam Guardian	Blocks all actions, shows a scam warning overlay	"Someone from the bank called and asked me to install AnyDesk"
🔧 Phone Agent	Silently performs actions via system APIs (Wi-Fi, Bluetooth, volume, brightness)	"I have no sound" → automatically turns volume up
✉️ Compose & Route	Opens the right app with a pre-filled message	"Text Ivan on Telegram" → opens Telegram compose
💬 Chat & Advice	Answers general questions as a friendly helper	"What's the weather?", "How do I boil rice?"

Tech Stack

Kotlin · Min SDK 26 · Target SDK 34
OpenAI GPT-4o-mini — reasoning engine
Retrofit2 + OkHttp + Gson — networking
AccessibilityService — screen interaction (tap, swipe, type)
Kotlin Coroutines — async
MVVM — ViewModel + StateFlow
Material Design 3 — UI

Project Structure

app/src/main/java/com/meemaw/assist/
├── MainActivity.kt                  # Chat UI, voice input, status indicators
├── MainViewModel.kt                 # Mode routing, StateFlow
├── data/
│   ├── LLMRepository.kt            # OpenAI API calls, JSON response parsing
│   └── api/
│       ├── Models.kt                # Request/response data classes
│       └── OpenAIService.kt         # Retrofit interface
├── prompt/
│   └── PromptBuilder.kt            # System prompt + JSON schema for AI
├── agent/
│   ├── AgentLoop.kt                # Multi-step command execution
│   ├── ScreenReader.kt             # Read UI elements via AccessibilityService
│   ├── ScreenActions.kt            # Tap, swipe, type (gestures)
│   └── SystemConfigExecutor.kt     # Wi-Fi, Bluetooth, volume, brightness, Settings
├── accessibility/
│   └── MeemawAccessibilityService.kt  # AccessibilityEvent handling
└── ui/
    ├── ChatAdapter.kt              # RecyclerView adapter (user/ai/scam bubbles)
    └── MessageItem.kt              # Sealed class for message types

Available Agent Commands

Command	Description
`wifi_on` / `wifi_off`	Toggle Wi-Fi
`bluetooth_on` / `bluetooth_off`	Toggle Bluetooth
`volume_up` / `volume_down` / `volume_max` / `volume_mute`	Volume control
`brightness_up` / `brightness_down` / `brightness_max`	Brightness control
`open_settings`	Open the relevant settings screen
`restart_suggestion`	Suggest a reboot

Compose Mode: Supported Apps

Telegram · WhatsApp · SMS · Gmail · Phone (call)

UI Design

Font ≥ 18sp — large, readable
Accent color #00A8E0 (AT&T Blue)
Chat bubbles: user on the right (blue), AI on the left (grey)
Large microphone button for voice input
Status overlays: "Listening…" · "Thinking…" · "Fixing…"
Red border for scam warnings
High contrast for visually impaired users

Permissions

INTERNET, ACCESS_WIFI_STATE, CHANGE_WIFI_STATE,
BLUETOOTH, BLUETOOTH_ADMIN, BLUETOOTH_CONNECT,
MODIFY_AUDIO_SETTINGS, RECORD_AUDIO, WRITE_SETTINGS,
BIND_ACCESSIBILITY_SERVICE

Setup

Add your OpenAI key to local.properties:
```
OPENAI_API_KEY=sk-proj-...
```
Open the project in Android Studio
Sync Gradle → Run
On the device: Settings → Accessibility → MeemawAssist → enable

🛡️ MeemawDefender

Background scam-protection service — monitors SMS, notifications, and on-screen text in real time using GPT-4o-mini.

How It Works

Source	Trigger
ScreenMonitor	AccessibilityService reads on-screen text every 4 seconds
NotificationReceiver	Intercepts all incoming notifications (SMS, Gmail, messengers)
SmsReceiver	BroadcastReceiver on incoming SMS

When a threat is detected — shows a full-screen red block and sends an alert to the dashboard.

Dashboard (Node.js + MongoDB Atlas)

cd defender
node server.js   # http://localhost:3000

For external access — ngrok:

ngrok http 3000

MongoDB Atlas Integration

All alerts are stored in MongoDB Atlas (cloud):

Feature	Description
TTL Index	Alerts auto-deleted after 30 days
Aggregation Pipeline	`GET /api/analytics` — threat stats by type, avg/max score
Full-text Search	`GET /api/search?q=anydesk` — search across all alerts
Upsert Config	Settings (email, name) stored as a singleton document

Configure .env in the defender/ folder:

MONGODB_URI=mongodb+srv://user:password@cluster.mongodb.net/meemawdefender

API Endpoints

Method	URL	Description
`GET`	`/api/ping`	Heartbeat from the Android app
`GET`	`/api/status`	Connection status (30s timeout)
`POST`	`/api/alert`	New alert from the phone
`GET`	`/api/alerts`	Last 10 alerts
`GET`	`/api/analytics`	Threat type statistics (Atlas Aggregation)
`GET`	`/api/search?q=`	Search alerts
`GET/POST`	`/api/settings`	Settings (email, name, active flag)

Android → Dashboard via USB

adb reverse tcp:3000 tcp:3000
adb shell am start -n com.meemaw.defender/.MainActivity
# In the app: Server URL = http://127.0.0.1:3000 → Save

Test Without SMS

adb shell "am broadcast -n com.meemaw.defender/.TestTriggerReceiver \
  -a com.meemaw.defender.DEMO \
  --es text 'install anydesk and share the nine digit code' \
  --es source sms"

🧩 Meemaw Assist — Chrome Extension (MV3)

A React + Vite Chrome extension that helps elderly users complete tasks in the browser. It takes a screenshot of the active tab, asks Google Gemini to identify the single next small step, draws a large numbered red arrow on the target element, and reads the instruction aloud via ElevenLabs.

Folder: exe/meemaw-assist

What It Does

Screenshot → next step. The service worker captures the visible tab and sends the PNG to Gemini. Gemini returns exactly ONE action (click / type / choose) with a bounding box, target text, and role.
Snap-to-DOM arrow. The content script snaps the arrow to the real DOM element using the target text + coordinates.
Voice. The instruction is spoken via ElevenLabs (with an in-memory cache).
Diagnose mode. When there's nothing on screen to analyze — a gentle one-question-at-a-time chat flow with tap-to-reply buttons.
Multilingual: EN / SK / DE.
Accessibility: large text, high contrast, light/dark theme, voice input.

Tech Stack

React 19 + Vite 8 + TailwindCSS 4
Manifest V3 (popup, module service worker, content script on <all_urls>)
Google Gemini — vision + reasoning
ElevenLabs — TTS
Web Speech API — voice input
chrome.storage.sync — settings

Setup

API keys go in .env in the exe/ folder:
```
VITE_GEMINI_API_KEY=AIza...
VITE_ELEVENLABS_API_KEY=sk_...
VITE_ELEVENLABS_VOICE_ID=JBFqnCBsd6RMkjVDRZzb
```
Get a Gemini key at https://aistudio.google.com/apikey. If the key is missing, mock mode activates and the badge shows Mock · <reason>.

Build:

cd exe/meemaw-assist
npm install
npm run build

Load into Chrome:
- Open chrome://extensions
- Enable Developer mode
- Load unpacked → select exe/meemaw-assist/dist
- Pin the Meemaw icon to the toolbar
Use it:
- Open any normal website (e.g. gmail.com)
- Click the Meemaw icon
- Type or speak the goal ("send an email to my daughter")
- Press the big green button — an arrow and spoken instruction appear on the page
- Press Done after completing the step — the extension screenshots again and shows the next one

Badge at the top of the guide: AI · Gemini — real model in use; Mock · — fallback.

Dev Mode

npm run watch

Vite rebuilds dist/ on every change. After each change, hit the reload icon on the Meemaw card in chrome://extensions.

File Structure

File	Role
`public/manifest.json`	MV3 manifest
`public/steps.json`	Optional pre-baked scenarios for scripted flows
`src/background.js`	Service worker: tab capture, Gemini calls, TTS prefetch, session state
`src/content.js`	Draws the numbered arrow and plays audio on the page
`src/precision-engine.js`	Snaps Gemini's target text / bbox to a real DOM element
`src/services/openaiService.js`	Gemini vision client (legacy name)
`src/services/diagnoseService.js`	Gemini text chat — diagnose mode
`src/services/ttsService.js`	ElevenLabs TTS with in-memory cache
`src/Popup.jsx`	React popup UI (home / guide / done views)
`src/components/VoiceInput.jsx`	Mic button + Web Speech API recognition
`src/components/StepPreview.jsx`	Annotated screenshot preview
`src/components/SettingsPanel.jsx`	Language, text size, high contrast, voice
`src/settings.js`	`chrome.storage.sync` wrapper
`src/i18n.js`	EN / SK / DE strings

Permissions

activeTab, tabs, scripting, storage, host_permissions: <all_urls> — the minimum needed to read the active tab URL, call chrome.tabs.captureVisibleTab, and inject the overlay content script.

Privacy

Screenshots are sent directly from the service worker to https://generativelanguage.googleapis.com (Gemini).
Instruction text is sent to https://api.elevenlabs.io for speech.
No backend, no analytics, no third-party hops.
Keys are inlined into the build at compile time. Don't publish to the Chrome Web Store without moving keys behind a backend proxy.

Troubleshooting

Symptom	Likely cause
Badge `Mock · no_api_key`	`VITE_GEMINI_API_KEY` missing or doesn't start with `AIza`
Badge `Mock · http_400 / 403`	Gemini key invalid or quota exhausted
Arrow points to the wrong place	`precision-engine.js` couldn't match `target_text` — reload the page or rephrase the goal
"This page can't be assisted"	You're on `chrome://`, Chrome Web Store, or a PDF — switch to a regular site
No voice	`VITE_ELEVENLABS_API_KEY` missing or voice disabled in Settings

⚠️ The popup cannot capture your screen when opened via npm run dev — chrome.tabs.captureVisibleTab only works for an installed extension. Always load the built dist/.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.vscode		.vscode
HK		HK
app		app
defender		defender
exe		exe
gradle/wrapper		gradle/wrapper
.gitignore		.gitignore
README.md		README.md
README_V2.md		README_V2.md
SETUP.md		SETUP.md
add_contact.sh		add_contact.sh
after-text2.png		after-text2.png
app-log.txt		app-log.txt
app-log2.txt		app-log2.txt
app-log3.txt		app-log3.txt
app-log4.txt		app-log4.txt
assist-scam-final.png		assist-scam-final.png
assist-scam-test.png		assist-scam-test.png
assist-scam-test2.png		assist-scam-test2.png
assist-showme-final.png		assist-showme-final.png
assist-showme-final2.png		assist-showme-final2.png
assist-sms-polish-final.png		assist-sms-polish-final.png
assist-sms-polish-final2.png		assist-sms-polish-final2.png
build.gradle.kts		build.gradle.kts
call-test.png		call-test.png
camera.png		camera.png
contacts.txt		contacts.txt
defender-block.png		defender-block.png
defender-emu.png		defender-emu.png
defender-emu2.png		defender-emu2.png
defender-emu3.png		defender-emu3.png
defender_main_ui.xml		defender_main_ui.xml
defender_ui.xml		defender_ui.xml
emu5556.log		emu5556.log
emu5556.log.err		emu5556.log.err
emulator-live.png		emulator-live.png
gradle.properties		gradle.properties
guide-log.txt		guide-log.txt
launch.log		launch.log
launch.ps1		launch.ps1
m1.xml		m1.xml
meemaw-after-ok.png		meemaw-after-ok.png
meemaw-after-shot.png		meemaw-after-shot.png
meemaw-camera.png		meemaw-camera.png
meemaw-clean-state.png		meemaw-clean-state.png
meemaw-device.png		meemaw-device.png
meemaw-home.png		meemaw-home.png
meemaw-live.png		meemaw-live.png
meemaw-result.png		meemaw-result.png
meemaw-retest-after-shot.png		meemaw-retest-after-shot.png
meemaw_after_ok_ui.xml		meemaw_after_ok_ui.xml
meemaw_after_shot_ui.xml		meemaw_after_shot_ui.xml
meemaw_camera_ui.xml		meemaw_camera_ui.xml
meemaw_clean_ui.xml		meemaw_clean_ui.xml
meemaw_ready_ui.xml		meemaw_ready_ui.xml
meemaw_retest_after_shot_ui.xml		meemaw_retest_after_shot_ui.xml
meemaw_scam_keyboard.xml		meemaw_scam_keyboard.xml
meemaw_test_ui.xml		meemaw_test_ui.xml
meemaw_ui.xml		meemaw_ui.xml
overlay-wifi.png		overlay-wifi.png
overlay-wifi10.png		overlay-wifi10.png
overlay-wifi11.png		overlay-wifi11.png
overlay-wifi2.png		overlay-wifi2.png
overlay-wifi3.png		overlay-wifi3.png
overlay-wifi4.png		overlay-wifi4.png
overlay-wifi5.png		overlay-wifi5.png
overlay-wifi6.png		overlay-wifi6.png
overlay-wifi7.png		overlay-wifi7.png
overlay-wifi8.png		overlay-wifi8.png
overlay-wifi9.png		overlay-wifi9.png
phone-current.png		phone-current.png
phone_current_ui.xml		phone_current_ui.xml
picker.png		picker.png
picker_after_register.xml		picker_after_register.xml
pk.xml		pk.xml
router-back.jpg		router-back.jpg
scam_send.xml		scam_send.xml
seed-contacts.ps1		seed-contacts.ps1
seed2.ps1		seed2.ps1
seed3.log		seed3.log
seed3.ps1		seed3.ps1
settings.gradle.kts		settings.gradle.kts
showme-ok.png		showme-ok.png
showme-result.png		showme-result.png
showme-result2.png		showme-result2.png
showme-result3.png		showme-result3.png
showme-router.png		showme-router.png
showme-router2.png		showme-router2.png
showme_attach_ui.xml		showme_attach_ui.xml
showme_attached.xml		showme_attached.xml
showme_before_send.xml		showme_before_send.xml
sms-polished.png		sms-polished.png
sms-test-en.png		sms-test-en.png
sms-test.png		sms-test.png
sms-test1.png		sms-test1.png
t1-call-test.png		t1-call-test.png
t1.xml		t1.xml
t2.xml		t2.xml
t3.xml		t3.xml
t4.xml		t4.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MeemawAssist 🤖💙

Operating Modes

Tech Stack

Project Structure

Available Agent Commands

Compose Mode: Supported Apps

UI Design

Permissions

Setup

🛡️ MeemawDefender

How It Works

Dashboard (Node.js + MongoDB Atlas)

MongoDB Atlas Integration

API Endpoints

Android → Dashboard via USB

Test Without SMS

🧩 Meemaw Assist — Chrome Extension (MV3)

What It Does

Tech Stack

Setup

Dev Mode

File Structure

Permissions

Privacy

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MeemawAssist 🤖💙

Operating Modes

Tech Stack

Project Structure

Available Agent Commands

Compose Mode: Supported Apps

UI Design

Permissions

Setup

🛡️ MeemawDefender

How It Works

Dashboard (Node.js + MongoDB Atlas)

MongoDB Atlas Integration

API Endpoints

Android → Dashboard via USB

Test Without SMS

🧩 Meemaw Assist — Chrome Extension (MV3)

What It Does

Tech Stack

Setup

Dev Mode

File Structure

Permissions

Privacy

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages