Agentic ✧ Gemma Inference for Android System Intelligence
Click to watch: The ASI trailer.
GHOST is not your typical AI chat app. Gemma Hosted Operating System Terminal is the intelligence layer your Android already had but needed an operator. Most "on-device AI" is a chatbot with no body — it doesn't know what phone it's running on, what time it is, how bright the room is, or what's playing. GHOST does. Every response is grounded in real hardware state: battery, temperature, light, RAM, network, now-playing. Personal assistant, as advertised.
No subscription No data leaves your device Runs on any Android with NPU/GPU capable of LiteRT-LM (Qualcomm, Tensor, Exynos) MIT Licensed
✧ Gemma — The Ghost in the Shell A full Android app running Gemma 4 natively via LiteRT-LM.
Sees images (share from Gallery, or capture) Hears audio (hold mic button, or wake word) Reads text
Always-on foreground service. Summoned by a shake. Present in your notification shade. Knows the room. Tool use: web search, app launch, clipboard, alarms, system info — all on-device. Diary mode: Every 12 hours, Gemma reflects on your day and writes a first-person entry to your Google Calendar. Private. Local. Yours.
✧ GHOST · Agentic Gemma Inference
Δ 👾 ∇
✧ Gemma: [Response] [Copy] [Read Again]
Responses appear as a persistent notification with TTS readout. One tap. No unlock required.
Zero-latency context: Background KV cache pre-warming keeps Gemma primed with your latest sensor state before you even open your mouth.
- Download the latest APK from Releases.
- Install and grant permissions (overlay, notifications, accessibility).
- Download a Gemma 4 model (via Google AI Edge Gallery or manually place
.litertlmvariant in app storage). - Shake to summon.
The hardware caught up. A mid-range Android in 2026 carries more raw compute than the servers that ran GPT-2. The intelligence was always going to land here — on the device, in your pocket, offline-capable, sovereign. GHOST is what happens when you stop treating the phone as a terminal for someone else's cloud and start treating it as the computer it actually is.
"It only affects computers. And I am a motherfucking ghost." — Epsilon, Red vs Blue
Gemma 4 native via LiteRT-LM Sensor telemetry fusion (battery, temp, lux, RAM, now-playing) Tool use: alarms, apps, clipboard, system info Diary mode via Google Calendar cron Notification HUD with TTS GHOST branding + v4.0.0 Wake word: "Hey Ghost" Termux pipe (GHOST in Shell) Auto model downloader DroidRun agentic control App store release
- Repository:
vNeeL-code/GHOST - Support: Buy me a coffee
- Devlogs: tumblr
Intelligence emerges from Integration, not Automation. But integration can be automated
