Skip to content

GemX-v0.1.0

Choose a tag to compare

@Avaneesh40585 Avaneesh40585 released this 23 May 21:21

v0.1.0 Initial Release Notes

Welcome to the very first release of GemX!, a fully local, native chat application for Apple Silicon Macs designed specifically around the multimodal capabilities of Gemma 4. By leveraging Apple's MLX framework and the mlx-vlm runtime, GemX goes beyond standard text streaming to support native vision, document parsing, and agentic web research directly on your machine.

Key Features in this Release

  • Native Multimodal Vision: Paste or drag-and-drop up to four images per message. The mlx-vlm backend ensures Gemma 4 processes these locally through its integrated image encoder.
  • Integrated Web Research: GemX performs multi-step web research—searching, fetching top URLs, and synthesizing information—using Tavily or a DuckDuckGo fallback. The model outputs verifiable, clickable citations for its sources.
  • Local Voice Transcription: Speak directly to the model using an on-device Whisper model powered by WebGPU, ensuring audio data never leaves your computer.
  • Document Context: Attach PDFs, DOCX files, or raw code files. The application extracts the text locally and injects it cleanly as context for your prompt.
  • Model Hot-Swapping: Switch seamlessly between four quantized Gemma 4 variants (ranging from the E2B to the 31B model) mid-conversation without restarting the application or losing chat history.
  • Native UI Elements: The chat interface supports full GitHub Flavored Markdown, LaTeX math rendering, syntax-highlighted code blocks, collapsible reasoning blocks, and a persistent sidebar featuring pinned chats and auto-generated titles.

First-Launch Automation

Upon opening the application for the first time, GemX handles the backend setup automatically. It will locate your Python installation, provision an isolated virtual environment, install the MLX backend dependencies, and download your selected model weights from HuggingFace.

Please see the README in the main repository for complete installation instructions, system requirements, model memory specifications, and troubleshooting steps.