Skip to content
This repository was archived by the owner on Apr 1, 2026. It is now read-only.

v1.2.3

Choose a tag to compare

@RamboRogers RamboRogers released this 20 Jul 04:32
· 16 commits to main since this release

v1.2.3 - Real-Time Model Status & Model Support (July 19 2025)
πŸš€ Real-Time Status Monitoring
βœ… Live Model Status - Added real-time status tracking for model loading, including download progress.
πŸ“Š Detailed Status View - See download percentage, speed, and ETA directly in the UI.

πŸ§ͺ New API Test Console
βœ… Built-in API Testing - Added dedicated API Test tab in the admin interface for single-turn testing
🎯 Model Selection - Test any loaded model with customizable parameters (temperature, max tokens, system messages)
πŸ“Š Response Analytics - View response time, token count, and detailed statistics
πŸ“ Test History - Keep track of recent API tests with timestamps and performance metrics
⚑ Quick Validation - Perfect for testing model responses and API functionality πŸš€ Comprehensive Model Ecosystem
βœ… 15+ New Verified Models - Added support for trending MLX models including SmolLM3, Kimi-K2, Gemma-3n, and more
🧠 Trillion-Parameter Support - Added detection for ultra-large models like Kimi-K2-Instruct (1.02T parameters)
🎯 Enhanced Model Discovery - Improved trending models endpoint with curated high-performance models
πŸ” Smart Multimodal Detection - Fixed classification for models like Gemma-3n to properly show as "Multimodal"
🎨 New Verified Tested Models

SmolLM3-3B-4bit - Multilingual 481M parameter model with 8-language support
Kimi-Dev-72B-4bit-DWQ - Large reasoning model with advanced capabilities
Kimi-K2-Instruct-4bit - Ultra-large 1.02T parameter instruction-tuned model
Llama-3.2-3B-Instruct-4bit - Meta's instruction-following model with 502M parameters
Gemma-2-9B-it-4bit - Google's advanced reasoning model with 1.44B parameters
Qwen3-30B-A3B-4bit-DWQ - MoE model with 30B total/3B active parameters
Gemma-3n-E4B-it-MLX-4bit - Advanced multimodal model with image/audio/text capabilities
πŸ”§ Technical Improvements

🎯 Improved Model Type Classification - Enhanced detection for multimodal models with image-text-to-text capabilities
πŸ“Š Expanded Parameter Patterns - Added support for trillion-scale model memory estimation
πŸ§ͺ Comprehensive Test Suite - Added dedicated test scripts for all new models with streaming/non-streaming validation
πŸ”„ Install-Load Workflow - Updated all tests to follow proper MLX-GUI model lifecycle (install β†’ load β†’ use)