v1.2.3
v1.2.3 - Real-Time Model Status & Model Support (July 19 2025)
π Real-Time Status Monitoring
β
Live Model Status - Added real-time status tracking for model loading, including download progress.
π Detailed Status View - See download percentage, speed, and ETA directly in the UI.
π§ͺ New API Test Console
β
Built-in API Testing - Added dedicated API Test tab in the admin interface for single-turn testing
π― Model Selection - Test any loaded model with customizable parameters (temperature, max tokens, system messages)
π Response Analytics - View response time, token count, and detailed statistics
π Test History - Keep track of recent API tests with timestamps and performance metrics
β‘ Quick Validation - Perfect for testing model responses and API functionality π Comprehensive Model Ecosystem
β
15+ New Verified Models - Added support for trending MLX models including SmolLM3, Kimi-K2, Gemma-3n, and more
π§ Trillion-Parameter Support - Added detection for ultra-large models like Kimi-K2-Instruct (1.02T parameters)
π― Enhanced Model Discovery - Improved trending models endpoint with curated high-performance models
π Smart Multimodal Detection - Fixed classification for models like Gemma-3n to properly show as "Multimodal"
π¨ New Verified Tested Models
SmolLM3-3B-4bit - Multilingual 481M parameter model with 8-language support
Kimi-Dev-72B-4bit-DWQ - Large reasoning model with advanced capabilities
Kimi-K2-Instruct-4bit - Ultra-large 1.02T parameter instruction-tuned model
Llama-3.2-3B-Instruct-4bit - Meta's instruction-following model with 502M parameters
Gemma-2-9B-it-4bit - Google's advanced reasoning model with 1.44B parameters
Qwen3-30B-A3B-4bit-DWQ - MoE model with 30B total/3B active parameters
Gemma-3n-E4B-it-MLX-4bit - Advanced multimodal model with image/audio/text capabilities
π§ Technical Improvements
π― Improved Model Type Classification - Enhanced detection for multimodal models with image-text-to-text capabilities
π Expanded Parameter Patterns - Added support for trillion-scale model memory estimation
π§ͺ Comprehensive Test Suite - Added dedicated test scripts for all new models with streaming/non-streaming validation
π Install-Load Workflow - Updated all tests to follow proper MLX-GUI model lifecycle (install β load β use)