cryptopoly / ChaosEngineAI Sponsor Star 5 Code Issues Pull requests Local AI workstation — discover, run, chat, benchmark, and generate images from open-weight models. DFlash/DDTree speculative decoding, five cache compression strategies (RotorQuant, TriAttention, TurboQuant, ChaosEngine), MLX + llama.cpp + vLLM backends. desktop-app python machine-learning typescript ai image-generation mlx tauri huggingface apple-silicon openai-api cache-compression llm stable-diffusion llama-cpp vllm local-ai gguf speculative-decoding dflash Updated Apr 17, 2026 Python