Skip to content

v0.3.1

Choose a tag to compare

@github-actions github-actions released this 03 May 15:51
· 189 commits to main since this release

Changed

  • Model format handling now accepts GCUF as a compatibility alias for canonical GGUF.
  • Architecture inspection now supports local GGUF files, MLX models with local config.json directories, and local-server MLX IDs that point back to HF-style repos, including leading-slash IDs such as /lmstudio-community/...-MLX-6bit.
  • Architecture inspection now uses a layered pipeline: exact Transformers construction first, then format-aware config/header fallback with explicit provenance and accuracy metadata.
  • Config fallback now normalizes nested decoder configs, estimates dense decoder, multimodal projector, and MoE structures, respects tied embeddings, and returns a clear unsupported error when required dimensions are missing.
  • GPTQ, AWQ, SafeTensors, MLX, and GGUF inspection targets now route through the appropriate exact, config-backed, or header-only strategy without downloading weight tensors.
  • Architecture cache entries now include inspector metadata and invalidate stale zero-parameter root-only results.