prep for mlx-swift 0.29.1 #411

davidkoski · 2025-10-13T20:32:44Z

Support for mlx-swift 0.29.1. In particular this is updates for changes in quantization.

davidkoski · 2025-10-13T20:33:10Z

Package.swift

    ],
    dependencies: [
-        .package(url: "https://github.com/ml-explore/mlx-swift", .upToNextMinor(from: "0.25.5")),
+        .package(url: "https://github.com/ml-explore/mlx-swift", branch: "mlx-0291"),


For now just point to the branch.

davidkoski · 2025-10-13T20:33:55Z

Libraries/MLXLLM/LLMModelFactory.swift

+    static public let gpt_oss_20b_MXFP4_Q8 = ModelConfiguration(
+        id: "mlx-community/gpt-oss-20b-MXFP4-Q8",
+        defaultPrompt: "Why is the sky blue?"
+    )


The MXFP4 quantization is now supported. This model was used to test that and the quantized kvcache.

davidkoski · 2025-10-13T20:34:15Z

Libraries/MLXLLM/SwitchLayers.swift

 class QuantizedSwitchLinear: SwitchLinear, Quantized {
    @ModuleInfo(key: "scales") var scales: MLXArray
-    @ModuleInfo(key: "biases") var biases: MLXArray
+    @ModuleInfo(key: "biases") var biases: MLXArray?


biases are now optional

davidkoski · 2025-10-13T20:34:55Z

Libraries/MLXLMCommon/BaseConfiguration.swift

        public let bits: Int
-        public var quantMethod: String? = nil
-        public var linearClass: String? = nil
-        public var quantizationMode: String? = nil


These were defined so that they could be skipped (below). We can just skip them directly.

davidkoski · 2025-10-13T20:35:09Z

Libraries/MLXLMCommon/BaseConfiguration.swift

+
+                // additional keys that are not layer instructions, see
+                // mlx-community/bitnet-b1.58-2B-4T-4bit
+                case "quant_method", "linear_class", "quantization_mode": continue


Skip directly.

davidkoski · 2025-10-13T20:35:34Z

Libraries/MLXLMCommon/KVCache.swift

    /// - Returns: Quantized tuples (keys, values) as ((weight, scales, biases), (weight, scales, biases))
    func updateQuantized(keys: MLXArray, values: MLXArray) -> (
-        (MLXArray, MLXArray, MLXArray), (MLXArray, MLXArray, MLXArray)
+        (MLXArray, MLXArray, MLXArray?), (MLXArray, MLXArray, MLXArray?)


Support for optional biases.

davidkoski · 2025-10-13T20:36:24Z

Libraries/MLXVLM/Models/Gemma3.swift

-        case groupSize = "group_size"
-        case bits
-    }
-}


We can just use the standard configuration for this

davidkoski · 2025-10-13T20:37:12Z

Libraries/MLXVLM/Models/Gemma3.swift

                let fullPath = "language_model.\(path)"
-                if weights["\(fullPath).scales"] != nil && weights["\(fullPath).biases"] != nil
+                if weights["\(fullPath).scales"] != nil
                    && weights["\(fullPath).weight"]?.dtype == .uint32


biases is now optional -- handle that case. I am not sure this is strictly required as the load() method handles this, though not with this exact logic.

awni

Looks great. You added mode support in the QKV cache which is a step beyond mlx-lm 😄

- handle changes in quantization

davidkoski requested a review from awni October 13, 2025 20:32

davidkoski commented Oct 13, 2025

View reviewed changes

awni approved these changes Oct 16, 2025

View reviewed changes

mlx-swift 0.29.1

a2f4750

- handle changes in quantization

davidkoski force-pushed the mlx-0291 branch from 98029ad to a2f4750 Compare October 16, 2025 18:19

davidkoski merged commit 9bff95c into main Oct 16, 2025
2 checks passed

davidkoski deleted the mlx-0291 branch October 16, 2025 18:27

This was referenced Oct 16, 2025

Support quantization mode #397

Closed

config.json parse configurationDecodingError with MXFP4 models #386

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

prep for mlx-swift 0.29.1 #411

prep for mlx-swift 0.29.1 #411

Uh oh!

davidkoski commented Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

davidkoski Oct 13, 2025

Uh oh!

awni left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

prep for mlx-swift 0.29.1 #411

prep for mlx-swift 0.29.1 #411

Uh oh!

Conversation

davidkoski commented Oct 13, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants