Skip to content

Releases: ml-explore/mlx-swift

0.29.1

16 Oct 18:14
072b684

Choose a tag to compare

Integrate mlx 0.29.1

NOTE

This change contains some breaking API changes in the area of quantization. Specifically:

  • the quantized / dequantized methods now take a mode parameter (not breaking)
  • the biases result from quantized is now optional, e.g. (wq: MLXArray, scales: MLXArray, biases: MLXArray?)

We are keeping the same semver here to match with python mlx. Although the change is breaking, it will likely be limited to implementations of quantized layers, e.g. QuantizedLinear, or other code that uses quantization directly. mlx-swift-examples will have a synchronized release to reflect this change.

If you need to make a similar change, consider the changes from QuantizedLinear:

The properties changed from this:

    public let scales: MLXArray
    public let biases: MLXArray

to:

    public let mode: QuantizationMode
    public let scales: MLXArray
    public let biases: MLXArray?

A mode with parameter with a default value was added where needed: mode: QuantizationMode = .affine and the mode parameter was used in calls to the quantization APIs:

        var x = quantizedMatmul(
            x,
            weight,
            scales: scales,
            biases: biases,
            transpose: true,
            groupSize: groupSize,
            bits: bits,
            mode: mode
        )

and the Quantizable protocol was updated to have a mode parameter (protocol methods can't have default values):

    /// Return the module as a quantized representation
    func toQuantized(groupSize: Int, bits: Int, mode: QuantizationMode) -> Module

What's Changed

New Contributors

Full Changelog: 0.25.6...0.29.1

0.25.6

14 Jul 22:42
96c0f69

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 0.25.5...0.25.6

0.25.5

24 Jun 20:01
b79c74c

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 0.25.4...0.25.5

0.25.4

02 Jun 15:22
b94473a

Choose a tag to compare

What's Changed

Full Changelog: 0.25.3...0.25.4

0.25.3

16 May 23:39
d3f89b1

Choose a tag to compare

What's Changed

  • update README and examples by @davidkoski in #232
  • manage object lifetime in the face of more aggressive inlining and optimizations by @davidkoski in #239
  • if no error handler set produce a fatal error (like the default used to be) by @davidkoski in #238

Full Changelog: 0.25.2...0.25.3

0.25.2

07 May 23:11
a8f7411

Choose a tag to compare

What's Changed

Full Changelog: 0.25.1...0.25.2

0.25.1

02 May 17:03
5976316

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 0.23.1...0.25.1

0.23.1

24 Mar 20:28
939b2ad

Choose a tag to compare

Significant Changes

What's Changed

New Contributors

Full Changelog: 0.21.3...0.23.1

0.21.3

10 Feb 20:57
b990c58

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 0.21.2...0.21.3

0.21.2

05 Dec 16:20
70dbb62

Choose a tag to compare

What's Changed

Full Changelog: 0.21.1...0.21.2