address issues that prevent using composition for layers like LoRA #177

davidkoski · 2024-12-17T00:59:54Z

see exploration of LoRA using composition mlx-swift-examples#167
also fixes issue where quantize() could quantize a quantized layer!

- see ml-explore/mlx-swift-examples#167 - also fixes issue where quantize() could quantize a quantized layer!

davidkoski · 2024-12-17T01:00:30Z

Source/MLXNN/Linear.swift

@@ -73,7 +73,7 @@ open class Linear: Module, UnaryLayer, Quantizable {
    public let weight: MLXArray
    public let bias: MLXArray?

-    public var shape: (Int, Int) {
+    open var shape: (Int, Int) {


A lot of changes here are public -> open to allow subclasses to override.

davidkoski · 2024-12-17T01:01:39Z

Source/MLXNN/Module.swift

-    /// ``unfreeze(recursive:keys:strict:)``.
-    public private(set) var noGrad = Set<String>()
+    /// See ``noGrad()``
+    private var _noGrad = Set<String>()


This is a breaking change but unlikely to be used directly (there are methods to manipulate). Subclasses can't override stored properties (they can do so for computed properties). This is replaced with methods that can be overridden.

davidkoski · 2024-12-17T01:02:09Z

Source/MLXNN/Module.swift

+    /// - Parameters:
+    ///   - key: module key, see ``ModuleInfo``
+    ///   - value: the replacement module
+    open func updateModule(key: String, _ value: Any) throws {


Primarily exposed for subclasses to override (see test case)

davidkoski · 2024-12-17T01:02:33Z

Source/MLXNN/Module.swift

@@ -922,7 +969,7 @@ extension Module: Updatable, Evaluatable {
 /// ### See Also
 /// - <doc:layers>
 /// - ``Sequential``
-public protocol UnaryLayer {
+public protocol UnaryLayer: Module {


Per observation in ml-explore/mlx-swift-examples#167

davidkoski · 2024-12-17T01:03:13Z

Source/MLXNN/Quantized.swift

+}
+
+public func quantizeSingle(layer: Module, groupSize: Int = 64, bits: Int = 4) -> Quantized? {
+    if layer is Quantized {


Observed in the past and in the same area -- do not quantize already quantized layers.

davidkoski · 2024-12-17T01:03:48Z

Source/MLXNN/Transformer.swift

+    @ModuleInfo(key: "query_proj") public var queryProjection: UnaryLayer
+    @ModuleInfo(key: "key_proj") public var keyProjection: UnaryLayer
+    @ModuleInfo(key: "value_proj") public var valueProjection: UnaryLayer
+    @ModuleInfo(key: "out_proj") public var outProjection: UnaryLayer


This is how we would change e.g. attention layers if we wanted to use composition for LoRA.

davidkoski · 2024-12-17T01:04:34Z

Source/MLXNN/Quantized.swift

+    open override var shape: (Int, Int) {
+        let shape = weight.shape2
+        return (shape.0, shape.1 * 32 / bits)
+    }


This is also a breaking change but IMHO it was broken before (returning the size of the quantized arrays which was useless)

davidkoski · 2024-12-17T01:05:36Z

Tests/MLXTests/ModuleTests.swift

+        // like Linear/QuantizedLinear) but this verifies that it can
+        // be written this way.
+
+        class LoRA: Module, UnaryLayer {


A simple implementation of LoRA using composition, see ml-explore/mlx-swift-examples#167. This isn't meant as a reference implementation (though it may grow into that), but rather a test of the subclassing to make sure we have it all covered.

address issues that prevent using composition for layers like LoRA

739f84b

- see ml-explore/mlx-swift-examples#167 - also fixes issue where quantize() could quantize a quantized layer!

davidkoski requested a review from awni December 17, 2024 00:59

davidkoski commented Dec 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

address issues that prevent using composition for layers like LoRA #177

address issues that prevent using composition for layers like LoRA #177

davidkoski commented Dec 17, 2024

davidkoski Dec 17, 2024

davidkoski Dec 17, 2024

davidkoski Dec 17, 2024

davidkoski Dec 17, 2024

davidkoski Dec 17, 2024

davidkoski Dec 17, 2024

davidkoski Dec 17, 2024

davidkoski Dec 17, 2024

address issues that prevent using composition for layers like LoRA #177

Are you sure you want to change the base?

address issues that prevent using composition for layers like LoRA #177

Conversation

davidkoski commented Dec 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment