Make LLMModelFactory and VLMModelFactory inits public #226

ibrahimcetin · 2025-03-07T11:45:05Z

This PR makes LLMModelFactory and VLMModelFactory inits public.

The PR currently not ready and I want to discuss some cases:

I created shared instance on registry types. Is it a good approach? First, I thought init() should create default registry but I think it isn't a good approach because people going to expect an empty registry.
Currently, there are two ModelRegistry class which is same. I think we should merge these two to one on MLXLMCommon package. Maybe, we can make it as a base class and create LLMRegistry and VLMRegistry class separately as a subclass of ModelRegistry. (Same for ModelTypeRegistry)
I didn't make the changes but we may use String(describing:) for ProcessorTypeRegistry registration. For example,

public func registerProcessorType<T>(
        _ type: T.Type,
        creator: @Sendable @escaping (
            URL,
            any Tokenizer
        ) throws -> any UserInputProcessor
    ) {
        let typeName = String(describing: type)
        lock.withLock {
            creators[typeName] = creator
        }
    }

This is just for showing but if we use String(describing:) it will prevent typos and make the library more type safe IMO.

In conclusion, I want to be sure is this approach is acceptable? If so, I will do the same for the remaining parts.

davidkoski · 2025-03-07T18:52:14Z

Libraries/MLXVLM/VLMModelFactory.swift

+
+        instance.registerProcessorType("PaliGemmaProcessor", creator: create(PaliGemmaProcessorConfiguration.self, PaligGemmaProcessor.init))
+        instance.registerProcessorType("Qwen2VLProcessor", creator: create(Qwen2VLProcessorConfiguration.self, Qwen2VLProcessor.init))
+        instance.registerProcessorType("Idefics3Processor", creator: create(Idefics3ProcessorConfiguration.self, Idefics3Processor.init))


This makes sense to me and I agree with your thinking in the description. Two thoughts on improvements:

this could look more like a table (like the deleted code below) and I think it will be more obvious for people to edit

perhaps the table should be extracted as a private let at the top level -- we can put all the tables at the top of the file so it is even more obvious what people need to edit to register these built in models

I wonder if we would want a way to copy shared in case you wanted a private registry but starting with the default types? Or perhaps you just use shared in that case. The use case I am imagining is you have a chat application that can use any of these shared models but you also have a private model in your app. In that case you want a merge and updating the global would be fine.

If you provided a framework / swiftpm library and wanted to register your types perhaps you would have a static method that took a VLMModelFactory and updated the registries on that. That would allow you to add to a pre-populated registry or an empty one.

So I think we don't need a way to copy() a registry -- if something comes up later we can revisit.

See what you think about moving the default registry to a table at the top of the file.

davidkoski · 2025-03-07T18:55:36Z

I created shared instance on registry types. Is it a good approach? First, I thought init() should create default registry but I think it isn't a good approach because people going to expect an empty registry.

See comment in the code -- I think shared is reasonable and it lets people construct different variations.

Currently, there are two ModelRegistry class which is same. I think we should merge these two to one on MLXLMCommon package. Maybe, we can make it as a base class and create LLMRegistry and VLMRegistry class separately as a subclass of ModelRegistry. (Same for ModelTypeRegistry)

A base class makes sense, something like AbstractModelRegistry and the VLM/LLM variants just subclass that. I think the piece that blocked this in the past was how to handle the default registry, but a shared static might deal with that.

Right now the ModelRegistry is a place where the static model configurations live, e.g. qwen2VL2BInstruct4Bit and I think that is useful.

I didn't make the changes but we may use String(describing:) for ProcessorTypeRegistry registration. For example,

public func registerProcessorType<T>(
        _ type: T.Type,
        creator: @Sendable @escaping (
            URL,
            any Tokenizer
        ) throws -> any UserInputProcessor

So this would be called like this:

registry.registerProcessorType(PaligGemmaProcessor.self) { url, tokenizer in
    let config = try JSONDecoder().decode(PaliGemmaProcessorConfiguration.self, from: Data(contentsOf: url))
    return PaligGemmaProcessor(configuration, tokenizer: tokenizer)
}

it looks like String(describing:) gives just the type name without the module (good in this case):

  2> import Foundation
  3> Date.self
$R1: Foundation.Date.Type = Foundation.Date
  4> String(describing: Date.self)
$R2: String = "Date"

A few concerns:

I think we still need the String version of this call to handle cases where the type and the string in the config don't match. I don't know for certain this will happen as the string seems to be a python type name, but I think we should be prepared for it
there isn't a tie between the type passed and the actual processor that is created -- perhaps we can do that with an adjustment to the signature (below)

public func registerProcessorType<T>(
        _ type: T.Type,
        creator: @Sendable @escaping (
            URL,
            any Tokenizer
        ) throws -> T

If we can enforce the type name and the type matching like that I think this looks valuable.

… codes

ibrahimcetin · 2025-03-07T22:44:57Z

Right now the ModelRegistry is a place where the static model configurations live, e.g. qwen2VL2BInstruct4Bit and I think that is useful.

I agree. To clarify my thinking, I would like to add an example code here:

While we add new features on ModelRegistry, we have to duplicate them in the current implementation. But if we move it in MLXLMCommon, we may define LLMRegistry and VLMRegistry on MLXLLM and MLXVLM respectively.

// in MLXLMCommon
public class ModelRegistry: @unchecked Sendable {
    /// Creates an empty registry.
    public init() {
        registry = Dictionary()
    }

    /// Creates a new registry with from given model configurations.
    public init(modelConfigurations: [ModelConfiguration]) {
        registry = Dictionary(uniqueKeysWithValues: modelConfigurations.map { ($0.name, $0) })
    }

    private let lock = NSLock()
    private var registry: Dictionary<String, ModelConfiguration>

    public func register(configurations: [ModelConfiguration]) {
        lock.withLock {
            for c in configurations {
                registry[c.name] = c
            }
        }
    }

    public func configuration(id: String) -> ModelConfiguration {
        lock.withLock {
            if let c = registry[id] {
                return c
            } else {
                return ModelConfiguration(id: id)
            }
        }
    }

    public var models: some Collection<ModelConfiguration> & Sendable {
        lock.withLock {
            return registry.values
        }
    }
}

// in MLXVLM
class VLMRegistry: ModelRegistry {
    /// Shared instance with default model configurations.
    public static let shared = ModelRegistry(modelConfigurations: all())

     static private func all() -> [ModelConfiguration] {
        [
            paligemma3bMix448_8bit,
            qwen2VL2BInstruct4Bit,
        ]
    }

    static public let paligemma3bMix448_8bit = ModelConfiguration(
        id: "mlx-community/paligemma-3b-mix-448-8bit",
        defaultPrompt: "Describe the image in English"
    )

    static public let qwen2VL2BInstruct4Bit = ModelConfiguration(
        id: "mlx-community/Qwen2-VL-2B-Instruct-4bit",
        defaultPrompt: "Describe the image in English"
    )

    static public let smolvlminstruct4bit = ModelConfiguration(
        id: "mlx-community/SmolVLM-Instruct-4bit",
        defaultPrompt: "Describe the image in English"
    )
}

// in MLXLLM
class LLMRegistry: ModelRegistry {
    // Same as MLXVLM
}

This is just to avoid code duplication. What do you think?

davidkoski · 2025-03-07T23:29:35Z

This is just to avoid code duplication. What do you think?

Yes, looks great!

ibrahimcetin · 2025-03-08T02:24:23Z

@davidkoski Thanks for the feedback. I want to limit this PR with these changes and it is ready now. I will move ModelRegistry and ModelTypeRegistry to MLXLMCommon after this PR is merged. Then, I will add contains(id:) method on ModelRegistry as discussed in #224.

davidkoski · 2025-03-08T05:45:15Z

CI failed on the swift-format check -- can you please run:

https://github.com/ml-explore/mlx-swift-examples/blob/main/CONTRIBUTING.md#pull-requests

ibrahimcetin · 2025-03-08T05:53:27Z

@davidkoski Done

davidkoski

Thank you for the additions!

ibrahimcetin added 3 commits March 7, 2025 11:02

Make LLMModelFactory and VLMModelFactory init public

7569337

Make ModelRegistry init public

8d6c71d

Make ProcessorTypeRegistry init public

19b71f2

davidkoski reviewed Mar 7, 2025

View reviewed changes

ibrahimcetin added 2 commits March 8, 2025 01:10

Add init(creators:) to ProcessorTypeRegistry and update corresponding…

0bf5cd2

… codes

Remove unnecessary code

c9faed9

ibrahimcetin added 2 commits March 8, 2025 05:15

Make ModelTypeRegistry init public in LLMModelFactory

e602abb

Make ModelTypeRegistry init public in VLMModelFactory

ce163e3

Run formatter

bedb51a

davidkoski approved these changes Mar 8, 2025

View reviewed changes

davidkoski merged commit 3885b92 into ml-explore:main Mar 8, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make LLMModelFactory and VLMModelFactory inits public #226

Make LLMModelFactory and VLMModelFactory inits public #226

ibrahimcetin commented Mar 7, 2025

davidkoski Mar 7, 2025

davidkoski commented Mar 7, 2025

ibrahimcetin commented Mar 7, 2025

davidkoski commented Mar 7, 2025

ibrahimcetin commented Mar 8, 2025

davidkoski commented Mar 8, 2025

ibrahimcetin commented Mar 8, 2025

davidkoski left a comment

Make LLMModelFactory and VLMModelFactory inits public #226

Make LLMModelFactory and VLMModelFactory inits public #226

Conversation

ibrahimcetin commented Mar 7, 2025

davidkoski Mar 7, 2025

Choose a reason for hiding this comment

davidkoski commented Mar 7, 2025

ibrahimcetin commented Mar 7, 2025

davidkoski commented Mar 7, 2025

ibrahimcetin commented Mar 8, 2025

davidkoski commented Mar 8, 2025

ibrahimcetin commented Mar 8, 2025

davidkoski left a comment

Choose a reason for hiding this comment