fix: inject the HubApi #209

matiasvillaverde · 2025-02-20T14:44:08Z

Improved Progress Tracking:

The PR adds structured progress reporting with specific percentages for each loading phase (download, config loading, weight loading, tokenizer loading)
Instead of just forwarding the download progress, it breaks down the entire loading process into meaningful stages
For example: 0-30% for download, 30-50% for config loading, 50-80% for weights, and 80-100% for tokenizer

Custom Model Loading Support:

Adds direct support for loading models from custom file paths with the directURL property
This allows loading models from local directories without requiring them to be in the Hub structure

Consistent HubApi Injection:

The PR makes HubApi injection more consistent across different factory methods
Ensures HubApi is passed to all relevant model creation functions

davidkoski · 2025-02-26T18:56:53Z

Can you explain more what the intent is here?

I see some code to give custom Progress for the different parts of loading the model -- does this override the download progress? I am not sure if these nest properly because it isn't set as the "current"

I am not sure what is meant by injecting Hub -- it is already taken as a parameter.

Thanks!

matiasvillaverde · 2025-02-28T10:24:55Z

@davidkoski It is a small change to better track loading progress and to support local URLs.

The PR doesn't override the download progress - it includes it as part of a larger progress structure, with the download being the first 30% of the total progress if I am not mistaken. The approach should nest properly as it's creating a new Progress object with a total unit count of 100 and updating it as each phase completes.

The "injecting Hub" part refers to ensuring the HubApi instance is properly passed through to all the necessary functions that need it, particularly in cases like the StableDiffusion module where it was inconsistently passed.

davidkoski · 2025-03-04T19:49:33Z

Libraries/StableDiffusion/Load.swift

-/// - ``presetSDXLTurbo``
-/// - ``presetStableDiffusion21Base``
+/// - presetSDXLTurbo
+/// - presetStableDiffusion21Base


Why remove the backquotes? With them we get links to the symbols

davidkoski · 2025-03-04T20:58:24Z

The PR doesn't override the download progress - it includes it as part of a larger progress structure, with the download being the first 30% of the total progress if I am not mistaken. The approach should nest properly as it's creating a new Progress object with a total unit count of 100 and updating it as each phase completes.

Got it -- I just ran with a download and could see the progress going.

davidkoski · 2025-03-04T20:59:04Z

Libraries/MLXLLM/LLMModelFactory.swift

    public static let shared = LLMModelFactory()
-
-    /// registry of model type, e.g. configuration value `llama` -> configuration and init methods


Why remove documentation?

davidkoski · 2025-03-04T21:00:16Z

Libraries/MLXLLM/LLMModelFactory.swift


-        // load the generic config to unerstand which model and how to load the weights


I think this comment (minus maybe the typo) is still good to keep around -- the two phase config loading is otherwise a bit mysterious.

davidkoski · 2025-03-04T21:02:54Z

Libraries/StableDiffusion/Load.swift

-/// ``textToImageGenerator(hub:configuration:)`` or
-/// ``imageToImageGenerator(hub:configuration:)`` to produce the ``ImageGenerator``.
+/// For custom model locations, use:
+/// swift /// let config = StableDiffusionConfiguration.custom( ///     baseURL: URL(fileURLWithPath: "/path/to/model"), ///     isXL: true  // true for SDXL, false for base model /// ) ///


This comment looks mangled -- it should probably be something like:

/// ```swift /// let config ... /// ```

davidkoski · 2025-03-04T21:03:54Z

Libraries/StableDiffusion/StableDiffusion.swift

@@ -127,9 +127,11 @@ public actor ModelContainer<M> {

    /// create a ``ModelContainer`` that supports ``TextToImageGenerator``
    static public func createTextToImageGenerator(
-        configuration: StableDiffusionConfiguration, loadConfiguration: LoadConfiguration = .init()
+        hub: HubApi = HubApi(),


I see -- this is the one you mentioned that was missing 👍

davidkoski

Let me know why some documentation is removed (I think it is probably good to have) and why some links to symbols (the double backquotes) are removed. I would prefer to restore these if there isn't a compelling reason.

Thanks!

matiasvillaverde added 4 commits February 20, 2025 15:40

fix: inject the HubApi

7dfec6e

refactor: report progress when loading weights

600667c

feat: stable diffusion configuration can be loaded from a local URL

d8f5605

feat: report progress while loading the vlm model

e2a2a5b

davidkoski reviewed Mar 4, 2025

View reviewed changes

davidkoski requested changes Mar 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: inject the HubApi #209

fix: inject the HubApi #209

matiasvillaverde commented Feb 20, 2025 •

edited

Loading

davidkoski commented Feb 26, 2025

matiasvillaverde commented Feb 28, 2025

davidkoski Mar 4, 2025

davidkoski commented Mar 4, 2025

davidkoski Mar 4, 2025

davidkoski Mar 4, 2025

davidkoski Mar 4, 2025

davidkoski Mar 4, 2025

davidkoski left a comment

		public static let shared = LLMModelFactory()

		/// registry of model type, e.g. configuration value `llama` -> configuration and init methods


		// load the generic config to unerstand which model and how to load the weights

fix: inject the HubApi #209

Are you sure you want to change the base?

fix: inject the HubApi #209

Conversation

matiasvillaverde commented Feb 20, 2025 • edited Loading

davidkoski commented Feb 26, 2025

matiasvillaverde commented Feb 28, 2025

davidkoski Mar 4, 2025

Choose a reason for hiding this comment

davidkoski commented Mar 4, 2025

davidkoski Mar 4, 2025

Choose a reason for hiding this comment

davidkoski Mar 4, 2025

Choose a reason for hiding this comment

davidkoski Mar 4, 2025

Choose a reason for hiding this comment

davidkoski Mar 4, 2025

Choose a reason for hiding this comment

davidkoski left a comment

Choose a reason for hiding this comment

matiasvillaverde commented Feb 20, 2025 •

edited

Loading