Models fail to load with keyNotFound error #218

atdrendel · 2025-02-28T13:43:07Z

Related to #214, I believe.

Some models fail with keyNotFound errors. The Qwen1.5 and 2.5 errors were solved in #210. Some errors still remain, though.

Phi-3.5-MoE-instruct-4bit: keyNotFound(base: "SuScaledRotaryEmbedding", key: "_freqs")
OpenELM-270M-Instruct: keyNotFound(base: "Linear", key: "weight")

The text was updated successfully, but these errors were encountered:

davidkoski · 2025-02-28T15:55:28Z

I will take a look -- probably a variation on the same issue.

- do not fail parameter update validation for "invalid" keys (e.g. _freqs)

davidkoski · 2025-03-07T23:12:58Z

Phi-3.5-MoE-instruct-4bit is interesting -- it has a rope layer which is SuScaledRotaryEmbedding, but it is not meant to be parameters. Indeed it is not marked with @ParameterInfo, it is just a constant value.

po self.rope.parameters()
▿ [
  
]
  - contents : 0 elements

and that is because:

    public func parameters() -> ModuleParameters {
        filterMap(filter: Self.filterValidParameters, map: Self.mapParameters())
    }

filters out parameters with leading _ names. I think we should probably not consider those for validation.

That will be fixed with ml-explore/mlx-swift#200

davidkoski · 2025-03-07T23:19:25Z

OpenELM-270M-Instruct has a couple of cases where layers should be optional but are not:

        var out = transformer(inputs, cache: cache)
        if shareInputOutputLayers {
            out = matmul(out, transformer.embedTokens.weight.T)
        } else {
            out = lmHead(out)
        }

lmHead is always present but ignored depending on this config value. Should be:

        var out = transformer(inputs, cache: cache)
        if let lmHead {
            out = lmHead(out)
        } else {
            out = matmul(out, transformer.embedTokens.weight.T)
        }

there are some other layers in the model set up the same way.

- OpenELM had optional layers that were always created - see #214

davidkoski · 2025-03-07T23:25:42Z

I inspected the rest of the models for similar patterns but didn't see any.

davidkoski self-assigned this Feb 28, 2025

davidkoski mentioned this issue Mar 3, 2025

Unable to Apply LoRA Adapters in MLX for Swift #220

Closed

davidkoski added a commit to davidkoski/mlx-swift that referenced this issue Mar 7, 2025

Fix ml-explore/mlx-swift-examples#218

d93271b

- do not fail parameter update validation for "invalid" keys (e.g. _freqs)

davidkoski added a commit that referenced this issue Mar 7, 2025

Fix #218

8d30ead

- OpenELM had optional layers that were always created - see #214

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models fail to load with keyNotFound error #218

Models fail to load with keyNotFound error #218

atdrendel commented Feb 28, 2025

davidkoski commented Feb 28, 2025

davidkoski commented Mar 7, 2025

davidkoski commented Mar 7, 2025

davidkoski commented Mar 7, 2025

Models fail to load with keyNotFound error #218

Models fail to load with keyNotFound error #218

Comments

atdrendel commented Feb 28, 2025

davidkoski commented Feb 28, 2025

davidkoski commented Mar 7, 2025

davidkoski commented Mar 7, 2025

davidkoski commented Mar 7, 2025