Prefer chat_template.json for chat template #184

DePasqualeOrg · 2025-02-26T08:28:45Z

ml-explore/mlx-swift-examples#197 (comment)

FL33TW00D · 2025-02-26T10:10:25Z

Looks good to me @pcuenca

pcuenca · 2025-02-26T16:17:08Z

Taking a look now

pcuenca

Thanks for another great contribution @DePasqualeOrg!

There are a couple of subtleties about chat templates that I'm willing to ignore (unless I'm missing something important), and some minor code changes that I think could make the code a bit cleaner – although this could admittedly be subjective.

pcuenca · 2025-02-26T16:45:03Z

Sources/Hub/Hub.swift

+                let tokenizerVocab = try hubApi.configuration(fileURL: modelFolder.appending(path: "tokenizer.json"))
+                let configs = Configurations(
+                    modelConfig: modelConfig,
+                    tokenizerConfig: updatedConfig,
+                    tokenizerData: tokenizerVocab
+                )
+                return configs


Not a fan of repeating this code block and the return here, inside the nested ifs.

I'd recommend we write a helper function to potentially update the chat template, such as:

func updatedTokenizerConfig(tokenizerConfig: Config?, chatTemplateConfig: Config?) -> Config? { guard let chatTemplateConfig = chatTemplateConfig, let overrideChatTemplate = chatTemplateConfig.chatTemplate?.stringValue else { return tokenizerConfig } var configDict = tokenizerConfig?.dictionary ?? [:] configDict["chat_template"] = overrideChatTemplate return Config(configDict) }

And then we can just use this before the return:

let configs = Configurations( modelConfig: modelConfig, tokenizerConfig: updatedTokenizerConfig(tokenizerConfig: tokenizerConfig, chatTemplateConfig: chatTemplateConfig), tokenizerData: tokenizerVocab )

pcuenca · 2025-02-26T16:46:52Z

Tests/TokenizersTests/ChatTemplateTests.swift

+        XCTAssertTrue(qwen2VLEncoded == qwen2_5VLEncoded)
+        XCTAssertTrue(qwen2VLDecoded == qwen2_5VLDecoded && qwen2_5VLDecoded == expectedOutput)


Suggested change

XCTAssertTrue(qwen2VLEncoded == qwen2_5VLEncoded)

XCTAssertTrue(qwen2VLDecoded == qwen2_5VLDecoded && qwen2_5VLDecoded == expectedOutput)

XCTAssertEqual(qwen2VLEncoded, qwen2_5VLEncoded, "Encoded sequences should be equal")

XCTAssertEqual(qwen2VLDecoded, qwen2_5VLDecoded, "Decoded sequences should be equal")

XCTAssertEqual(qwen2_5VLDecoded, expectedOutput, "Decoded should match expected")

nit, should provide better error messages maybe

pcuenca · 2025-02-26T16:52:04Z

Sources/Hub/Hub.swift

+        // Check for chat_template.json, which contains the preferred chat template for vision language models
+        if let chatTemplateConfig = try? hubApi.configuration(fileURL: modelFolder.appending(path: "chat_template.json")) {


Technically, this is not the same algorithm used in transformers. IIRC, if we instantiate a tokenizer from a repo where the tokenizer has a chat template and a different chat_template.json, the template from the tokenizer will still be used. However, if we instantiate a processor, then the chat_template.json will be used.

I'm willing to diverge from this behaviour, given that:

Chat template divergence should not be expected (it's possibly a mistake if both templates differ)

Processor and tokenizer templates should be synced at some point.

There is no processor abstraction in swift-transformers.

cc @Rocketknight1 in case I'm missing some weird edge case.

pcuenca · 2025-02-26T16:53:00Z

Sources/Hub/Hub.swift

+        // Check for chat_template.json, which contains the preferred chat template for vision language models
+        if let chatTemplateConfig = try? hubApi.configuration(fileURL: modelFolder.appending(path: "chat_template.json")) {
+            // If chat_template.json exists and contains a chat_template field, use it to override the tokenizer_config
+            if let chatTemplate = chatTemplateConfig.chatTemplate?.stringValue {


Technically, this could potentially be an array too. But this is discouraged.

DePasqualeOrg · 2025-02-27T08:28:49Z

Thanks for your suggestions, @pcuenca. I made some changes based on your input.

DePasqualeOrg · 2025-03-03T08:23:19Z

We'll need a new version tag in this repo to pick this up in mlx-swift-examples so that we can make Qwen 2.5 VL work with the correct chat template.

pcuenca · 2025-03-03T14:28:31Z

Added

Prefer chat_template.json for chat template

da1e193

DePasqualeOrg mentioned this pull request Feb 26, 2025

adding support for Qwen2.5-VL ml-explore/mlx-swift-examples#197

Open

pcuenca reviewed Feb 26, 2025

View reviewed changes

Refinements

4bebf9e

pcuenca approved these changes Feb 27, 2025

View reviewed changes

pcuenca merged commit be855fa into huggingface:main Feb 27, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer chat_template.json for chat template #184

Prefer chat_template.json for chat template #184

DePasqualeOrg commented Feb 26, 2025

FL33TW00D commented Feb 26, 2025

pcuenca commented Feb 26, 2025

pcuenca left a comment

pcuenca Feb 26, 2025

pcuenca Feb 26, 2025

pcuenca Feb 26, 2025

pcuenca Feb 26, 2025

DePasqualeOrg commented Feb 27, 2025

DePasqualeOrg commented Mar 3, 2025

pcuenca commented Mar 3, 2025

		XCTAssertTrue(qwen2VLEncoded == qwen2_5VLEncoded)
		XCTAssertTrue(qwen2VLDecoded == qwen2_5VLDecoded && qwen2_5VLDecoded == expectedOutput)

-        XCTAssertTrue(qwen2VLEncoded == qwen2_5VLEncoded)
-        XCTAssertTrue(qwen2VLDecoded == qwen2_5VLDecoded && qwen2_5VLDecoded == expectedOutput)
+        XCTAssertEqual(qwen2VLEncoded, qwen2_5VLEncoded, "Encoded sequences should be equal")
+        XCTAssertEqual(qwen2VLDecoded, qwen2_5VLDecoded, "Decoded sequences should be equal")
+        XCTAssertEqual(qwen2_5VLDecoded, expectedOutput, "Decoded should match expected")

		// Check for chat_template.json, which contains the preferred chat template for vision language models
		if let chatTemplateConfig = try? hubApi.configuration(fileURL: modelFolder.appending(path: "chat_template.json")) {

Prefer chat_template.json for chat template #184

Prefer chat_template.json for chat template #184

Conversation

DePasqualeOrg commented Feb 26, 2025

FL33TW00D commented Feb 26, 2025

pcuenca commented Feb 26, 2025

pcuenca left a comment

Choose a reason for hiding this comment

pcuenca Feb 26, 2025

Choose a reason for hiding this comment

pcuenca Feb 26, 2025

Choose a reason for hiding this comment

pcuenca Feb 26, 2025

Choose a reason for hiding this comment

pcuenca Feb 26, 2025

Choose a reason for hiding this comment

DePasqualeOrg commented Feb 27, 2025

DePasqualeOrg commented Mar 3, 2025

pcuenca commented Mar 3, 2025