-
Notifications
You must be signed in to change notification settings - Fork 266
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Monkeypatch for Qwen2.5-VL #552
base: main
Are you sure you want to change the base?
Conversation
I made some fixes to the multimodal convergence test. However the float32 convergence test is failing:
These are small difference, but I wonder if they show up in this test instead of the other convergence tests because the multimodal-rope implementation is slightly different? Could you look into it? Thanks - other than this the PR looks close |
@tyler-romero Thanks for reviewing my PR and helping out with testing.
Does it only fail on Qwen2.5-VL? I have checked, the The discrepancy might be related to PR #412. I'm not sure why, but #412 only changes the
Should we just change the |
For Qwen2.5VL, we dont have this same issue:
Because it was added after this change to huggingface (so there arent multiple versions to support) So we should track whatever that modification was and update our patch to support it correctly. |
I haven't done any change in the code. Somehow the tests passed on my laptop (main git branch of transformers: 6b55046).
|
In this PR, the forward function of Qwen2.5-VL has already been changed to match the My theory is: Before #412, Qwen2-VL failed on |
Summary
Qwen2.5-VL was released a few days ago. This PR aims to add support for it.
There are some small changes in its architecture compared to Qwen2-VL.
Details
As of February 1, the latest git version of
transformers
(62db3e6) supports it, but it's not yet in the latest release oftransformers
.Needs help with the tests, as I got OOMs on my laptop GPU. I have tried LoRA with PR, which works great.
Testing Done
Can't run the tests on my hardware, got OOMs.
make test
to ensure correctnessmake checkstyle
to ensure code stylemake test-convergence
to ensure convergence