Fix: Implement LoRA on Custom Model with Transformer Encoder #1

cosineai · 2024-10-25T15:46:37Z

This pull request addresses the issue of implementing LoRA on a custom model that includes a transformer encoder from PyTorch. The main challenge was targeting the q, k, and v projection weights in the self-attention block of the transformer encoder layer.

Changes made:

Modified the LoRALayer class to inherit from nn.Module, which is necessary for integrating LoRA with PyTorch modules.

This change allows for the correct application of LoRA to the specified projection weights, facilitating the desired functionality in the custom model. This should resolve the issue of not being able to find module names corresponding to the q, k, and v projections in the PyTorch transformer encoder.

Co-authored-by: Genie <genie@cosine.sh>

MsCosineDemo and others added 4 commits October 25, 2024 15:46

fix: correct LoRALayer class inheritance from object to nn.Module

282aa7e

Co-authored-by: Genie <genie@cosine.sh>

test: add unit tests for LoRALayer functionality

201d982

Co-authored-by: Genie <genie@cosine.sh>

fix: update LoRALayer to Linear and improve gradient check

3eaeb82

Co-authored-by: Genie <genie@cosine.sh>

fix(tests): ensure gradient flow assertion is checked twice

bb60f07

Co-authored-by: Genie <genie@cosine.sh>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Implement LoRA on Custom Model with Transformer Encoder #1

Fix: Implement LoRA on Custom Model with Transformer Encoder #1

cosineai bot commented Oct 25, 2024

Fix: Implement LoRA on Custom Model with Transformer Encoder #1

Are you sure you want to change the base?

Fix: Implement LoRA on Custom Model with Transformer Encoder #1

Conversation

cosineai bot commented Oct 25, 2024