Support Low Rank Adaptation (LoRA). #745

mingxu1067 · 2024-04-02T18:08:58Z

Implemented LoRA and related tests.
Supported the LoRA scope control to TransformerLayer and MultiHeadAttention.
LoRA Implementation Details::

Only support len(axis) and len(features) <= 5

When there are multiple dimensions of features, the LoRA would transform the last dimension only.
For example, X in shape of (B, S, Hin), features in (3, Hout), axis = (2,), then LoRA would do like (B, S, Hin) x (3, rank) = (B, S, 3, rank), then (B, S, 3, rank) x (3, rank, Hout) = (B, S, 3, Hout)

zlsh80826 · 2024-04-04T06:42:58Z

Generally LGTM!

zlsh80826 · 2024-04-04T06:40:36Z

transformer_engine/jax/flax/transformer.py

+    SCOPE_EX_OUTPUT_PROJ = 'exclude_output_proj'
+    SCOPE_EX_MLP = 'exclude_mlp'
+
+    assert scope in [


I noticed that the low_rank_adaptation_scope is expected to be a string. That would make users confuse to use None or string 'None'. It would be better to enhance the handle of None. Either accepting None and converting it to string 'None' or showing an error message to let user pass string 'None' is okay.

This a good point. Thank you for bringing this up. Added the handle to None.

Signed-off-by: Ming Huang <mingh@nvidia.com>

mingxu1067 · 2024-04-10T01:08:04Z

/te-ci jax

denera

LGTM!

yhtang · 2024-04-16T23:13:07Z

When could this get merged? I will cherry-pick this into the JAX 24.04 NGC release once it is merged into TE.

denera · 2024-04-16T23:27:42Z

@yhtang Just merged. Thanks for the heads up!

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

mingxu1067 requested review from denera, zlsh80826 and nouiz April 2, 2024 18:09

mingxu1067 force-pushed the mingh/add_lora branch from e1eb2bb to 9d9c20a Compare April 2, 2024 18:13

zlsh80826 approved these changes Apr 9, 2024

View reviewed changes

mingxu1067 added 8 commits April 10, 2024 01:06

Adding LoRA and related docs.

e3a8a04

Signed-off-by: Ming Huang <mingh@nvidia.com>

Adding tests to LoRA.

161b424

Signed-off-by: Ming Huang <mingh@nvidia.com>

Fix the missing license issue.

042cb00

Signed-off-by: Ming Huang <mingh@nvidia.com>

Adding alpha to LoRA

a5ad42d

Signed-off-by: Ming Huang <mingh@nvidia.com>

Adding scope controller to LoRA.

152f341

Signed-off-by: Ming Huang <mingh@nvidia.com>

Fixing the wrong shape of axes to LoRA weights.

8a00711

Signed-off-by: Ming Huang <mingh@nvidia.com>

Fix the wrong axes shape of lora_a in MLP.

73b7254

Signed-off-by: Ming Huang <mingh@nvidia.com>

Adding None handle to LoRA.

c7f570b

Signed-off-by: Ming Huang <mingh@nvidia.com>

mingxu1067 force-pushed the mingh/add_lora branch from e5ccefb to c7f570b Compare April 10, 2024 01:07

mingxu1067 added the jax label Apr 10, 2024

denera approved these changes Apr 15, 2024

View reviewed changes

denera merged commit 7c1828f into NVIDIA:main Apr 16, 2024
15 checks passed

pggPL pushed a commit to pggPL/TransformerEngine that referenced this pull request May 9, 2024

Support Low Rank Adaptation (LoRA). (NVIDIA#745)

d98aa6f

pggPL pushed a commit to pggPL/TransformerEngine that referenced this pull request May 15, 2024

Support Low Rank Adaptation (LoRA). (NVIDIA#745)

5b1b868

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pggPL pushed a commit to pggPL/TransformerEngine that referenced this pull request May 16, 2024

Support Low Rank Adaptation (LoRA). (NVIDIA#745)

3f485dd

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

pggPL pushed a commit to pggPL/TransformerEngine that referenced this pull request May 23, 2024

Support Low Rank Adaptation (LoRA). (NVIDIA#745)

a27264b

Signed-off-by: Pawel Gadzinski <pgadzinski@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Low Rank Adaptation (LoRA). #745

Support Low Rank Adaptation (LoRA). #745

mingxu1067 commented Apr 2, 2024 •

edited

Loading

zlsh80826 commented Apr 4, 2024

zlsh80826 Apr 4, 2024

mingxu1067 Apr 9, 2024

mingxu1067 commented Apr 10, 2024

denera left a comment

yhtang commented Apr 16, 2024

denera commented Apr 16, 2024

Support Low Rank Adaptation (LoRA). #745

Support Low Rank Adaptation (LoRA). #745

Conversation

mingxu1067 commented Apr 2, 2024 • edited Loading

zlsh80826 commented Apr 4, 2024

zlsh80826 Apr 4, 2024

Choose a reason for hiding this comment

mingxu1067 Apr 9, 2024

Choose a reason for hiding this comment

mingxu1067 commented Apr 10, 2024

denera left a comment

Choose a reason for hiding this comment

yhtang commented Apr 16, 2024

denera commented Apr 16, 2024

mingxu1067 commented Apr 2, 2024 •

edited

Loading