[lora] support lora for Gemini #5001

Fridge003 · 2023-11-01T08:59:08Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

resolved #4929

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.

support lora feature for gemini plugin

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

ver217 · 2023-11-09T09:25:46Z

colossalai/booster/plugin/gemini_plugin.py

+ peft_config.base_model_name_or_path = peft_model.base_model.model.__dict__.get("name_or_path", None)
+
+ inference_mode = peft_config.inference_mode
+ peft_config.inference_mode = True


Why set inference mode to True before saving?

Peft will set it to True when saving, and I didn't change this behavior.

ver217 · 2023-11-09T09:27:18Z

colossalai/tensor/param_op_hook.py

@@ -142,7 +142,6 @@ def _flatten_grad_args(args) -> Tuple[list, list, List[bool], TreeSpec]:
 grad_args.append(arg)
 else:
 other_args.append(arg)
- assert len(grad_args) > 0


Why remove the assertation? If there are no grad args, post-backward hook may be not triggered.

They are plenty of parameters with requires_grad being False when lora is enabled. So len(grad_args) often equals to zero during training.

tests/test_lora/test_gemini_lora.py

xs1997zju · 2024-01-09T07:45:12Z

any update of this work?

KaiLv69 · 2024-06-01T08:05:40Z

Looking forward to the feature implemented in this PR！

Fridge003 added 2 commits November 1, 2023 16:42

add optimizer checkpointing test

0626e5b

support fwd bwd

e187290

Fridge003 force-pushed the feature/lora branch from ac56dee to e187290 Compare November 2, 2023 04:05

fix modules_to_save bug for gemini

50ebc1a

Fridge003 force-pushed the feature/lora branch from 5e46e2f to 50ebc1a Compare November 2, 2023 07:47

implement lora checkpoint

1cab325

Fridge003 force-pushed the feature/lora branch 6 times, most recently from 69136b6 to 3bf7e1c Compare November 8, 2023 10:59

add checkpointing tests

13ff77d

Fridge003 force-pushed the feature/lora branch from 3bf7e1c to 49c5d73 Compare November 8, 2023 11:08

implement chunk management for lora params

12e1ae9

Fridge003 force-pushed the feature/lora branch from fb1a59b to 12e1ae9 Compare November 9, 2023 04:25

add grad accum tests and debug

eefa012

Fridge003 marked this pull request as ready for review November 9, 2023 06:58

Fridge003 requested a review from ver217 November 9, 2023 07:05

ver217 reviewed Nov 9, 2023

View reviewed changes

merge test

6cece1d

Fridge003 force-pushed the feature/lora branch from 5193428 to 6cece1d Compare November 10, 2023 06:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[lora] support lora for Gemini #5001

[lora] support lora for Gemini #5001

Fridge003 commented Nov 1, 2023 •

edited

Loading

ver217 Nov 9, 2023

Fridge003 Nov 10, 2023

ver217 Nov 9, 2023

Fridge003 Nov 10, 2023

xs1997zju commented Jan 9, 2024

KaiLv69 commented Jun 1, 2024

[lora] support lora for Gemini #5001

Are you sure you want to change the base?

[lora] support lora for Gemini #5001

Conversation

Fridge003 commented Nov 1, 2023 • edited Loading

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

ver217 Nov 9, 2023

Choose a reason for hiding this comment

Fridge003 Nov 10, 2023

Choose a reason for hiding this comment

ver217 Nov 9, 2023

Choose a reason for hiding this comment

Fridge003 Nov 10, 2023

Choose a reason for hiding this comment

xs1997zju commented Jan 9, 2024

KaiLv69 commented Jun 1, 2024

Fridge003 commented Nov 1, 2023 •

edited

Loading