[Inference]Lazy Init Support #5785

LRY89757 · 2024-06-06T03:16:53Z

This PR support the Lazy Init of Llama2, Llama 3 and Baichuan(partially).

NOTE：for baichuan model, the weight of lm_head loaded by lazy init has some difference with transformers model, which results the output diff with transformers output.

Lazy Init supports both local path and remote path.

Lazy Init could accelerate the process of model loading.

For Llama 70B, fp32, tensor parallel=4, It only take 3.3 min to load the model. Without Lazy init, program will freeze due to limited CPU Memory.

for more information, see https://pre-commit.ci

…init

colossalai/inference/executor/rpc_worker.py

colossalai/inference/modeling/models/nopadding_llama.py

colossalai/inference/core/engine.py

colossalai/inference/executor/rpc_worker.py

lazy init support

7c6fee2

LRY89757 added the colossal-inference label Jun 6, 2024

LRY89757 requested a review from a team as a code owner June 6, 2024 03:16

pre-commit-ci bot and others added 3 commits June 6, 2024 03:17

[pre-commit.ci] auto fixes from pre-commit.com hooks

ddb64a8

for more information, see https://pre-commit.ci

lazy init llama support

7b2b12c

Merge branch 'lazy_init' of github.com:LRY89757/ColossalAI into lazy_…

0449f54

…init

yuanheng-zhao reviewed Jun 7, 2024

View reviewed changes

LRY89757 added 7 commits June 9, 2024 16:16

fix ctx

aa6acbc

Merge branch 'main' of github.com:hpcaitech/ColossalAI into lazy_init

9e31e4c

Merge branch 'main' of github.com:hpcaitech/ColossalAI into lazy_init

f6ed7e0

lazy init support for baichuan

cd23e88

:lazy init support for baichuan

f334855

aligh rpc

180bde3

add note for baichuan

efdf248

yuanheng-zhao reviewed Jun 18, 2024

View reviewed changes

colossalai/inference/core/engine.py Outdated Show resolved Hide resolved

colossalai/inference/executor/rpc_worker.py Outdated Show resolved Hide resolved

Edenzzzz enabled auto-merge (squash) June 18, 2024 05:47

Edenzzzz disabled auto-merge June 18, 2024 05:47

yuanheng-zhao approved these changes Jun 18, 2024

View reviewed changes

LRY89757 added 2 commits June 20, 2024 06:52

Merge branch 'main' of github.com:hpcaitech/ColossalAI into lazy_init

9146be2

fix code

a6388d4

LRY89757 closed this Jun 24, 2024

LRY89757 reopened this Jun 24, 2024

LRY89757 closed this Jun 25, 2024

LRY89757 reopened this Jun 25, 2024

Merge branch 'main' of github.com:hpcaitech/ColossalAI into lazy_init

729fc9c

LRY89757 closed this Jun 27, 2024

LRY89757 reopened this Jun 27, 2024

LRY89757 merged commit 3c7cda0 into hpcaitech:main Jun 27, 2024
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference]Lazy Init Support #5785

[Inference]Lazy Init Support #5785

LRY89757 commented Jun 6, 2024 •

edited

Loading

[Inference]Lazy Init Support #5785

[Inference]Lazy Init Support #5785

Conversation

LRY89757 commented Jun 6, 2024 • edited Loading

LRY89757 commented Jun 6, 2024 •

edited

Loading