Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-node Tensor Parallelism #550

Merged
merged 7 commits into from
Jun 18, 2024
Merged

Multi-node Tensor Parallelism #550

merged 7 commits into from
Jun 18, 2024

Conversation

Ying1123
Copy link
Member

@Ying1123 Ying1123 commented Jun 16, 2024

Similar to torchrun, enter command into each node with respective node-rank.

GLOO_SOCKET_IFNAME=eth0 python3 -m sglang.launch_server --model-path meta-llama/Meta-Llama-3-70B-Instruct --tp 16 --nccl-init-addr 172.16.5.82:20000 --nnodes 2 --node-rank 0
GLOO_SOCKET_IFNAME=eth0 python3 -m sglang.launch_server --model-path meta-llama/Meta-Llama-3-70B-Instruct --tp 16 --nccl-init-addr 172.16.5.82:20000 --nnodes 2 --node-rank 1

@Ying1123 Ying1123 marked this pull request as draft June 16, 2024 03:46
@Ying1123 Ying1123 force-pushed the multinode-tp branch 13 times, most recently from 3cfeb85 to f993650 Compare June 16, 2024 09:49
@Ying1123 Ying1123 force-pushed the multinode-tp branch 4 times, most recently from 2a0f15b to b35de42 Compare June 16, 2024 10:17
@Ying1123 Ying1123 force-pushed the multinode-tp branch 4 times, most recently from e071cac to 0d994b3 Compare June 16, 2024 10:34
@Ying1123 Ying1123 changed the title Multi-node Serving Multi-node Tensor Parallelism Jun 16, 2024
python/sglang/launch_rpyc_service.py Outdated Show resolved Hide resolved
python/sglang/srt/server.py Show resolved Hide resolved
python/sglang/srt/server_args.py Outdated Show resolved Hide resolved
python/sglang/srt/utils.py Outdated Show resolved Hide resolved
python/sglang/srt/utils.py Outdated Show resolved Hide resolved
@Ying1123 Ying1123 marked this pull request as ready for review June 18, 2024 03:40
@Ying1123 Ying1123 merged commit 09593e9 into main Jun 18, 2024
@Ying1123 Ying1123 deleted the multinode-tp branch June 18, 2024 03:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants