Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DBRX support #337

Merged
merged 4 commits into from
Mar 28, 2024
Merged

DBRX support #337

merged 4 commits into from
Mar 28, 2024

Conversation

hnyls2002
Copy link
Collaborator

No description provided.

@benihime91
Copy link

benihime91 commented Mar 28, 2024

How much minimum GPU mem is required for loading this model? I tried on 2xH100 with the following command, but i am running into OOM

CUDA_VISIBLE_DEVICES=0,1 python3 -m sglang.launch_server --model-path databricks/dbrx-instruct  --port 30000 --tp-size 2 --random-seed 42 --trust-remote-code

@hnyls2002
Copy link
Collaborator Author

@benihime91 It's a 134B model and the model weight is 246GB. So it takes at least 4 * 80G GPUs to load this.

@benihime91
Copy link

Ahhh 😢 should've tried with 4xH100

@Ying1123 Ying1123 merged commit 2af565b into main Mar 28, 2024
@Ying1123 Ying1123 deleted the dbrx branch March 28, 2024 17:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants