-
Notifications
You must be signed in to change notification settings - Fork 417
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] Error in loading Qwen2-57B-A14B-Instruct
#1251
by LucienShui
was closed Aug 28, 2024
5 tasks done
[Bug] 0.2.14 version. ValueError: malformed node or string: None
#1245
by lss15151161
was closed Aug 28, 2024
5 tasks done
[Bug] AttributeError: 'ScheduleBatch' object has no attribute 'sample' WHEN I DO Benchmarking
#1241
by ArtificialZeng
was closed Aug 28, 2024
5 tasks
[Bug] subprocess.CalledProcessError: Command '['/usr/bin/gcc', '/tmp/tmpx4yubctp/main.c', '-O3', '-shared', '-fPIC', '-o', '/tmp/tmpx4yubctp/cuda_utils.cpython-310-x86_64-linux-gnu.so', '-lcuda', '-L/home/adminad/anaconda3/envs/py10/lib/python3.10/site-packages/triton/backends/nvidia/lib'
await-response
#1240
by ArtificialZeng
was closed Sep 22, 2024
5 tasks done
在A6000上启动,14bqwen1.5,发现有问题,多GPU启动,只能用1张卡或者2张卡,如果设置3,4,5,6会报错,
#1220
by yawzhe
was closed Aug 26, 2024
3 of 5 tasks
[Feature] add option to use liger triton kernel
await-response
#1216
by binarycrayon
was closed Sep 1, 2024
2 tasks done
[Feature] Use Embedding/Generation Model to get its Generation/Emebedding
#1200
by zhaochenyang20
was closed Aug 27, 2024
2 tasks done
[Bug] Bad outputs with fp8 quantization at high RPS
bug
Something isn't working
#1195
by siddhatiwari
was closed Sep 21, 2024
5 tasks done
[Bug] vllm updated its get_model function
#1183
by zhaochenyang20
was closed Aug 26, 2024
5 tasks done
[Help wanted] Does RadixAttention have anything to do with attention?
#1181
by Wanglongzhi2001
was closed Aug 22, 2024
[Bug] Dynamic FP8 quantization fails due to incorrect tensor shape
#1178
by qeternity
was closed Aug 28, 2024
5 tasks done
[Bug] Empty Something isn't working
top_logprobs
in LogProbs Output for Meta-Llama-3.1-8B-Instruct Model when Using OpenAI Compatible API
bug
#1176
by GuanghaoYe
was closed Sep 22, 2024
5 tasks done
[Feature] SGLang using JSON as template config file needs improve
#1172
by zhang001122
was closed Aug 21, 2024
2 tasks done
[Feature] In Sglang ,Is chunked-prefill use fused(prefill+decode) batch?
#1162
by CSEEduanyu
was closed Aug 20, 2024
2 tasks done
[Bug] Gemma-2-9b-it produces garbage output
#1160
by Quang-elec44
was closed Aug 20, 2024
5 tasks done
[Feature] support W8A8(FP8) and KV Cache FP8 for DeepSeek V2
feature
#1156
by zhyncs
was closed Sep 1, 2024
2 tasks done