-
Notifications
You must be signed in to change notification settings - Fork 175
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Is it possible to define the prompts for KV caching up-front?
#401
opened Apr 29, 2024 by
timothylimyl
ImportError: cannot import name 'get_cuda_stream' from 'triton.runtime.jit' In triton-nightly(V100)
#383
opened Apr 23, 2024 by
nenomigami
How does or does sglang support multiple completions / samples given the same prompt?
#379
opened Apr 22, 2024 by
wenting-zhao
Loading Chat Template in a more flexible way?
good first issue
Good for newcomers
#376
opened Apr 21, 2024 by
for-just-we
Inference does not work in sglang with regex?
bug
Something isn't working
#361
opened Apr 11, 2024 by
randomcodelookup
Potential Bug? Confusion about "need_vision" in llava implementation
#341
opened Apr 1, 2024 by
fedshyvtest
Add Default Timeout to urllib.request.urlopen Calls to Prevent Potential Hanging
#339
opened Mar 29, 2024 by
alessiodallapiazza
[Bug] llava-v1.6-34b can not enable Tensor Parallelism, server can not start
#330
opened Mar 25, 2024 by
lss15151161
ProTip!
Find all open issues with in progress development work with linked:pr.