sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 175
Star 2.7k

Code
Issues 164
Pull requests 6
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: sgl-project/sglang

Development Roadmap

#157 opened Feb 7, 2024 by Ying1123

Open 14

Add SGLang usage examples

#166 opened Feb 8, 2024 by Ying1123

Open 6

Trouble Shooting

#548 opened Jun 14, 2024 by Ying1123

Open

Labels 12 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

164 Open 114 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Is it possible to define the prompts for KV caching up-front?

#401 opened Apr 29, 2024 by timothylimyl

Support InternVL 1.5

#398 opened Apr 26, 2024 by themrzmaster

Prefill out of memory occur when deployed with servers

#393 opened Apr 25, 2024 by for-just-we

vLLM import error

#391 opened Apr 24, 2024 by jlin816

Regenerate benchmark results for latest vLLM

#389 opened Apr 24, 2024 by nilesh-c

Logprobs are almost the same for all choices

#388 opened Apr 23, 2024 by tom-doerr

Switch to non gated models

#387 opened Apr 23, 2024 by tom-doerr

ImportError: cannot import name 'get_cuda_stream' from 'triton.runtime.jit' In triton-nightly(V100)

#383 opened Apr 23, 2024 by nenomigami

How does or does sglang support multiple completions / samples given the same prompt?

#379 opened Apr 22, 2024 by wenting-zhao

Loading Chat Template in a more flexible way? good first issue

Good for newcomers

#376 opened Apr 21, 2024 by for-just-we

Loading a BNB 4 bit model + adapter

#374 opened Apr 19, 2024 by timothelaborie

VLLM version high priority

#373 opened Apr 19, 2024 by eaubin

JSON decoding result don't match regex

#371 opened Apr 17, 2024 by DouHappy

Inference does not work in sglang with regex? bug

Something isn't working

#361 opened Apr 11, 2024 by randomcodelookup

About parameter max_tokens

#360 opened Apr 11, 2024 by for-just-we

Don't get API response when sending images

#357 opened Apr 9, 2024 by tom-doerr

compatibility issues and memory leak problems --enable-flashinfer

#356 opened Apr 9, 2024 by pj-ml

Beam Search Support enhancement

New feature or request

#353 opened Apr 8, 2024 by LiquidGunay

Beginner here, how to load the model from checkpoint

#349 opened Apr 5, 2024 by HoqueMahmudul

Potential Bug? Confusion about "need_vision" in llava implementation

#341 opened Apr 1, 2024 by fedshyvtest

Add Default Timeout to urllib.request.urlopen Calls to Prevent Potential Hanging

#339 opened Mar 29, 2024 by alessiodallapiazza

will support multi-loras inference？

#334 opened Mar 28, 2024 by skykiseki

Allow OPTIONS Method on Http Server and add Cors headers.

#333 opened Mar 28, 2024 by kseyhan

[Bug] llava-v1.6-34b can not enable Tensor Parallelism, server can not start

#330 opened Mar 25, 2024 by lss15151161

Supports the InternVL multimodal large model

#328 opened Mar 24, 2024 by exceedzhang

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly