sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 180
Star 2.8k

Code
Issues 160
Pull requests 4
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: sgl-project/sglang

Development Roadmap

#157 opened Feb 7, 2024 by Ying1123

Open 14

Add SGLang usage examples

#166 opened Feb 8, 2024 by Ying1123

Open 7

Trouble Shooting

#548 opened Jun 14, 2024 by Ying1123

Open

Labels 12 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

160 Open 126 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Optimize abort request handling

#481 opened May 27, 2024 by Ying1123

KeyError: 'response', Error with fastchat.serve.sglang_worker and release v0.1.16 when using multimodal model

#479 opened May 26, 2024 by dhgarcia

TypeError: cannot unpack non-iterable NoneType object

#474 opened May 25, 2024 by pseudotensor

../aten/src/ATen/native/cuda/Indexing.cu:1236: indexSelectSmallIndex: block: [40,0,0], thread: [26,0,0] Assertion srcIndex < srcSelectDimSize failed.

#473 opened May 25, 2024 by pseudotensor

Dependency conflict with LLaVA

#464 opened May 23, 2024 by itay1542

Device-side assertion triggered on Batch.prepare_for_decode, release v0.1.16

#461 opened May 22, 2024 by noah-kim-theori

Type of self.variables

#455 opened May 20, 2024 by ChuyueSun

DBRX not working

#454 opened May 20, 2024 by Ying1123

Trace OpenAI backend usage

#453 opened May 19, 2024 by Ying1123

2 tasks

Regex generation causes 37x lower performance

#450 opened May 18, 2024 by Gintasz

OOM CUDA error on 8 * L4 machine when launching sglang server

#445 opened May 15, 2024 by mounamokaddem

Does sglang do automatic batching?

#444 opened May 15, 2024 by vedantroy

I can't use the OpenAI endpoint with images?

#443 opened May 14, 2024 by vedantroy

/generate request possibly hanging when CUDA out of memory is thrown

#435 opened May 13, 2024 by Gintasz

Support for multimodal models

#421 opened May 12, 2024 by babla9

Llama-3 regex generation can get stuck in infinite generation beyond max_tokens and crash server (reproduction example)

#414 opened May 8, 2024 by Gintasz

run python3 test_httpserver_llava.py get ValueError: 64002 is not in list

#413 opened May 8, 2024 by Aurorana

LLaVA-v1.6 RuntimeError in llava image encoding

#409 opened May 4, 2024 by lukashelff

Choices functionality breaking with images

#408 opened May 1, 2024 by dexius-ram-depop

Please add Phi3 support

#407 opened May 1, 2024 by Curiosity007

Repetitive zeros for structuring JSON (float type)

#405 opened Apr 30, 2024 by timothylimyl

no batch run when using openai's format for calling.

#404 opened Apr 30, 2024 by xjw00654

Import Errors occurring even when dependencies are installed

#403 opened Apr 29, 2024 by david-vectorflow

How does RadixAttention implements multi-head/multi-query/grouped-query attention.

#402 opened Apr 29, 2024 by Griffintaur

Is it possible to define the prompts for KV caching up-front?

#401 opened Apr 29, 2024 by timothylimyl

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly