-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: triton-inference-server/server
Author
Label
Milestones
Reviews
Assignee
Sort
Pull requests list
ci: Support BF16 data type in TensorRT backend
#7310
opened May 31, 2024 by
pskiran1
Loading…
6 of 20 tasks
build: Update vllm version to v0.4.3 (latest)
module: backends
Issues related to the backends
PR: build
Changes that affect the build system or external dependencies
#7309
opened May 31, 2024 by
oandreeva-nv
Loading…
6 of 20 tasks
fix: Fix L0_input_validation--base
PR: fix
A bug fix
#7304
opened May 30, 2024 by
yinggeh
Loading…
8 of 20 tasks
Fix gRPC streaming non-decoupled segfault if sending response and final flag separately
#7265
opened May 24, 2024 by
kthui
Loading…
Remove unnecessary wait in case of failed stub creation
#7192
opened May 7, 2024 by
indrajit96
Loading…
Raise MLFlow error when env TRITON_MODEL_REPO not set
#7147
opened Apr 22, 2024 by
JonasGoebel
Loading…
[Windows] Support CPU shared memory (Client/Frontend)
#7048
opened Mar 27, 2024 by
fpetrini15
Loading…
Adding a readiness matrix of the various first party Backends
#6912
opened Feb 23, 2024 by
zeryx
Loading…
Fix inference command sample in README.md
investigating
The developement team is investigating this issue
#6868
opened Feb 6, 2024 by
jasoncwik
Loading…
Fix some small typos
investigating
The developement team is investigating this issue
#6857
opened Feb 1, 2024 by
seiteta
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.