Skip to content
@LLM-Serve

LLM-Serve

Popular repositories Loading

  1. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  2. TensorRT-LLM TensorRT-LLM Public

    Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

    C++

  3. nvbandwidth nvbandwidth Public

    Forked from NVIDIA/nvbandwidth

    A tool for bandwidth measurements on NVIDIA GPUs.

    C++

  4. lightllm lightllm Public

    Forked from ModelTC/lightllm

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python

Repositories

Showing 4 of 4 repositories
  • TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

    LLM-Serve/TensorRT-LLM’s past year of commit activity
    C++ 0 Apache-2.0 841 0 0 Updated Dec 11, 2023
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    LLM-Serve/vllm’s past year of commit activity
    Python 0 Apache-2.0 3,402 0 0 Updated Dec 11, 2023
  • lightllm Public Forked from ModelTC/lightllm

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    LLM-Serve/lightllm’s past year of commit activity
    Python 0 Apache-2.0 181 0 0 Updated Dec 8, 2023
  • nvbandwidth Public Forked from NVIDIA/nvbandwidth

    A tool for bandwidth measurements on NVIDIA GPUs.

    LLM-Serve/nvbandwidth’s past year of commit activity
    C++ 0 Apache-2.0 24 0 0 Updated Nov 21, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…