Skip to content
@FMInference

Foundation Model Inference

Inference Systems for Foundation Models

Pinned Loading

  1. FlexGen FlexGen Public

    Running large language models on a single GPU for throughput-oriented scenarios.

    Python 9.1k 531

Repositories

Showing 3 of 3 repositories
  • H2O Public

    [NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

    FMInference/H2O’s past year of commit activity
    Python 329 30 28 2 Updated Jul 26, 2024
  • FlexGen Public

    Running large language models on a single GPU for throughput-oriented scenarios.

    FMInference/FlexGen’s past year of commit activity
    Python 9,091 Apache-2.0 531 50 (3 issues need help) 8 Updated Jul 24, 2024
  • DejaVu Public
    FMInference/DejaVu’s past year of commit activity
    Python 253 31 22 1 Updated Apr 2, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…