Skip to content
View martincai's full-sized avatar
  • Microsoft
  • Sammamish, WA

Organizations

@microsoft

Block or report martincai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 26,560 3,891 Updated Sep 13, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 88,948 6,956 Updated Sep 13, 2024

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,058 215 Updated Sep 11, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,685 4,045 Updated Sep 12, 2024