Skip to content
View MonadKai's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MonadKai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

12 results for forked starred repositories
Clear filter

A blazing fast inference solution for text embeddings models

Rust 3 2 Updated Aug 26, 2024

MTEB: Massive Text Embedding Benchmark with Spanish datasets

Python 3 Updated Feb 19, 2024
C++ 1 Updated May 22, 2024

Flash Attention in ~100 lines of CUDA (forward pass only)

Cuda 8 1 Updated Mar 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 247 9 Updated Aug 27, 2024

Large Language Model Text Generation Inference

Python 10 3 Updated Apr 26, 2024

An ecosystem of Rust libraries for working with large language models

Rust 10 2 Updated Oct 2, 2023

paper and its code for AI System

5 Updated Dec 11, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 9 1 Updated Nov 7, 2023

TensorFlow Serving based on encrypted model, protect model files from being stolen | 基于加密模型的 TensorFlow Serving ,保护模型文件免于被盗取

C++ 37 10 Updated Aug 11, 2022

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Python 245 28 Updated Dec 18, 2023

Shadowsocksr client using electron

JavaScript 1,723 510 Updated Jun 5, 2020