Language
Sort by: Recently starred
Starred repositories
A blazing fast inference solution for text embeddings models
jina-ai / mteb-es
Forked from embeddings-benchmark/mtebMTEB: Massive Text Embedding Benchmark with Spanish datasets
Flash Attention in ~100 lines of CUDA (forward pass only)
neuralmagic / nm-vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Large Language Model Text Generation Inference
BloopAI / llm
Forked from oppiliappan/llmAn ecosystem of Rust libraries for working with large language models
paper and its code for AI System
gameofdimension / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Laiye-Tech / serving
Forked from tensorflow/servingTensorFlow Serving based on encrypted model, protect model files from being stolen | 基于加密模型的 TensorFlow Serving ,保护模型文件免于被盗取
PKU-DAIR / Hetu
Forked from Hsword/HetuA high-performance distributed deep learning system targeting large-scale and automated distributed training.
shadowsocksrr / electron-ssr
Forked from Akkariiin/electron-ssrShadowsocksr client using electron