LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,515 198 Updated Oct 18, 2024

kpu / kenlm

KenLM: Faster and Smaller Language Model Queries

C++ 2,501 512 Updated Jul 30, 2024

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,130 67 Updated Oct 14, 2024

shibing624 / text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Python 4,440 395 Updated Sep 8, 2024

2dust / clashN

A clash client for Windows, support Mihomo

C# 4,810 596 Updated Jun 29, 2024

WinVector / Examples

Various examples for different articles

HTML 163 92 Updated Oct 16, 2024

shawroad / CoSENT_Pytorch

CoSENT、STS、SentenceBERT

Python 161 21 Updated Jul 10, 2023

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 10,323 2,310 Updated Oct 19, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,673 182 Updated Oct 15, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,440 956 Updated Oct 15, 2024

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,616 271 Updated Aug 14, 2024

OpenInterpreter / open-interpreter

A natural language interface for computers

Python 52,635 4,649 Updated Oct 15, 2024

ztxz16 / fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,299 336 Updated Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HeZez

Achievements

Achievements

Block or report HeZez

Stars

mem0ai / mem0

AdeelH / pytorch-multi-class-focal-loss

gpu-mode / lectures

doccano / doccano

unslothai / unsloth

eosphoros-ai / DB-GPT

joonspk-research / generative_agents

deepseek-ai / DeepSeek-V2

morecry / CharacterEval

CLUEbenchmark / SuperCLUE-Role

Leymore / ruozhiba

keezen / ntk_alibi

meta-llama / llama3

vllm-project / vllm

owenliang / qwen-vllm

youngyangyang04 / leetcode-master

zihaohe123 / speak-turn-emb-dialog-act-clf

ModelTC / lightllm