Skip to content
View stgzr's full-sized avatar
  • Alibaba
  • Hangzhou

Block or report stgzr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
Python 6,310 481 Updated Oct 14, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 48,229 6,889 Updated Oct 16, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,299 210 Updated Oct 15, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,565 570 Updated Oct 16, 2024

The official repo of INF-34B models trained by INF Technology.

Python 34 1 Updated Jul 25, 2024

A generative speech model for daily dialogue.

Python 31,519 3,432 Updated Oct 16, 2024

Collection of training data management explorations for large language models

274 28 Updated Aug 2, 2024

Implementation for MatMul-free LM.

Python 2,905 182 Updated Sep 19, 2024

Tile primitives for speedy kernels

Cuda 1,537 60 Updated Oct 16, 2024

cuDF - GPU DataFrame Library

C++ 8,347 890 Updated Oct 16, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 26,581 2,937 Updated Oct 16, 2024

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 334 21 Updated Oct 16, 2024

Brand new TTS solution

Python 13,385 998 Updated Oct 11, 2024
Python 54 2 Updated Aug 21, 2024

A PyTorch Native LLM Training Framework

Python 634 32 Updated Aug 25, 2024

Animation engine for explanatory math videos

Python 67,896 6,067 Updated Oct 15, 2024

Ring attention implementation with flash attention

Python 557 43 Updated Oct 8, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,254 154 Updated Jun 25, 2024

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 881 63 Updated Sep 13, 2024

Transformers with Arbitrarily Large Context

Python 627 48 Updated Aug 12, 2024

A natural language interface for computers

Python 52,601 4,644 Updated Oct 15, 2024
Python 7,110 550 Updated Oct 15, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,105 845 Updated Jul 1, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,538 260 Updated Sep 29, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,510 198 Updated Oct 16, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,422 950 Updated Oct 15, 2024
Python 1,189 172 Updated Sep 19, 2024

Official inference library for Mistral models

Jupyter Notebook 9,632 848 Updated Oct 16, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,483 4,223 Updated Oct 16, 2024

Mamba SSM architecture

Python 12,876 1,092 Updated Oct 13, 2024
Next