Skip to content
View feifeibear's full-sized avatar

Block or report feifeibear

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. hpcaitech/ColossalAI hpcaitech/ColossalAI Public

    Making large AI models cheaper, faster and more accessible

    Python 38.6k 4.3k

  2. Tencent/TurboTransformers Tencent/TurboTransformers Public

    a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

    C++ 1.5k 196

  3. Tencent/PatrickStar Tencent/PatrickStar Public

    PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.

    Python 747 58

  4. LLMSpeculativeSampling LLMSpeculativeSampling Public

    Fast inference from large lauguage models via speculative decoding

    Python 508 48

  5. xdit-project/xDiT xdit-project/xDiT Public

    xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

    Python 482 40

  6. long-context-attention long-context-attention Public

    USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

    Python 309 18