Skip to content
View ynych's full-sized avatar
Block or Report

Block or report ynych

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is yet another fast serving framework for large language models and vision language models.

Python 2,900 190 Updated Jul 24, 2024

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

Python 1,280 95 Updated Jul 10, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 20,868 1,976 Updated Jul 16, 2024

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 77,819 7,100 Updated Jul 24, 2024

Simple samples for TensorRT programming

Python 1,435 334 Updated Jul 3, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 89,658 14,174 Updated Jul 24, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,545 3,362 Updated Jul 24, 2024

how to optimize some algorithm in cuda.

Cuda 1,261 104 Updated Jul 24, 2024

compiler learning resources collect.

Python 1,965 311 Updated May 27, 2024

how to learn PyTorch and OneFlow

295 18 Updated Mar 22, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,531 1,216 Updated Dec 6, 2023

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 2,530 230 Updated Jul 24, 2024

A toolkit to run Ray applications on Kubernetes

Go 1,052 347 Updated Jul 24, 2024

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,020 122 Updated Jun 25, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 978 79 Updated Jul 2, 2024

🐚 OpenDevin: Code Less, Make More

Python 28,860 3,340 Updated Jul 24, 2024

✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.

TypeScript 2,013 151 Updated Apr 29, 2024

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 8,886 795 Updated Jul 24, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,053 139 Updated Jul 23, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,044 134 Updated Jun 25, 2024

Official Implementation of EAGLE-1 and EAGLE-2

Python 674 65 Updated Jul 23, 2024

Explorations into some recent techniques surrounding speculative decoding

Python 176 14 Updated Oct 9, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 33,991 3,979 Updated Jul 24, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 27,624 3,394 Updated Jul 24, 2024

ApacheCN 深度学习译文集

JavaScript 761 198 Updated Mar 28, 2023

🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.

Python 17,278 1,208 Updated Jul 21, 2024

Awesome AI GPTs, OpenAI GPTs, GPT-4, ChatGPT, GPTs, Prompts, plugins, Prompts leaking

1,021 131 Updated Jun 27, 2024

Mamba SSM architecture

Python 11,880 988 Updated Jul 24, 2024
Next