Lists (1)
Sort Name ascending (A-Z)
Stars
Applied AI experiments and examples for PyTorch
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Simple and fast low-bit matmul kernels in CUDA / Triton
A repository of Maker Skill Trees and templates to make your own.
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
libco is a coroutine library which is widely used in wechat back-end service. It has been running on tens of thousands of machines since 2013.
✨ rudimentary simulation of the three-body problem
Shattered Pixel Dungeon is an open-source traditional roguelike dungeon crawler with randomized levels and enemies, and hundreds of items to collect and use. It's based on the source code of Pixel …
[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm
Deploying LLMs offline on the NVIDIA Jetson platform marks the dawn of a new era in embodied intelligence, where devices can function independently without continuous internet access.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
SGLang is a fast serving framework for large language models and vision language models.
[TMLR 2024] Efficient Large Language Models: A Survey
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.