Skip to content
View qibabaidu's full-sized avatar
Block or Report

Block or report qibabaidu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 168 27 Updated Jun 21, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 738 59 Updated Jun 23, 2024

PyTorch for building ML systems. Iterable, debuggable, multi-cloud, 100% reproducible across research and production.

Python 924 36 Updated Jun 21, 2024

Introduction to WASM assembly

WebAssembly 69 4 Updated Jan 29, 2023

🚀 The leading Wasm Runtime supporting WASIX, WASI and Emscripten

Rust 18,139 762 Updated Jun 22, 2024

Incremental bundler and build system optimized for JavaScript and TypeScript, written in Rust – including Turbopack and Turborepo.

Rust 25,422 1,719 Updated Jun 22, 2024
Python 1,426 120 Updated Apr 27, 2023

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,392 4,342 Updated Jun 22, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,369 668 Updated Jun 21, 2024

An ecosystem of Rust libraries for working with large language models

Rust 6,043 349 Updated Mar 23, 2024

Standalone pre-training recipe with JAX+Flax

Python 31 4 Updated Apr 3, 2023

Accelerate your training with this open-source library. Optimize performance with streamlined training and serving options with JAX. 🚀

Python 164 19 Updated Jun 22, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 5,872 506 Updated Jan 5, 2024

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 424 64 Updated Jun 17, 2024

🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧

Rust 7,068 246 Updated Jun 16, 2024

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Rust 7,606 364 Updated Jun 22, 2024

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

C++ 3,256 1,070 Updated Jun 23, 2024

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 18,637 1,279 Updated Jun 22, 2024

🔍 Tiny, full-text search engine for static websites built with Rust and Wasm

Rust 2,682 84 Updated Oct 18, 2023

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 34,815 4,671 Updated Jun 23, 2024

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 26,607 1,571 Updated Jun 21, 2024

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 1,892 131 Updated Jun 22, 2024

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d…

Python 14,373 1,695 Updated Jun 22, 2024

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

Python 6,745 762 Updated Jun 23, 2024

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

Python 9,168 581 Updated Jun 17, 2024

LLM inference in C/C++

C++ 60,609 8,639 Updated Jun 23, 2024

🦜🔗 Build context-aware reasoning applications

Python 87,816 13,728 Updated Jun 23, 2024

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Go 74,989 5,604 Updated Jun 22, 2024

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 4,316 395 Updated Jun 21, 2024
Next