Skip to content
View kuizhiqing's full-sized avatar

Organizations

@PaddlePaddle @kubeflow
Block or Report

Block or report kuizhiqing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 752 38 Updated Jul 13, 2024

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 56,652 29,105 Updated Jul 11, 2024

JobSet: a k8s native API for distributed ML training and HPC workloads

Go 116 38 Updated Jul 15, 2024

A curated list of great puzzles

713 36 Updated May 30, 2019

Gemma 2B with 10M context length using Infini-attention.

Python 892 56 Updated May 12, 2024

CUDA checkpoint and restore utility

Cuda 169 8 Updated Apr 17, 2024

A list of AI autonomous agents

8,541 599 Updated Jun 20, 2024

LLM training in simple, raw C/CUDA

Cuda 21,964 2,416 Updated Jul 16, 2024

Bitcoin Core integration/staging tree

C++ 77,236 35,646 Updated Jul 17, 2024

Cross-platform asynchronous I/O

C 23,611 3,553 Updated Jul 17, 2024
Python 1,126 157 Updated Jul 17, 2024

😎 Awesome lists about all kinds of interesting topics

311,425 27,063 Updated Jul 17, 2024

Universal LLM Deployment Engine with ML Compilation

Python 17,804 1,420 Updated Jul 17, 2024

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformer…

C++ 21,901 1,679 Updated Jul 17, 2024

A library for building fast, reliable and evolvable network services.

Rust 20,437 1,109 Updated Jul 16, 2024

Open weights LLM from Google DeepMind.

Jupyter Notebook 2,258 276 Updated Jun 28, 2024

distributed trainer for LLMs

Python 502 73 Updated May 20, 2024
TypeScript 3,570 576 Updated Jul 3, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 23,029 2,372 Updated Jul 17, 2024

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

Python 1,455 101 Updated Jun 19, 2024

🔥Highlighting the top ML papers every week.

9,415 543 Updated Jul 15, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,074 3,271 Updated Jul 17, 2024

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,226 225 Updated Jul 3, 2024

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,437 144 Updated Jul 16, 2024

A curated list of software and architecture related design patterns.

37,764 2,777 Updated Jun 11, 2024

Curated list of Go design patterns, recipes and idioms

Go 24,645 2,171 Updated May 14, 2024

MLX: An array framework for Apple silicon

C++ 15,795 900 Updated Jul 16, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 34,545 5,150 Updated Jul 6, 2024

基于Python的开源量化交易平台开发框架

Python 24,031 8,514 Updated Jul 8, 2024
Next