Skip to content
View wkcn's full-sized avatar
🐳
Tell Your World 🎵
🐳
Tell Your World 🎵
  • China

Highlights

  • Pro

Organizations

@apache @dmlc @MiraiTeam @SYSU-IARC

Block or report wkcn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

pytorch memory track code

Python 989 155 Updated May 4, 2021

PyTorch native quantization and sparsity for training and inference

Python 652 84 Updated Sep 4, 2024

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 5,743 402 Updated Aug 27, 2024

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

Python 1,219 82 Updated Sep 3, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,044 5,611 Updated Aug 19, 2024
C++ 4 1 Updated Apr 23, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,263 187 Updated Sep 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,869 3,774 Updated Sep 3, 2024

A massively parallel, high-level programming language

Rust 17,147 421 Updated Sep 3, 2024

教科书《计算机体系结构基础》(胡伟武等,第三版)的开源版本

TeX 2,971 284 Updated Sep 14, 2023

Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KN…

Jupyter Notebook 32 Updated Jul 29, 2024

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 317 29 Updated Sep 3, 2024

GeoGuessrAI, Predict the coordinates of a given street view image 🗺️

Python 1 Updated May 31, 2024

Tower of the Sorcerer for Windows Kai (改): Modifier of game variables and improvement of game experience

Assembly 42 Updated Sep 3, 2024

TinyChatEngine: On-Device LLM Inference Library

C++ 682 67 Updated Jul 4, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,368 154 Updated Sep 1, 2024

What would you do with 1000 H100s...

Jupyter Notebook 813 48 Updated Jan 10, 2024

Decompilation of The Legend of Zelda: Breath of the Wild (Switch 1.5.0)

C++ 1,516 104 Updated Aug 31, 2024

Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Python 386 23 Updated Apr 1, 2024

ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).

Python 158 14 Updated Aug 27, 2024

Image2Emoji, Predict the most relevant emoji for a given image 🐱

Jupyter Notebook 1 Updated Apr 6, 2024

For peering into the brain of CLIP and other vision models!

TypeScript 4 Updated Apr 27, 2024

Minimalist ML framework for Rust

Rust 14,993 875 Updated Sep 2, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 91,882 14,601 Updated Sep 3, 2024

Machine Learning Engineering Open Book

Python 10,615 641 Updated Sep 2, 2024

Puzzles for learning Triton

Jupyter Notebook 943 59 Updated Jul 17, 2024

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 263 29 Updated Aug 19, 2024

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

Python 26,838 1,439 Updated Sep 1, 2024

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,697 335 Updated Jul 31, 2024
Next