-
USTC
- Hefei, China
-
13:33
(UTC +08:00) - @XianYa_YiZhi
- https://www.zhihu.com/people/zhe-philosophy
Highlights
- Pro
Block or Report
Block or report Mr-Philo
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (6)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Efficiently computes derivatives of numpy code.
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
A package designed to produce logos of Chinese colleges.
Fast and memory-efficient exact attention
A curated list for Efficient Large Language Models
This repository contains the experimental PyTorch native float8 training UX
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
A collection of resources on controllable generation with text-to-image diffusion models.
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
What would you do with 1000 H100s...
⭐️ A proxy scraper made using Protractor | Proxy list Updates every three hour 🔥
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Awesome LLM compression research papers and tools.
PyTorch emulation library for Microscaling (MX)-compatible data formats
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
⚡ Dynamically generated stats for your github readmes