Skip to content
View Mr-Philo's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report Mr-Philo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Efficiently computes derivatives of numpy code.

Python 6,875 905 Updated May 25, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 26,838 3,317 Updated Jul 18, 2024

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Go 79,154 6,037 Updated Jul 19, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,399 2,551 Updated Jul 13, 2024

A package designed to produce logos of Chinese colleges.

TeX 32 6 Updated May 27, 2024

Your image is almost there!

Python 6,920 404 Updated Jul 14, 2024

Fast and memory-efficient exact attention

Python 12,458 1,107 Updated Jul 19, 2024

A curated list for Efficient Large Language Models

Python 975 74 Updated Jul 16, 2024

This repository contains the experimental PyTorch native float8 training UX

Python 194 18 Updated Jul 18, 2024

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

12,824 1,347 Updated Feb 13, 2023

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 349 11 Updated Jul 19, 2024

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 160 12 Updated May 28, 2024

The official Meta Llama 3 GitHub site

Python 23,357 2,505 Updated Jul 17, 2024

A collection of resources on controllable generation with text-to-image diffusion models.

769 22 Updated Jul 19, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,014 136 Updated Jul 16, 2024

What would you do with 1000 H100s...

Jupyter Notebook 790 48 Updated Jan 10, 2024

⭐️ A proxy scraper made using Protractor | Proxy list Updates every three hour 🔥

JavaScript 398 66 Updated Jul 19, 2024

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 891 85 Updated Jul 19, 2024

Puzzles for learning Triton

Jupyter Notebook 873 51 Updated Jul 17, 2024

A tutorial for CUDA&PyTorch

C++ 100 21 Updated Feb 7, 2024

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!

741 42 Updated Jul 16, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,524 822 Updated Jul 18, 2024

Awesome LLM compression research papers and tools.

891 54 Updated Jul 17, 2024

README文件语法解读,即Github Flavored Markdown语法介绍

6,762 7,256 Updated Mar 8, 2023

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 131 15 Updated May 29, 2024

Examples for MS-AMP package.

Shell 24 10 Updated Apr 13, 2024

Microsoft Automatic Mixed Precision Library

Python 484 35 Updated Apr 8, 2024

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

1,998 183 Updated Jul 8, 2024

⚡ Dynamically generated stats for your github readmes

JavaScript 66,814 21,753 Updated Jul 18, 2024

leaked prompts of GPTs

27,816 3,744 Updated Jul 9, 2024
Next