Skip to content
View ryantd's full-sized avatar
🏎️
🏎️

Organizations

@kubeflow
Block or Report

Block or report ryantd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

783 results for source starred repositories
Clear filter

Universal Python binding for the LMDB 'Lightning' Database

C 627 102 Updated Jul 1, 2024

Vector Search Engine base on BRPC + FAISS

C++ 144 50 Updated Oct 21, 2019

LLM training in simple, raw C/CUDA

Cuda 22,276 2,465 Updated Jul 25, 2024

Grok open release

Python 49,203 8,311 Updated May 29, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,191 493 Updated Jul 11, 2024

A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.

Python 117 9 Updated Dec 8, 2023

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,799 803 Updated Jul 1, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,768 508 Updated May 31, 2024

Make huge neural nets fit in memory

Python 2,667 271 Updated Apr 26, 2020

Provide Python access to the NVML library for GPU diagnostics

Python 207 31 Updated Jul 16, 2024

Building blocks for foundation models.

278 10 Updated Jan 3, 2024

Code repository for the paper - "Matryoshka Representation Learning"

Jupyter Notebook 361 17 Updated Feb 19, 2024

pytorch-profiler

Python 47 8 Updated Jun 1, 2023

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 473 35 Updated Jul 10, 2024

Efficient AI Inference & Serving

Python 447 25 Updated Jan 8, 2024

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 175 15 Updated Apr 24, 2024
Python 846 83 Updated Jul 24, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,101 195 Updated Jul 21, 2024

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 392 8 Updated Jul 4, 2024
Python 8,229 479 Updated Jan 27, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,692 407 Updated Jul 15, 2024

A collection of memory efficient attention operators implemented in the Triton language.

Python 187 15 Updated Jun 5, 2024

[TMLR 2024] Efficient Large Language Models: A Survey

872 73 Updated Jul 25, 2024
Python 1,133 160 Updated Jul 25, 2024

Awesome machine learning model compression research papers, quantization, tools, and learning material.

456 62 Updated May 8, 2024

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,238 1,048 Updated Jun 21, 2024

MLX: An array framework for Apple silicon

C++ 15,946 906 Updated Jul 25, 2024

Mamba SSM architecture

Python 11,901 994 Updated Jul 24, 2024

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,727 193 Updated Jul 20, 2024
Next