Skip to content
View ryantd's full-sized avatar
🏎️
🏎️

Organizations

@kubeflow
Block or Report

Block or report ryantd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

Python 67 7 Updated Apr 12, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,387 490 Updated Jul 13, 2024

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 4,466 482 Updated Jul 16, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,824 174 Updated Jul 24, 2024

GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under al…

Python 345 22 Updated Apr 10, 2024

A simple and effective LLM pruning approach.

Python 576 69 Updated Jul 9, 2024

Simple gRPC benchmarking and load testing tool

Go 2,950 264 Updated Jul 1, 2024

An efficient implementation of a rate limiter for asyncio.

Python 465 20 Updated Jul 25, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,702 391 Updated May 29, 2024

Serving multiple LoRA finetuned LLM as one

Python 903 41 Updated May 8, 2024

Fast inference engine for Transformer models

C++ 3,090 274 Updated Jul 25, 2024

Puck is a high-performance ANN search engine

Jupyter Notebook 320 36 Updated Jun 4, 2024

Robust recipes to align language models with human and AI preferences

Python 4,273 364 Updated Jul 17, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,356 2,003 Updated Jul 14, 2024

Adala: Autonomous DAta (Labeling) Agent framework

Python 870 69 Updated Jul 25, 2024

🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,234 49 Updated Jul 25, 2024

Tutorial for Porting PyTorch Transformer Models to Candle (Rust)

Rust 214 12 Updated Jul 22, 2024

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Rust 7,947 383 Updated Jul 25, 2024

python library for invisible image watermark (blind image watermark)

Python 1,529 139 Updated Sep 23, 2023

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,297 91 Updated Oct 31, 2023

Sparsity-aware deep learning inference runtime for CPUs

Python 2,945 167 Updated Jul 19, 2024

LLM training code for Databricks foundation models

Python 3,885 509 Updated Jul 25, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,675 834 Updated Jul 25, 2024

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Python 207 18 Updated Feb 23, 2024

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 6,771 609 Updated Jul 9, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

Python 747 81 Updated Jul 19, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,049 135 Updated Jun 25, 2024