ryantd

🏎️

Xiaoyu Zhai ryantd

🏎️

Senior MLE @kwai, @StevensInstituteOfTechnology Alumni

49 followers · 253 following

Achievements

Organizations

Block or Report

Block or report ryantd

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

microsoft / Moonlit

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

Python 67 7 Updated Apr 12, 2024

bobby-he / simplified_transformers

Python 278 24 Updated Nov 13, 2023

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,387 490 Updated Jul 13, 2024

hao-ai-lab / LookaheadDecoding

Python 1,053 63 Updated Feb 14, 2024

Deci-AI / super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 4,466 482 Updated Jul 16, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,824 174 Updated Jul 24, 2024

GPT-Fathom / GPT-Fathom

GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under al…

Python 345 22 Updated Apr 10, 2024

locuslab / wanda

A simple and effective LLM pruning approach.

Python 576 69 Updated Jul 9, 2024

bojand / ghz

Simple gRPC benchmarking and load testing tool

Go 2,950 264 Updated Jul 1, 2024

mjpieters / aiolimiter

An efficient implementation of a rate limiter for asyncio.

Python 465 20 Updated Jul 25, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,702 391 Updated May 29, 2024

punica-ai / punica

Serving multiple LoRA finetuned LLM as one

Python 903 41 Updated May 8, 2024

OpenNMT / CTranslate2

Fast inference engine for Transformer models

C++ 3,090 274 Updated Jul 25, 2024

microsoft / DeepSpeed-Kernels

C++ 45 9 Updated May 22, 2024

baidu / puck

Puck is a high-performance ANN search engine

Jupyter Notebook 320 36 Updated Jun 4, 2024

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,273 364 Updated Jul 17, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,356 2,003 Updated Jul 14, 2024

HumanSignal / Adala

Adala: Autonomous DAta (Labeling) Agent framework

Python 870 69 Updated Jul 25, 2024

spotify / voyager

🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,234 49 Updated Jul 25, 2024

ToluClassics / candle-tutorial

Tutorial for Porting PyTorch Transformer Models to Candle (Rust)

Rust 214 12 Updated Jul 22, 2024

tracel-ai / burn

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Rust 7,947 383 Updated Jul 25, 2024

ShieldMnt / invisible-watermark

python library for invisible image watermark (blind image watermark)

Python 1,529 139 Updated Sep 23, 2023

THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,297 91 Updated Oct 31, 2023

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python 2,945 167 Updated Jul 19, 2024

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 3,885 509 Updated Jul 25, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,675 834 Updated Jul 25, 2024

LLM-Tuning-Safety / LLMs-Finetuning-Safety

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Python 207 18 Updated Feb 23, 2024