Skip to content
View Xiaoyang-Wang's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report Xiaoyang-Wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RewardBench: the first evaluation tool for reward models.

Python 344 40 Updated Aug 28, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 16,418 1,265 Updated Sep 5, 2024
Jupyter Notebook 113 18 Updated Jan 16, 2024

LLM training in simple, raw C/CUDA

Cuda 23,168 2,571 Updated Aug 26, 2024

Minimalistic large language model 3D-parallelism training

Python 1,086 104 Updated Sep 5, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,439 469 Updated Jan 8, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,985 194 Updated Sep 3, 2024

A PyTorch Native LLM Training Framework

Python 567 27 Updated Aug 25, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,171 915 Updated Aug 29, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 8,313 677 Updated Sep 5, 2024

pickleDB is an open source key-value store using Python's json module.

Python 908 127 Updated Jun 15, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32,918 5,572 Updated Sep 5, 2024

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 4,248 765 Updated Mar 5, 2024

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 6,467 459 Updated Sep 5, 2024

Supercharge Your Model Training

Python 5,111 413 Updated Sep 4, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 30,203 3,721 Updated Sep 5, 2024

Robust recipes to align language models with human and AI preferences

Python 4,436 385 Updated Aug 20, 2024

Diffusion Reinforcement Learning Library

Python 169 7 Updated Feb 13, 2024
Python 2,483 299 Updated May 19, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 15,136 1,013 Updated Sep 5, 2024

SAM: Sharpness-Aware Minimization (PyTorch)

Python 1,733 195 Updated Feb 21, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,422 221 Updated Sep 5, 2024

Curated list of data science interview questions and answers

3,119 717 Updated Aug 16, 2024

Machine Learning and Computer Vision Engineer - Technical Interview Questions

2,866 485 Updated May 22, 2024

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

8,965 1,469 Updated Aug 31, 2023

Development repository for the Triton language and compiler

C++ 12,466 1,510 Updated Sep 5, 2024

[ICLR 2023 Spotlight] Code release for "Dirichlet-based Uncertainty Calibration for Active Domain Adaptation"

Python 27 Updated Mar 5, 2023
Scala 2 Updated May 7, 2021
Next