Skip to content
View yechenzhi's full-sized avatar
Block or Report

Block or report yechenzhi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SGLang is yet another fast serving framework for large language models and vision language models.

Python 3,567 219 Updated Jul 30, 2024
Jupyter Notebook 207 12 Updated Jul 22, 2024

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 3,431 365 Updated Apr 11, 2024

LLM101n: Let's build a Storyteller

26,145 1,396 Updated Jul 29, 2024

Go ahead and axolotl questions

Python 7,102 775 Updated Jul 31, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,114 195 Updated Jul 21, 2024

Custom data types and layouts for training and inference

Python 448 57 Updated Jul 31, 2024

Self-Explore to avoid ️the p️️it! Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards

Python 33 1 Updated May 4, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,690 311 Updated Jul 31, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,656 671 Updated Jan 14, 2024

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,377 484 Updated Jul 16, 2024

A framework for few-shot evaluation of language models.

Python 5,999 1,592 Updated Jul 30, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,399 492 Updated Jul 30, 2024

Train transformer language models with reinforcement learning.

Python 8,901 1,096 Updated Jul 30, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,429 358 Updated Jul 30, 2024

Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients

Python 23 6 Updated Mar 20, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,263 115 Updated Jun 13, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 905 79 Updated May 8, 2024

A Toolkit for Distributional Control of Generative Models

Python 68 4 Updated Sep 4, 2023

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 648 36 Updated May 30, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 1,914 149 Updated May 23, 2024

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Python 57 1 Updated Apr 30, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,004 575 Updated Jul 26, 2024

A framework for prompt tuning using Intent-based Prompt Calibration

Python 1,929 157 Updated Jul 28, 2024
Python 7,036 544 Updated Jul 25, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,172 421 Updated Jul 25, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,862 114 Updated May 15, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 785 45 Updated Jul 30, 2024

Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.

Python 1,154 156 Updated Sep 14, 2021

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,275 71 Updated Apr 11, 2024
Next