Skip to content
View luxinyu1's full-sized avatar
🎰
Waiting for the very pulse of the machine...
🎰
Waiting for the very pulse of the machine...

Highlights

  • Pro
Block or Report

Block or report luxinyu1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Training Sparse Autoencoders on Language Models

HTML 230 76 Updated Jul 19, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,787 170 Updated Jul 18, 2024

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,445 321 Updated Jul 16, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 26,780 3,316 Updated Jul 18, 2024

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 217 33 Updated Jul 16, 2024

Go ahead and axolotl questions

Python 6,948 762 Updated Jul 18, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,361 432 Updated May 3, 2024

Collection of papers for scalable automated alignment.

38 5 Updated Jul 4, 2024

RewardBench: the first evaluation tool for reward models.

Python 294 30 Updated Jul 18, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 9,837 2,182 Updated Apr 17, 2024

Scalable toolkit for efficient model alignment

Python 443 48 Updated Jul 19, 2024

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.

Jupyter Notebook 24 3 Updated Jul 18, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 528 30 Updated Jul 18, 2024
Python 1,379 118 Updated Jul 18, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 3,795 340 Updated Jul 18, 2024

A series of large language models trained from scratch by developers @01-ai

Python 7,498 458 Updated Jul 18, 2024
Python 486 39 Updated Feb 5, 2024
Python 67 3 Updated Jul 18, 2024

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Python 36 6 Updated Mar 22, 2024

Paper list of multi-agent reinforcement learning (MARL)

3,883 717 Updated Jul 18, 2024
Python 64 9 Updated Jul 2, 2024

Self-playing Adversarial Language Game Enhances LLM Reasoning

Python 76 8 Updated Jul 2, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,560 46 Updated Jul 18, 2024

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

363 19 Updated Feb 7, 2024

[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future

248 8 Updated Jul 4, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,069 194 Updated Jun 24, 2024
JavaScript 251 33 Updated Jul 18, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,324 359 Updated Jul 18, 2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Python 69 7 Updated Jun 24, 2024

Simple and efficient pytorch-native transformer training and inference (batched)

Python 45 2 Updated Apr 2, 2024
Next