Skip to content
View luxinyu1's full-sized avatar
🎰
Waiting for the very pulse of the machine...
🎰
Waiting for the very pulse of the machine...

Highlights

  • Pro

Block or report luxinyu1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 395 48 Updated Sep 1, 2024
Python 1,170 157 Updated Aug 31, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 1,740 255 Updated May 25, 2024

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 552 71 Updated Aug 29, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,199 818 Updated Aug 21, 2024

A native PyTorch Library for large model training

Python 1,501 137 Updated Aug 30, 2024
TeX 58 3 Updated Aug 20, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,199 396 Updated May 24, 2024

Training Sparse Autoencoders on Language Models

HTML 351 94 Updated Sep 1, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,966 192 Updated Sep 1, 2024

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,691 335 Updated Jul 31, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 29,958 3,696 Updated Sep 2, 2024

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 256 44 Updated Aug 16, 2024

Go ahead and axolotl questions

Python 7,422 799 Updated Sep 2, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,553 441 Updated May 3, 2024

Collection of papers for scalable automated alignment.

44 5 Updated Aug 11, 2024

RewardBench: the first evaluation tool for reward models.

Python 339 40 Updated Aug 28, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 9,937 2,192 Updated Aug 5, 2024

Scalable toolkit for efficient model alignment

Python 499 52 Updated Sep 1, 2024

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.

Jupyter Notebook 28 5 Updated Aug 29, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 619 36 Updated Aug 22, 2024
Python 1,477 129 Updated Aug 6, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 3,928 356 Updated Aug 30, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,571 463 Updated Aug 22, 2024
Python 494 39 Updated Feb 5, 2024
Python 73 5 Updated Aug 17, 2024

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Python 65 7 Updated Mar 22, 2024

Paper list of multi-agent reinforcement learning (MARL)

3,941 718 Updated Jul 18, 2024
Python 64 9 Updated Jul 2, 2024

Self-playing Adversarial Language Game Enhances LLM Reasoning

Python 78 8 Updated Jul 2, 2024
Next