Skip to content
View HuangLK's full-sized avatar

Block or report HuangLK

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Fcitx5 input method framework and engines ported to Android

Kotlin 2,915 173 Updated Oct 7, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 166 9 Updated Jun 7, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,113 225 Updated Oct 8, 2024
Python 153 11 Updated Oct 8, 2024

An automated pipeline for evaluating LLMs for role-playing.

Python 122 3 Updated Sep 14, 2024

A natural language interface for computers

Python 52,507 4,630 Updated Sep 26, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,004 336 Updated Oct 7, 2024

LLM101n: Let's build a Storyteller

29,206 1,599 Updated Aug 1, 2024

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 159 17 Updated May 23, 2024

The ultimate Vim configuration (vimrc)

Vim Script 30,624 7,287 Updated Oct 6, 2024

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 10,143 748 Updated Oct 7, 2024

GPT Meet Zotero.

TypeScript 4,997 200 Updated Sep 23, 2024

Ongoing research training transformer models at scale

Python 10,193 2,291 Updated Oct 7, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,481 115 Updated Oct 7, 2024

RewardBench: the first evaluation tool for reward models.

Python 379 48 Updated Oct 6, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 767 60 Updated Jul 1, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,080 840 Updated Jul 1, 2024

A unified evaluation framework for large language models

Python 2,407 179 Updated Sep 12, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,151 212 Updated Oct 6, 2024

A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you hav…

Python 45 2 Updated Jul 4, 2023

nanobind: tiny and efficient C++/Python bindings

C++ 2,306 193 Updated Oct 8, 2024
Python 310 16 Updated Jul 16, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,064 169 Updated Aug 11, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,722 455 Updated May 3, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,844 774 Updated Aug 24, 2023

LlamaIndex is a data framework for your LLM applications

Python 35,917 5,096 Updated Oct 8, 2024

MOSS-RLHF

Python 1,276 98 Updated Mar 3, 2024

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Makefile 105 3 Updated Oct 27, 2023
Next