Skip to content
View HuangLK's full-sized avatar
Block or Report

Block or report HuangLK

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

A natural language interface for computers

Python 51,165 4,472 Updated Jul 26, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 3,373 210 Updated Jul 29, 2024

LLM101n: Let's build a Storyteller

25,967 1,379 Updated Jul 29, 2024

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 111 13 Updated May 23, 2024

The ultimate Vim configuration (vimrc)

Vim Script 30,427 7,262 Updated May 27, 2024

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 9,708 730 Updated Jul 29, 2024

GPT Meet Zotero.

TypeScript 4,634 191 Updated Jun 23, 2024

Ongoing research training transformer models at scale

Python 9,524 2,150 Updated Jul 29, 2024

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Python 1,204 78 Updated Jul 29, 2024

RewardBench: the first evaluation tool for reward models.

Python 305 35 Updated Jul 26, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 747 58 Updated Jul 1, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,814 805 Updated Jul 1, 2024

A unified evaluation framework for large language models

Python 2,290 178 Updated Jul 25, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,839 175 Updated Jul 29, 2024

A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you hav…

Python 43 2 Updated Jul 4, 2023

nanobind: tiny and efficient C++/Python bindings

C++ 2,183 173 Updated Jul 29, 2024
Python 305 17 Updated Jul 16, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 1,912 148 Updated May 23, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,419 437 Updated May 3, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,633 761 Updated Aug 24, 2023

LlamaIndex is a data framework for your LLM applications

Python 34,015 4,791 Updated Jul 29, 2024

MOSS-RLHF

Python 1,235 93 Updated Mar 3, 2024

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

Makefile 89 3 Updated Oct 27, 2023

Train transformer language models with reinforcement learning.

Python 8,892 1,092 Updated Jul 29, 2024

更纯粹、更高压缩率的Tokenizer

Python 432 22 Updated Apr 19, 2024

Apache OpenDAL: access data freely.

Rust 3,074 425 Updated Jul 29, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,112 710 Updated Jul 16, 2024
Next