- Cambridge, MA
- https://scholar.harvard.edu/kli
Highlights
- Pro
Block or Report
Block or report likenneth
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Robust recipes to align language models with human and AI preferences
👨💻 An awesome and curated list of best code-LLM for research.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
800,000 step-level correctness labels on LLM solutions to MATH problems
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
corl-team / CORL
Forked from tinkoff-ai/CORLHigh-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
A framework for few-shot evaluation of language models.
A library for advanced large language model reasoning
Code for the paper Fine-Tuning Language Models from Human Preferences
A modular RL library to fine-tune language models to human preferences
Methods and Implements of Deep Clustering
"DeepDPM: Deep Clustering With An Unknown Number of Clusters" [Ronen, Finder, and Freifeld, CVPR 2022]
Hierarchical community detection by recursive bi-partitioning
A library for efficient similarity search and clustering of dense vectors.