Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 469 53 Updated Oct 23, 2024

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 780 65 Updated Sep 23, 2024

QwenLM / Qwen2.5-Math

A series of math-specific large language models of our Qwen2 series.

Python 570 56 Updated Oct 29, 2024

microsoft / ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 968 72 Updated Feb 22, 2024

cmu-l3 / llmlean

LLMs + Lean, on your laptop or in the cloud

Lean 122 14 Updated Oct 23, 2024

project-numina / aimo-progress-prize

Jupyter Notebook 299 22 Updated Jul 22, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,444 795 Updated Oct 29, 2024

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 855 58 Updated Sep 25, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,583 1,620 Updated Aug 1, 2024

scicode-bench / SciCode

A benchmark that challenges language models to code solutions for scientific problems

Python 84 9 Updated Oct 28, 2024

tongyx361 / Awesome-LLM4Math

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

76 2 Updated Jul 12, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,820 111 Updated Jul 29, 2024

baaivision / EVE

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 220 3 Updated Oct 2, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—foundation models

Python 18,375 1,412 Updated Nov 2, 2024

multimodal-art-projection / MAP-NEO

Python 871 81 Updated Jun 21, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 33,574 4,120 Updated Nov 2, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Python 2,394 234 Updated Nov 1, 2024

huggingface / cosmopedia

Python 441 44 Updated Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Junxian He jxhe

Achievements

Achievements

Organizations

Block or report jxhe

Stars

askforalfred / alfred

allenai / openpi-dataset

openreasoner / openr

xjdr-alt / entropix

GAIR-NLP / O1-Journey

OpenBMB / MiniCPM

zhiyuanhubj / UoT

McGill-NLP / VinePPO

YuxiXie / MCTS-DPO

All-Hands-AI / OpenHands

SalesforceAIResearch / xLAM

njucckevin / SeeClick

magpie-align / magpie