Skip to content
View jxhe's full-sized avatar

Organizations

@asyml

Block or report jxhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

C 374 84 Updated Jul 25, 2024

OpenPI dataset for tracking entities in open domain procedural text

Python 21 2 Updated Aug 13, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 926 66 Updated Nov 2, 2024

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 2,916 305 Updated Oct 31, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,154 28 Updated Oct 28, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,093 450 Updated Oct 10, 2024

[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Python 72 4 Updated Aug 5, 2024

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 76 6 Updated Oct 24, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 180 22 Updated Aug 6, 2024

🙌 OpenHands: Code Less, Make More

Python 33,489 3,836 Updated Nov 2, 2024
Python 305 24 Updated Sep 26, 2024

The model, data and code for the visual GUI Agent SeeClick

HTML 210 11 Updated Aug 27, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 469 53 Updated Oct 23, 2024

Recipes to train reward model for RLHF.

Python 780 65 Updated Sep 23, 2024

A series of math-specific large language models of our Qwen2 series.

Python 570 56 Updated Oct 29, 2024

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 968 72 Updated Feb 22, 2024

LLMs + Lean, on your laptop or in the cloud

Lean 122 14 Updated Oct 23, 2024
Jupyter Notebook 299 22 Updated Jul 22, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,444 795 Updated Oct 29, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 855 58 Updated Sep 25, 2024

LLM101n: Let's build a Storyteller

29,583 1,620 Updated Aug 1, 2024

A benchmark that challenges language models to code solutions for scientific problems

Python 84 9 Updated Oct 28, 2024

Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers g…

76 2 Updated Jul 12, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,820 111 Updated Jul 29, 2024

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 220 3 Updated Oct 2, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 18,375 1,412 Updated Nov 2, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 33,574 4,120 Updated Nov 2, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Python 2,394 234 Updated Nov 1, 2024
Python 441 44 Updated Oct 28, 2024
Next