Skip to content
View yxchng's full-sized avatar

Block or report yxchng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 10 1 Updated Nov 8, 2024

Code for studying the super weight in LLM

Jupyter Notebook 3 Updated Nov 11, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,803 447 Updated Jun 22, 2024

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 183 8 Updated Nov 12, 2024

A Video Tokenizer Evaluation Dataset

Python 39 1 Updated Nov 6, 2024

A suite of image and video neural tokenizers

Python 707 16 Updated Nov 8, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,617 128 Updated Nov 12, 2024

Robust recipes to align language models with human and AI preferences

Python 4,670 406 Updated Oct 7, 2024

✨✨Latest Papers and Datasets on Mobile and PC Agent

23 4 Updated Nov 7, 2024

Collect some World Models for Autonomous Driving papers.

526 15 Updated Nov 11, 2024

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

147 5 Updated Nov 9, 2024
Python 51 5 Updated Oct 28, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,181 214 Updated Nov 6, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 509 51 Updated Aug 30, 2024

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 433 37 Updated Nov 11, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,138 63 Updated Nov 7, 2024

The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"

Jupyter Notebook 76 Updated Oct 12, 2024

Code for Quiet-STaR

Python 648 88 Updated Aug 21, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,272 34 Updated Oct 28, 2024

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 333 39 Updated Mar 4, 2024

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 639 30 Updated Nov 8, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 4,579 343 Updated Nov 5, 2024

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,472 174 Updated Sep 30, 2024

Building blocks for foundation models.

391 15 Updated Jan 3, 2024

Automated Design of Agentic Systems

Python 1,021 149 Updated Nov 6, 2024

A paper list of some recent works about Token Compress for Vit and VLM

133 4 Updated Nov 11, 2024
Python 31 5 Updated Sep 14, 2024

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Python 179 11 Updated Oct 12, 2024
Next