Skip to content
View zchoi's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report zchoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

MINT-1T: A one trillion token multimodal interleaved dataset.

627 11 Updated Jul 31, 2024

Summaries of ICML 2024 papers

2 Updated Jul 31, 2024
Jupyter Notebook 7 Updated Jul 31, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 6,474 323 Updated Aug 1, 2024

Documentation that simply works

HTML 19,468 3,417 Updated Jul 29, 2024

A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.

TypeScript 46 22 Updated Jun 3, 2024

GRUtopia: Dream General Robots in a City at Scale

Python 390 11 Updated Jul 26, 2024
Python 150 9 Updated May 31, 2024

RoleInteract: Evaluating the Social Interaction of Role-Playing Agents

Python 38 4 Updated May 27, 2024

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

18 1 Updated Jul 31, 2024
1 Updated Apr 15, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,044 56 Updated Jul 30, 2024

DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.

Python 10 Updated Jul 22, 2024

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 362 32 Updated Feb 1, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,169 72 Updated Jul 30, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 656 43 Updated Jul 10, 2024

This is the official implementation of the paper "Needle In A Multimodal Haystack"

Python 66 4 Updated Jul 4, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,617 99 Updated Jul 26, 2024

Must-read Papers on LLM Agents.

1,522 80 Updated Jul 8, 2024

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

412 22 Updated Jul 8, 2024

抢占显卡

Python 42 5 Updated Feb 25, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 149 6 Updated Jul 24, 2024

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 199 6 Updated May 28, 2024

A RLHF Infrastructure for Vision-Language Models

Python 71 4 Updated Jun 12, 2024

LLM101n: Let's build a Storyteller

26,282 1,408 Updated Aug 1, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,537 352 Updated Aug 1, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,119 710 Updated Jul 16, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,724 753 Updated May 19, 2024

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Python 229 7 Updated Jun 25, 2024
Next