A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Python 2,914 175 Updated Nov 13, 2024

GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 675 36 Updated Aug 5, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,833 112 Updated Jul 29, 2024

agentic-learning-ai-lab / procreate-diffusion

Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"

Python 34 Updated Nov 8, 2024

baaivision / DIVA

Diffusion Feedback Helps CLIP See Better

Python 214 11 Updated Aug 24, 2024

apple / corenet

CoreNet: A library for training deep neural networks

Jupyter Notebook 6,980 541 Updated Oct 14, 2024

CarperAI / OpenELM

Evolution Through Large Models

Python 695 85 Updated Nov 15, 2023

mihirp1998 / VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 212 14 Updated Aug 19, 2024

ShareGPT4Omni / ShareGPT4V

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 152 4 Updated Jul 1, 2024

TencentARC / SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 743 56 Updated Oct 11, 2024

yichengchen24 / ACP

Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Python 21 Updated Jul 1, 2024

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 540 23 Updated Nov 9, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,755 113 Updated Oct 30, 2024

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,262 336 Updated Jun 28, 2024

TiankaiHang / Min-SNR-Diffusion-Training

[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy

Python 226 6 Updated Apr 19, 2024

FoundationVision / VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,255 315 Updated Oct 6, 2024

mira-space / Mira

Python 344 14 Updated Oct 21, 2024

LukeForeverYoung / UReader

Python 122 7 Updated Feb 13, 2024

zhuyiche / llava-phi

Python 367 38 Updated May 1, 2024

dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 732 43 Updated Jul 29, 2024

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,148 552 Updated Oct 19, 2024

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,629 273 Updated Aug 14, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,226 2,174 Updated Aug 9, 2024

DEAKSoftware / Synergy-Binaries

Download the latest stable Synergy binaries.

Python 1,221 117 Updated Nov 1, 2024

yuyang-shi / dsbm-pytorch

PyTorch Implementation of Diffusion Schrodinger Bridge Matching

Python 119 5 Updated May 28, 2023

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,330 567 Updated May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhipeng Huang hzphzp

Achievements