Name		Name	Last commit message	Last commit date
parent directory ..
assets		assets
configs		configs
core		core
README.md		README.md
requirement.txt		requirement.txt

README.md

CoDi-2: In-Context, Interleaved, and Interactive
Any-to-Any Generation

Zineng Tang^1,4*, Ziyi Yang^2†, Mahmoud Khademi³, Yang Liu², Chenguang Zhu^3‡, Mohit Bansal^4†

¹UC Berkeley ²Microsoft Azure AI ³Zoom ⁴UNC Chapel Hill
^* Work done at Microsoft and UNC Chapel Hill. ^‡ Work done at Microsoft. ^†Corresponding Authors

Code Release Checklist [...coming soon...]

Code
Checkpoints (7B, 14B)
Demos

Introduction

We present CoDi-2, a versatile and interactive Multi-modal Large Language Model (MLLM) that can follow complex multimodal interleaved instructions, conduct in-context learning (ICL), reason, chat, edit, etc., in an any-to-any input-output modality paradigm. By aligning modalities with language for both encoding and generation, CoDi-2 empowers Large Language Models (LLMs) to not only understand complex modality-interleaved instructions and in-context examples, but also autoregressively generate grounded and coherent multimodal outputs in the continuous feature space. To train CoDi-2, we build a large-scale generation dataset encompassing in-context multi-modal instructions across text, vision, and audio. CoDi-2 demonstrates a wide range of zero-shot capabilities for multimodal generation, such as in-context learning, reasoning, and compositionality of any-to-any modality generation through multi-round interactive conversation. CoDi- 2 surpasses previous domain-specific models on tasks such as subject-driven image generation, vision transformation, and audio editing. CoDi-2 signifies a substantial breakthrough in developing a comprehensive multimodal foundation model adept at interpreting in-context language-vision-audio interleaved instructions and producing multimodal outputs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CoDi-2

CoDi-2

README.md

CoDi-2: In-Context, Interleaved, and Interactive
Any-to-Any Generation

Code Release Checklist [...coming soon...]

Introduction

Files

CoDi-2

Directory actions

More options

Directory actions

More options

Latest commit

History

CoDi-2

Folders and files

parent directory

README.md

CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation

Code Release Checklist [...coming soon...]

Introduction

CoDi-2: In-Context, Interleaved, and Interactive
Any-to-Any Generation