Skip to content
View ilovecv's full-sized avatar

Block or report ilovecv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 118 Updated Oct 9, 2024
Python 1,595 114 Updated Nov 8, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,421 119 Updated Jul 17, 2024
HTML 25 1 Updated Aug 2, 2024

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,159 189 Updated Nov 7, 2024

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

83 2 Updated Jul 16, 2024

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Python 459 29 Updated Aug 12, 2024

This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023

Jupyter Notebook 462 20 Updated Aug 25, 2024

Scaling Diffusion Transformers with Mixture of Experts

Python 202 8 Updated Sep 9, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,851 129 Updated Jul 2, 2024

TryOnDiffusion: A Tale of Two UNets Implementation

Jupyter Notebook 354 44 Updated Nov 8, 2024

【NeurIPS 2024】Dense Connector for MLLMs

Python 133 5 Updated Oct 14, 2024

Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 799 43 Updated Oct 5, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,018 379 Updated Aug 7, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,973 1,134 Updated Sep 24, 2024

VisionLLM Series

Python 902 26 Updated Oct 18, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 737 56 Updated Oct 11, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 588 28 Updated Oct 14, 2024
Python 109 6 Updated Jul 12, 2024

Kolors Team

Python 3,825 262 Updated Sep 4, 2024

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 113 3 Updated Sep 21, 2024

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 367 9 Updated Sep 2, 2024

Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models

Python 191 29 Updated Jul 9, 2023

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Python 197 11 Updated Apr 3, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 676 36 Updated Aug 5, 2024

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 502 29 Updated Jul 16, 2024

ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023

Jupyter Notebook 118 10 Updated Nov 8, 2023

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Python 247 15 Updated Jul 21, 2024

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Python 536 27 Updated Jul 5, 2024

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 227 3 Updated Oct 2, 2024
Next