Skip to content
View hzphzp's full-sized avatar
  • University of Science and Technology of China
  • University of Science and Technology of China
  • 21:11 (UTC -12:00)

Highlights

  • Pro

Block or report hzphzp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 2,563 190 Updated Nov 11, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 702 26 Updated Nov 11, 2024

We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.

Python 11 Updated Aug 30, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,987 158 Updated Oct 31, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,914 175 Updated Nov 13, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 675 36 Updated Aug 5, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,833 112 Updated Jul 29, 2024

Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"

Python 34 Updated Nov 8, 2024

Diffusion Feedback Helps CLIP See Better

Python 214 11 Updated Aug 24, 2024

CoreNet: A library for training deep neural networks

Jupyter Notebook 6,980 541 Updated Oct 14, 2024

Evolution Through Large Models

Python 695 85 Updated Nov 15, 2023

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 212 14 Updated Aug 19, 2024

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 152 4 Updated Jul 1, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 743 56 Updated Oct 11, 2024

Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Python 21 Updated Jul 1, 2024

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 540 23 Updated Nov 9, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,755 113 Updated Oct 30, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,262 336 Updated Jun 28, 2024

[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy

Python 226 6 Updated Apr 19, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,255 315 Updated Oct 6, 2024
Python 344 14 Updated Oct 21, 2024
Python 122 7 Updated Feb 13, 2024
Python 367 38 Updated May 1, 2024

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 732 43 Updated Jul 29, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,148 552 Updated Oct 19, 2024

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,629 273 Updated Aug 14, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,226 2,174 Updated Aug 9, 2024

Download the latest stable Synergy binaries.

Python 1,221 117 Updated Nov 1, 2024

PyTorch Implementation of Diffusion Schrodinger Bridge Matching

Python 119 5 Updated May 28, 2023

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,330 567 Updated May 31, 2024
Next