Skip to content
View ymzhang0319's full-sized avatar
🌴
On vacation
🌴
On vacation

Highlights

  • Pro
Block or Report

Block or report ymzhang0319

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
15 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 67,042 10,027 Updated Jun 18, 2024

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 6,796 1,045 Updated Aug 6, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 4,784 312 Updated Jun 28, 2024
Jupyter Notebook 3,007 282 Updated May 14, 2024
Jupyter Notebook 1,658 161 Updated Apr 18, 2024

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Jupyter Notebook 1,271 91 Updated Oct 18, 2022

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,089 205 Updated May 21, 2023

[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting

Jupyter Notebook 459 72 Updated Jul 26, 2021

Neural Rendering with Attention: An Incremental Improvement for Anime Character Animation

Jupyter Notebook 416 21 Updated Apr 15, 2023

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Jupyter Notebook 367 28 Updated Aug 1, 2024

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 224 15 Updated Mar 14, 2024

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 137 12 Updated Mar 25, 2024
Jupyter Notebook 24 4 Updated Dec 22, 2023
Jupyter Notebook 12 4 Updated Feb 17, 2024
Jupyter Notebook 9 Updated Jun 28, 2024