- Shanghai, China
- https://ymzhang0319.github.io/
Highlights
- Pro
Block or Report
Block or report ymzhang0319
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language: Jupyter Notebook
Sort by: Most stars
A latent text-to-image diffusion model
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
Neural Rendering with Attention: An Incremental Improvement for Anime Character Animation
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"