[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型，能够根据多种控制生成自然和谐的结果！

Python 91 1 Updated Jul 5, 2024

apple / ml-mgie

Python 3,817 249 Updated Mar 15, 2024

yichengchen24 / ACP

Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Python 19 Updated Jul 1, 2024

JacobChalk / TIM

Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"

Python 31 3 Updated Aug 2, 2024

YuanGongND / cav-mae

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 219 20 Updated Mar 20, 2024

open-mmlab / StyleShot

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!

Python 149 8 Updated Jul 5, 2024

NVlabs / edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Python 451 16 Updated May 30, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 30,622 3,523 Updated Aug 10, 2024

open-mmlab / PowerPaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 474 29 Updated Jul 30, 2024

Vill-Lab / 2023-AAAI-SDMIA

code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.

Python 10 1 Updated Nov 6, 2023

open-mmlab / FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝

Python 349 25 Updated Jul 26, 2024

camenduru / FoleyCrafter-jupyter

Jupyter Notebook 9 Updated Jun 28, 2024

leiurayer / downkyi

哔哩下载姬downkyi，哔哩哔哩网站视频下载工具，支持批量下载，支持8K、HDR、杜比视界，提供工具箱（音视频提取、去水印等）。

C# 20,265 2,231 Updated Aug 3, 2024

jianzongwu / MotionBooth

The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"

Python 79 7 Updated Jul 31, 2024

donahowe / AutoStudio

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Jupyter Notebook 367 28 Updated Aug 1, 2024

ToonCrafter / ToonCrafter

a research paper for generative cartoon interpolation

Python 4,986 410 Updated Jun 1, 2024

Chen-and-Sim / ChordNova

ChordNova is a powerful open-source chord progression analysis plus generation software with unprecedentedly detailed control over chord trait parameters, that is way above mainstream softwares. Ru…

C++ 711 80 Updated Jan 10, 2024

magenta / midi-ddsp

Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)

Python 299 18 Updated Nov 30, 2022

Sound2Synth / Sound2Synth

Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation

Python 73 11 Updated Jul 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yiming ymzhang0319

Achievements

Achievements

Highlights

Block or report ymzhang0319

Stars

archinetai / audio-ai-timeline

Tencent / MimicMotion

sdaqo / anipy-cli

haoheliu / audioldm_eval

JongSuk1 / AVCap

liuxubo717 / V-ACT

open-mmlab / Live2Diff

NVlabs / edm

aik2mlj / polyffusion

UNITES-Lab / MoE-RBench

kwatcharasupat / bandit

open-mmlab / AnyControl