Skip to content
View ymzhang0319's full-sized avatar
🌴
On vacation
🌴
On vacation

Highlights

  • Pro
Block or Report

Block or report ymzhang0319

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A timeline of the latest AI models for audio generation, starting in 2023!

1,876 67 Updated Jan 4, 2024

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 1,330 100 Updated Jul 17, 2024

Little tool in python to watch and download anime from the terminal (the better way to watch anime). Also applicable as an API

Python 250 38 Updated Aug 7, 2024

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 283 30 Updated Jun 2, 2024
Python 6 Updated Jul 13, 2024

Visually-Aware Audio Captioning

Python 39 Updated Mar 3, 2023

Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.

Python 131 10 Updated Jul 22, 2024

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,240 130 Updated Mar 16, 2024

Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls

Python 70 7 Updated Jul 16, 2024

[ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"

Python 7 Updated Jul 1, 2024

BandIt: Cinematic Audio Source Separation

Python 73 3 Updated Jul 19, 2024

[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型,能够根据多种控制生成自然和谐的结果!

Python 91 1 Updated Jul 5, 2024
Python 3,817 249 Updated Mar 15, 2024

Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Python 19 Updated Jul 1, 2024

Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"

Python 31 3 Updated Aug 2, 2024

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 219 20 Updated Mar 20, 2024

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!

Python 149 8 Updated Jul 5, 2024

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Python 451 16 Updated May 30, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 30,622 3,523 Updated Aug 10, 2024

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 474 29 Updated Jul 30, 2024

code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.

Python 10 1 Updated Nov 6, 2023

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 349 25 Updated Jul 26, 2024
Jupyter Notebook 9 Updated Jun 28, 2024

哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。

C# 20,265 2,231 Updated Aug 3, 2024

The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"

Python 79 7 Updated Jul 31, 2024

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Jupyter Notebook 367 28 Updated Aug 1, 2024

a research paper for generative cartoon interpolation

Python 4,986 410 Updated Jun 1, 2024

ChordNova is a powerful open-source chord progression analysis plus generation software with unprecedentedly detailed control over chord trait parameters, that is way above mainstream softwares. Ru…

C++ 711 80 Updated Jan 10, 2024

Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)

Python 299 18 Updated Nov 30, 2022

Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation

Python 73 11 Updated Jul 28, 2022
Next