Skip to content
View nojiyoon's full-sized avatar

Block or report nojiyoon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Image-to-Image Translation in PyTorch

Python 22,615 6,266 Updated May 14, 2024

Implementation of the Wave-U-Net for audio source separation

Python 818 177 Updated Mar 24, 2023

Modeling, training, eval, and inference code for OLMo

Python 4,308 421 Updated Aug 27, 2024

A simple notebook demonstrating prompt-based music generation via Mubert API

Jupyter Notebook 2,731 243 Updated May 4, 2023

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 10,753 787 Updated Jul 18, 2024

Get a ChatGPT plugin up and running in under 5 minutes!

Python 4,250 740 Updated Jan 30, 2024

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 20,997 3,676 Updated Jul 4, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,229 737 Updated Aug 15, 2024

[CVPR2024] Make Your Dream A Vlog

Python 406 41 Updated Mar 19, 2024

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 15,804 1,703 Updated Aug 27, 2024

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 559 40 Updated Aug 21, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,609 1,224 Updated Dec 6, 2023

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,656 249 Updated Jun 24, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,076 4,108 Updated Aug 19, 2024

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

TypeScript 1,579 169 Updated Aug 19, 2024
Jupyter Notebook 515 63 Updated Jul 25, 2023

This is the official repository for M2UGen

Jupyter Notebook 435 38 Updated May 8, 2024

MU-LLaMA: Music Understanding Large Language Model

Python 221 16 Updated Mar 25, 2024

Singing Voice Conversion via diffusion model

Jupyter Notebook 2,614 799 Updated Jul 10, 2023

Instant voice cloning by MIT and MyShell.

Python 28,096 2,752 Updated Aug 21, 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,237 952 Updated Jul 26, 2024

so-vits-svc fork with realtime support, improved interface and more features.

Python 8,646 1,149 Updated Aug 21, 2024
Python 9,185 1,194 Updated Aug 27, 2024

End-to-End Speech Processing Toolkit

Python 8,233 2,142 Updated Aug 27, 2024

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,062 104 Updated May 10, 2024

Audio super resolution using neural networks

Python 1,143 205 Updated Oct 24, 2023

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,526 2,155 Updated Jun 26, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,312 929 Updated Aug 5, 2024

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Shell 24,922 3,116 Updated Aug 15, 2024
Next