Jackson-Kang

🎯

Focusing

Minsu Kang Jackson-Kang

🎯

Focusing

Standing on the shoulders of "Giants".

162 followers · 302 following

NCSOFT AI
Seongnam, Republic of Korea
https://www.linkedin.com/in/minsu-kang-54a43b212/

Achievements

Organizations

Stars

cantabile-kwok / vec2wav2.0

Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995

Python 46 4 Updated Nov 11, 2024

liutaocode / TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 286 21 Updated Nov 15, 2024

Plachtaa / seed-vc

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 608 66 Updated Nov 15, 2024

supertone-inc / super-monotonic-align

Python 124 9 Updated Sep 19, 2024

maum-ai / phaseaug

ICASSP 2023 Accepted

Python 190 14 Updated May 6, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,352 1,135 Updated Oct 14, 2024

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,102 103 Updated Nov 3, 2024

GILHANS010 / goyulmusicacademy

goyulmusicacademy

HTML 1 Updated Aug 9, 2024

openvpi / SingingVocoders

A collection of neural vocoders suitable for singing voice synthesis tasks.

Python 101 9 Updated Sep 10, 2024

jishengpeng / ControlSpeech

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

Python 196 7 Updated Sep 3, 2024

dongzhuoyao / awesome-flow-matching

A summary of related works about flow matching, stochastic interpolants

335 10 Updated Jul 29, 2024

clu0 / unet.cu

UNet diffusion model in pure CUDA

Cuda 584 28 Updated Jun 28, 2024

ditto-tts / ditto-tts.github.io

Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

HTML 30 1 Updated Aug 21, 2024

ming024 / SpeechLLM_Survey

Codebase for benchmarking several open-sourced SpeechLLM models

4 Updated Jun 2, 2024

line / LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

114 2 Updated Jun 13, 2024

ldzhangyx / instruct-MusicGen

The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".

Python 71 3 Updated Sep 2, 2024

VincentStimper / normalizing-flows

PyTorch implementation of normalizing flow models

Python 718 108 Updated Aug 25, 2024

interactiveaudiolab / ppgs

High-Fidelity Neural Phonetic Posteriorgrams

Python 95 6 Updated Nov 6, 2024

hrnoh24 / stream-vc

An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)

Python 111 7 Updated Jul 30, 2024

mintisan / awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…

2,562 234 Updated Nov 6, 2024