Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Python 181 19 Updated Jul 3, 2024

facebookresearch / audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 398 44 Updated Aug 6, 2024

argmaxinc / WhisperKit

On-device Speech Recognition for Apple Silicon

Swift 3,065 254 Updated Aug 21, 2024

IAHispano / Applio

VITS-based Voice Conversion focused on simplicity, quality and performance.

Python 1,444 241 Updated Aug 25, 2024

Vaibhavs10 / open-tts-tracker

1,072 67 Updated Jun 21, 2024

kkoutini / PaSST

Efficient Training of Audio Transformers with Patchout

Python 292 49 Updated Jan 12, 2024

wavmark / wavmark

AI-based Audio Watermarking Tool

Python 208 28 Updated Jan 7, 2024

abreuwallace / Stochastic-Restoration-GAN

Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks in Pytorch

Python 5 Updated Dec 19, 2023

dechamps / APO

Some random notes about Windows Audio Processing Objects (APOs).

60 4 Updated May 29, 2022

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

921 78 Updated Aug 22, 2024

ExistentialAudio / AEC3

Forked from shapedbyiris/AEC3

AEC3 Extracted From WebRTC

C++ 1 2 Updated Jul 7, 2022

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,609 366 Updated Aug 10, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,401 373 Updated Aug 22, 2024

microsoft / Llama-2-Onnx

Python 1,010 90 Updated Jan 4, 2024

SociallyIneptWeeb / AICoverGen

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Python 1,000 233 Updated Jul 29, 2024

prosodylab / prosobeast-annotation-tool

Python 40 1 Updated Feb 16, 2022

ga642381 / Speech-Prompts-Adapters

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

103 5 Updated Aug 4, 2023

huggingface / distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,439 258 Updated Jul 12, 2024

KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

Python 998 87 Updated Feb 13, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 2,458 227 Updated Jul 15, 2024

apple / ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Python 16,625 913 Updated Aug 23, 2024

MingjieChen / EasyVC

A toolkit for any-to-any encoder-decoder voice conversion systems

Python 79 8 Updated Aug 10, 2023

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 22,377 3,381 Updated Aug 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weiren weirenlan

Block or report weirenlan

Starred repositories

mendableai / firecrawl

pipecat-ai / pipecat

bootphon / phonemizer

charlax / professional-programming

JasonSWFu / VQscore

jcurtis4207 / Juce-Plugins

karpathy / minbpe

hayeong0 / Diff-HierVC