hhlcorpusant

Follow

hhlcorpusant

Follow

1 follower · 2 following

Stars

collabora / WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,799 207 Updated Jun 18, 2024

AudiogenAI / agc

Audiogen Codec

Python 118 11 Updated Jul 9, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,195 3,812 Updated Sep 17, 2024

Stability-AI / stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 140 15 Updated Jul 25, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 2,549 237 Updated Jul 15, 2024

horseee / DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 764 36 Updated Jun 27, 2024

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,197 1,203 Updated Jul 23, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,976 1,559 Updated Sep 27, 2024

a43992899 / MARBLE-Benchmark

Music Audio Representation Benchmark for Universal Evaluation

Python 84 4 Updated May 16, 2024

researchmm / MM-Diffusion

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Python 388 22 Updated Jun 5, 2024

XinhaoMei / WavCaps

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 196 11 Updated Jul 25, 2024

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,703 705 Updated Jul 3, 2024

SayaSS / vits-finetuning

Fine-Tuning your VITS model using a pre-trained model

Python 546 86 Updated May 2, 2023

state-spaces / mamba

Mamba SSM architecture

Python 12,699 1,064 Updated Sep 26, 2024

mhrice / RemFx

General Purpose Audio Effect Removal

Python 92 4 Updated Aug 31, 2023

csteinmetz1 / dasp-pytorch

Differentiable audio signal processors in PyTorch

Python 225 5 Updated Dec 4, 2023

RustAudio / dasp

The fundamentals for Digital Audio Signal Processing. Formerly `sample`.

Rust 870 63 Updated Mar 26, 2024

Fictionarry / ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,016 133 Updated Jul 12, 2024

yakami129 / VirtualWife

VirtualWife是一个虚拟数字人项目，支持B站直播，支持openai、ollama

Python 1,497 271 Updated May 20, 2024

jiran214 / langup-ai

AGI 社交网络 Bot. BiliBili | 直播聊天数字人 | 视频@自动回复 | 私信bot | 终端聊天 | 语音交互

Python 545 108 Updated Mar 30, 2024

Ikaros-521 / AI-Vtuber

Forked from sandboxdream/AI-Vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊…

Python 2,875 436 Updated Sep 27, 2024

invoke-ai / InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. Th…

TypeScript 23,034 2,381 Updated Sep 27, 2024

shutterstock / shutterstock-cli

A command-line utility that allows you to interact with the Shutterstock public API.

Python 6 2 Updated May 26, 2023

whwu95 / Cap4Video

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Python 227 16 Updated Sep 12, 2024

danieljf24 / awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

583 66 Updated Oct 20, 2023

ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 852 121 Updated Apr 12, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 33,836 4,119 Updated Aug 16, 2024

descriptinc / audiotools

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Python 220 37 Updated Jul 22, 2024

adobe-research / DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Python 360 45 Updated May 30, 2023

yqzhishen / onnxcrepe

ONNX deployment of the CREPE pitch tracker

Python 20 1 Updated Oct 27, 2022