Skip to content
View hhlcorpusant's full-sized avatar

Block or report hhlcorpusant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,799 207 Updated Jun 18, 2024

Audiogen Codec

Python 118 11 Updated Jul 9, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,195 3,812 Updated Sep 17, 2024

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 140 15 Updated Jul 25, 2024

Generative models for conditional audio generation

Python 2,549 237 Updated Jul 15, 2024

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 764 36 Updated Jun 27, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,197 1,203 Updated Jul 23, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,976 1,559 Updated Sep 27, 2024

Music Audio Representation Benchmark for Universal Evaluation

Python 84 4 Updated May 16, 2024

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Python 388 22 Updated Jun 5, 2024

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 196 11 Updated Jul 25, 2024

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,703 705 Updated Jul 3, 2024

Fine-Tuning your VITS model using a pre-trained model

Python 546 86 Updated May 2, 2023

Mamba SSM architecture

Python 12,699 1,064 Updated Sep 26, 2024

General Purpose Audio Effect Removal

Python 92 4 Updated Aug 31, 2023

Differentiable audio signal processors in PyTorch

Python 225 5 Updated Dec 4, 2023

The fundamentals for Digital Audio Signal Processing. Formerly `sample`.

Rust 870 63 Updated Mar 26, 2024

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,016 133 Updated Jul 12, 2024

VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama

Python 1,497 271 Updated May 20, 2024

AGI 社交网络 Bot. BiliBili | 直播聊天数字人 | 视频@自动回复 | 私信bot | 终端聊天 | 语音交互

Python 545 108 Updated Mar 30, 2024

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 2,875 436 Updated Sep 27, 2024

InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. Th…

TypeScript 23,034 2,381 Updated Sep 27, 2024

A command-line utility that allows you to interact with the Shutterstock public API.

Python 6 2 Updated May 26, 2023

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

Python 227 16 Updated Sep 12, 2024

A curated list of deep learning resources for video-text retrieval.

583 66 Updated Oct 20, 2023

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 852 121 Updated Apr 12, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 33,836 4,119 Updated Aug 16, 2024

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Python 220 37 Updated Jul 22, 2024

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Python 360 45 Updated May 30, 2023

ONNX deployment of the CREPE pitch tracker

Python 20 1 Updated Oct 27, 2022
Next