Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 3,740 204 Updated Aug 27, 2024

sarulab-speech / UTMOSv2

UTokyo-SaruLab MOS Prediction System

Python 44 5 Updated Jul 28, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 514 24 Updated Aug 28, 2024

qiuk2 / AAR

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 49 4 Updated Aug 24, 2024

RoyJames / room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Shell 378 28 Updated Apr 23, 2024

keonlee9420 / evaluate-zero-shot-tts

Evaluation Protocol for Large-Scale Zero-Shot TTS Literature

Python 38 2 Updated Aug 19, 2024

unilight / seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

Python 79 9 Updated Jul 5, 2024

bfs18 / e2_tts

Python 35 5 Updated Aug 23, 2024

ChristophReich1996 / Dirac-GAN

PyTorch reimplementation of the DiracGAN proposed in the paper "Which Training Methods for GANs do actually Converge?" [ICML 2018].

Python 18 5 Updated Jul 12, 2021

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 6,247 552 Updated Aug 27, 2024

tianweiy / DMD2

Python 406 23 Updated Jul 10, 2024

yangdongchao / SimpleSpeech

The open source code for SimpleSpeech series

Python 66 4 Updated Aug 19, 2024

soham97 / awesome-sound_event_detection

Reading list for research topics in Sound AI

159 8 Updated Aug 8, 2024

RicherMans / Dasheng

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 33 2 Updated Aug 13, 2024

yeungchenwa / Recommendations-Diffusion-Text-Image

A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten ge…

178 4 Updated Aug 2, 2024