dariadiatlova

🦄

Daria Diatlova dariadiatlova

🦄

voice dl researcher

44 followers · 40 following

@deepvk
Saint-Petersburg
https://www.linkedin.com/in/daria-diatlova-09b589184/

Achievements

Organizations

Lists (9)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

jishengpeng / ControlSpeech

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

Python 182 6 Updated Sep 3, 2024

kyutai-labs / moshi

Python 5,665 420 Updated Sep 27, 2024

arxyzan / data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Python 168 26 Updated May 30, 2023

huggingface / dataspeech

Python 277 37 Updated Sep 3, 2024

gudgud96 / frechet-audio-distance

A lightweight library for Frechet Audio Distance calculation.

Python 231 23 Updated Sep 4, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,258 427 Updated Sep 23, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,675 250 Updated Sep 25, 2024

jishengpeng / WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 671 39 Updated Sep 21, 2024

deepvk / metr

🚜 METR: Message Enhanced Tree-Ring

Jupyter Notebook 10 Updated Aug 19, 2024

JunyiPeng00 / SLT22_MultiHead-Factorized-Attentive-Pooling

An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification

Python 11 2 Updated Sep 22, 2024

Lamomal / s3prl_correlation

Forked from s3prl/s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Python 8 1 Updated Aug 22, 2022

bytedance / SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Python 992 76 Updated Sep 24, 2024

facebookresearch / libri-light

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

Python 472 76 Updated Jul 11, 2023

Helw150 / levanter

Forked from stanford-crfm/levanter

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 8 Updated Jun 16, 2024

Daria Diatlova dariadiatlova

Organizations

Lists (9)

datasets

etts

music

plc

speech-datasets

speech enhancement

tts

vc

vocoders

Stars