Skip to content
View dariadiatlova's full-sized avatar
🦄
🦄

Organizations

@deepvk

Block or report dariadiatlova

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

Python 182 6 Updated Sep 3, 2024
Python 5,665 420 Updated Sep 27, 2024

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Python 168 26 Updated May 30, 2023
Python 277 37 Updated Sep 3, 2024

A lightweight library for Frechet Audio Distance calculation.

Python 231 23 Updated Sep 4, 2024

Inference and training library for high-quality TTS models.

Python 4,258 427 Updated Sep 23, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,675 250 Updated Sep 25, 2024

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 671 39 Updated Sep 21, 2024

🚜 METR: Message Enhanced Tree-Ring

Jupyter Notebook 10 Updated Aug 19, 2024

An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification

Python 11 2 Updated Sep 22, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Python 8 1 Updated Aug 22, 2022

SALMONN: Speech Audio Language Music Open Neural Network

Python 992 76 Updated Sep 24, 2024

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

Python 472 76 Updated Jul 11, 2023

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 8 Updated Jun 16, 2024

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 95 5 Updated Sep 3, 2024

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Python 172 9 Updated Aug 20, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,574 1,223 Updated Aug 21, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,469 200 Updated Aug 1, 2024

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Python 26 1 Updated Aug 6, 2024

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Python 1,392 158 Updated Sep 25, 2024

Official PyTorch implementation of AdaFlow

Jupyter Notebook 15 Updated Jun 4, 2024

BLSP-Emo: Towards Empathetic Large Speech-Language Models

Python 34 2 Updated Jun 7, 2024

Implementation of TTS model based on NVIDIA P-Flow TTS Paper

Python 65 5 Updated May 12, 2024

Official release of StyleTalk dataset.

54 2 Updated Jul 1, 2024

Pytorch implementation of BigVSAN

Python 196 16 Updated Mar 23, 2024

A summary of related works about flow matching, stochastic interpolants

281 10 Updated Jul 29, 2024

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 222 42 Updated Apr 29, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 636 80 Updated Sep 23, 2024

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023

Python 196 13 Updated Mar 13, 2023

[AAAI 2024] Code for CTX-vec2wav in UniCATS

Python 117 16 Updated Jun 11, 2024
Next