This repo is a fork, containing the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Python 1 Updated Sep 28, 2023

guyyariv / TempoTokens

This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Python 98 10 Updated Apr 23, 2024

facebookresearch / nevergrad

A Python toolbox for performing gradient-free optimization

Python 3,918 352 Updated Aug 19, 2024

unilight / seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

Python 79 9 Updated Jul 5, 2024

slp-rl / AudioToken

Forked from guyyariv/AudioToken

This repo is a fork from the official PyTorch implementation of "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation" (Interspeech 2023)

Python 5 Updated Jun 25, 2023

slp-rl / SpokenStoryCloze

A spoken version of the textual story cloze benchmark

12 1 Updated Aug 6, 2023

BerlinerA / DSVAE-NES

This repository contains the official PyTorch implementation of the paper: "Learning Discrete Structured VAE using NES".

Python 4 4 Updated May 3, 2022

guyyariv / AudioToken

This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

Python 74 3 Updated Jun 18, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,502 2,063 Updated Jul 18, 2024

gallilmaimon / DISSC

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730

Python 121 9 Updated Dec 8, 2023

slp-rl / aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Python 193 26 Updated Jul 14, 2024

slp-rl / SLM-Discrete-Representations

This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Modeling" (ICASSP 2023)

Python 17 1 Updated Jan 3, 2023

facebookresearch / diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,269 157 Updated Apr 3, 2023

RoySheffer / im2wav

Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

Python 102 9 Updated Jan 18, 2023

Arnontu / DeepAudioWaveformPrior

Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441

Python 8 Updated Oct 25, 2022

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,380 303 Updated Jan 4, 2024

gallilmaimon / LUNATC

This is the official implementation of "A Universal Adversarial Policy for Text Classifiers", Neural Networks (2022), https://doi.org/10.1016/j.neunet.2022.06.018

Python 9 Updated Aug 23, 2022

slp-rl / SC-PhASE

This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)

Python 27 2 Updated Aug 8, 2022

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,969 3,977 Updated Aug 16, 2024

SolomidHero / speech-regeneration-enhancer

Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"

Python 15 1 Updated May 8, 2021

YannickJadoul / Parselmouth

Praat in Python, the Pythonic way

C++ 1,045 114 Updated Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yossi Adi adiyoss

Achievements

Achievements

Block or report adiyoss

Stars

mir-aidj / all-in-one

guyyariv / vLMIG

slp-rl / HebTTS

YuanGongND / ltu

isjakewong / awesome-discrete-diffusion-models

ShovalMessica / NAST

microsoft / CodeBERT

junegunn / fzf

romkatv / powerlevel10k

slp-rl / TempoTokens