Skip to content
View adiyoss's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report adiyoss

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

All-In-One Music Structure Analyzer

Python 391 36 Updated May 9, 2024

This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Generation

Python 14 Updated Jul 1, 2024

The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"

Python 59 3 Updated Jul 21, 2024

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 351 29 Updated Apr 24, 2024

A curated list for awesome discrete diffusion models resources.

36 1 Updated Apr 11, 2023

Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11037

Python 40 2 Updated Jul 2, 2024

CodeBERT

Python 2,131 439 Updated Jul 9, 2023

🌸 A command-line fuzzy finder

Go 63,354 2,365 Updated Aug 25, 2024

A Zsh theme

Shell 45,213 2,148 Updated Aug 21, 2024

This repo is a fork, containing the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Python 1 Updated Sep 28, 2023

This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Python 98 10 Updated Apr 23, 2024

A Python toolbox for performing gradient-free optimization

Python 3,918 352 Updated Aug 19, 2024

A sequence-to-sequence voice conversion toolkit.

Python 79 9 Updated Jul 5, 2024

This repo is a fork from the official PyTorch implementation of "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation" (Interspeech 2023)

Python 5 Updated Jun 25, 2023

A spoken version of the textual story cloze benchmark

12 1 Updated Aug 6, 2023

This repository contains the official PyTorch implementation of the paper: "Learning Discrete Structured VAE using NES".

Python 4 4 Updated May 3, 2022

This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

Python 74 3 Updated Jun 18, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,502 2,063 Updated Jul 18, 2024

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730

Python 121 9 Updated Dec 8, 2023

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Python 193 26 Updated Jul 14, 2024

This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Modeling" (ICASSP 2023)

Python 17 1 Updated Jan 3, 2023

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,269 157 Updated Apr 3, 2023

Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

Python 102 9 Updated Jan 18, 2023

Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441

Python 8 Updated Oct 25, 2022

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,380 303 Updated Jan 4, 2024

This is the official implementation of "A Universal Adversarial Policy for Text Classifiers", Neural Networks (2022), https://doi.org/10.1016/j.neunet.2022.06.018

Python 9 Updated Aug 23, 2022

This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)

Python 27 2 Updated Aug 8, 2022

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,969 3,977 Updated Aug 16, 2024

Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"

Python 15 1 Updated May 8, 2021

Praat in Python, the Pythonic way

C++ 1,045 114 Updated Aug 15, 2024
Next