Skip to content
View loretoparisi's full-sized avatar
🐍
NightShift
🐍
NightShift

Organizations

@Musixmatchdev @musixmatchresearch
Block or Report

Block or report loretoparisi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Audio Generation

Audio synthesis
28 repositories

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Python 186 35 Updated Apr 2, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,570 3,930 Updated Aug 6, 2024

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

C 424 31 Updated Jul 1, 2022

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 130,358 25,909 Updated Aug 9, 2024

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

JavaScript 1,416 732 Updated Sep 10, 2022

Experiments with Hugging Face 🔬 🤗

Python 44 6 Updated Jun 17, 2024
Python 49 14 Updated May 31, 2023

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Python 1,268 178 Updated Jul 30, 2024

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 3,194 249 Updated Aug 9, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 24,519 5,067 Updated Aug 9, 2024

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,294 362 Updated Apr 3, 2024

A fast, local neural text to speech system

C++ 5,391 382 Updated Aug 7, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 34,132 4,047 Updated Jul 10, 2024

Port of OpenAI's Whisper model in C/C++

C++ 33,587 3,402 Updated Aug 9, 2024

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,411 430 Updated Jun 10, 2024

The code for the bark-voicecloning model. Training and inference.

Python 627 106 Updated Sep 13, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,383 2,055 Updated Jul 18, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,398 252 Updated Jul 12, 2024
Python 361 29 Updated Nov 6, 2023

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,542 361 Updated Jul 31, 2024
Jupyter Notebook 103 17 Updated Jul 17, 2024

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 200 28 Updated Jul 1, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,650 193 Updated Jun 18, 2024

On-device Speech Recognition for Apple Silicon

Swift 3,026 250 Updated Aug 7, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,310 715 Updated Jun 24, 2024

React Native Expo wrapper for the Swift WhisperKit library

Kotlin 7 Updated Jul 12, 2024

A PyTorch-based Speech Toolkit

Python 8,367 1,346 Updated Aug 5, 2024

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 389 42 Updated Aug 6, 2024