loretoparisi

🐍

NightShift

Loreto Parisi loretoparisi

🐍

NightShift

MSc Computer Engineering, Machine Learning at @musixmatch

397 followers · 1.7k following

@Musixmatchdev
Italy, Bologna
@loretoparisi

Achievements

x2 x2

Achievements

x2 x2

Organizations

Block or Report

Block or report loretoparisi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Audio Generation

Audio synthesis

28 repositories

galgreshler / Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Python 186 35 Updated Apr 2, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,570 3,930 Updated Aug 6, 2024

petewarden / spchcat

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

C 424 31 Updated Jul 1, 2022

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 130,358 25,909 Updated Aug 9, 2024

mdn / web-speech-api

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

JavaScript 1,416 732 Updated Sep 10, 2022

loretoparisi / hf-experiments

Experiments with Hugging Face 🔬 🤗

Python 44 6 Updated Jun 17, 2024

acids-ircam / cached_conv

Python 49 14 Updated May 31, 2023

acids-ircam / RAVE

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Python 1,268 178 Updated Jul 30, 2024

spotify / basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 3,194 249 Updated Aug 9, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 24,519 5,067 Updated Aug 9, 2024

sanchit-gandhi / whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,294 362 Updated Apr 3, 2024

rhasspy / piper

A fast, local neural text to speech system

C++ 5,391 382 Updated Aug 7, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 34,132 4,047 Updated Jul 10, 2024

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 33,587 3,402 Updated Aug 9, 2024

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,411 430 Updated Jun 10, 2024

gitmylo / bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Python 627 106 Updated Sep 13, 2023

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,383 2,055 Updated Jul 18, 2024