loretoparisi

🐍

NightShift

Loreto Parisi loretoparisi

🐍

NightShift

MSc Computer Engineering, Machine Learning at @musixmatch

396 followers · 1.7k following

@Musixmatchdev
Italy, Bologna
@loretoparisi

Achievements

x2 x2

Achievements

x2 x2

Organizations

Block or Report

Block or report loretoparisi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Audio

43 repositories

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,509 726 Updated Jul 12, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 31,958 3,837 Updated Jul 8, 2024

coqui-ai / STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

C++ 2,199 265 Updated Mar 11, 2024

petewarden / spchcat

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

C 422 31 Updated Jul 1, 2022

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 129,183 25,612 Updated Jul 12, 2024

mdn / web-speech-api

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

JavaScript 1,415 731 Updated Sep 10, 2022

loretoparisi / hf-experiments

Experiments with Hugging Face 🔬 🤗

Python 44 6 Updated Jun 17, 2024

acids-ircam / cached_conv

Python 49 14 Updated May 31, 2023

acids-ircam / RAVE

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Python 1,254 176 Updated Apr 22, 2024

adobe-research / MetaAF

Control adaptive filters with neural networks.

Python 214 39 Updated Oct 4, 2023

wangyu / rethink-audio-fsl

Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)

Python 40 6 Updated May 24, 2022

spotify / basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 3,138 242 Updated Jul 11, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 64,470 7,511 Updated Jul 2, 2024

sony / DiffRoll

PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model

Jupyter Notebook 64 11 Updated Dec 6, 2023

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,305 298 Updated Jan 4, 2024

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 33,081 3,311 Updated Jul 12, 2024

adefossez / sdx23

SDX23 startkit for the Demucs baselines.

Python 21 1 Updated Mar 3, 2023

loretoparisi / waveform.js

Waveform generation from audio file

JavaScript 4 Updated Jan 24, 2020

ggerganov / ggwave

Tiny data-over-sound library

C++ 1,895 141 Updated Feb 3, 2024

sanchit-gandhi / whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,252 358 Updated Apr 3, 2024

rhasspy / piper

A fast, local neural text to speech system

C++ 5,095 357 Updated Jul 11, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 33,799 4,011 Updated Jul 10, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,207 2,025 Updated Jun 19, 2024

ffmpegwasm / ffmpeg.wasm

FFmpeg for browser, powered by WebAssembly

C 13,504 779 Updated Jul 11, 2024

alfg / ffprobe-wasm

A Web-based FFProbe. Powered by FFmpeg, Vue and Web Assembly!

Vue 139 33 Updated Dec 13, 2023

yangdongchao / UniAudio

The Open Source Code of UniAudio

Python 479 31 Updated May 3, 2024

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 7,931 989 Updated Apr 24, 2024

huggingface / distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,348 243 Updated Jul 12, 2024

KoeAI / LLVC

Python 356 30 Updated Nov 6, 2023

minzwon / musicfm

Python 146 4 Updated Feb 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loreto Parisi loretoparisi

Achievements

Achievements

Organizations

Block or report loretoparisi

Audio

pyannote / pyannote-audio

coqui-ai / TTS

coqui-ai / STT

petewarden / spchcat

huggingface / transformers

mdn / web-speech-api

loretoparisi / hf-experiments

acids-ircam / cached_conv

acids-ircam / RAVE

adobe-research / MetaAF

wangyu / rethink-audio-fsl

spotify / basic-pitch

openai / whisper

sony / DiffRoll

facebookresearch / encodec

ggerganov / whisper.cpp

adefossez / sdx23

loretoparisi / waveform.js

ggerganov / ggwave

sanchit-gandhi / whisper-jax

rhasspy / piper

suno-ai / bark

facebookresearch / audiocraft

ffmpegwasm / ffmpeg.wasm

alfg / ffprobe-wasm

yangdongchao / UniAudio

facebookresearch / demucs

huggingface / distil-whisper

KoeAI / LLVC

minzwon / musicfm