Skip to content
View loretoparisi's full-sized avatar
🐍
NightShift
🐍
NightShift

Organizations

@Musixmatchdev @musixmatchresearch
Block or Report

Block or report loretoparisi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Automatic Speech Recognition

Speech to Text
9 repositories

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 24,770 3,924 Updated Jun 22, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 129,185 25,613 Updated Jul 12, 2024

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

JavaScript 1,415 731 Updated Sep 10, 2022

Experiments with Hugging Face 🔬 🤗

Python 44 6 Updated Jun 17, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 64,476 7,513 Updated Jul 2, 2024

Port of OpenAI's Whisper model in C/C++

C++ 33,083 3,311 Updated Jul 12, 2024

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,252 358 Updated Apr 3, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,540 1,018 Updated Jun 26, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,348 243 Updated Jul 12, 2024