Skip to content
View abdouaziz's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report abdouaziz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Vector (and Scalar) Quantization, in Pytorch

Python 2,480 198 Updated Sep 26, 2024

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Python 1,358 85 Updated Sep 15, 2024
Python 6,120 457 Updated Oct 9, 2024
62 1 Updated Jan 15, 2024

Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995

22 Updated Sep 27, 2024

Notebooks for the Practicals at the Deep Learning Indaba 2024.

Jupyter Notebook 47 44 Updated Sep 3, 2024

lina-speech : linear attention based text-to-speech

Jupyter Notebook 117 10 Updated Jun 3, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,454 304 Updated Jan 4, 2024
Jupyter Notebook 140 15 Updated Jan 7, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,152 107 Updated Jul 11, 2024

This is a simple ComfyUI custom TTS node based on Parler_tts.

Python 32 3 Updated Aug 10, 2024

A list of scripts/notebooks I'd like to keep handy

Jupyter Notebook 7 1 Updated Aug 15, 2024

Wolof Dataset for Open LLM Fine-Tuning

Python 3 1 Updated Sep 21, 2024

Deep learning for audio processing

Jupyter Notebook 573 102 Updated Oct 5, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Python 331 18 Updated Aug 7, 2024

asyncio (PEP 3156) Redis support

Python 2,299 336 Updated Feb 20, 2023

Deep Learning Audio Course, 2023

Jupyter Notebook 71 3 Updated Oct 3, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,200 334 Updated Sep 27, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,123 404 Updated Oct 9, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,458 293 Updated Oct 3, 2024
Python 6 1 Updated Aug 20, 2024

🦝 OpenAPI plugin for generating API reference docs in Docusaurus v3.

TypeScript 668 230 Updated Oct 9, 2024

Easy to maintain open source documentation websites.

TypeScript 56,041 8,408 Updated Oct 9, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 12,052 1,925 Updated Oct 9, 2024

The n-gram Language Model

C 1,319 93 Updated Aug 5, 2024
Python 186 22 Updated May 30, 2024
Python 42 14 Updated Feb 13, 2022

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python 325 53 Updated Oct 1, 2024

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 154 9 Updated Apr 20, 2024
Next