Skip to content
View weirenlan's full-sized avatar

Block or report weirenlan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 9,276 699 Updated Aug 25, 2024

Open Source framework for voice and multimodal conversational AI

Python 2,892 216 Updated Aug 23, 2024

Simple text to phones converter for multiple languages

Python 1,181 165 Updated Aug 1, 2024

A collection of learning resources for curious software engineers

Python 46,150 3,699 Updated Aug 20, 2024
Python 27 3 Updated Jun 16, 2024

Audio Plugins created using C++ and Juce Framework

C++ 69 9 Updated Aug 26, 2021

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,954 820 Updated Jul 1, 2024

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Python 181 19 Updated Jul 3, 2024

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 398 44 Updated Aug 6, 2024

On-device Speech Recognition for Apple Silicon

Swift 3,065 254 Updated Aug 21, 2024

VITS-based Voice Conversion focused on simplicity, quality and performance.

Python 1,444 241 Updated Aug 25, 2024

Efficient Training of Audio Transformers with Patchout

Python 292 49 Updated Jan 12, 2024

AI-based Audio Watermarking Tool

Python 208 28 Updated Jan 7, 2024

Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks in Pytorch

Python 5 Updated Dec 19, 2023

Some random notes about Windows Audio Processing Objects (APOs).

60 4 Updated May 29, 2022

[TMLR 2024] Efficient Large Language Models: A Survey

921 78 Updated Aug 22, 2024

AEC3 Extracted From WebRTC

C++ 1 2 Updated Jul 7, 2022

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,609 366 Updated Aug 10, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,401 373 Updated Aug 22, 2024
Python 1,010 90 Updated Jan 4, 2024

A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.

Python 1,000 233 Updated Jul 29, 2024

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

103 5 Updated Aug 4, 2023

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,439 258 Updated Jul 12, 2024

Audio processing by using pytorch 1D convolution network

Python 998 87 Updated Feb 13, 2024

Generative models for conditional audio generation

Python 2,458 227 Updated Jul 15, 2024

Stable Diffusion with Core ML on Apple Silicon

Python 16,625 913 Updated Aug 23, 2024

A toolkit for any-to-any encoder-decoder voice conversion systems

Python 79 8 Updated Aug 10, 2023

Easily train a good VC model with voice data <= 10 mins!

Python 22,377 3,381 Updated Aug 17, 2024
Next