Skip to content
View Ryu1845's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Ryu1845

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 709 27 Updated Sep 25, 2024

YAAPT Pitch Tracking function in PyTorch

Python 6 Updated Jul 19, 2024

Public code release associated with SceneScript.

Python 12 Updated Sep 26, 2024

Unofficial implementation

Python 3 1 Updated Sep 26, 2024
Python 49 2 Updated Sep 27, 2024

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Python 77 3 Updated Sep 20, 2024

Diffusion-based singing voice pitch correction

Python 87 14 Updated Sep 20, 2024
Python 107 5 Updated Sep 27, 2024

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 105 2 Updated Sep 27, 2024
C 393 32 Updated Sep 20, 2024

Computes the Energy Sliced Wasserstein Loss between two distributions. An optimal-transporty-energyish vibe distribution matching loss/regulariser.

Python 6 Updated Sep 24, 2024

PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.

Python 40 1 Updated Sep 23, 2024

phoneme tokenizer and grapheme-to-phoneme model for 8k languages

Python 141 13 Updated Jun 9, 2023

This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory features, for over 7000 languages.

Python 6 Updated Sep 23, 2024

16-fold memory access reduction with nearly no loss

Python 42 1 Updated Aug 18, 2024

The source code for the Interspeech 2024 paper "Lightweight Transducer Based on Frame Level Criterion".

Python 7 1 Updated Sep 23, 2024

Writing FLUX in Triton

Python 21 7 Updated Sep 22, 2024
Python 3 Updated Sep 22, 2024

A dialect of Lisp that's embedded in Python

Python 5,087 372 Updated Sep 22, 2024

[INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset

Python 6 Updated Sep 5, 2024

An all-purpose window upscaler for Windows 10/11.

HLSL 9,113 481 Updated Sep 19, 2024
HTML 82 1 Updated Sep 23, 2024

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system in 275+ supported cars.

Python 49,582 9,007 Updated Sep 27, 2024

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

Python 37 2 Updated Sep 20, 2024

an architecture for neural network inference in real-time audio applications

C++ 88 2 Updated Sep 23, 2024

The reproduce training process for Moshi

Python 69 5 Updated Sep 20, 2024

EDM-HSE is an open audio dataset featuring 8000 house music drum loops.

2 Updated Sep 17, 2024

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook 114 10 Updated Nov 12, 2022

A very simple BERT implementation in PyTorch, which only depends on PyTorch itself.

Python 7 Updated Sep 21, 2024
Python 2 Updated Sep 17, 2024
Next