Skip to content
View Mikezz1's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report Mikezz1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Fused SwiGLU Triton kernels

Python 1 Updated Jan 25, 2024

SoftVC VITS Singing Voice Conversion

Python 24,983 4,708 Updated Nov 11, 2023

SoftVC VITS Singing Voice Conversion

Python 547 83 Updated Apr 4, 2023

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,217 174 Updated Jul 19, 2024

This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implementations of 1D, 2D, and 3D convolutions with different kern…

Python 313 22 Updated Jul 9, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,055 93 Updated Jul 11, 2024

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 735 69 Updated Jul 15, 2024

Tools for handling speech data in machine learning projects.

Python 913 207 Updated Jul 26, 2024

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Python 358 13 Updated Jun 3, 2024

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 1,980 49 Updated Jun 15, 2024

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 489 39 Updated Jul 2, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 2,950 168 Updated Jul 19, 2024

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 584 47 Updated Sep 13, 2023

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,438 381 Updated Jul 30, 2024
Python 283 11 Updated Jun 21, 2024

A course in reinforcement learning in the wild

Jupyter Notebook 5,828 1,679 Updated May 12, 2024

This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Jo…

75 7 Updated Dec 20, 2023

An open source dataset for source separation

Python 351 66 Updated Feb 9, 2024

The PyTorch-based audio source separation toolkit for researchers

Python 2,185 419 Updated Jul 19, 2024

Conformer-based Metric GAN for speech enhancement

Python 290 55 Updated May 3, 2024

Deep Learning for Speech

Jupyter Notebook 71 5 Updated Dec 5, 2023

✍️ A way to integrate LaTeX, VS Code, and Inkscape in macOS

Python 315 20 Updated May 17, 2024

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…

Python 311 15 Updated Aug 1, 2024

An easy to use PyTorch to TensorRT converter

Python 4,506 670 Updated Jun 17, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,406 3,902 Updated Jul 25, 2024

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Python 71 7 Updated Sep 25, 2023

Convmelspec: Convertible Melspectrograms via 1D Convolutions

Python 124 8 Updated May 13, 2024

Port of OpenAI's Whisper model in C/C++

C++ 33,447 3,384 Updated Jul 31, 2024
Next