Skip to content
View yusunnny's full-sized avatar
Block or Report

Block or report yusunnny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Python 346 13 Updated Jun 3, 2024

Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http:https://github.com/justinsalamon/scaper

Python 18 6 Updated Sep 14, 2018
Jupyter Notebook 17 4 Updated Jul 3, 2024

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 477 39 Updated Jul 2, 2024

Rotary Transformer

Python 751 45 Updated Mar 21, 2022

Domain Generalization Semantic Segmentation

Python 8 1 Updated Sep 21, 2023

A PyTorch-based Speech Toolkit

Python 8,290 1,333 Updated Jul 22, 2024

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

Python 440 43 Updated Jan 18, 2023

Unsupervised domain adaptation for conversational speech enhancement using RemixIT

Jupyter Notebook 51 5 Updated Apr 25, 2023

Code for the paper: "Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information"

Python 21 5 Updated Oct 10, 2021

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 30,862 4,651 Updated Jul 22, 2024

AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023

Python 22 1 Updated Aug 27, 2023

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Python 536 119 Updated May 16, 2023

CMT: Convolutional Neural Networks Meet Vision Transformers

Python 113 15 Updated Nov 11, 2021

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 13,374 2,023 Updated Jul 15, 2024

Reformer, the efficient Transformer, in Pytorch

Python 2,082 255 Updated Jun 21, 2023

PyTorch implementation of SENet

Python 2,260 440 Updated Mar 2, 2021

Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch

Python 40 9 Updated Jun 4, 2020

Squeeze-and-Excitation Networks

Cuda 3,347 834 Updated Feb 25, 2019

Datasets, Transforms and Models specific to Computer Vision

Python 15,786 6,888 Updated Jul 23, 2024

This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.

Python 84 18 Updated May 31, 2022

My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.

Jupyter Notebook 8 Updated Nov 12, 2022
Python 25 5 Updated Sep 12, 2022

Example MATLAB/Octave scripts to perform ambisonic encoding of microphone array signals

MATLAB 31 4 Updated Oct 4, 2023

Pytorch implemenation of "Learning Filter Basis for Convolutional Neural Network Compression" ICCV2019

Python 16 3 Updated Mar 28, 2022

A Repository of Room Responses and 360 Videos of a Variable Acoustics Lab

HTML 43 Updated Mar 14, 2023

PyTorch implementations of several SOTA backbone deep neural networks (such as ResNet, ResNeXt, RegNet) on one-dimensional (1D) signal/time-series data.

Python 400 96 Updated Feb 7, 2022
Next