Skip to content
View zhouzhao01's full-sized avatar
Block or Report

Block or report zhouzhao01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

这是我学习 PyTorch 的笔记对应的代码,点击查看 PyTorch 笔记在线电子书

Python 1,221 270 Updated Dec 5, 2020

Python implementation of OMLSA+IMCRA algorithm for speech enhancement.

Python 50 17 Updated Jun 29, 2021

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,779 184 Updated Jul 13, 2024

Code for DCASE 2020 task 1a and task 1b.

Python 85 28 Updated Jan 20, 2022

Code repository for the paper Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs

Python 28 9 Updated May 19, 2022

Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks

Python 64 11 Updated Mar 24, 2023

A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]

Python 35 13 Updated Aug 1, 2024

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

Python 69 8 Updated Jun 20, 2024

Neural Network based Sound Source Localization Models

Python 31 9 Updated Aug 29, 2023

Dual-Input Neural Networks

Python 6 1 Updated Oct 24, 2023

Baseline method for sound event localization task of DCASE 2022 challenge

Python 47 21 Updated Jun 21, 2022

Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network

Python 329 65 Updated Nov 21, 2022
Python 34 3 Updated Nov 12, 2021

Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.

Jupyter Notebook 369 103 Updated Oct 26, 2019

Easy to use Beamformers for multi-channel speech separation/enhancement

Python 173 48 Updated Jan 26, 2021

A timeline of the latest AI models for audio generation, starting in 2023!

1,874 67 Updated Jan 4, 2024

Can Neural Networks Crack Sudoku?

Python 822 133 Updated Feb 17, 2023

A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.

Python 13 Updated Dec 3, 2020

PyTorch implementation of the U-Net for image semantic segmentation with high quality images

Python 8,796 2,427 Updated May 29, 2024

FcaNet: Frequency Channel Attention Networks

Python 473 100 Updated Mar 11, 2021

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MATLAB 695 149 Updated Dec 1, 2020

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

Python 96 12 Updated Mar 24, 2023

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,400 430 Updated Jun 10, 2024

microphone array speech generator (MASG) in room acoustic

Jupyter Notebook 32 12 Updated Jan 2, 2020

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,578 918 Updated Apr 23, 2024

Source code for "FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control"

Python 136 21 Updated Mar 25, 2024

A toolkit for symbolic music generation

Python 422 49 Updated Jan 25, 2024

Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)

Python 132 23 Updated Mar 14, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,350 5,471 Updated Aug 2, 2024
Next