Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
-
Updated
Jun 9, 2021 - MATLAB
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
关于语音信号声源定位DOA估计所用的一些传统算法
Spectral Subtraction, Wiener Filtering, MMSE
Efficient voice activity detection algorithm using long-term speech information
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
ASPP: Binaural Speech Enhancement with Atomic Speech Presence Probability Estimation
Classifying AI Synthesised Voice and Human Voice using Machine Learning by Spectral and Cepstral Analysis. Also classified different TTS(Text-to-Speech) engines for different AI synthesized Voice. Published Paper for the whole art of work. Link Given below.
A real-time analyzer to detect normal speech/abusive speech/noise
Classifying sound signals as Links, Midden or Rechts using features computed using a Mel-Frequency filterbank, summing the power of the frequency-domain in the relevant filters. Dynamic Time Warping is used to find proper alignment between the unknown word and several labelled exemplars per word we are looking for. Then, k nearest neighbours tel…
wSTMI: A speech intelligibility prediction algorithm for noisy and processed speech
MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.
Python and Matlab code for segmentation of field recordings
This project is written by MATLAB R2020b for speech watermarking suitable for content authentication. Firstly, 4 folders are made by names of "original", "watermark", "extract" and "attack". Then 4 wav files are copied to "original" folder. Finally "final_1.m" can be run.
Some simulation macros related to signal processing
Exemplary simulation toolchain combining FADE, TASCAR, and openMHA for aided speech recognition performance predictions in complex auditory scenes
Signal Processing design to improve SPEECH signal quality through spectrum by mainly using Filtration Techniques (IEEE Report & MATLAB code).
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."