Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages wi…
deep-learning
speech-recognition
speech-processing
asr
common-voice
self-supervised-learning
huggingface
wandb
huggingface-transformers
phone-recognition
-
Updated
May 9, 2022 - Python