#

wav2vec2

Here are 116 public repositories matching this topic...

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Nov 20, 2024
Python

s3prl

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Updated Nov 16, 2024
Python

audeering / w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

deep-learning valence arousal onnx speech-emotion-recognition dominance transformer-models wav2vec2 msp-podcast

Updated May 22, 2023
Jupyter Notebook

oliverguhr / wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

pyaudio speech speech-recognition speech-to-text asr wav2vec wav2vec2

Updated Feb 4, 2024
Python

vid2cleantxt

pszemraj / vid2cleantxt

Python API & command-line tool to easily transcribe speech-based video files into clean text

Updated Oct 29, 2024
Jupyter Notebook

habla-liaa / ser-with-w2v2

Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'

deep-learning tensorflow speech speech-emotion-recognition wav2vec2

Updated Dec 23, 2021
Jupyter Notebook

khanld / ASR-Wav2vec-Finetune

⚡ Finetune Wa2vec 2.0 For Speech Recognition

pytorch speech-recognition speech-to-text asr huggingface vietnamese-speech-recognition wav2vec2 finetune-wav2vec

Updated Nov 7, 2023
Python

inboxpraveen / LLM-Minutes-of-Meeting

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where we'll be open for contributions to enable real-time meeting transcription! 🚀

python nlp natural-language-processing web translation transformers web-application speech-recognition speech-to-text whisper meeting-minutes webapplication minutes-of-meeting huggingface huggingface-transformers wav2vec2 llm whisper-ai llm-inference

Updated Jun 10, 2024
Python

ASR

vietai / ASR

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

asr pretrained-weights ctc-loss asr-model end-to-end-speech-recognition wav2vec2

Updated Sep 3, 2021

thevasudevgupta / gsoc-wav2vec2

GSoC'2021 | TensorFlow implementation of Wav2Vec2

tensorflow gsoc speech-to-text librispeech-dataset wav2vec2

Updated Jan 11, 2022
Jupyter Notebook

tuanio / noisy-student-training-asr

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

machine-learning deep-learning pytorch speech-recognition semi-supervised-learning data-augmentation nst conformer pretrained noisy-student wav2vec2 aped

Updated Sep 14, 2023
Python

Telegram-Zalo / zac2022-lyric-alignment

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

deep-learning vietnamese pytorch dynamic-programming forced-alignment wav2vec2 music-alignment

Updated Dec 5, 2022
Python

lstrgar / self-supervised-phone-segmentation

Phoneme segmentation using pre-trained speech models

deep-learning speech-segmentation self-supervised-learning speech-technology hubert wav2vec2

Updated Nov 4, 2022
Python

MiniASR

vectominist / MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

minimal pytorch speech-recognition asr ctc fairseq speech-representation hubert wav2vec2 s3prl

Updated Dec 6, 2022
Jupyter Notebook

mmakiuchi / multimodal_emotion_recognition

Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in the ASRU 2021 conference.

emotion-recognition speech-emotion-recognition text-emotion-detection disentanglement-learning wav2vec2 asru2021

Updated Sep 14, 2021
Python

pooya-mohammadi / audio-classification-pytorch

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.

python deep-learning transformers pytorch lstm audio-classification wav2vec2 deep-utils

Updated Nov 23, 2024
Jupyter Notebook

Hamtech-ai / wav2vec2-fa

fine-tune Wav2vec2. an ASR model released by Facebook

nlp transformer speech-to-text asr asr-model huggingface wav2vec2

Updated Dec 11, 2021
Jupyter Notebook

mt-upc / SHAS

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

speech speech-to-text audio-segmentation speech-translation wav2vec2

Updated Feb 9, 2023
Python

ECNU-Cross-Innovation-Lab / ShiftSER

[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

speech-emotion-recognition hubert wav2vec2

Updated Dec 18, 2023
Python

HarunoriKawano / Wav2vec2.0

Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.

pytorch speech-recognition wav2vec2

Updated May 19, 2023
Python

Improve this page

Add a description, image, and links to the wav2vec2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wav2vec2 topic, visit your repo's landing page and select "manage topics."