Skip to content

Latest commit

 

History

History

workshop-2022

IndicWav2Vec-Workshop-2022

In this workshop, we will finetune wav2vec2 style model from scratch on custom asr data using Fairseq library. We will also train an n-gram language model using Kenlm library. Finally we will export it to Huggingface's format and deploy it as a web app using Gradio.

Poster link here

Notebooks

Details and step-by-step walkthrough of training, exporting and deploying Indicwav2vec models have been outlined in the Jupyter Notebook.

Slides

Discussion on topics like ASR Pipeline, Wav2Vec2 Architecture, Role of LM in ASR, Mining Parallel Data, etc. can be found here

For any queries related to workshop, create a new Github issues with the label, workshop-2022 or add to the Q&A tab in Discussions.

Note: Video recordings of the workshop will be made available by 7th, August, 2022.