Indian Accent Speech Recognition

Traditional ASR (Signal Analysis, MFCC, DTW, HMM & Language Modelling) and DNNs (Custom Models & Baidu DeepSpeech Model) on Indian Accent Speech

<< Uploaded the pre-trained model owing to requests >>
The generated trie file is uploaded to pre-trained-models directory. So you can skip the KenLM Toolkit step.

To understand the context, theory and explanation of this project, head over to my blog:
https://towardsdatascience.com/indian-accent-speech-recognition-2d433eb7edac

How to Use?

A starter Code to use the model is given in the file: Starter.ipynb. You can run it in your Google Colab, if you upload the 3 files (given in params) to your google drive.

Install DeepSpeech 0.6.1
Download the pre-trained model (.pbmm), language model and trie file.
Download instructions are given in pre-trained-models folder. After download give them as arguments.

!deepspeech --model speech/output_graph.pbmm --lm speech/lm.binary --trie speech/trie --audio /content/06_M_artic_01_004.wav

If you run into issue while loading the pre-trained model, then it is mostly due to your deepspeech version.

Data Source/ Training Data:

Indic TTS Project: Downloaded 50+ GB of Indic TTS voice DB from Speech and Music Technology Lab, IIT Madras, which comprises of 10000+ spoken sentences from 20+ states (both Male and Female native speakers)

https://www.iitm.ac.in/donlab/tts/index.php

You can also record your own audio or let the ebook reader apps read a document. But I found it is insufficient to train such a heavy model. Then I requested support of IIT Madras, Speech Lab who kindly granted access to their Voice database.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
audio		audio
images		images
pre-trained-models		pre-trained-models
results		results
.gitattributes		.gitattributes
DeepSpeech_Training.ipynb		DeepSpeech_Training.ipynb
README.md		README.md
Starter.ipynb		Starter.ipynb
Training_Instructions.docx		Training_Instructions.docx
char_map.py		char_map.py
data_generator.py		data_generator.py
final_captions_24_02_20.csv		final_captions_24_02_20.csv
final_compile.csv		final_compile.csv
sample_models.py		sample_models.py
train_corpus.json		train_corpus.json
train_utils.py		train_utils.py
utils.py		utils.py
valid_corpus.json		valid_corpus.json
vui_notebook.ipynb		vui_notebook.ipynb
workspace-utils.py		workspace-utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Indian Accent Speech Recognition

How to Use?

Contents:

Data Source/ Training Data:

DNN Custom Models for Speech Recognition:

parmarjh/Indian-Accent-Speech-Recognition

Folders and files

Latest commit

History

Repository files navigation

Indian Accent Speech Recognition

How to Use?

Contents:

Data Source/ Training Data:

DNN Custom Models for Speech Recognition: