Automatic Speech Recognition (ASR) system for Samrómur speech corpus using Kaldi-ASR
Center for Analysis and Design of Intelligent Agents, Language and Voice Lab
Reykjavik University
Click to expand
Samrómur ASR is a collection of scripts and recipes for the training of an ASR environment using the Kaldi-ASR toolkit.
Samrómur speech corpus is an open and accessible database of voices that everyone is free to use when developing software in Icelandic. The database consists of sentences and audio clips from the reading of those sentences as well as metadata. Each entry in the database contains WAV audio clips and the corresponding text file. Samrómur speech corpus will be available for download soon on CLARIN-IS. For more information about the dataset visit https://samromur.is/gagnasafn.
You can use these guides for reference even though you do not use Terra (a cloud cluster at LVL).
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
Pull requests are welcome. For significant changes, please open an issue first to discuss what you would like to change. For more information, please take a look at LVL Software Development Guidelines.
🌟 PLEASE STAR THIS REPO IF YOU FOUND SOMETHING INTERESTING 🌟