Skip to content

Automatic Speech Recognition (ASR) system for Samrómur speech corpus using Kaldi-ASR

License

Notifications You must be signed in to change notification settings

xbsdsongnan/samromur-asr

 
 

Repository files navigation

LVL Samrómur ASR

Cover Image

Automatic Speech Recognition (ASR) system for Samrómur speech corpus using Kaldi-ASR
Center for Analysis and Design of Intelligent Agents, Language and Voice Lab
Reykjavik University

Table of Contents

Click to expand

1. Introduction

Samrómur ASR is a collection of scripts and recipes for the training of an ASR environment using the Kaldi-ASR toolkit.

2. The Dataset

Samrómur speech corpus is an open and accessible database of voices that everyone is free to use when developing software in Icelandic. The database consists of sentences and audio clips from the reading of those sentences as well as metadata. Each entry in the database contains WAV audio clips and the corresponding text file. Samrómur speech corpus will be available for download soon on CLARIN-IS. For more information about the dataset visit https://samromur.is/gagnasafn.

3. Setup

You can use these guides for reference even though you do not use Terra (a cloud cluster at LVL).

4. License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

5. References

6. Contributing

Pull requests are welcome. For significant changes, please open an issue first to discuss what you would like to change. For more information, please take a look at LVL Software Development Guidelines.

7. Contributors

Become a contributor

🌟 PLEASE STAR THIS REPO IF YOU FOUND SOMETHING INTERESTING 🌟

About

Automatic Speech Recognition (ASR) system for Samrómur speech corpus using Kaldi-ASR

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Shell 100.0%