Skip to content

Commit

Permalink
Merge pull request SeanNaren#107 from SeanNaren/features
Browse files Browse the repository at this point in the history
Added features list
  • Loading branch information
Sean Naren committed Jun 29, 2017
2 parents a52830f + dcdd850 commit fb9452c
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,16 @@
Implementation of DeepSpeech2 using [Baidu Warp-CTC](https://github.com/baidu-research/warp-ctc).
Creates a network based on the [DeepSpeech2](http:https://arxiv.org/pdf/1512.02595v1.pdf) architecture, trained with the CTC activation function.

## Features

* Train DeepSpeech, configurable RNN types and architectures with multi-gpu support.
* Language model support using kenlm (WIP right now, currently no instructions to build a LM yet).
* Multiple dataset downloaders, support for AN4, TED, Voxforge and Librispeech. Datasets can be merged, support for custom datasets included.
* Noise injection for online training to improve noise robustness.
* Audio augmentation to improve noise robustness.
* Easy start/stop capabilities in the event of crash or hard stop during training.
* Visdom/Tensorboard support for visualising training graphs.

# Installation

Several libraries are needed to be installed for training to work. I will assume that everything is being installed in
Expand Down

0 comments on commit fb9452c

Please sign in to comment.