Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
corpus.py		corpus.py
eval.py		eval.py
eval_utils.py		eval_utils.py
preprocess.py		preprocess.py
record.py		record.py
record_utils.py		record_utils.py
speech_input.py		speech_input.py
speech_model.py		speech_model.py
train.py		train.py
vocabulary.py		vocabulary.py

Repository files navigation

speechT

An opensource speech-to-text software written in tensorflow.

Python 3 is required.

Architecture

Currently speechT is based on the Wav2Letter paper and the CTC loss function.

The speech corpus from https://www.openslr.org/12/ is automatically downloaded.
Note: The corpus is about 30GB!

Training

The data must be preprocessed before training

python3 preprocess.py

Then, to run the training, execute

python3 train.py

Important flags
--data_dir to specify the data directory to download speech corpus to (defaults to ./data/)
--train_dir to specify the train directory to save checkpoints and vocabulary to (defaults to ./train/)

Testing

Not yet implemented.

Live usage

Not yet implemented.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speechT

Architecture

Training

Testing

Live usage

About

Releases 1

Packages

Contributors 2

Languages

License

louiskirsch/speechT

Folders and files

Latest commit

History

Repository files navigation

speechT

Architecture

Training

Testing

Live usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages