Information Quantizer

This repository contains the code for the paper ``Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition'' (more features available soon).

@inproceedings{wang-etal-2022-iq,
  author={Liming Wang and Siyuan Feng and Mark Hasegawa-Johnson and Chang D. Yoo},
  title={Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition},
  booktitle={Annual Meeting of the Association for Computational Linguistics},
  year={2022}
}

Dependencies

ZeroSpeech 2021 baseline system
UnsupSeg
BEER
Other dependencies can be found in requirements.txt

How to run it?

Simply run bash run.sh for the small datasets we provided. To reproduce the results in the paper, please download the whole datasets and convert them in a similar format as the small datasets by the following steps:

Prepare datasets. Download the LibriSpeech dataset, manually cut out spoken word segments using information provided in resources/librispeech_word/librispeech_word.json. Also download the TIMIT dataset, convert the audio files to .wav and create the meta data files as done in resources/TIMIT/test_subset.
Modify the paths and variables in run.sh and configs/librispeech_word.conf.
Run bash run.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
UnsupSeg		UnsupSeg
configs		configs
datasets		datasets
doc/image		doc/image
resources		resources
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt
run.sh		run.sh
solver.py		solver.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Information Quantizer

Dependencies

How to run it?

About

Releases

Packages

Languages

License

lwang114/InformationQuantizer

Folders and files

Latest commit

History

Repository files navigation

Information Quantizer

Dependencies

How to run it?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages