Skip to content
/ st Public
forked from dqqcasia/st

End-to-end Speech Translation

Notifications You must be signed in to change notification settings

mingxuan/st

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

End-to-end Speech Translation

This repository is the official implementation of the following papers:

Requirements

  • Python 3
  • Tensorflow 1.15
  • Required packages are listed here.

To install requirements:

pip install -r requirements.txt

Data Preprocessing

LUT

python3 -m st.tools.dataset configs_template/lut.yaml

COSTT

python3 -m st.tools.dataset configs_template/costt.yaml

Train

LUT

python3 -m st.bin.run_lut -m train -c configs_template/lut.yaml

COSTT

python3 -m st.bin.run_costt -m train -c configs_template/costt.yaml

Decode

LUT

python3 -m st.bin.run_lut -m infer -c configs_template/lut.yaml

COSTT

python3 -m st.bin.run_costt -m infer -c configs_template/costt.yaml

Citations

Please consider citing our papers in your publications if the project helps your research. BibTeX reference is as follows.

@inproceedings{dong2021listen,
  title={Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation},
  author={Qianqian Dong, Rong Ye, Mingxuan Wang, Hao Zhou, Shuang Xu, Bo Xu, Lei Li},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  year={2021}
}
@inproceedings{dong2021consecutive,
  title={Consecutive Decoding for Speech-to-text Translation},
  author={Qianqian Dong, Mingxuan Wang, Hao Zhou, Shuang Xu, Bo Xu, Lei Li},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  year={2021}
}

About

End-to-end Speech Translation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.6%
  • Other 0.4%