Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Kuchaiev, Oleksii; Ginsburg, Boris; Gitman, Igor; Lavrukhin, Vitaly; Li, Jason; Nguyen, Huyen; Case, Carl; Micikevicius, Paulius

Computer Science > Computation and Language

arXiv:1805.10387 (cs)

[Submitted on 25 May 2018 (v1), last revised 21 Nov 2018 (this version, v2)]

Title:Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Authors:Oleksii Kuchaiev, Boris Ginsburg, Igor Gitman, Vitaly Lavrukhin, Jason Li, Huyen Nguyen, Carl Case, Paulius Micikevicius

View PDF

Abstract:We present OpenSeq2Seq - a TensorFlow-based toolkit for training sequence-to-sequence models that features distributed and mixed-precision training. Benchmarks on machine translation and speech recognition tasks show that models built using OpenSeq2Seq give state-of-the-art performance at 1.5-3x less training time. OpenSeq2Seq currently provides building blocks for models that solve a wide range of tasks including neural machine translation, automatic speech recognition, and speech synthesis.

Comments:	Presented at Workshop for Natural Language Processing Open Source Software (NLP-OSS), co-located with ACL2018
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1805.10387 [cs.CL]
	(or arXiv:1805.10387v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1805.10387

Submission history

From: Oleksii Kuchaiev [view email]
[v1] Fri, 25 May 2018 22:54:38 UTC (270 KB)
[v2] Wed, 21 Nov 2018 17:48:55 UTC (289 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Oleksii Kuchaiev
Boris Ginsburg
Igor Gitman
Vitaly Lavrukhin
Carl Case

…

export BibTeX citation

Computer Science > Computation and Language

Title:Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators