Speech Translation and the End-to-End Promise: Taking Stock of Where We Are

Sperber, Matthias; Paulik, Matthias

Computer Science > Computation and Language

arXiv:2004.06358 (cs)

[Submitted on 14 Apr 2020]

Title:Speech Translation and the End-to-End Promise: Taking Stock of Where We Are

Authors:Matthias Sperber, Matthias Paulik

View PDF

Abstract:Over its three decade history, speech translation has experienced several shifts in its primary research themes; moving from loosely coupled cascades of speech recognition and machine translation, to exploring questions of tight coupling, and finally to end-to-end models that have recently attracted much attention. This paper provides a brief survey of these developments, along with a discussion of the main challenges of traditional approaches which stem from committing to intermediate representations from the speech recognizer, and from training cascaded models separately towards different objectives.
Recent end-to-end modeling techniques promise a principled way of overcoming these issues by allowing joint training of all model components and removing the need for explicit intermediate representations. However, a closer look reveals that many end-to-end models fall short of solving these issues, due to compromises made to address data scarcity. This paper provides a unifying categorization and nomenclature that covers both traditional and recent approaches and that may help researchers by highlighting both trade-offs and open research questions.

Comments:	ACL 2020 theme track
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2004.06358 [cs.CL]
	(or arXiv:2004.06358v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2004.06358

Submission history

From: Matthias Sperber [view email]
[v1] Tue, 14 Apr 2020 08:43:51 UTC (72 KB)

Computer Science > Computation and Language

Title:Speech Translation and the End-to-End Promise: Taking Stock of Where We Are

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Speech Translation and the End-to-End Promise: Taking Stock of Where We Are

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators