Neuron Interaction Based Representation Composition for Neural Machine Translation

Li, Jian; Wang, Xing; Yang, Baosong; Shi, Shuming; Lyu, Michael R.; Tu, Zhaopeng

Computer Science > Computation and Language

arXiv:1911.09877 (cs)

[Submitted on 22 Nov 2019]

Title:Neuron Interaction Based Representation Composition for Neural Machine Translation

Authors:Jian Li, Xing Wang, Baosong Yang, Shuming Shi, Michael R. Lyu, Zhaopeng Tu

View PDF

Abstract:Recent NLP studies reveal that substantial linguistic information can be attributed to single neurons, i.e., individual dimensions of the representation vectors. We hypothesize that modeling strong interactions among neurons helps to better capture complex information by composing the linguistic properties embedded in individual neurons. Starting from this intuition, we propose a novel approach to compose representations learned by different components in neural machine translation (e.g., multi-layer networks or multi-head attention), based on modeling strong interactions among neurons in the representation vectors. Specifically, we leverage bilinear pooling to model pairwise multiplicative interactions among individual neurons, and a low-rank approximation to make the model computationally feasible. We further propose extended bilinear pooling to incorporate first-order representations. Experiments on WMT14 English-German and English-French translation tasks show that our model consistently improves performances over the SOTA Transformer baseline. Further analyses demonstrate that our approach indeed captures more syntactic and semantic information as expected.

Comments:	AAAI 2020
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1911.09877 [cs.CL]
	(or arXiv:1911.09877v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1911.09877

Submission history

From: Jian Li [view email]
[v1] Fri, 22 Nov 2019 06:38:42 UTC (215 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2019-11

Change to browse by:

cs.CL
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jian Li
Xing Wang
Baosong Yang
Shuming Shi
Michael R. Lyu

…

export BibTeX citation

Computer Science > Computation and Language

Title:Neuron Interaction Based Representation Composition for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Neuron Interaction Based Representation Composition for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators