ERNIE: Enhanced Language Representation with Informative Entities

Zhang, Zhengyan; Han, Xu; Liu, Zhiyuan; Jiang, Xin; Sun, Maosong; Liu, Qun

Computer Science > Computation and Language

arXiv:1905.07129 (cs)

[Submitted on 17 May 2019 (v1), last revised 4 Jun 2019 (this version, v3)]

Title:ERNIE: Enhanced Language Representation with Informative Entities

Authors:Zhengyan Zhang, Xu Han, Zhiyuan Liu, Xin Jiang, Maosong Sun, Qun Liu

View PDF

Abstract:Neural language representation models such as BERT pre-trained on large-scale corpora can well capture rich semantic patterns from plain text, and be fine-tuned to consistently improve the performance of various NLP tasks. However, the existing pre-trained language models rarely consider incorporating knowledge graphs (KGs), which can provide rich structured knowledge facts for better language understanding. We argue that informative entities in KGs can enhance language representation with external knowledge. In this paper, we utilize both large-scale textual corpora and KGs to train an enhanced language representation model (ERNIE), which can take full advantage of lexical, syntactic, and knowledge information simultaneously. The experimental results have demonstrated that ERNIE achieves significant improvements on various knowledge-driven tasks, and meanwhile is comparable with the state-of-the-art model BERT on other common NLP tasks. The source code of this paper can be obtained from this https URL.

Comments:	Accepted by ACL 2019
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1905.07129 [cs.CL]
	(or arXiv:1905.07129v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1905.07129

Submission history

From: Zhengyan Zhang [view email]
[v1] Fri, 17 May 2019 06:24:16 UTC (1,571 KB)
[v2] Sun, 26 May 2019 02:42:16 UTC (1,728 KB)
[v3] Tue, 4 Jun 2019 11:35:58 UTC (1,742 KB)

Computer Science > Computation and Language

Title:ERNIE: Enhanced Language Representation with Informative Entities

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ERNIE: Enhanced Language Representation with Informative Entities

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators