Revisiting Pre-Trained Models for Chinese Natural Language Processing

Cui, Yiming; Che, Wanxiang; Liu, Ting; Qin, Bing; Wang, Shijin; Hu, Guoping

doi:10.18653/v1/2020.findings-emnlp.58

Computer Science > Computation and Language

arXiv:2004.13922 (cs)

[Submitted on 29 Apr 2020 (v1), last revised 2 Nov 2020 (this version, v2)]

Title:Revisiting Pre-Trained Models for Chinese Natural Language Processing

Authors:Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Shijin Wang, Guoping Hu

View PDF

Abstract:Bidirectional Encoder Representations from Transformers (BERT) has shown marvelous improvements across various NLP tasks, and consecutive variants have been proposed to further improve the performance of the pre-trained language models. In this paper, we target on revisiting Chinese pre-trained language models to examine their effectiveness in a non-English language and release the Chinese pre-trained language model series to the community. We also propose a simple but effective model called MacBERT, which improves upon RoBERTa in several ways, especially the masking strategy that adopts MLM as correction (Mac). We carried out extensive experiments on eight Chinese NLP tasks to revisit the existing pre-trained language models as well as the proposed MacBERT. Experimental results show that MacBERT could achieve state-of-the-art performances on many NLP tasks, and we also ablate details with several findings that may help future research. Resources available: this https URL

Comments:	12 pages, to appear at Findings of EMNLP 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2004.13922 [cs.CL]
	(or arXiv:2004.13922v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2004.13922
Related DOI:	https://doi.org/10.18653/v1/2020.findings-emnlp.58

Submission history

From: Yiming Cui [view email]
[v1] Wed, 29 Apr 2020 02:08:30 UTC (226 KB)
[v2] Mon, 2 Nov 2020 06:27:52 UTC (51 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2020-04

Change to browse by:

cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yiming Cui
Wanxiang Che
Ting Liu
Bing Qin
Shijin Wang

…

export BibTeX citation

Computer Science > Computation and Language

Title:Revisiting Pre-Trained Models for Chinese Natural Language Processing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Revisiting Pre-Trained Models for Chinese Natural Language Processing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators