Improving Attributed Text Generation of Large Language Models via Preference Learning

Li, Dongfang; Sun, Zetian; Hu, Baotian; Liu, Zhenyu; Hu, Xinshuo; Liu, Xuebo; Zhang, Min

Computer Science > Computation and Language

arXiv:2403.18381 (cs)

[Submitted on 27 Mar 2024]

Title:Improving Attributed Text Generation of Large Language Models via Preference Learning

Authors:Dongfang Li, Zetian Sun, Baotian Hu, Zhenyu Liu, Xinshuo Hu, Xuebo Liu, Min Zhang

View PDF HTML (experimental)

Abstract:Large language models have been widely adopted in natural language processing, yet they face the challenge of generating unreliable content. Recent works aim to reduce misinformation and hallucinations by resorting to attribution as a means to provide evidence (i.e., citations). However, current attribution methods usually focus on the retrieval stage and automatic evaluation that neglect mirroring the citation mechanisms in human scholarly writing to bolster credibility. In this paper, we address these challenges by modelling the attribution task as preference learning and introducing an Automatic Preference Optimization (APO) framework. First, we create a curated collection for post-training with 6,330 examples by collecting and filtering from existing datasets. Second, considering the high cost of labelling preference data, we further propose an automatic method to synthesize attribution preference data resulting in 95,263 pairs. Moreover, inspired by the human citation process, we further propose a progressive preference optimization method by leveraging fine-grained information. Extensive experiments on three datasets (i.e., ASQA, StrategyQA, and ELI5) demonstrate that APO achieves state-of-the-art citation F1 with higher answer quality.

Comments:	23 pages, 15 tables, 2 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.18381 [cs.CL]
	(or arXiv:2403.18381v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.18381

Submission history

From: Dongfang Li [view email]
[v1] Wed, 27 Mar 2024 09:19:13 UTC (351 KB)

Computer Science > Computation and Language

Title:Improving Attributed Text Generation of Large Language Models via Preference Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Improving Attributed Text Generation of Large Language Models via Preference Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators