Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Liu, Yinhong; Zhou, Han; Guo, Zhijiang; Shareghi, Ehsan; Vulić, Ivan; Korhonen, Anna; Collier, Nigel

Computer Science > Computation and Language

arXiv:2403.16950 (cs)

[Submitted on 25 Mar 2024 (v1), last revised 10 Aug 2024 (this version, v3)]

Title:Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Authors:Yinhong Liu, Han Zhou, Zhijiang Guo, Ehsan Shareghi, Ivan Vulić, Anna Korhonen, Nigel Collier

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have demonstrated promising capabilities as automatic evaluators in assessing the quality of generated natural language. However, LLMs still exhibit biases in evaluation and often struggle to generate coherent evaluations that align with human assessments. In this work, we first conduct a systematic study of the misalignment between LLM evaluators and human judgement, revealing that existing calibration methods aimed at mitigating biases are insufficient for effectively aligning LLM evaluators. Inspired by the use of preference data in RLHF, we formulate the evaluation as a ranking problem and introduce Pairwise-preference Search (PairS), an uncertainty-guided search method that employs LLMs to conduct pairwise comparisons and efficiently ranks candidate texts. PairS achieves state-of-the-art performance on representative evaluation tasks and demonstrates significant improvements over direct scoring. Furthermore, we provide insights into the role of pairwise preference in quantifying the transitivity of LLMs and demonstrate how PairS benefits from calibration.

Comments:	This paper has been accepted by COLM 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2403.16950 [cs.CL]
	(or arXiv:2403.16950v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.16950

Submission history

From: Yinhong Liu [view email]
[v1] Mon, 25 Mar 2024 17:11:28 UTC (3,373 KB)
[v2] Tue, 26 Mar 2024 02:28:42 UTC (3,373 KB)
[v3] Sat, 10 Aug 2024 15:42:51 UTC (1,266 KB)

Computer Science > Computation and Language

Title:Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators