Large Language Models Can Be Easily Distracted by Irrelevant Context

Shi, Freda; Chen, Xinyun; Misra, Kanishka; Scales, Nathan; Dohan, David; Chi, Ed; Schärli, Nathanael; Zhou, Denny

Computer Science > Computation and Language

arXiv:2302.00093 (cs)

[Submitted on 31 Jan 2023 (v1), last revised 6 Jun 2023 (this version, v3)]

Title:Large Language Models Can Be Easily Distracted by Irrelevant Context

Authors:Freda Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou

View PDF

Abstract:Large language models have achieved impressive performance on various natural language processing tasks. However, so far they have been evaluated primarily on benchmarks where all information in the input context is relevant for solving the task. In this work, we investigate the distractibility of large language models, i.e., how the model problem-solving accuracy can be influenced by irrelevant context. In particular, we introduce Grade-School Math with Irrelevant Context (GSM-IC), an arithmetic reasoning dataset with irrelevant information in the problem description. We use this benchmark to measure the distractibility of cutting-edge prompting techniques for large language models, and find that the model performance is dramatically decreased when irrelevant information is included. We also identify several approaches for mitigating this deficiency, such as decoding with self-consistency and adding to the prompt an instruction that tells the language model to ignore the irrelevant information.

Comments:	Published in ICML 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.00093 [cs.CL]
	(or arXiv:2302.00093v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.00093

Submission history

From: Xinyun Chen [view email]
[v1] Tue, 31 Jan 2023 20:48:57 UTC (1,140 KB)
[v2] Mon, 13 Feb 2023 20:08:59 UTC (1,138 KB)
[v3] Tue, 6 Jun 2023 08:36:20 UTC (1,142 KB)

Computer Science > Computation and Language

Title:Large Language Models Can Be Easily Distracted by Irrelevant Context

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Large Language Models Can Be Easily Distracted by Irrelevant Context

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators