The Unreasonable Effectiveness of Easy Training Data for Hard Tasks

Hase, Peter; Bansal, Mohit; Clark, Peter; Wiegreffe, Sarah

Computer Science > Computation and Language

arXiv:2401.06751v1 (cs)

[Submitted on 12 Jan 2024 (this version), latest version 5 Jun 2024 (v2)]

Title:The Unreasonable Effectiveness of Easy Training Data for Hard Tasks

Authors:Peter Hase, Mohit Bansal, Peter Clark, Sarah Wiegreffe

View PDF HTML (experimental)

Abstract:How can we train models to perform well on hard test data when hard training data is by definition difficult to label correctly? This question has been termed the scalable oversight problem and has drawn increasing attention as language models have continually improved. In this paper, we present the surprising conclusion that current language models often generalize relatively well from easy to hard data, even performing as well as "oracle" models trained on hard data. We demonstrate this kind of easy-to-hard generalization using simple training methods like in-context learning, linear classifier heads, and QLoRA for seven different measures of datapoint hardness, including six empirically diverse human hardness measures (like grade level) and one model-based measure (loss-based). Furthermore, we show that even if one cares most about model performance on hard data, it can be better to collect and train on easy data rather than hard data, since hard data is generally noisier and costlier to collect. Our experiments use open models up to 70b in size and four publicly available question-answering datasets with questions ranging in difficulty from 3rd grade science questions to college level STEM questions and general-knowledge trivia. We conclude that easy-to-hard generalization in LMs is surprisingly strong for the tasks studied, suggesting the scalable oversight problem may be easier than previously thought. Our code is available at this https URL

Comments:	22 pages, 20 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2401.06751 [cs.CL]
	(or arXiv:2401.06751v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.06751

Submission history

From: Peter Hase [view email]
[v1] Fri, 12 Jan 2024 18:36:29 UTC (1,397 KB)
[v2] Wed, 5 Jun 2024 14:10:11 UTC (1,315 KB)

Computer Science > Computation and Language

Title:The Unreasonable Effectiveness of Easy Training Data for Hard Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Unreasonable Effectiveness of Easy Training Data for Hard Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators