Self-Instruct: Aligning Language Models with Self-Generated Instructions

Wang, Yizhong; Kordi, Yeganeh; Mishra, Swaroop; Liu, Alisa; Smith, Noah A.; Khashabi, Daniel; Hajishirzi, Hannaneh

Computer Science > Computation and Language

arXiv:2212.10560v2 (cs)

[Submitted on 20 Dec 2022 (v1), last revised 25 May 2023 (this version, v2)]

Title:Self-Instruct: Aligning Language Models with Self-Generated Instructions

Authors:Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi

View PDF

Abstract:Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We introduce Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off their own generations. Our pipeline generates instructions, input, and output samples from a language model, then filters invalid or similar ones before using them to finetune the original model. Applying our method to the vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT-001, which was trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT-001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning. Our code and data are available at this https URL.

Comments:	ACL 2023 camera ready, 23 pages, 9 figures, 11 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2212.10560 [cs.CL]
	(or arXiv:2212.10560v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.10560

Submission history

From: Yizhong Wang [view email]
[v1] Tue, 20 Dec 2022 18:59:19 UTC (4,072 KB)
[v2] Thu, 25 May 2023 23:50:07 UTC (7,954 KB)

Computer Science > Computation and Language

Title:Self-Instruct: Aligning Language Models with Self-Generated Instructions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Self-Instruct: Aligning Language Models with Self-Generated Instructions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators