Emergent and Predictable Memorization in Large Language Models

Biderman, Stella; Prashanth, USVSN Sai; Sutawika, Lintang; Schoelkopf, Hailey; Anthony, Quentin; Purohit, Shivanshu; Raff, Edward

Computer Science > Computation and Language

arXiv:2304.11158 (cs)

[Submitted on 21 Apr 2023 (v1), last revised 31 May 2023 (this version, v2)]

Title:Emergent and Predictable Memorization in Large Language Models

Authors:Stella Biderman, USVSN Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff

View PDF

Abstract:Memorization, or the tendency of large language models (LLMs) to output entire sequences from their training data verbatim, is a key concern for safely deploying language models. In particular, it is vital to minimize a model's memorization of sensitive datapoints such as those containing personal identifiable information (PII). The prevalence of such undesirable memorization can pose issues for model trainers, and may even require discarding an otherwise functional model. We therefore seek to predict which sequences will be memorized before a large model's full train-time by extrapolating the memorization behavior of lower-compute trial runs. We measure memorization of the Pythia model suite and plot scaling laws for forecasting memorization, allowing us to provide equi-compute recommendations to maximize the reliability (recall) of such predictions. We additionally provide further novel discoveries on the distribution of memorization scores across models and data. We release all code and data necessary to reproduce the results in this paper at this https URL

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2304.11158 [cs.CL]
	(or arXiv:2304.11158v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2304.11158

Submission history

From: Hailey Schoelkopf [view email]
[v1] Fri, 21 Apr 2023 17:58:31 UTC (6,352 KB)
[v2] Wed, 31 May 2023 19:09:45 UTC (13,432 KB)

Computer Science > Computation and Language

Title:Emergent and Predictable Memorization in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Emergent and Predictable Memorization in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators