TRAK: Attributing Model Behavior at Scale

Park, Sung Min; Georgiev, Kristian; Ilyas, Andrew; Leclerc, Guillaume; Madry, Aleksander

Statistics > Machine Learning

arXiv:2303.14186 (stat)

[Submitted on 24 Mar 2023 (v1), last revised 3 Apr 2023 (this version, v2)]

Title:TRAK: Attributing Model Behavior at Scale

Authors:Sung Min Park, Kristian Georgiev, Andrew Ilyas, Guillaume Leclerc, Aleksander Madry

View PDF

Abstract:The goal of data attribution is to trace model predictions back to training data. Despite a long line of work towards this goal, existing approaches to data attribution tend to force users to choose between computational tractability and efficacy. That is, computationally tractable methods can struggle with accurately attributing model predictions in non-convex settings (e.g., in the context of deep neural networks), while methods that are effective in such regimes require training thousands of models, which makes them impractical for large models or datasets.
In this work, we introduce TRAK (Tracing with the Randomly-projected After Kernel), a data attribution method that is both effective and computationally tractable for large-scale, differentiable models. In particular, by leveraging only a handful of trained models, TRAK can match the performance of attribution methods that require training thousands of models. We demonstrate the utility of TRAK across various modalities and scales: image classifiers trained on ImageNet, vision-language models (CLIP), and language models (BERT and mT5). We provide code for using TRAK (and reproducing our work) at this https URL .

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2303.14186 [stat.ML]
	(or arXiv:2303.14186v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2303.14186

Submission history

From: Andrew Ilyas [view email]
[v1] Fri, 24 Mar 2023 17:56:22 UTC (16,397 KB)
[v2] Mon, 3 Apr 2023 17:37:50 UTC (16,380 KB)

Statistics > Machine Learning

Title:TRAK: Attributing Model Behavior at Scale

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:TRAK: Attributing Model Behavior at Scale

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators