What is it for a Machine Learning Model to Have a Capability?

Jacqueline Harding; Nathaniel Sharadin

What is it for a Machine Learning Model to Have a Capability?

British Journal for the Philosophy of Science (forthcoming) Copy BIBT_EX

Abstract

What can contemporary machine learning (ML) models do? Given the proliferation of ML models in society, answering this question matters to a variety of stakeholders, both public and private. The evaluation of models' capabilities is rapidly emerging as a key subfield of modern ML, buoyed by regulatory attention and government grants. Despite this, the notion of an ML model possessing a capability has not been interrogated: what are we saying when we say that a model is able to do something? And what sorts of evidence bear upon this question? In this paper, we aim to answer these questions, using the capabilities of large language models (LLMs) as a running example. Drawing on the large philosophical literature on abilities, we develop an account of ML models' capabilities which can be usefully applied to the nascent science of model evaluation. Our core proposal is a conditional analysis of model abilities (CAMA): crudely, a machine learning model has a capability to X just when it would reliably succeed at doing X if it 'tried'. The main contribution of the paper is making this proposal precise in the context of ML, resulting in an operationalisation of CAMA applicable to LLMs. We then put CAMA to work, showing that it can help make sense of various features of ML model evaluation practice, as well as suggest procedures for performing fair inter-model comparisons.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

View on PhilPapers

Author Profiles

Nathaniel Sharadin

University of Hong Kong

Jacqueline Harding

Stanford University

Archival history

Archival date: 2024-05-14
View all versions

Keywords

Large Language Model Abilities Benchmarks Machine Learning Artificial Intelligence Performance/Competence Distinction Model Evaluation Artificial Agency

Reprint years

DOI

10.1086/732153

Analytics

Added to PP
2024-05-14

Downloads
570 (#39,210)

6 months
382 (#3,474)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

What is it for a Machine Learning Model to Have a Capability?

Abstract

Author Profiles

Archival history

Categories

Keywords

Reprint years

DOI

Analytics