Measuring progress in robotics: Benchmarking and the ‘measure-target confusion’

Vincent C. Müller

Measuring progress in robotics: Benchmarking and the ‘measure-target confusion’

In Fabio Bonsignorio, John Hallam, Elena Messina & Angel P. Del Pobil (eds.), Metrics of sensory motor coordination and integration in robots and animals. Springer. pp. 169-179 (2019) Copy BIBT_EX

Abstract

While it is often said that robotics should aspire to reproducible and measurable results that allow benchmarking, I argue that a focus on benchmarking can be a hindrance for progress in robotics. The reason is what I call the ‘measure-target confusion’, the confusion between a measure of progress and the target of progress. Progress on a benchmark (the measure) is not identical to scientific or technological progress (the target). In the past, several academic disciplines have been led into pursuing only reproducible and measurable ‘scientific’ results – robotics should be careful to follow that line because results that can be benchmarked must be specific and context-dependent, but robotics targets whole complex systems for a broad variety of contexts. While it is extremely valuable to improve benchmarks to reduce the distance be- tween measure and target, the general problem to measure progress towards more intelligent machines (the target) will not be solved by benchmarks alone; we need a balanced approach with sophisticated benchmarks, plus real-life testing, plus qualitative judgment.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

View on PhilPapers

Author's Profile

Vincent C. Müller

Universität Erlangen-Nürnberg

Archival history

First archival date: 2018-06-09
Latest version: 2 (2019-04-30)
View all versions

Keywords

artificial intelligence benchmark Campbell's law intelligence test measure-target confusion robotics science

Reprint years

Analytics

Added to PP
2018-06-09

Downloads
645 (#33,397)

6 months
119 (#41,593)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Measuring progress in robotics: Benchmarking and the ‘measure-target confusion’

Abstract

Author's Profile

Archival history

Categories

Keywords

Reprint years

Analytics