Optimizing the Dice Score and Jaccard Index for Medical Image Segmentation: Theory & Practice

Bertels, Jeroen; Eelbode, Tom; Berman, Maxim; Vandermeulen, Dirk; Maes, Frederik; Bisschops, Raf; Blaschko, Matthew

doi:10.1007/978-3-030-32245-8_11

Computer Science > Computer Vision and Pattern Recognition

arXiv:1911.01685 (cs)

[Submitted on 5 Nov 2019]

Title:Optimizing the Dice Score and Jaccard Index for Medical Image Segmentation: Theory & Practice

Authors:Jeroen Bertels, Tom Eelbode, Maxim Berman, Dirk Vandermeulen, Frederik Maes, Raf Bisschops, Matthew Blaschko

View PDF

Abstract:The Dice score and Jaccard index are commonly used metrics for the evaluation of segmentation tasks in medical imaging. Convolutional neural networks trained for image segmentation tasks are usually optimized for (weighted) cross-entropy. This introduces an adverse discrepancy between the learning optimization objective (the loss) and the end target metric. Recent works in computer vision have proposed soft surrogates to alleviate this discrepancy and directly optimize the desired metric, either through relaxations (soft-Dice, soft-Jaccard) or submodular optimization (Lovász-softmax). The aim of this study is two-fold. First, we investigate the theoretical differences in a risk minimization framework and question the existence of a weighted cross-entropy loss with weights theoretically optimized to surrogate Dice or Jaccard. Second, we empirically investigate the behavior of the aforementioned loss functions w.r.t. evaluation with Dice score and Jaccard index on five medical segmentation tasks. Through the application of relative approximation bounds, we show that all surrogates are equivalent up to a multiplicative factor, and that no optimal weighting of cross-entropy exists to approximate Dice or Jaccard measures. We validate these findings empirically and show that, while it is important to opt for one of the target metric surrogates rather than a cross-entropy-based loss, the choice of the surrogate does not make a statistical difference on a wide range of medical segmentation tasks.

Comments:	MICCAI 2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:1911.01685 [cs.CV]
	(or arXiv:1911.01685v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1911.01685
Journal reference:	LNCS 11765, Springer Nature Switzerland AG 2019
Related DOI:	https://doi.org/10.1007/978-3-030-32245-8_11

Submission history

From: Jeroen Bertels [view email]
[v1] Tue, 5 Nov 2019 09:42:25 UTC (1,753 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Optimizing the Dice Score and Jaccard Index for Medical Image Segmentation: Theory & Practice

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Optimizing the Dice Score and Jaccard Index for Medical Image Segmentation: Theory & Practice

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators