Moonshine: Distilling with Cheap Convolutions

Crowley, Elliot J.; Gray, Gavin; Storkey, Amos

Statistics > Machine Learning

arXiv:1711.02613v1 (stat)

[Submitted on 7 Nov 2017 (this version), latest version 17 Jan 2019 (v4)]

Title:Moonshine: Distilling with Cheap Convolutions

Authors:Elliot J. Crowley, Gavin Gray, Amos Storkey

View PDF

Abstract:Model distillation compresses a trained machine learning model, such as a neural network, into a smaller alternative such that it could be easily deployed in a resource limited setting. Unfortunately, this requires engineering two architectures: a student architecture smaller than the first teacher architecture but trained to emulate it. In this paper, we present a distillation strategy that produces a student architecture that is a simple transformation of the teacher architecture. Recent model distillation methods allow us to preserve most of the performance of the trained model after replacing convolutional blocks with a cheap alternative. In addition, distillation by attention transfer provides student network performance that is better than training that student architecture directly on data.

Subjects:	Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1711.02613 [stat.ML]
	(or arXiv:1711.02613v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1711.02613

Submission history

From: Gavin Gray [view email]
[v1] Tue, 7 Nov 2017 17:21:06 UTC (316 KB)
[v2] Mon, 21 May 2018 11:43:02 UTC (318 KB)
[v3] Mon, 22 Oct 2018 16:47:40 UTC (125 KB)
[v4] Thu, 17 Jan 2019 12:26:19 UTC (124 KB)

Statistics > Machine Learning

Title:Moonshine: Distilling with Cheap Convolutions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Moonshine: Distilling with Cheap Convolutions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators