Enabling Binary Neural Network Training on the Edge

Wang, Erwei; Davis, James J.; Moro, Daniele; Zielinski, Piotr; Coelho, Claudionor; Chatterjee, Satrajit; Cheung, Peter Y. K.; Constantinides, George A.

Computer Science > Machine Learning

arXiv:2102.04270v2 (cs)

[Submitted on 8 Feb 2021 (v1), revised 10 Feb 2021 (this version, v2), latest version 24 Sep 2023 (v6)]

Title:Enabling Binary Neural Network Training on the Edge

Authors:Erwei Wang, James J. Davis, Daniele Moro, Piotr Zielinski, Claudionor Coelho, Satrajit Chatterjee, Peter Y. K. Cheung, George A. Constantinides

View PDF

Abstract:The ever-growing computational demands of increasingly complex machine learning models frequently necessitate the use of powerful cloud-based infrastructure for their training. Binary neural networks are known to be promising candidates for on-device inference due to their extreme compute and memory savings over higher-precision alternatives. In this paper, we demonstrate that they are also strongly robust to gradient quantization, thereby making the training of modern models on the edge a practical reality. We introduce a low-cost binary neural network training strategy exhibiting sizable memory footprint reductions and energy savings vs Courbariaux & Bengio's standard approach. Against the latter, we see coincident memory requirement and energy consumption drops of 2--6$\times$, while reaching similar test accuracy in comparable time, across a range of small-scale models trained to classify popular datasets. We also showcase ImageNet training of ResNetE-18, achieving a 3.12$\times$ memory reduction over the aforementioned standard. Such savings will allow for unnecessary cloud offloading to be avoided, reducing latency, increasing energy efficiency and safeguarding privacy.

Subjects:	Machine Learning (cs.LG); Hardware Architecture (cs.AR)
Cite as:	arXiv:2102.04270 [cs.LG]
	(or arXiv:2102.04270v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2102.04270

Submission history

From: Erwei Wang [view email]
[v1] Mon, 8 Feb 2021 15:06:41 UTC (123 KB)
[v2] Wed, 10 Feb 2021 21:57:45 UTC (123 KB)
[v3] Mon, 26 Apr 2021 13:24:13 UTC (123 KB)
[v4] Tue, 8 Jun 2021 17:39:07 UTC (210 KB)
[v5] Sun, 10 Jul 2022 21:29:27 UTC (3,971 KB)
[v6] Sun, 24 Sep 2023 23:07:32 UTC (113 KB)

Computer Science > Machine Learning

Title:Enabling Binary Neural Network Training on the Edge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enabling Binary Neural Network Training on the Edge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators