Efficient Algorithms for Device Placement of DNN Graph Operators

Tarnawski, Jakub; Phanishayee, Amar; Devanur, Nikhil R.; Mahajan, Divya; Paravecino, Fanny Nina

Computer Science > Machine Learning

arXiv:2006.16423v2 (cs)

[Submitted on 29 Jun 2020 (v1), last revised 29 Oct 2020 (this version, v2)]

Title:Efficient Algorithms for Device Placement of DNN Graph Operators

Authors:Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino

View PDF

Abstract:Modern machine learning workloads use large models, with complex structures, that are very expensive to execute. The devices that execute complex models are becoming increasingly heterogeneous as we see a flourishing of domain-specific accelerators being offered as hardware accelerators in addition to CPUs. These trends necessitate distributing the workload across multiple devices. Recent work has shown that significant gains can be obtained with model parallelism, i.e, partitioning a neural network's computational graph onto multiple devices. In particular, this form of parallelism assumes a pipeline of devices, which is fed a stream of samples and yields high throughput for training and inference of DNNs. However, for such settings (large models and multiple heterogeneous devices), we require automated algorithms and toolchains that can partition the ML workload across devices. In this paper, we identify and isolate the structured optimization problem at the core of device placement of DNN operators, for both inference and training, especially in modern pipelined settings. We then provide algorithms that solve this problem to optimality. We demonstrate the applicability and efficiency of our approaches using several contemporary DNN computation graphs.

Comments:	Accepted to NeurIPS 2020
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
Cite as:	arXiv:2006.16423 [cs.LG]
	(or arXiv:2006.16423v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.16423

Submission history

From: Jakub Tarnawski [view email]
[v1] Mon, 29 Jun 2020 22:45:01 UTC (1,233 KB)
[v2] Thu, 29 Oct 2020 19:07:35 UTC (1,230 KB)

Computer Science > Machine Learning

Title:Efficient Algorithms for Device Placement of DNN Graph Operators

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient Algorithms for Device Placement of DNN Graph Operators

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators