On the Expressive Power of Deep Learning: A Tensor Analysis

Cohen, Nadav; Sharir, Or; Shashua, Amnon

Computer Science > Neural and Evolutionary Computing

arXiv:1509.05009 (cs)

[Submitted on 16 Sep 2015 (v1), last revised 27 May 2016 (this version, v3)]

Title:On the Expressive Power of Deep Learning: A Tensor Analysis

Authors:Nadav Cohen, Or Sharir, Amnon Shashua

View PDF

Abstract:It has long been conjectured that hypotheses spaces suitable for data that is compositional in nature, such as text or images, may be more efficiently represented with deep hierarchical networks than with shallow ones. Despite the vast empirical evidence supporting this belief, theoretical justifications to date are limited. In particular, they do not account for the locality, sharing and pooling constructs of convolutional networks, the most successful deep learning architecture to date. In this work we derive a deep network architecture based on arithmetic circuits that inherently employs locality, sharing and pooling. An equivalence between the networks and hierarchical tensor factorizations is established. We show that a shallow network corresponds to CP (rank-1) decomposition, whereas a deep network corresponds to Hierarchical Tucker decomposition. Using tools from measure theory and matrix algebra, we prove that besides a negligible set, all functions that can be implemented by a deep network of polynomial size, require exponential size in order to be realized (or even approximated) by a shallow network. Since log-space computation transforms our networks into SimNets, the result applies directly to a deep learning architecture demonstrating promising empirical performance. The construction and theory developed in this paper shed new light on various practices and ideas employed by the deep learning community.

Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
Cite as:	arXiv:1509.05009 [cs.NE]
	(or arXiv:1509.05009v3 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1509.05009
Journal reference:	29th Annual Conference on Learning Theory, pp. 698-728, 2016

Submission history

From: Nadav Cohen [view email]
[v1] Wed, 16 Sep 2015 19:32:54 UTC (767 KB)
[v2] Sun, 14 Feb 2016 16:31:49 UTC (412 KB)
[v3] Fri, 27 May 2016 19:07:22 UTC (409 KB)

Computer Science > Neural and Evolutionary Computing

Title:On the Expressive Power of Deep Learning: A Tensor Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:On the Expressive Power of Deep Learning: A Tensor Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators