Beating the Perils of Non-Convexity: Guaranteed Training of Neural Networks using Tensor Methods

Janzamin, Majid; Sedghi, Hanie; Anandkumar, Anima

Computer Science > Machine Learning

arXiv:1506.08473 (cs)

[Submitted on 28 Jun 2015 (v1), last revised 12 Jan 2016 (this version, v3)]

Title:Beating the Perils of Non-Convexity: Guaranteed Training of Neural Networks using Tensor Methods

Authors:Majid Janzamin, Hanie Sedghi, Anima Anandkumar

View PDF

Abstract:Training neural networks is a challenging non-convex optimization problem, and backpropagation or gradient descent can get stuck in spurious local optima. We propose a novel algorithm based on tensor decomposition for guaranteed training of two-layer neural networks. We provide risk bounds for our proposed method, with a polynomial sample complexity in the relevant parameters, such as input dimension and number of neurons. While learning arbitrary target functions is NP-hard, we provide transparent conditions on the function and the input for learnability. Our training method is based on tensor decomposition, which provably converges to the global optimum, under a set of mild non-degeneracy conditions. It consists of simple embarrassingly parallel linear and multi-linear operations, and is competitive with standard stochastic gradient descent (SGD), in terms of computational complexity. Thus, we propose a computationally efficient method with guaranteed risk bounds for training neural networks with one hidden layer.

Comments:	The tensor decomposition analysis is expanded, and the analysis of ridge regression is added for recovering the parameters of last layer of neural network
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1506.08473 [cs.LG]
	(or arXiv:1506.08473v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1506.08473

Submission history

From: Majid Janzamin [view email]
[v1] Sun, 28 Jun 2015 23:19:49 UTC (59 KB)
[v2] Mon, 24 Aug 2015 19:31:24 UTC (54 KB)
[v3] Tue, 12 Jan 2016 03:19:42 UTC (285 KB)

Computer Science > Machine Learning

Title:Beating the Perils of Non-Convexity: Guaranteed Training of Neural Networks using Tensor Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Beating the Perils of Non-Convexity: Guaranteed Training of Neural Networks using Tensor Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators