Exploiting Explainable Metrics for Augmented SGD

Hosseini, Mahdi S.; Tuli, Mathieu; Plataniotis, Konstantinos N.

Computer Science > Machine Learning

arXiv:2203.16723 (cs)

[Submitted on 31 Mar 2022]

Title:Exploiting Explainable Metrics for Augmented SGD

Authors:Mahdi S. Hosseini, Mathieu Tuli, Konstantinos N. Plataniotis

View PDF

Abstract:Explaining the generalization characteristics of deep learning is an emerging topic in advanced machine learning. There are several unanswered questions about how learning under stochastic optimization really works and why certain strategies are better than others. In this paper, we address the following question: \textit{can we probe intermediate layers of a deep neural network to identify and quantify the learning quality of each layer?} With this question in mind, we propose new explainability metrics that measure the redundant information in a network's layers using a low-rank factorization framework and quantify a complexity measure that is highly correlated with the generalization performance of a given optimizer, network, and dataset. We subsequently exploit these metrics to augment the Stochastic Gradient Descent (SGD) optimizer by adaptively adjusting the learning rate in each layer to improve in generalization performance. Our augmented SGD -- dubbed RMSGD -- introduces minimal computational overhead compared to SOTA methods and outperforms them by exhibiting strong generalization characteristics across application, architecture, and dataset.

Comments:	Accepted in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR2022)
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.16723 [cs.LG]
	(or arXiv:2203.16723v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.16723

Submission history

From: Mahdi S. Hosseini Dr. [view email]
[v1] Thu, 31 Mar 2022 00:16:44 UTC (7,978 KB)

Computer Science > Machine Learning

Title:Exploiting Explainable Metrics for Augmented SGD

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Exploiting Explainable Metrics for Augmented SGD

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators