Multi-Objective Optimization for Sparse Deep Multi-Task Learning

Hotegni, S. S.; Berkemeier, M.; Peitz, S.

Computer Science > Machine Learning

arXiv:2308.12243 (cs)

[Submitted on 23 Aug 2023 (v1), last revised 26 Mar 2024 (this version, v4)]

Title:Multi-Objective Optimization for Sparse Deep Multi-Task Learning

Authors:S. S. Hotegni, M. Berkemeier, S. Peitz

View PDF HTML (experimental)

Abstract:Different conflicting optimization criteria arise naturally in various Deep Learning scenarios. These can address different main tasks (i.e., in the setting of Multi-Task Learning), but also main and secondary tasks such as loss minimization versus sparsity. The usual approach is a simple weighting of the criteria, which formally only works in the convex setting. In this paper, we present a Multi-Objective Optimization algorithm using a modified Weighted Chebyshev scalarization for training Deep Neural Networks (DNNs) with respect to several tasks. By employing this scalarization technique, the algorithm can identify all optimal solutions of the original problem while reducing its complexity to a sequence of single-objective problems. The simplified problems are then solved using an Augmented Lagrangian method, enabling the use of popular optimization techniques such as Adam and Stochastic Gradient Descent, while efficaciously handling constraints. Our work aims to address the (economical and also ecological) sustainability issue of DNN models, with a particular focus on Deep Multi-Task models, which are typically designed with a very large number of weights to perform equally well on multiple tasks. Through experiments conducted on two Machine Learning datasets, we demonstrate the possibility of adaptively sparsifying the model during training without significantly impacting its performance, if we are willing to apply task-specific adaptations to the network weights. Code is available at this https URL

Comments:	12 pages, 7 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Optimization and Control (math.OC)
Cite as:	arXiv:2308.12243 [cs.LG]
	(or arXiv:2308.12243v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2308.12243

Submission history

From: Sedjro Salomon Hotegni [view email]
[v1] Wed, 23 Aug 2023 16:42:27 UTC (5,598 KB)
[v2] Thu, 12 Oct 2023 15:06:50 UTC (5,777 KB)
[v3] Thu, 25 Jan 2024 17:15:41 UTC (8,057 KB)
[v4] Tue, 26 Mar 2024 15:12:19 UTC (8,057 KB)

Computer Science > Machine Learning

Title:Multi-Objective Optimization for Sparse Deep Multi-Task Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multi-Objective Optimization for Sparse Deep Multi-Task Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators