Improving Low-Latency Predictions in Multi-Exit Neural Networks via Block-Dependent Losses

This repo contains the PyTorch implementation of our paper titled Improving Low-Latency Predictions in Multi-Exit Neural Networks via Block-Dependent Losses, published in IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

Abstract: As the size of a model increases, making predictions using deep neural networks (DNNs) is becoming more computationally expensive. Multi-exit neural network is one promising solution that can flexibly make anytime predictions via early exits, depending on the current test-time budget which may vary over time in practice (e.g., selfdriving cars with dynamically changing speeds). However, the prediction performance at the earlier exits is generally much lower than the final exit, which becomes a critical issue in low-latency applications having a tight test-time budget. Compared to the previous works where each block is optimized to minimize the losses of all exits simultaneously, in this work, we propose a new method for training multi-exit neural networks by strategically imposing different objectives on individual blocks. The proposed idea based on grouping and overlapping strategies improves the prediction performance at the earlier exits while not degrading the performance of later ones, making our scheme to be more suitable for low-latency applications. Extensive experimental results on both image classification and semantic segmentation confirm the advantage of our approach. The proposed idea does not require any modifications in the model architecture and can be easily combined with existing strategies aiming to improve the performance of multi-exit neural networks.

Requirements

This code was tested on the following environments:

Ubuntu 18.04
Python 3.7.13
PyTorch 1.12.0
CUDA 11.6

You can install all necessary packages from requirements.txt (or you can use environment.yml in the official code of MSDNet).

pip install -r requirements.txt

Experiments

Experiments can be conducted on two image classification datasets: CIFAR-100, ImageNet.

How to Run

All parameters required for the experiment are described in args.py. Please see the python file for a detailed description of the parameters.
We provided all training options (for implementations of our work and baselines) in train.sh and train_imagenet.sh.

# Cifar-100 dataset

bash train.sh

# ImageNet dataset

bash train_imagenet.sh

Citation

To cite our paper in your papers, please use the following bibtex entry.

@article{han2023improving,
  title={Improving Low-Latency Predictions in Multi-Exit Neural Networks via Block-Dependent Losses},
  author={Han, Dong-Jun and Park, Jungwuk and Ham, Seokil and Lee, Namjin and Moon, Jaekyun},
  journal={IEEE Transactions on Neural Networks and Learning Systems},
  year={2023},
  publisher={IEEE}
}

Acknowledgement

Our code is built upon the implementations at https://github.com/kalviny/MSDNet-PyTorch

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
MSDNet_block_dependent_loss_		MSDNet_block_dependent_loss_
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Low-Latency Predictions in Multi-Exit Neural Networks via Block-Dependent Losses

Requirements

Experiments

How to Run

Citation

Acknowledgement

About

Releases

Packages

Languages

License

savertm/Block-Dependent-Losses-for-Multi-Exit-Networks

Folders and files

Latest commit

History

Repository files navigation

Improving Low-Latency Predictions in Multi-Exit Neural Networks via Block-Dependent Losses

Requirements

Experiments

How to Run

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages