DiVANet-PyTorch

This is repository is an official PyTorch implementation of the paper "Single image super-resolution based on directional variance attention network".

Pattern Recognition, 2022. [Paper]

Abstract

Recent advances in single image super-resolution (SISR) explore the power of deep convolutional neural networks (CNNs) to achieve better performance. However, most of the progress has been made by scaling CNN architectures, which usually raise computational demands and memory consumption. This makes modern architectures less applicable in practice. In addition, most CNN-based SR methods do not fully utilize the informative hierarchical features that are helpful for ﬁnal image recovery. In order to address these issues, we propose a directional variance attention network (DiVANet), a computationally eﬃcient yet accurate network for SISR. Speciﬁcally, we introduce a novel directional variance attention (DiVA) mechanism to capture long-range spatial dependencies and exploit inter-channel dependencies simultaneously for more discriminative representations. Furthermore, we propose a residual attention feature group (RAFG) for parallelizing attention and residual block computation. The output of each residual block is linearly fused at the RAFG output to provide access to the whole feature hierarchy. In parallel, DiVA extracts most relevant features from the network for improving the ﬁnal output and preventing information loss along the successive operations inside the network. Experimental results demonstrate the superiority of DiVANet over the state of the art in several datasets, while maintaining relatively low computation and memory footprint.

Requirements

Python 3
PyTorch (0.4.0), torchvision
Numpy, Scipy
Pillow, Scikit-image
h5py
importlib

Dataset

We use DIV2K dataset for training and Set5, Set14, B100, and Urban100 dataset for the benchmark test. Here are the following steps to prepare datasets.

Download DIV2K and unzip on dataset directory as below:

dataset
└── DIV2K
    ├── DIV2K_train_HR
    ├── DIV2K_train_LR_bicubic
    ├── DIV2K_valid_HR
    └── DIV2K_valid_LR_bicubic

To accelerate training, we first convert training images to h5 format as follow (h5py module has to be installed).

$ python div2h5.py

Other benchmark datasets can be downloaded in Google Drive. Same as DIV2K, please put all the datasets in dataset directory.

Testing

We provide the pretrained models in checkpoint directory. To test DiVANet on benchmark dataset:

# Scale factor x2
$ python sample.py      --test_data_dir dataset/<dataset> --scale 2 --ckpt_path ./checkpoints/<path>.pth --sample_dir <sample_dir>

# Scale factor x3                
$ python sample.py      --test_data_dir dataset/<dataset> --scale 3 --ckpt_path ./checkpoints/<path>.pth --sample_dir <sample_dir>

# Scale factor x4
$ python sample.py      --test_data_dir dataset/<dataset> --scale 4 --ckpt_path ./checkpoints/<path>.pth --sample_dir <sample_dir>

Training

Here are our settings to train DiVANet. Note: We use two GPU to utilize large batch size, but if OOM error arise, please reduce batch size.

# Scale factor x2
$ python train.py --patch_size 64 --batch_size 64 --max_steps 600000 --lr 0.001 --decay 150000 --scale 2 

# Scale factor x3
$ python train.py --patch_size 64 --batch_size 64 --max_steps 600000 --lr 0.001 --decay 150000 --scale 3 

# Scale factor x4
$ python train.py --patch_size 64 --batch_size 64 --max_steps 600000 --lr 0.001 --decay 150000 --scale 4

Results

We achieved state-of-the-art performance on lightweigh image SR, denoising and deblurring. All visual results of DiVANet (BI, BD, and DN) for scale factor x2, x3, and x4 can be downloaded here.

Lightweight Single Image Super-Resolution (click me)

Image denoising and deblurring (click me)

Citation

@article{behjati2023single,
  title={Single image super-resolution based on directional variance attention network},
  author={Behjati, Parichehr and Rodriguez, Pau and Fern{\'a}ndez, Carles and Hupont, Isabelle and Mehri, Armin and Gonz{\`a}lez, Jordi},
  journal={Pattern Recognition},
  volume={133},
  pages={108997},
  year={2023},
  publisher={Elsevier}
}

Please also see our other works:

Frequency-Based Enhancement Network for Efficient Super-Resolution - IEEE ACCESS, 2022 - [Paper] [Code]
OverNet: Lightweight Multi-Scale Super-Resolution with Overscaling Network - WACV, 2022- [Paper] [Code]
Hierarchical Residual Attention Network for Single Image Super-Resolution [arXiv]

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
assets		assets
checkpoints		checkpoints
Attention_module.py		Attention_module.py
FM_vis.py		FM_vis.py
README.md		README.md
dataset.py		dataset.py
div2h5.py		div2h5.py
divanet.py		divanet.py
mean.py		mean.py
ops.py		ops.py
sample.py		sample.py
solver.py		solver.py
super.sh		super.sh
train.py		train.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiVANet-PyTorch

This is repository is an official PyTorch implementation of the paper "Single image super-resolution based on directional variance attention network".

Abstract

Requirements

Contents

Dataset

Testing

Training

Results

Citation

About

Releases

Packages

Languages

pbehjatii/DiVANet

Folders and files

Latest commit

History

Repository files navigation

DiVANet-PyTorch

This is repository is an official PyTorch implementation of the paper "Single image super-resolution based on directional variance attention network".

Abstract

Requirements

Contents

Dataset

Testing

Training

Results

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages