U-shape Transformer

This repository is the official PyTorch implementation of U-shape Transformer for Underwater Image Enhancement. (arxiv, Dataset, video demo, visual results). U-shape Transformer achieves state-of-the-art performance in underwater image enhancement task.

🚀 🚀 🚀 News:

2021/11/25 We released our pretrained model, You can download the pretrain models in BaiduYun with the password tdg9 or in Google Drive.
2021/11/24 We released the official code of U-shape Transformer
2021/11/23 We released LSUI dataset, We released a large-scale underwater image (LSUI) dataset including 5004 image pairs, which involve richer underwater scenes (lighting conditions, water types and target categories) and better visual quality reference images than the existing ones. You can download it from [here]. Please contact [email protected] to obtain the download password, and leave your name and organization, we will reply within 48 hours.

The light absorption and scattering of underwater impurities lead to poor underwater imaging quality. The existing data-driven based underwater image enhancement (UIE) techniques suffer from the lack of a large-scale dataset containing various underwater scenes and high-fidelity reference images. Besides, the inconsistent attenuation in different color channels and space areas is not fully considered for boosted enhancement. In this work, we constructed a large-scale underwater image (LSUI) dataset including 5004 image pairs, and reported an U-shape Transformer network where the transformer model is for the first time introduced to the UIE task. The U-shape Transformer is integrated with a channel-wise multi-scale feature fusion transformer (CMSFFT) module and a spatial-wise global feature modeling transformer (SGFMT) module, which reinforce the network's attention to the color channels and space areas with more serious attenuation. Meanwhile, in order to further improve the contrast and saturation, a novel loss function combining RGB, LAB and LCH color spaces is designed following the human vision principle. The extensive experiments on available datasets validate the state-of-the-art performance of the reported technique with more than 2dB superiority.

Training

If you need to train our U-shape transformer from scratch, you need to download our dataset from LSUI (Please contact [email protected] to obtain the download password, and leave your name and organization, we will reply within 48 hours.), and then randomly select 4500 picture pairs as the training set to replace the data folder, and the remaining 504 as the test set to replace the test folder.

Then, run the train.ipynb file with Jupiter notebook, and the trained model weight file will be automatically saved in saved_ Models folder. As described in the paper, we recommend you use L2 loss for the first 600 epochs and L1 loss for the last 200 epochs.

Environmental requirements:

Python 3.7 or a newer version
Pytorch 1.7 0r a newer version
CUDA 10.1 or a newer version
OpenCV 4.5.3 or a newer version
Jupyter Notebook

Or you can install from the requirements.txt using

pip install -r requirements.txt

Testing

For your convience, we provide some example datasets (~20Mb) in ./test. You can download the pretrain models in BaiduYun with the password tdg9 or in Google Drive.

After downloading, extract the pretrained model into the project folder and replace the ./saved_models folder, and then run test.ipynb. The code will use the pretrained model to automatically process all the images in the ./test/input folder and output the results to the ./test/output folder. In addition, the output result will automatically calculate the PSNR value with the reference image.

Results

We achieved state-of-the-art performance on underwater image enhancement task. Detailed results can be found in the paper or our project page

Full-Reference Evaluation (click me)

Non-reference Evaluation

Citation

@misc{peng2021ushape,
  title={U-shape Transformer for Underwater Image Enhancement}, 
  author={Lintao Peng and Chunli Zhu and Liheng Bian},
  year={2021},
  eprint={2111.11843},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
  }

License and Acknowledgement

This project is released under the MIT license. The codes are designed based on pix2pix. We also refer to codes in UCTransNet and TransBTS. Please also follow their licenses. Thanks for their awesome works.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
figs		figs
images/results		images/results
loss		loss
net		net
pytorch-ssim-loss		pytorch-ssim-loss
pytorch_ssim		pytorch_ssim
test		test
utility		utility
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test.ipynb		test.ipynb
train.ipynb		train.ipynb
trainL1.ipynb		trainL1.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

U-shape Transformer

Contents

Training

Testing

Results

Citation

License and Acknowledgement

About

Releases

Packages

Languages

License

lilubit/U-shape_Transformer_for_Underwater_Image_Enhancement

Folders and files

Latest commit

History

Repository files navigation

U-shape Transformer

Contents

Training

Testing

Results

Citation

License and Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages