[CVPR2022] Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer

Fushun Zhu¹, Shan Zhao², Peng Wang², Hao Wang², Hua Yan¹, Shuaicheng Liu^3,1

1. Sichuan University, 2. Megvii Technology

3. University of Electronic Science and Technology of China

This is the official implementation of Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer, CVPR 2022

Abstract

We propose a semi-supervised network for wide-angle portraits correction. Wide-angle images often suffer from skew and distortion affected by perspective distortion, especially noticeable at the face regions. Previous deep learning based approaches need the ground-truth correction flow maps for training guidance. However, such labels are expensive, which can only be obtained manually. In this work, we design a semi-supervised scheme and build a high-quality unlabeled dataset with rich scenarios, allowing us to simultaneously use labeled and unlabeled data to improve performance. Specifically, our semi-supervised scheme takes advantage of the consistency mechanism, with several novel components such as direction and range consistency (DRC) and regression consistency (RC). Furthermore, different from the existing methods, we propose the Multi-Scale Swin-Unet (MS-Unet) based on the multi-scale swin transformer block (MSTB), which can simultaneously learn short-distance and long-distance information to avoid artifacts. Extensive experiments demonstrate that the proposed method is superior to the state-of-the-art methods and other representative baselines.

Presentation Video

[Youtube], [Bilibili]

The pipline of semi-supervised wide-angle portraits correction framework with the surrogate task (segmentation)

Note

In this repository, we will release the unlabeled dataset and MegDL implementation of our paper.

Quick Start

All codes are tested on Linux.

Installation

Clone the repository
Install dependecines

Dataset

UltraWidePortraits2022
- Unlabeled wide-angle portraits correction dataset
- Download from [GoogleDrive] or [BaiduCloud] (extraction code:79cw)

Pre-trained model

Training

Testing

Results

Citation

If you think this work is helpful, please cite

@inproceedings{zhu2022semi,
  title={Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer},
  author={Zhu, Fushun and Zhao, Shan and Wang, Peng and Wang, Hao and Yan, Hua and Liu, Shuaicheng},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={19689--19698},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
checkpoint		checkpoint
losses		losses
models		models
utils		utils
readme.md		readme.md
semi-supervised framework.PNG		semi-supervised framework.PNG
test.py		test.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR2022] Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer

Fushun Zhu¹, Shan Zhao², Peng Wang², Hao Wang², Hua Yan¹, Shuaicheng Liu^3,1

1. Sichuan University, 2. Megvii Technology

3. University of Electronic Science and Technology of China

Abstract

Presentation Video

Note

Quick Start

Installation

Dataset

Pre-trained model

Training

Testing

Results

Citation

About

Releases

Packages

Contributors 3

Languages

megvii-research/Portraits_Correction

Folders and files

Latest commit

History

Repository files navigation

[CVPR2022] Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer

Fushun Zhu1, Shan Zhao2, Peng Wang2, Hao Wang2, Hua Yan1, Shuaicheng Liu3,1

1. Sichuan University, 2. Megvii Technology

3. University of Electronic Science and Technology of China

Abstract

Presentation Video

Note

Quick Start

Installation

Dataset

Pre-trained model

Training

Testing

Results

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Fushun Zhu¹, Shan Zhao², Peng Wang², Hao Wang², Hua Yan¹, Shuaicheng Liu^3,1

Packages