Harmonizer: Learning to Perform White-Box Image and Video Harmonization

Ke, Zhanghan; Sun, Chunyi; Zhu, Lei; Xu, Ke; Lau, Rynson W. H.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.01322 (cs)

[Submitted on 4 Jul 2022 (v1), last revised 20 Jul 2022 (this version, v2)]

Title:Harmonizer: Learning to Perform White-Box Image and Video Harmonization

Authors:Zhanghan Ke, Chunyi Sun, Lei Zhu, Ke Xu, Rynson W.H. Lau

View PDF

Abstract:Recent works on image harmonization solve the problem as a pixel-wise image translation task via large autoencoders. They have unsatisfactory performances and slow inference speeds when dealing with high-resolution images. In this work, we observe that adjusting the input arguments of basic image filters, e.g., brightness and contrast, is sufficient for humans to produce realistic images from the composite ones. Hence, we frame image harmonization as an image-level regression problem to learn the arguments of the filters that humans use for the task. We present a Harmonizer framework for image harmonization. Unlike prior methods that are based on black-box autoencoders, Harmonizer contains a neural network for filter argument prediction and several white-box filters (based on the predicted arguments) for image harmonization. We also introduce a cascade regressor and a dynamic loss strategy for Harmonizer to learn filter arguments more stably and precisely. Since our network only outputs image-level arguments and the filters we used are efficient, Harmonizer is much lighter and faster than existing methods. Comprehensive experiments demonstrate that Harmonizer surpasses existing methods notably, especially with high-resolution inputs. Finally, we apply Harmonizer to video harmonization, which achieves consistent results across frames and 56 fps at 1080P resolution. Code and models are available at: this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2207.01322 [cs.CV]
	(or arXiv:2207.01322v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.01322

Submission history

From: Zhanghan Ke [view email]
[v1] Mon, 4 Jul 2022 10:59:33 UTC (16,118 KB)
[v2] Wed, 20 Jul 2022 09:42:46 UTC (16,121 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Harmonizer: Learning to Perform White-Box Image and Video Harmonization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Harmonizer: Learning to Perform White-Box Image and Video Harmonization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators