Tuning computer vision models with task rewards

Pinto, André Susano; Kolesnikov, Alexander; Shi, Yuge; Beyer, Lucas; Zhai, Xiaohua

Computer Science > Computer Vision and Pattern Recognition

arXiv:2302.08242v1 (cs)

[Submitted on 16 Feb 2023]

Title:Tuning computer vision models with task rewards

Authors:André Susano Pinto, Alexander Kolesnikov, Yuge Shi, Lucas Beyer, Xiaohua Zhai

View PDF

Abstract:Misalignment between model predictions and intended usage can be detrimental for the deployment of computer vision models. The issue is exacerbated when the task involves complex structured outputs, as it becomes harder to design procedures which address this misalignment. In natural language processing, this is often addressed using reinforcement learning techniques that align models with a task reward. We adopt this approach and show its surprising effectiveness across multiple computer vision tasks, such as object detection, panoptic segmentation, colorization and image captioning. We believe this approach has the potential to be widely useful for better aligning models with a diverse range of computer vision tasks.

Comments:	11 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2302.08242 [cs.CV]
	(or arXiv:2302.08242v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2302.08242

Submission history

From: André Susano Pinto [view email]
[v1] Thu, 16 Feb 2023 11:49:48 UTC (5,076 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2023-02

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Tuning computer vision models with task rewards

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Tuning computer vision models with task rewards

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators