DOCK: Detecting Objects by transferring Common-sense Knowledge

Singh, Krishna Kumar; Divvala, Santosh; Farhadi, Ali; Lee, Yong Jae

Computer Science > Computer Vision and Pattern Recognition

arXiv:1804.01077 (cs)

[Submitted on 3 Apr 2018 (v1), last revised 31 Jul 2018 (this version, v2)]

Title:DOCK: Detecting Objects by transferring Common-sense Knowledge

Authors:Krishna Kumar Singh, Santosh Divvala, Ali Farhadi, Yong Jae Lee

View PDF

Abstract:We present a scalable approach for Detecting Objects by transferring Common-sense Knowledge (DOCK) from source to target categories. In our setting, the training data for the source categories have bounding box annotations, while those for the target categories only have image-level annotations. Current state-of-the-art approaches focus on image-level visual or semantic similarity to adapt a detector trained on the source categories to the new target categories. In contrast, our key idea is to (i) use similarity not at the image-level, but rather at the region-level, and (ii) leverage richer common-sense (based on attribute, spatial, etc.) to guide the algorithm towards learning the correct detections. We acquire such common-sense cues automatically from readily-available knowledge bases without any extra human effort. On the challenging MS COCO dataset, we find that common-sense knowledge can substantially improve detection performance over existing transfer-learning baselines.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1804.01077 [cs.CV]
	(or arXiv:1804.01077v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1804.01077
Journal reference:	ECCV, 2018

Submission history

From: Krishna Kumar Singh [view email]
[v1] Tue, 3 Apr 2018 17:41:53 UTC (7,528 KB)
[v2] Tue, 31 Jul 2018 06:42:30 UTC (8,376 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DOCK: Detecting Objects by transferring Common-sense Knowledge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DOCK: Detecting Objects by transferring Common-sense Knowledge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators