Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Jain, Ashesh; Zamir, Amir R.; Savarese, Silvio; Saxena, Ashutosh

Computer Science > Computer Vision and Pattern Recognition

arXiv:1511.05298 (cs)

[Submitted on 17 Nov 2015 (v1), last revised 11 Apr 2016 (this version, v3)]

Title:Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Authors:Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena

View PDF

Abstract:Deep Recurrent Neural Network architectures, though remarkably capable at modeling sequences, lack an intuitive high-level spatio-temporal structure. That is while many problems in computer vision inherently have an underlying high-level structure and can benefit from it. Spatio-temporal graphs are a popular tool for imposing such high-level intuitions in the formulation of real world problems. In this paper, we propose an approach for combining the power of high-level spatio-temporal graphs and sequence learning success of Recurrent Neural Networks~(RNNs). We develop a scalable method for casting an arbitrary spatio-temporal graph as a rich RNN mixture that is feedforward, fully differentiable, and jointly trainable. The proposed method is generic and principled as it can be used for transforming any spatio-temporal graph through employing a certain set of well defined steps. The evaluations of the proposed approach on a diverse set of problems, ranging from modeling human motion to object interactions, shows improvement over the state-of-the-art with a large margin. We expect this method to empower new approaches to problem formulation through high-level spatio-temporal graphs and Recurrent Neural Networks.

Comments:	CVPR 2016 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
Cite as:	arXiv:1511.05298 [cs.CV]
	(or arXiv:1511.05298v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1511.05298

Submission history

From: Ashesh Jain [view email]
[v1] Tue, 17 Nov 2015 07:49:58 UTC (4,165 KB)
[v2] Fri, 20 Nov 2015 01:26:23 UTC (4,165 KB)
[v3] Mon, 11 Apr 2016 19:00:24 UTC (2,823 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators