PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

Mao, Hangyu; Zhao, Rui; Li, Ziyue; Xu, Zhiwei; Chen, Hao; Chen, Yiqun; Zhang, Bin; Xiao, Zhen; Zhang, Junge; Yin, Jiangjin

Computer Science > Machine Learning

arXiv:2312.15863v1 (cs)

[Submitted on 26 Dec 2023]

Title:PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

Authors:Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiangjin Yin

View PDF HTML (experimental)

Abstract:Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL. This work studies the former. Specifically, the Perception and Decision-making Interleaving Transformer (PDiT) network is proposed, which cascades two Transformers in a very natural way: the perceiving one focuses on \emph{the environmental perception} by processing the observation at the patch level, whereas the deciding one pays attention to \emph{the decision-making} by conditioning on the history of the desired returns, the perceiver's outputs, and the actions. Such a network design is generally applicable to a lot of deep RL settings, e.g., both the online and offline RL algorithms under environments with either image observations, proprioception observations, or hybrid image-language observations. Extensive experiments show that PDiT can not only achieve superior performance than strong baselines in different settings but also extract explainable feature representations. Our code is available at \url{this https URL}.

Comments:	Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024, full paper with oral presentation). Cover our preliminary study: arXiv:2212.14538
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO); Systems and Control (eess.SY)
Cite as:	arXiv:2312.15863 [cs.LG]
	(or arXiv:2312.15863v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.15863

Submission history

From: Hangyu Mao [view email]
[v1] Tue, 26 Dec 2023 03:07:10 UTC (1,476 KB)

Computer Science > Machine Learning

Title:PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators