Hyper-Decision Transformer for Efficient Online Policy Adaptation

Xu, Mengdi; Lu, Yuchen; Shen, Yikang; Zhang, Shun; Zhao, Ding; Gan, Chuang

Computer Science > Machine Learning

arXiv:2304.08487 (cs)

[Submitted on 17 Apr 2023]

Title:Hyper-Decision Transformer for Efficient Online Policy Adaptation

Authors:Mengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan

View PDF

Abstract:Decision Transformers (DT) have demonstrated strong performances in offline reinforcement learning settings, but quickly adapting to unseen novel tasks remains challenging. To address this challenge, we propose a new framework, called Hyper-Decision Transformer (HDT), that can generalize to novel tasks from a handful of demonstrations in a data- and parameter-efficient manner. To achieve such a goal, we propose to augment the base DT with an adaptation module, whose parameters are initialized by a hyper-network. When encountering unseen tasks, the hyper-network takes a handful of demonstrations as inputs and initializes the adaptation module accordingly. This initialization enables HDT to efficiently adapt to novel tasks by only fine-tuning the adaptation module. We validate HDT's generalization capability on object manipulation tasks. We find that with a single expert demonstration and fine-tuning only 0.5% of DT parameters, HDT adapts faster to unseen tasks than fine-tuning the whole DT model. Finally, we explore a more challenging setting where expert actions are not available, and we show that HDT outperforms state-of-the-art baselines in terms of task success rates by a large margin.

Comments:	ICLR 2023. Project page: this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2304.08487 [cs.LG]
	(or arXiv:2304.08487v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2304.08487

Submission history

From: Chuang Gan [view email]
[v1] Mon, 17 Apr 2023 17:59:32 UTC (6,821 KB)

Computer Science > Machine Learning

Title:Hyper-Decision Transformer for Efficient Online Policy Adaptation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hyper-Decision Transformer for Efficient Online Policy Adaptation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators