FedVLN: Privacy-preserving Federated Vision-and-Language Navigation

Zhou, Kaiwen; Wang, Xin Eric

Computer Science > Artificial Intelligence

arXiv:2203.14936v3 (cs)

[Submitted on 28 Mar 2022 (v1), last revised 23 Sep 2022 (this version, v3)]

Title:FedVLN: Privacy-preserving Federated Vision-and-Language Navigation

Authors:Kaiwen Zhou, Xin Eric Wang

View PDF

Abstract:Data privacy is a central problem for embodied agents that can perceive the environment, communicate with humans, and act in the real world. While helping humans complete tasks, the agent may observe and process sensitive information of users, such as house environments, human activities, etc. In this work, we introduce privacy-preserving embodied agent learning for the task of Vision-and-Language Navigation (VLN), where an embodied agent navigates house environments by following natural language instructions. We view each house environment as a local client, which shares nothing other than local updates with the cloud server and other clients, and propose a novel federated vision-and-language navigation (FedVLN) framework to protect data privacy during both training and pre-exploration. Particularly, we propose a decentralized training strategy to limit the data of each client to its local model training and a federated pre-exploration method to do partial model aggregation to improve model generalizability to unseen environments. Extensive results on R2R and RxR datasets show that under our FedVLN framework, decentralized VLN models achieve comparable results with centralized training while protecting seen environment privacy, and federated pre-exploration significantly outperforms centralized pre-exploration while preserving unseen environment privacy.

Comments:	Accepted by ECCV 2022
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2203.14936 [cs.AI]
	(or arXiv:2203.14936v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2203.14936

Submission history

From: Kaiwen Zhou [view email]
[v1] Mon, 28 Mar 2022 17:43:35 UTC (3,590 KB)
[v2] Tue, 26 Jul 2022 23:05:24 UTC (5,173 KB)
[v3] Fri, 23 Sep 2022 22:11:05 UTC (10,350 KB)

Computer Science > Artificial Intelligence

Title:FedVLN: Privacy-preserving Federated Vision-and-Language Navigation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:FedVLN: Privacy-preserving Federated Vision-and-Language Navigation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators