Federated Nearest Neighbor Machine Translation

Du, Yichao; Zhang, Zhirui; Wu, Bingzhe; Liu, Lemao; Xu, Tong; Chen, Enhong

Computer Science > Computation and Language

arXiv:2302.12211 (cs)

[Submitted on 23 Feb 2023]

Title:Federated Nearest Neighbor Machine Translation

Authors:Yichao Du, Zhirui Zhang, Bingzhe Wu, Lemao Liu, Tong Xu, Enhong Chen

View PDF

Abstract:To protect user privacy and meet legal regulations, federated learning (FL) is attracting significant attention. Training neural machine translation (NMT) models with traditional FL algorithm (e.g., FedAvg) typically relies on multi-round model-based interactions. However, it is impractical and inefficient for machine translation tasks due to the vast communication overheads and heavy synchronization. In this paper, we propose a novel federated nearest neighbor (FedNN) machine translation framework that, instead of multi-round model-based interactions, leverages one-round memorization-based interaction to share knowledge across different clients to build low-overhead privacy-preserving systems. The whole approach equips the public NMT model trained on large-scale accessible data with a $k$-nearest-neighbor ($$kNN) classifier and integrates the external datastore constructed by private text data in all clients to form the final FL model. A two-phase datastore encryption strategy is introduced to achieve privacy-preserving during this process. Extensive experiments show that FedNN significantly reduces computational and communication costs compared with FedAvg, while maintaining promising performance in different FL settings.

Comments:	ICLR 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2302.12211 [cs.CL]
	(or arXiv:2302.12211v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.12211

Submission history

From: Yichao Du [view email]
[v1] Thu, 23 Feb 2023 18:04:07 UTC (1,799 KB)

Computer Science > Computation and Language

Title:Federated Nearest Neighbor Machine Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Federated Nearest Neighbor Machine Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators