Enhancing Data Privacy in Large Language Models through Private Association Editing

Venditti, Davide; Ruzzetti, Elena Sofia; Xompero, Giancarlo A.; Giannone, Cristina; Favalli, Andrea; Romagnoli, Raniero; Zanzotto, Fabio Massimo

Computer Science > Computation and Language

arXiv:2406.18221 (cs)

[Submitted on 26 Jun 2024 (v1), last revised 16 Oct 2024 (this version, v3)]

Title:Enhancing Data Privacy in Large Language Models through Private Association Editing

Authors:Davide Venditti, Elena Sofia Ruzzetti, Giancarlo A. Xompero, Cristina Giannone, Andrea Favalli, Raniero Romagnoli, Fabio Massimo Zanzotto

View PDF

Abstract:Large language models (LLMs) require a significant redesign in solutions to preserve privacy in data-intensive applications due to their text-generation capabilities. Indeed, LLMs tend to memorize and emit private information when maliciously prompted. In this paper, we introduce Private Association Editing (PAE) as a novel defense approach for private data leakage. PAE is designed to effectively remove Personally Identifiable Information (PII) without retraining the model. Experimental results demonstrate the effectiveness of PAE with respect to alternative baseline methods. We believe PAE will serve as a critical tool in the ongoing effort to protect data privacy in LLMs, encouraging the development of safer models for real-world applications.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.18221 [cs.CL]
	(or arXiv:2406.18221v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.18221

Submission history

From: Elena Sofia Ruzzetti [view email]
[v1] Wed, 26 Jun 2024 10:08:47 UTC (8,466 KB)
[v2] Thu, 15 Aug 2024 19:30:09 UTC (8,515 KB)
[v3] Wed, 16 Oct 2024 13:31:05 UTC (8,670 KB)

Computer Science > Computation and Language

Title:Enhancing Data Privacy in Large Language Models through Private Association Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Enhancing Data Privacy in Large Language Models through Private Association Editing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators