You can't pick your neighbors, or can you? When and how to rely on retrieval in the $k$NN-LM

Drozdov, Andrew; Wang, Shufan; Rahimi, Razieh; McCallum, Andrew; Zamani, Hamed; Iyyer, Mohit

Computer Science > Computation and Language

arXiv:2210.15859 (cs)

[Submitted on 28 Oct 2022]

Title:You can't pick your neighbors, or can you? When and how to rely on retrieval in the $k$NN-LM

Authors:Andrew Drozdov, Shufan Wang, Razieh Rahimi, Andrew McCallum, Hamed Zamani, Mohit Iyyer

View PDF

Abstract:Retrieval-enhanced language models (LMs), which condition their predictions on text retrieved from large external datastores, have recently shown significant perplexity improvements compared to standard LMs. One such approach, the $k$NN-LM, interpolates any existing LM's predictions with the output of a $k$-nearest neighbors model and requires no additional training. In this paper, we explore the importance of lexical and semantic matching in the context of items retrieved by $k$NN-LM. We find two trends: (1) the presence of large overlapping $n$-grams between the datastore and evaluation set plays an important factor in strong performance, even when the datastore is derived from the training data; and (2) the $k$NN-LM is most beneficial when retrieved items have high semantic similarity with the query. Based on our analysis, we define a new formulation of the $k$NN-LM that uses retrieval quality to assign the interpolation coefficient. We empirically measure the effectiveness of our approach on two English language modeling datasets, Wikitext-103 and PG-19. Our re-formulation of the $k$NN-LM is beneficial in both cases, and leads to nearly 4% improvement in perplexity on the Wikitext-103 test set.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2210.15859 [cs.CL]
	(or arXiv:2210.15859v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.15859

Submission history

From: Andrew Drozdov [view email]
[v1] Fri, 28 Oct 2022 02:57:40 UTC (5,707 KB)

Computer Science > Computation and Language

Title:You can't pick your neighbors, or can you? When and how to rely on retrieval in the $k$NN-LM

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:You can't pick your neighbors, or can you? When and how to rely on retrieval in the $k$NN-LM

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators