RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models

Huang, Jie; Ping, Wei; Xu, Peng; Shoeybi, Mohammad; Chang, Kevin Chen-Chuan; Catanzaro, Bryan

Computer Science > Computation and Language

arXiv:2308.07922 (cs)

[Submitted on 15 Aug 2023 (v1), last revised 19 Aug 2024 (this version, v3)]

Title:RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models

Authors:Jie Huang, Wei Ping, Peng Xu, Mohammad Shoeybi, Kevin Chen-Chuan Chang, Bryan Catanzaro

View PDF HTML (experimental)

Abstract:In this paper, we investigate the in-context learning ability of retrieval-augmented encoder-decoder language models. We first conduct a comprehensive analysis of existing models and identify their limitations in in-context learning, primarily due to a mismatch between pretraining and inference, as well as a restricted context length. To address these issues, we propose RAVEN, a model that combines retrieval-augmented masked language modeling and prefix language modeling. We further introduce Fusion-in-Context Learning to enhance the few-shot performance by enabling the model to leverage more in-context examples without requiring additional training. Through extensive experiments, we demonstrate that our simple yet effective design significantly improves performance, achieving results comparable to the most advanced language models in certain scenarios, despite having substantially fewer parameters. Our work underscores the potential of retrieval-augmented encoder-decoder language models for in-context learning and encourages further research in this direction.

Comments:	COLM 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2308.07922 [cs.CL]
	(or arXiv:2308.07922v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.07922

Submission history

From: Jie Huang [view email]
[v1] Tue, 15 Aug 2023 17:59:18 UTC (1,642 KB)
[v2] Mon, 1 Apr 2024 06:32:12 UTC (1,687 KB)
[v3] Mon, 19 Aug 2024 05:46:56 UTC (1,687 KB)

Computer Science > Computation and Language

Title:RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators