LAB: Large-Scale Alignment for ChatBots

Sudalairaj, Shivchander; Bhandwaldar, Abhishek; Pareja, Aldo; Xu, Kai; Cox, David D.; Srivastava, Akash

Computer Science > Computation and Language

arXiv:2403.01081 (cs)

[Submitted on 2 Mar 2024 (v1), last revised 29 Apr 2024 (this version, v3)]

Title:LAB: Large-Scale Alignment for ChatBots

Authors:Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Kai Xu, David D. Cox, Akash Srivastava

View PDF HTML (experimental)

Abstract:This work introduces LAB (Large-scale Alignment for chatBots), a novel methodology designed to overcome the scalability challenges in the instruction-tuning phase of large language model (LLM) training. Leveraging a taxonomy-guided synthetic data generation process and a multi-phase tuning framework, LAB significantly reduces reliance on expensive human annotations and proprietary models like GPT-4. We demonstrate that LAB-trained models can achieve competitive performance across several benchmarks compared to models trained with traditional human-annotated or GPT-4 generated synthetic data. Thus offering a scalable, cost-effective solution for enhancing LLM capabilities and instruction-following behaviors without the drawbacks of catastrophic forgetting, marking a step forward in the efficient training of LLMs for a wide range of applications.

Comments:	Corresponding Author: Akash Srivastava. Equal Contribution: Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Akash Srivastava, Code: this https URL
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2403.01081 [cs.CL]
	(or arXiv:2403.01081v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.01081

Submission history

From: Akash Srivastava [view email]
[v1] Sat, 2 Mar 2024 03:48:37 UTC (1,468 KB)
[v2] Wed, 6 Mar 2024 22:25:44 UTC (1,468 KB)
[v3] Mon, 29 Apr 2024 18:55:34 UTC (1,468 KB)

Computer Science > Computation and Language

Title:LAB: Large-Scale Alignment for ChatBots

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LAB: Large-Scale Alignment for ChatBots

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators