ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting

This repo contains the code for our paper ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting. In this work, we introduce ScaleBiO, the first scalable instantiation of first-order bilevel optimization algorithm, focusing on large-scale LLM data reweighting.

Latest News

[2024-07-30] Preview version - we release the demo data reweighting code and data for gpt2 and Yi-34B on alpaca and alpaca-gpt4 dataset.

Quick Start

Setup

pip install -r requirements.txt

Reweighting

export WANDB_API_KEY=<your_wandb_api_key> and turn on --use_wandb in the script

./run_gpt2.sh

./run_Yi34B.sh

Recommended Hardware Configuration

8x A40/A100 GPUs and 2TB Memory

Citation

If you find this repository useful, please consider giving ⭐ and citing our paper:

@article{pan2024scalebio,
  title={ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting},
  author={Pan, Rui and Zhang, Jipeng and Pan, Xingyuan and Pi, Renjie and Wang, Xiaoyu and Zhang, Tong},
  journal={arXiv preprint arXiv:2406.19976},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
conf		conf
data_tiny		data_tiny
python		python
LICENSE		LICENSE
README.md		README.md
fsdp_config.yaml		fsdp_config.yaml
fsdp_config_gpt2.yaml		fsdp_config_gpt2.yaml
requirements.txt		requirements.txt
run_Yi34B.sh		run_Yi34B.sh
run_gpt2.sh		run_gpt2.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting

Latest News

Quick Start

Setup

Reweighting

Recommended Hardware Configuration

Citation

About

Releases

Packages

Contributors 3

Languages

License

2003pro/ScaleBiO

Folders and files

Latest commit

History

Repository files navigation

ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting

Latest News

Quick Start

Setup

Reweighting

Recommended Hardware Configuration

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages