MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

This is the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".

Directory structure

base_model: MeteoRA model
ckpt: the datasets and dataset processing code.
eval: the evaluation results and evaluation code.
MoELoRA: MeteoRA module and adapted PEFT code.

Usage

Preparation

Install necessary packages:

pip install -r requirements.txt

Download all JSON format BIG-bench benchmarks, please refer to here.
Change bigbench_dataset_dir path in configs/config.yaml.
Prepare datasets:

cd data
python create_dataset.py --task all

If you just want to create a specific dataset, run:

cd data
python create_dataset.py --task <task_name>

Prepare composite-n tasks:

python create_composite.py --n <n>

We prepared n=3, n=5 and n=10 few-shot dataset generating code. Before generating, please ensure that the sub-tasks to composite composite-n task have been included in data/datasets.

Prepare LoRA adapters checkpoint and MeteoRA model checkpoint. You can train by yourself or download ours(LlaMA2 and LlaMA3 as base model) by:

python download_ckpt.py

Change other paths in configs/config.yaml. Example:

base_model_path: 'meta-llama3/Meta-Llama-3-8B'
meteora_ckpt_path: 'ckpt/llama3_8b/llama3_8b_meteora/top_2'
adapter_dir: 'ckpt/llama3_8b/llama3_8b_peft'

Evaluation

Running a benchmark with MeteoRA model:

python eval_model.py --task <task_name> --batch_size <batch_size>

For example:

python eval_model.py --task composite_10 --batch_size 4

Note: If you want to run a composite-n task, please set a larger temperature value (self.T in MoELoRA/layer.py). As a reference, 15, 20 and 30 for n=3, n=5 and n=10.

Save the evaluation result:

python eval_model.py --task <task_name> --batch_size <batch_size> --save

Debug mode (model output and ground truth will be shown in the console):

python eval_model.py --task <task_name> --batch_size <batch_size> --debug

Running a benchmark with PEFT model:

python eval_model.py --task <task_name> --batch_size <batch_size> --model <adapter_name>

Train MeteoRA model

Prepare LoRA adapters and corresponding datasets in JSONL format. Ensure each LoRA adapter has a corresponding dataset. Place all LoRA adapters and datasets in their respective folders with matching subfolder names:
```
- lora_adapters
      - adapter_name1
      - adapter_name2
      - ...
- datasets
      - dataset_name1
      - dataset_name2
      - ...
```
Change file paths in run_meteora_train_fsdp.sh.
Train MeteoRA model:

sh run_meteora_train_fsdp.sh

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
MoELoRA		MoELoRA
base_model/llama		base_model/llama
configs		configs
data		data
eval		eval
llama-meteor		llama-meteor
.gitignore		.gitignore
README.md		README.md
constant.py		constant.py
download_ckpt.py		download_ckpt.py
eval_model.py		eval_model.py
meteora_train.py		meteora_train.py
requirements.txt		requirements.txt
run_bbl_fsdp.sh		run_bbl_fsdp.sh
run_eval.sh		run_eval.sh
run_meteora_train_fsdp.sh		run_meteora_train_fsdp.sh
train_old.py		train_old.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

Directory structure

Usage

Preparation

Evaluation

Train MeteoRA model

About

Releases

Packages

Languages

LprG6WVR0e/MeteoRA

Folders and files

Latest commit

History

Repository files navigation

MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

Directory structure

Usage

Preparation

Evaluation

Train MeteoRA model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages