D4MARL: A Dataset and Benchmark for Diverse, Measurable Offline Multi-Agent Reinforcement Learning

Documentation | Installation | Data Download Guide | Usage Guide

Installation Guide

git clone https://github.com/jymh/d4marl.git
conda create -n d4marl python==3.7
conda activate d4marl
cd d4marl
pip install -r requirements.txt

Data Download Guide

You can download the demonstration dataset in advance by:

wget https://d4marl.oss-cn-beijing.aliyuncs.com/demo_files/$map_name/$quality/$quality.hdf5

Here, replace replay $map_name and $quality with the map and quality you want.

Or you can also set the parameter download_dataset in the running shell.

In the following part, we will show how to pre-train the policy with default or customized configuration based on the downloaded dataset.

Usage Guide

Here we provide Examples of how to use D4MARL.
You can train offline MARL policy by running python run_**.
You can run command for an easy-to-download dataset and train policy by setting download_dataset as True.
You can customize the configuration of the algorithm by running on the visible platform streamlit run visualize.py.
You can run an evaluation process by simply clicking the compare methods on the platform.
You can choose the training curve colors of each method by clicking the color button.

Train Policy

Example
if [ mode == "baseline" ]
then
    python -u run_baseline_sc2.py \ 
        --offline_data_dir $path_to_data \
        --download_dataset \   # download demo dataset to start a quick training
        --algorithm $baseline_algorithm \
elif [ mode == "madt" ]
then
    python -u run_madt_sc2.py \
        --offline_data_dir $path_to_data \
        --download_dataset \
fi

The above command will train a policy with baseline algorithms including ICQ, BCQ, CQL, or MADT, and the total training steps is 1024. The vector environment number is 1. The steps_per_epoch is default as 500. If there is no local offline dataset in the offline_data_dir, the command will download the dataset automatically from our online storage.

You can also customize the configuration of the offline algorithm by running

streamlit run visualize.py

Here we provide a user interface, in this platform, you can choose which specific task and approach need to be trained offline:

We developed a visible training tool that integrates data preparation, hyperparameter configuration, model training, and evaluation of pre-trained models based on the Streamlit platform.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
algorithms		algorithms
analysis		analysis
docs		docs
run_scripts		run_scripts
utils		utils
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
README.md		README.md
__init__.py		__init__.py
install_sc2.sh		install_sc2.sh
requirements.txt		requirements.txt
train_smac.sh		train_smac.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

D4MARL: A Dataset and Benchmark for Diverse, Measurable Offline Multi-Agent Reinforcement Learning

Installation Guide

Data Download Guide

Usage Guide

Train Policy

About

Releases

Packages

Contributors 2

Languages

jymh/d4marl

Folders and files

Latest commit

History

Repository files navigation

D4MARL: A Dataset and Benchmark for Diverse, Measurable Offline Multi-Agent Reinforcement Learning

Installation Guide

Data Download Guide

Usage Guide

Train Policy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages