Offline Imitation Learning with Variational Counterfactual Reasoning

This is the code for reproducing the results of the paper Offline Imitation Learning with Variational Counterfactual Reasoning accepted at NeurIPS'2023. The paper can be found here.

Usage

Paper results were collected with Deep Mind Control (and Causal World). Networks are trained using PyTorch 1.4.0 and Python 3.6.

You can also run OILCA on the setting used in DWBC by running main_ivae_model.py and main_ivae_model.py:

python main.py \
    --aux_dim="5" \  
    --epochs="30" \
    --task="cheetah_run"

After the counterfactual model training, you can run the main.py to train the offline policy with the augmented data with the pretrained policy path:

python main_setting_demodice.py \
    --data_path="../dataset/dm_control_suite/" \  
    --env="cheetah_run" \
    --expert-policy-path="../learned_models/BC_all/bc_model_cheetah_run.pkl"

Bibtex

@inproceedings{sun2023offline,
  title     = {Offline Imitation Learning with Variational Counterfactual Reasoning},
  author    = {Sun, Zexu and He, Bowei and Liu, Jinxin and Chen, Xu and Ma, Chen and Zhang, Shuai},
  booktitle = {Proceedings of the 37th Conference on Neural Information Processing Systems},
  year      = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
algos		algos
learned_models		learned_models
README.md		README.md
auto_run.sh		auto_run.sh
ivae_exogenous_model.py		ivae_exogenous_model.py
main.py		main.py
main_ivae_model.py		main_ivae_model.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Offline Imitation Learning with Variational Counterfactual Reasoning

Usage

Bibtex

About

Releases

Packages

Languages

ZexuSun/OILCA-NeurIPS23

Folders and files

Latest commit

History

Repository files navigation

Offline Imitation Learning with Variational Counterfactual Reasoning

Usage

Bibtex

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages