Skip to content
/ IVR Public

[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

License

Notifications You must be signed in to change notification settings

ryanxhr/IVR

Repository files navigation

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

This is the code for reproducing the results of the paper Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization accepted as Notable-top-5% at ICLR'2023.

The discrete version of IVR on Atari datasets can be found at https://github.com/ryanxhr/Discrete_IVR.

Usage

Our code is built on the jax version code of IQL (https://github.com/ikostrikov/implicit_q_learning). Paper reuslts can be reproduced by running ./run_mujoco.sh, ./run_antmaze.sh and ./run_kitchen.sh.

Bibtex

@inproceedings{xu2023offline,
  title  = {Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization},
  author = {Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Victor Wai Kin Chan, Xianyuan Zhan},
  year   = {2023},
  booktitle = {International Conference on Learning Representations},
}

About

[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published