This repository contains the implementation of the WQL algorithm both in tabular domains and Atari games, together with implementation of algorithms we used to confront our results. It uses the Mushroom RL library and OpenAI Gym.
To reproduce our results run the following scripts:
q_learning/run.py
: Run experiments in tabular RL. You can specify the environments, algorithms, policies, update methods and any other hyperparameter of each algorithm used. By default will run all algorithms in all the environments and log the results in the 'tabular_data' directory.dqn/atari/run.py.py
: Run experiments in atari games. You can specify the environments, algorithms, policies, update methods and any other hyperparameter of each algorithm used. By default will run particle DQN in Breakout using posterior sampling policy with MO update and log the results in the 'logs' directory.