Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
config		config
examples		examples
meta-nml		meta-nml
models		models
multiworld		multiworld
scripts		scripts
softlearning		softlearning
tests		tests
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt
setup.py		setup.py

Repository files navigation

MURAL

Official code for MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven RL (ICML 2021)

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning
Kevin Li*, Abhishek Gupta*, Ashwin D Reddy, Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine
International Conference on Machine Learning (ICML) 2021

Website | Paper

Code Structure

The original codebase was based off of standard implementations of Soft Actor-Critic (softlearning) and Model-Agnostic Meta-Learning (pytorch-maml-rl).

Below are the main additions that are relevant to MURAL:

meta-nml/: Implementation of Meta-NML, an amortized version of Normalized Maximum Likelihood for deep neural networks using meta-learning.
- meta-nml/maml/meta_nml.py: Core Meta-NML algorithm
- meta-nml/maml/metadatasets/nml.py: The NML dataset. Given a standard classification dataset with n inputs and labels (for k classes), this will construct a meta-dataset of n*k tasks, each of which involves adapting to one of the input points with an arbitrary label in 1, ..., k.
softlearning/: Library of RL algorithms
- softlearning/algorithms/vice.py: Implementation of MURAL, a classifier RL algorithm that uses Meta-NML for uncertainty-aware rewards
examples/: Example setups for the environments in our paper
- examples/*/variants.py: Hyperparameter and environment settings for each task
scripts/examples/*: Scripts to run MURAL on various environments

Setup Instructions

Clone the repository
Create a conda environment with the required dependencies, and activate it (2 commands):

conda env create -f environment.yml
conda activate mural

Add the necessary paths (2 commands):

pip install -e .
conda develop meta-nml

Install subfolder dependencies (2 commands):

cd meta-nml && pip install -r requirements.txt
cd ../multiworld && pip install -e .

Enable execution for all run scripts:

cd .. && chmod +x scripts/examples/*.sh

Running MURAL

We have included separate scripts for each of the environments in the paper. Use the following commands to run MURAL on the desired environment:

Zigzag Maze: ./scripts/examples/run_zigzag_maze.sh
Spiral Maze: ./scripts/examples/run_spiral_maze.sh
Sawyer Push: ./scripts/examples/run_sawyer_push.sh
Sawyer Pick-and-Place: ./scripts/examples/run_sawyer_pick.sh
Sawyer Door: ./scripts/examples/run_sawyer_door.sh
Ant Locomotion: ./scripts/examples/run_ant_maze.sh
Dexterous Hand: Unfortunately, the code for the Dexterous Hand environment is private and we have been asked not to include it in this submission for the time being.

Common Issues

numpy.ndarray size changed, may indicate binary incompatibility. Expected 88 from C header, got 80 from PyObject

Uninstall and reinstall numpy:

pip uninstall numpy
pip install numpy

TypeError: init() got an unexpected keyword argument 'tags'

Install an earlier gym version:

pip install gym==0.15.4

Missing aiohttp

pip install aiohttp psutil

Acknowledgements

This codebase was built off of the following publicly available repos:

softlearning (implementation of SAC and other common RL algorithms): https://github.com/rail-berkeley/softlearning
multiworld (multitask gym environments for RL): https://github.com/vitchyr/multiworld
pytorch-maml (implementation of Model-Agnostic Meta-Learning): https://github.com/tristandeleu/pytorch-maml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MURAL

Code Structure

Setup Instructions

Running MURAL

Common Issues

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

kevintli/mural

Folders and files

Latest commit

History

Repository files navigation

MURAL

Code Structure

Setup Instructions

Running MURAL

Common Issues

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages