imitation/experiments at master · HumanCompatibleAI/imitation

History

Name		Name	Last commit message	Last commit date
parent directory ..
.gitignore		.gitignore
README.md		README.md
bc_benchmark.sh		bc_benchmark.sh
benchmark_and_table.sh		benchmark_and_table.sh
commands.py		commands.py
common.sh		common.sh
convert_traj.py		convert_traj.py
dagger_benchmark.sh		dagger_benchmark.sh
imit_benchmark.sh		imit_benchmark.sh
imit_benchmark_config.csv		imit_benchmark_config.csv
imit_table_cheetahs.csv		imit_table_cheetahs.csv
imit_table_mvp_seals_config.csv		imit_table_mvp_seals_config.csv
rollouts_from_policies.sh		rollouts_from_policies.sh
rollouts_from_policies_config.csv		rollouts_from_policies_config.csv
transfer_learn_benchmark.sh		transfer_learn_benchmark.sh

README.md

Experiment scripts are compatible with Linux and macOS.

(macOS only) macOS compatibility setup

macOS to install some GNU-compatible binaries before all experiments scripts will work.

brew install coreutils gnu-getopt parallel

Scripts

Phase 1: Generate expert demonstrations from models.

Run experiments/rollouts_from_policies.sh. (Rollouts saved in output/train_experts/). Demonstrations are used in Phase 2 for imitation learning.

Phase 2: Train imitation learning.

Run experiments/imit_benchmark.sh --run_name RUN_NAME. To choose AIRL or GAIL, add the --airl and --gail flags (default is GAIL).

To analyze these results, run python -m imitation.scripts.analyze with run_name=RUN_NAME. Analysis can be run even while training is midway (will only show completed imitation learner's results). Example output.

Phase 3: Transfer learning.

Run experiments/transfer_learn_benchmark.sh. To choose AIRL or GAIL, add the --airl and --gail flags (default is GAIL). Transfer rewards are loaded from data/reward_models.

Hyperparameter tuning

Add a named config containing the hyperparameter search space and other settings to src/imitation/scripts/config/parallel.py. (def example_cartpole_rl(): is an example).

Run your hyperparameter tuning experiment using python -m imitation.scripts.parallel with YOUR_NAMED_CONFIG inner_run_name=RUN_NAME.

Analyze imitation learning experiments using python -m imitation.scripts.analyze with run_name=RUN_NAME source_dir=~/ray_results.

View Stable Baselines training stats on TensorBoard (available for regular RL, imitation learning, and transfer learning) using tensorboard --log_dir ~/ray_results. To view only a subset of TensorBoard training progress use imitation.scripts.analyze gather_tb_directories with source_dir=~/ray_results run_name=RUN_NAME.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiments

experiments

README.md

(macOS only) macOS compatibility setup

Scripts

Phase 1: Generate expert demonstrations from models.

Phase 2: Train imitation learning.

Phase 3: Transfer learning.

Hyperparameter tuning

Files

experiments

Directory actions

More options

Directory actions

More options

Latest commit

History

experiments

Folders and files

parent directory

README.md

(macOS only) macOS compatibility setup

Scripts

Phase 1: Generate expert demonstrations from models.

Phase 2: Train imitation learning.

Phase 3: Transfer learning.

Hyperparameter tuning