Skip to content

Commit

Permalink
modify README
Browse files Browse the repository at this point in the history
  • Loading branch information
dongminlee94 committed Mar 11, 2019
1 parent a026985 commit ad04bf1
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -82,7 +82,7 @@ We have implemented `APP`, `MaxEnt` using Q-learning as RL step in `MountainCar-

Navigate to `expert_demo.npy` in [lets-do-irl/mountaincar/app/expert_demo](https://github.com/reinforcement-learning-kr/lets-do-irl/tree/master/mountaincar/app/expert_demo) or [lets-do-irl/mountaincar/maxent/expert_demo](https://github.com/reinforcement-learning-kr/lets-do-irl/tree/master/mountaincar/maxent/expert_demo).

Shape of expert's demonstrations is (20, 130, 3); (number of demonstrations, length of demonstrations, state and action of demonstrations)
Shape of expert's demonstrations is (20, 130, 3); (number of demonstrations, length of demonstrations, states and actions of demonstrations)

If you make demonstrations, Navigate to `make_expert.py` in [lets-do-irl/mountaincar/app/expert_demo](https://github.com/reinforcement-learning-kr/lets-do-irl/tree/master/mountaincar/app/expert_demo) or [lets-do-irl/mountaincar/maxent/expert_demo](https://github.com/reinforcement-learning-kr/lets-do-irl/tree/master/mountaincar/maxent/expert_demo).

Expand Down Expand Up @@ -142,7 +142,7 @@ We have implemented `GAIL`, `VAIL` using PPO as RL step in `Hopper-v2` environme

Navigate to `expert_demo.p` in [lets-do-irl/mujoco/gail/expert_demo](https://github.com/reinforcement-learning-kr/lets-do-irl/tree/master/mujoco/gail/expert_demo) or [lets-do-irl/mujoco/vail/expert_demo](https://github.com/reinforcement-learning-kr/lets-do-irl/tree/master/mujoco/vail/expert_demo).

Shape of expert's demonstrations is (50000, 14); (number of demonstrations, state and action of demonstrations)
Shape of expert's demonstrations is (50000, 14); (number of demonstrations, states and actions of demonstrations)

We used demonstrations that get scores between about 2200 and 2600 on average.

Expand Down

0 comments on commit ad04bf1

Please sign in to comment.