Skip to content

Issues: benbaarber/rl

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or ⇧ + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Double Deep Q Network (DDQN) algo RL algorithms in the rl::algo module new A new feature
#22 opened Jun 22, 2024 by benbaarber
Bayesian Control Rule exploration Exploration policies in the rl::exploration module new A new feature
#19 opened May 19, 2024 by benbaarber
EXP3 exploration Exploration policies in the rl::exploration module new A new feature
#18 opened May 19, 2024 by benbaarber
Random Network Distillation (RND) exploration Exploration policies in the rl::exploration module new A new feature
#14 opened May 15, 2024 by benbaarber
Noisy Networks exploration Exploration policies in the rl::exploration module new A new feature
#13 opened May 15, 2024 by benbaarber
Thompson Sampling exploration Exploration policies in the rl::exploration module new A new feature
#12 opened May 15, 2024 by benbaarber
Rainbow algo RL algorithms in the rl::algo module new A new feature
#8 opened May 15, 2024 by benbaarber
Asynchronous Advantage Actor-Critic (A3C) algo RL algorithms in the rl::algo module new A new feature
#7 opened May 15, 2024 by benbaarber
Advantage Actor-Critic (A2C) algo RL algorithms in the rl::algo module new A new feature
#6 opened May 15, 2024 by benbaarber
Twin Delayed DDPG (TD3) algo RL algorithms in the rl::algo module new A new feature
#5 opened May 15, 2024 by benbaarber
Trust Region Policy Optimization (TRPO) algo RL algorithms in the rl::algo module new A new feature
#4 opened May 15, 2024 by benbaarber
Soft Actor-Critic (SAC) algo RL algorithms in the rl::algo module new A new feature
#3 opened May 15, 2024 by benbaarber
Proximal Policy Optimization (PPO) algo RL algorithms in the rl::algo module new A new feature
#2 opened May 15, 2024 by benbaarber
ProTip! no:milestone will show everything without a milestone.