hill-a / stable-baselines Public

forked from openai/baselines

Notifications You must be signed in to change notification settings
Fork 728
Star 4.1k

Code
Issues 121
Pull requests 10
Actions
Projects 1
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Issues: hill-a/stable-baselines

Tensorflow 2.0 support?

#366 by heron1 was closed Mar 8, 2020

Closed 20

V3 new backend: PyTorch? and the future of Stable Baselines

#733 by araffin was closed Mar 2, 2021

Closed 10

Labels 19 Milestones 1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

121 Open 824 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Question] Using PPO1 on a cluster question

Further information is requested

#957 opened Jul 20, 2020 by AlessandroZavoli

[Question] Profiling with custum environment and MPI custom gym env

Issue related to Custom Gym Env

question

Further information is requested

#953 opened Jul 18, 2020 by AlessandroZavoli

Maybe there is one problem in implementing the class PrioritizedReplayBuffer bug

Something isn't working

#941 opened Jul 13, 2020 by UPUPGOO

Pre-Training Problem more information needed

Please fill the issue template completely

windows

#932 opened Jul 8, 2020 by FabioPINO

[question] PPO2 pretrain always resets weights? bug

Something isn't working

question

Further information is requested

#921 opened Jul 2, 2020 by SolaWeng

[question] Specify a prior over action distribution? question

Further information is requested

#903 opened Jun 23, 2020 by juliuskittler

[question] SAC target q nets may be updated many times in each round? bug

Something isn't working

#900 opened Jun 21, 2020 by xuanqing94 v2.10.1

Value Function Normalization enhancement

New feature or request

experimental

Experimental Feature

Discussion about V3

#892 opened Jun 16, 2020 by huvar

Read errors when running PPO1 with MPI question

Further information is requested

#886 opened Jun 8, 2020 by siferati

error while using LstmPolicy question

Further information is requested

windows

#882 opened Jun 4, 2020 by pirate-lofy

[feature request] Add maximum time steps parameter to evaluation function to protect against infinite episodes enhancement

New feature or request

#876 opened Jun 3, 2020 by philwinder

[feature request] Plotting additional info collected by monitor using results_plotter enhancement

New feature or request

#872 opened May 28, 2020 by nisheeth-golakiya

Should TensorboardWriter close its tf.summary.FileWriter? bug

Something isn't working

help wanted

Help from contributors is needed

#855 opened May 14, 2020 by shwang

Adding glossary to the docs documentation

Documentation should be updated

enhancement

New feature or request

Discussion about V3

#853 opened May 13, 2020 by mhtb32

question: multiple reward array ? custom gym env

Issue related to Custom Gym Env

question

Further information is requested

#841 opened May 4, 2020 by greg2paris

PPO2 episode reward drops catastrophically during training custom gym env

Issue related to Custom Gym Env

question

Further information is requested

#837 opened May 1, 2020 by kp368

Using Saved Model as Enemy Policy in Custom Environment (while training in a subprocvecenv) custom gym env

Issue related to Custom Gym Env

question

Further information is requested

#835 opened Apr 30, 2020 by lukepolson

Adding Additional Observations and Actions to Buffer per TimeStep custom gym env

Issue related to Custom Gym Env

question

Further information is requested

#834 opened Apr 30, 2020 by lukepolson

Converting an existing gazebo environment into vectorized environment custom gym env

Issue related to Custom Gym Env

question

Further information is requested

#831 opened Apr 27, 2020 by utsavpatel22

Box observation_space high bound wrongly set by wrap_deepmind openai gym

related to OpenAI Gym interface

question

Further information is requested

#829 opened Apr 25, 2020 by alexpalms

[question] How to get layer activations in Tensorflow question

Further information is requested

#825 opened Apr 24, 2020 by lumelanie

Reverse sign in TD-error of DQN enhancement

New feature or request

#808 opened Apr 16, 2020 by juliuskittler

[PPO2] problems resuming training

#781 opened Apr 3, 2020 by k0rean

[potential bug] HER Replay buffer observations question

Further information is requested

#745 opened Mar 17, 2020 by johannes-dornheim

"Getting Mean Reward in CustomCallBack" Unsupported operand type(s) for /: 'str' and 'int' custom gym env

Issue related to Custom Gym Env

question

Further information is requested

windows

#738 opened Mar 11, 2020 by toksis

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly