forked from openai/baselines
-
Notifications
You must be signed in to change notification settings - Fork 728
Issues: hill-a/stable-baselines
V3 new backend: PyTorch? and the future of Stable Baselines
#733
by araffin
was closed Mar 2, 2021
Closed
10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Question] Using PPO1 on a cluster
question
Further information is requested
#957
opened Jul 20, 2020 by
AlessandroZavoli
[Question] Profiling with custum environment and MPI
custom gym env
Issue related to Custom Gym Env
question
Further information is requested
#953
opened Jul 18, 2020 by
AlessandroZavoli
Maybe there is one problem in implementing the class PrioritizedReplayBuffer
bug
Something isn't working
#941
opened Jul 13, 2020 by
UPUPGOO
Pre-Training Problem
more information needed
Please fill the issue template completely
windows
#932
opened Jul 8, 2020 by
FabioPINO
[question] PPO2 pretrain always resets weights?
bug
Something isn't working
question
Further information is requested
#921
opened Jul 2, 2020 by
SolaWeng
[question] Specify a prior over action distribution?
question
Further information is requested
#903
opened Jun 23, 2020 by
juliuskittler
Value Function Normalization
enhancement
New feature or request
experimental
Experimental Feature
v3
Discussion about V3
#892
opened Jun 16, 2020 by
huvar
Read errors when running PPO1 with MPI
question
Further information is requested
#886
opened Jun 8, 2020 by
siferati
error while using LstmPolicy
question
Further information is requested
windows
#882
opened Jun 4, 2020 by
pirate-lofy
[feature request] Add maximum time steps parameter to evaluation function to protect against infinite episodes
enhancement
New feature or request
#876
opened Jun 3, 2020 by
philwinder
[feature request] Plotting additional info collected by monitor using results_plotter
enhancement
New feature or request
#872
opened May 28, 2020 by
nisheeth-golakiya
Should Something isn't working
help wanted
Help from contributors is needed
TensorboardWriter
close its tf.summary.FileWriter
?
bug
#855
opened May 14, 2020 by
shwang
Adding glossary to the docs
documentation
Documentation should be updated
enhancement
New feature or request
v3
Discussion about V3
#853
opened May 13, 2020 by
mhtb32
question: multiple reward array ?
custom gym env
Issue related to Custom Gym Env
question
Further information is requested
#841
opened May 4, 2020 by
greg2paris
PPO2 episode reward drops catastrophically during training
custom gym env
Issue related to Custom Gym Env
question
Further information is requested
#837
opened May 1, 2020 by
kp368
Using Saved Model as Enemy Policy in Custom Environment (while training in a subprocvecenv)
custom gym env
Issue related to Custom Gym Env
question
Further information is requested
#835
opened Apr 30, 2020 by
lukepolson
Adding Additional Observations and Actions to Buffer per TimeStep
custom gym env
Issue related to Custom Gym Env
question
Further information is requested
#834
opened Apr 30, 2020 by
lukepolson
Converting an existing gazebo environment into vectorized environment
custom gym env
Issue related to Custom Gym Env
question
Further information is requested
#831
opened Apr 27, 2020 by
utsavpatel22
Box observation_space high bound wrongly set by wrap_deepmind
openai gym
related to OpenAI Gym interface
question
Further information is requested
#829
opened Apr 25, 2020 by
alexpalms
[question] How to get layer activations in Tensorflow
question
Further information is requested
#825
opened Apr 24, 2020 by
lumelanie
Reverse sign in TD-error of DQN
enhancement
New feature or request
#808
opened Apr 16, 2020 by
juliuskittler
[potential bug] HER Replay buffer observations
question
Further information is requested
#745
opened Mar 17, 2020 by
johannes-dornheim
"Getting Mean Reward in CustomCallBack" Unsupported operand type(s) for /: 'str' and 'int'
custom gym env
Issue related to Custom Gym Env
question
Further information is requested
windows
#738
opened Mar 11, 2020 by
toksis
ProTip!
Exclude everything labeled
bug
with -label:bug.