Skip to content

Issues: thu-ml/tianshou

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

question about DRQN bug Something isn't working not reproduced yet Not yet tested or reproduced by a reviewer RNN Temporary label to group all things RNN
#584 opened Apr 3, 2022 by leao1995
[Algorithm request] Suggest for implementing D4PG in future version. new algorithm Adding a new RL algorithm
#220 opened Sep 16, 2020 by GIS-PuppetMaster
3 of 8 tasks
Add reward shaping and policy shaping to DQN algorithm enhancement Not quite a new algorithm, but an enhancement to algo. functionality
#279 opened Jan 25, 2021 by zhujl1991
Action mask for more algorithms algorithm enhancement Not quite a new algorithm, but an enhancement to algo. functionality refactoring No change to functionality
#334 opened Apr 11, 2021 by Stuhl
MultiAgentPolicyManager misses rewards enhancement Feature that is not a new algorithm or an algorithm enhancement MARL Temporary label to group all things MARL
#399 opened Jul 14, 2021 by benblack769
4 of 8 tasks
Some questions in recurrent-style SAC question Further information is requested RNN Temporary label to group all things RNN
#470 opened Oct 25, 2021 by chocolate616
Episode start signal not used in RNN for on-policy algorithms bug Something isn't working RNN Temporary label to group all things RNN
#486 opened Nov 29, 2021 by araffin
4 of 8 tasks
Does Tianshou support Multi-Agent Reinforcement learning algorithms, such as maddpg? enhancement Feature that is not a new algorithm or an algorithm enhancement MARL Temporary label to group all things MARL
#490 opened Dec 3, 2021 by Kai-gege
3 of 8 tasks
Does tianshou support RNN-SAC and how can I find the demo code? question Further information is requested RNN Temporary label to group all things RNN
#491 opened Dec 6, 2021 by caimingxue
8 tasks
A question: LSTM + PPO question Further information is requested RNN Temporary label to group all things RNN
#498 opened Dec 29, 2021 by tesla-cat
RNN for continuous CQL algorithm enhancement Feature that is not a new algorithm or an algorithm enhancement RNN Temporary label to group all things RNN
#513 opened Jan 21, 2022 by BFAnas
5 of 8 tasks
What paper or reference is the RNN implementation trying to replicate? bug Something isn't working RNN Temporary label to group all things RNN
#567 opened Mar 11, 2022 by BFAnas
5 of 8 tasks
How to support multi-agent reinforcement learning discussion Discussion of a typical issue good first issue Good for newcomers MARL Temporary label to group all things MARL
#121 opened Jul 9, 2020 by youkaichao
3 of 8 tasks
Implementation design issues in SubprocVectorEnv discussion Discussion of a typical issue enhancement Feature that is not a new algorithm or an algorithm enhancement refactoring No change to functionality
#573 opened Mar 19, 2022 by duburcqa
Improve discrete control offline RL benchmark enhancement Feature that is not a new algorithm or an algorithm enhancement
#612 opened Apr 25, 2022 by nuance1979
4 of 8 tasks
Implement Decision Transformer for offline RL blocked Can't be worked on for now new algorithm Adding a new RL algorithm RNN Temporary label to group all things RNN
#626 opened May 2, 2022 by nuance1979
4 of 8 tasks
lstm+ppo/sac question Further information is requested RNN Temporary label to group all things RNN
#754 opened Oct 7, 2022 by 1900360
RNN support for TD3 and SAC question Further information is requested RNN Temporary label to group all things RNN
#795 opened Jan 12, 2023 by qtomcatq
Possible Leak of Observations In Multi-Agent Policies MARL Temporary label to group all things MARL question Further information is requested
#806 opened Feb 15, 2023 by uinversion
4 of 8 tasks
Fix handling of torch "device" association bug Something isn't working good first issue Good for newcomers
#810 opened Feb 18, 2023 by jamartinh
3 of 8 tasks
Release 2.0.0
[question] LSTM for A2C with discrete action space question Further information is requested RNN Temporary label to group all things RNN
#814 opened Feb 27, 2023 by cbschen
3 of 6 tasks
Errors with ParallelEnv and AECEnv question Further information is requested
#816 opened Feb 28, 2023 by Franjrz
5 of 8 tasks
Getting started example causes TypeError: object of type 'TimeLimit' has no len() bug Something isn't working not reproduced yet Not yet tested or reproduced by a reviewer
#819 opened Mar 6, 2023 by SorenSc
4 of 8 tasks
Release 2.0.0
Plotting more metrics for PPO blocked Can't be worked on for now enhancement Feature that is not a new algorithm or an algorithm enhancement
#842 opened Mar 31, 2023 by arvganesh
2 of 8 tasks
Release 2.0.0
MPO Implementation new algorithm Adding a new RL algorithm
#1165 opened Jul 4, 2024 by ziqiao30
ProTip! Adding no:label will show everything without a label.