-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: thu-ml/tianshou
Clearer separation between the trainer and the algorithm and ...
#1034
opened Jan 24, 2024 by
maxhuettenrauch
Open
1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
question about DRQN
bug
Something isn't working
not reproduced yet
Not yet tested or reproduced by a reviewer
RNN
Temporary label to group all things RNN
#584
opened Apr 3, 2022 by
leao1995
[Algorithm request] Suggest for implementing D4PG in future version.
new algorithm
Adding a new RL algorithm
#220
opened Sep 16, 2020 by
GIS-PuppetMaster
3 of 8 tasks
Add reward shaping and policy shaping to DQN
algorithm enhancement
Not quite a new algorithm, but an enhancement to algo. functionality
#279
opened Jan 25, 2021 by
zhujl1991
Action mask for more algorithms
algorithm enhancement
Not quite a new algorithm, but an enhancement to algo. functionality
refactoring
No change to functionality
#334
opened Apr 11, 2021 by
Stuhl
MultiAgentPolicyManager misses rewards
enhancement
Feature that is not a new algorithm or an algorithm enhancement
MARL
Temporary label to group all things MARL
#399
opened Jul 14, 2021 by
benblack769
4 of 8 tasks
Some questions in recurrent-style SAC
question
Further information is requested
RNN
Temporary label to group all things RNN
#470
opened Oct 25, 2021 by
chocolate616
Episode start signal not used in RNN for on-policy algorithms
bug
Something isn't working
RNN
Temporary label to group all things RNN
#486
opened Nov 29, 2021 by
araffin
4 of 8 tasks
Does Tianshou support Multi-Agent Reinforcement learning algorithms, such as maddpg?
enhancement
Feature that is not a new algorithm or an algorithm enhancement
MARL
Temporary label to group all things MARL
#490
opened Dec 3, 2021 by
Kai-gege
3 of 8 tasks
Does tianshou support RNN-SAC and how can I find the demo code?
question
Further information is requested
RNN
Temporary label to group all things RNN
#491
opened Dec 6, 2021 by
caimingxue
8 tasks
A question: LSTM + PPO
question
Further information is requested
RNN
Temporary label to group all things RNN
#498
opened Dec 29, 2021 by
tesla-cat
RNN for continuous CQL algorithm
enhancement
Feature that is not a new algorithm or an algorithm enhancement
RNN
Temporary label to group all things RNN
#513
opened Jan 21, 2022 by
BFAnas
5 of 8 tasks
What paper or reference is the RNN implementation trying to replicate?
bug
Something isn't working
RNN
Temporary label to group all things RNN
#567
opened Mar 11, 2022 by
BFAnas
5 of 8 tasks
How to support multi-agent reinforcement learning
discussion
Discussion of a typical issue
good first issue
Good for newcomers
MARL
Temporary label to group all things MARL
#121
opened Jul 9, 2020 by
youkaichao
3 of 8 tasks
Implementation design issues in SubprocVectorEnv
discussion
Discussion of a typical issue
enhancement
Feature that is not a new algorithm or an algorithm enhancement
refactoring
No change to functionality
#573
opened Mar 19, 2022 by
duburcqa
Improve discrete control offline RL benchmark
enhancement
Feature that is not a new algorithm or an algorithm enhancement
#612
opened Apr 25, 2022 by
nuance1979
4 of 8 tasks
Implement Decision Transformer for offline RL
blocked
Can't be worked on for now
new algorithm
Adding a new RL algorithm
RNN
Temporary label to group all things RNN
#626
opened May 2, 2022 by
nuance1979
4 of 8 tasks
lstm+ppo/sac
question
Further information is requested
RNN
Temporary label to group all things RNN
#754
opened Oct 7, 2022 by
1900360
RNN support for TD3 and SAC
question
Further information is requested
RNN
Temporary label to group all things RNN
#795
opened Jan 12, 2023 by
qtomcatq
Possible Leak of Observations In Multi-Agent Policies
MARL
Temporary label to group all things MARL
question
Further information is requested
#806
opened Feb 15, 2023 by
uinversion
4 of 8 tasks
Fix handling of torch "device" association
bug
Something isn't working
good first issue
Good for newcomers
[question] LSTM for A2C with discrete action space
question
Further information is requested
RNN
Temporary label to group all things RNN
#814
opened Feb 27, 2023 by
cbschen
3 of 6 tasks
Errors with ParallelEnv and AECEnv
question
Further information is requested
#816
opened Feb 28, 2023 by
Franjrz
5 of 8 tasks
Getting started example causes TypeError: object of type 'TimeLimit' has no len()
bug
Something isn't working
not reproduced yet
Not yet tested or reproduced by a reviewer
Plotting more metrics for PPO
blocked
Can't be worked on for now
enhancement
Feature that is not a new algorithm or an algorithm enhancement
Previous Next
ProTip!
Adding no:label will show everything without a label.