-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Issues: thu-ml/tianshou
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How to support multi-agent reinforcement learning
discussion
Discussion of a typical issue
good first issue
Good for newcomers
MARL
Temporary label to group all things MARL
#121
opened Jul 9, 2020 by
youkaichao
3 of 8 tasks
RNN for continuous CQL algorithm
enhancement
Feature that is not a new algorithm or an algorithm enhancement
RNN
Temporary label to group all things RNN
#513
opened Jan 21, 2022 by
BFAnas
5 of 8 tasks
Suggestion - Redesign RayEnvWorker for Improved Performance
performance issues
Slow execution or poor-quality results
#1172
opened Jul 10, 2024 by
destin-v
4 tasks done
Implement Decision Transformer for offline RL
blocked
Can't be worked on for now
new algorithm
Adding a new RL algorithm
RNN
Temporary label to group all things RNN
#626
opened May 2, 2022 by
nuance1979
4 of 8 tasks
How to successfully run a demo
build/test
documentation
question
Further information is requested
#1015
opened Dec 28, 2023 by
zhiyu2020
RNN related issues
bug
Something isn't working
help wanted
Extra attention is needed
RNN
Temporary label to group all things RNN
#937
opened Sep 7, 2023 by
MischaPanch
Episode start signal not used in RNN for on-policy algorithms
bug
Something isn't working
RNN
Temporary label to group all things RNN
#486
opened Nov 29, 2021 by
araffin
4 of 8 tasks
Collector sampling with multiple environment does not seem to be unbiased with n_episodes
algorithm enhancement
Not quite a new algorithm, but an enhancement to algo. functionality
question
Further information is requested
#1042
opened Feb 2, 2024 by
utkarshp
5 of 9 tasks
Improve discrete control offline RL benchmark
enhancement
Feature that is not a new algorithm or an algorithm enhancement
#612
opened Apr 25, 2022 by
nuance1979
4 of 8 tasks
Poetry update the torch versioned from cuda (2.0.1+cu118) to cpu (2.1.1) defaultly on Windows
build/test
#1145
opened May 11, 2024 by
coolermzb3
6 of 9 tasks
Batch: don't just set 0 when elements have None entries
Batch and Buffer
Improvements in internal data structures, temporary label
bug
Something isn't working
#1088
opened Apr 3, 2024 by
MischaPanch
compute_episodic_return
bug when v_s=None
performance issues
Buffer: fix discrepancy in slicing order
Batch and Buffer
Improvements in internal data structures, temporary label
breaking changes
Changes in public interfaces. Includes small changes or changes in keys
refactoring
No change to functionality
#1090
opened Apr 3, 2024 by
MischaPanch
Support Dict observation spaces
documentation
enhancement
Feature that is not a new algorithm or an algorithm enhancement
good first issue
Good for newcomers
tentative
Up to discussion, may be dismissed
#1065
opened Feb 26, 2024 by
MischaPanch
Errors with ParallelEnv and AECEnv
question
Further information is requested
#816
opened Feb 28, 2023 by
Franjrz
5 of 8 tasks
Fix handling of torch "device" association
bug
Something isn't working
good first issue
Good for newcomers
MultiAgentPolicyManager misses rewards
enhancement
Feature that is not a new algorithm or an algorithm enhancement
MARL
Temporary label to group all things MARL
#399
opened Jul 14, 2021 by
benblack769
4 of 8 tasks
How can I make action sampling within the range specified by my environment when using onpolicy_trainer?
question
Further information is requested
#1142
opened May 9, 2024 by
lidaken
Regarding the error related to SEED when I train in a homebrew environment
question
Further information is requested
#1039
opened Jan 31, 2024 by
iamysy
Docs and examples on how to report performance issues
documentation
enhancement
Feature that is not a new algorithm or an algorithm enhancement
Possible Leak of Observations In Multi-Agent Policies
MARL
Temporary label to group all things MARL
question
Further information is requested
#806
opened Feb 15, 2023 by
uinversion
4 of 8 tasks
Some questions in recurrent-style SAC
question
Further information is requested
RNN
Temporary label to group all things RNN
#470
opened Oct 25, 2021 by
chocolate616
Batch: don't just strip off empty entries when creating batches
Batch and Buffer
Improvements in internal data structures, temporary label
bug
Something isn't working
#1089
opened Apr 3, 2024 by
MischaPanch
Use RNN in MARL
MARL
Temporary label to group all things MARL
question
Further information is requested
RNN
Temporary label to group all things RNN
#965
opened Oct 13, 2023 by
zhangwenjun1229
1 of 8 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.