thu-ml / tianshou Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 7.6k

Code
Issues 131
Pull requests 4
Discussions
Actions
Projects 1
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: thu-ml/tianshou

Adding Hyperparameter Optimisation (HPO)

#978 opened Oct 25, 2023 by bordeauxred

Open 2

Clearer separation between the trainer and the algorithm and ...

#1034 opened Jan 24, 2024 by maxhuettenrauch

Open 1

Labels 31 Milestones 1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

131 Open 589 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

question about DRQN bug

Something isn't working

not reproduced yet

Not yet tested or reproduced by a reviewer

RNN

Temporary label to group all things RNN

#584 opened Apr 3, 2022 by leao1995

[Algorithm request] Suggest for implementing D4PG in future version. new algorithm

Adding a new RL algorithm

#220 opened Sep 16, 2020 by GIS-PuppetMaster

3 of 8 tasks

Add reward shaping and policy shaping to DQN algorithm enhancement

Not quite a new algorithm, but an enhancement to algo. functionality

#279 opened Jan 25, 2021 by zhujl1991

Action mask for more algorithms algorithm enhancement

Not quite a new algorithm, but an enhancement to algo. functionality

refactoring

No change to functionality

#334 opened Apr 11, 2021 by Stuhl

MultiAgentPolicyManager misses rewards enhancement

Feature that is not a new algorithm or an algorithm enhancement

MARL

Temporary label to group all things MARL

#399 opened Jul 14, 2021 by benblack769

4 of 8 tasks

Some questions in recurrent-style SAC question

Further information is requested

RNN

Temporary label to group all things RNN

#470 opened Oct 25, 2021 by chocolate616

Episode start signal not used in RNN for on-policy algorithms bug

Something isn't working

RNN

Temporary label to group all things RNN

#486 opened Nov 29, 2021 by araffin

4 of 8 tasks

Does Tianshou support Multi-Agent Reinforcement learning algorithms, such as maddpg? enhancement

Feature that is not a new algorithm or an algorithm enhancement

MARL

Temporary label to group all things MARL

#490 opened Dec 3, 2021 by Kai-gege

3 of 8 tasks

Does tianshou support RNN-SAC and how can I find the demo code? question

Further information is requested

RNN

Temporary label to group all things RNN

#491 opened Dec 6, 2021 by caimingxue

8 tasks

A question: LSTM + PPO question

Further information is requested

RNN

Temporary label to group all things RNN

#498 opened Dec 29, 2021 by tesla-cat

RNN for continuous CQL algorithm enhancement

Feature that is not a new algorithm or an algorithm enhancement

RNN

Temporary label to group all things RNN

#513 opened Jan 21, 2022 by BFAnas

5 of 8 tasks

What paper or reference is the RNN implementation trying to replicate? bug

Something isn't working

RNN

Temporary label to group all things RNN

#567 opened Mar 11, 2022 by BFAnas

5 of 8 tasks

How to support multi-agent reinforcement learning discussion

Discussion of a typical issue

good first issue

Good for newcomers

MARL

Temporary label to group all things MARL

#121 opened Jul 9, 2020 by youkaichao

3 of 8 tasks

Implementation design issues in SubprocVectorEnv discussion

Discussion of a typical issue

enhancement

Feature that is not a new algorithm or an algorithm enhancement

refactoring

No change to functionality

#573 opened Mar 19, 2022 by duburcqa

Improve discrete control offline RL benchmark enhancement

Feature that is not a new algorithm or an algorithm enhancement

#612 opened Apr 25, 2022 by nuance1979

4 of 8 tasks

Implement Decision Transformer for offline RL blocked

Can't be worked on for now

new algorithm

Adding a new RL algorithm

RNN

Temporary label to group all things RNN

#626 opened May 2, 2022 by nuance1979

4 of 8 tasks

lstm+ppo/sac question

Further information is requested

RNN

Temporary label to group all things RNN

#754 opened Oct 7, 2022 by 1900360

RNN support for TD3 and SAC question

Further information is requested

RNN

Temporary label to group all things RNN

#795 opened Jan 12, 2023 by qtomcatq

Possible Leak of Observations In Multi-Agent Policies MARL

Temporary label to group all things MARL

question

Further information is requested

#806 opened Feb 15, 2023 by uinversion

4 of 8 tasks

Fix handling of torch "device" association bug

Something isn't working

good first issue

Good for newcomers

#810 opened Feb 18, 2023 by jamartinh

3 of 8 tasks

Release 2.0.0

[question] LSTM for A2C with discrete action space question

Further information is requested

RNN

Temporary label to group all things RNN

#814 opened Feb 27, 2023 by cbschen

3 of 6 tasks

Errors with ParallelEnv and AECEnv question

Further information is requested

#816 opened Feb 28, 2023 by Franjrz

5 of 8 tasks

Getting started example causes TypeError: object of type 'TimeLimit' has no len() bug

Something isn't working

not reproduced yet

Not yet tested or reproduced by a reviewer

#819 opened Mar 6, 2023 by SorenSc

4 of 8 tasks

Release 2.0.0

Plotting more metrics for PPO blocked

Can't be worked on for now

enhancement

Feature that is not a new algorithm or an algorithm enhancement

#842 opened Mar 31, 2023 by arvganesh

2 of 8 tasks

Release 2.0.0

MPO Implementation new algorithm

Adding a new RL algorithm

#1165 opened Jul 4, 2024 by ziqiao30

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly