thu-ml / tianshou Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 7.8k

Code
Issues 134
Pull requests 3
Discussions
Actions
Projects 1
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: thu-ml/tianshou

Towards Roadmap

#1215 opened Sep 8, 2024 by MischaPanch

Open 11

Labels 31 Milestones 1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

134 Open 609 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

How to support multi-agent reinforcement learning discussion

Discussion of a typical issue

good first issue

Good for newcomers

MARL

Temporary label to group all things MARL

#121 opened Jul 9, 2020 by youkaichao

3 of 8 tasks

RNN for continuous CQL algorithm enhancement

Feature that is not a new algorithm or an algorithm enhancement

RNN

Temporary label to group all things RNN

#513 opened Jan 21, 2022 by BFAnas

5 of 8 tasks

Suggestion - Redesign RayEnvWorker for Improved Performance performance issues

Slow execution or poor-quality results

#1172 opened Jul 10, 2024 by destin-v

4 tasks done

Implement Decision Transformer for offline RL blocked

Can't be worked on for now

new algorithm

Adding a new RL algorithm

RNN

Temporary label to group all things RNN

#626 opened May 2, 2022 by nuance1979

4 of 8 tasks

How to successfully run a demo build/test documentation question

Further information is requested

#1015 opened Dec 28, 2023 by zhiyu2020

RNN related issues bug

Something isn't working

help wanted

Extra attention is needed

RNN

Temporary label to group all things RNN

#937 opened Sep 7, 2023 by MischaPanch

Towards Roadmap major

Large changes that cannot or should not be broken down into smaller ones

#1215 opened Sep 8, 2024 by MischaPanch Release 2.0.0

Episode start signal not used in RNN for on-policy algorithms bug

Something isn't working

RNN

Temporary label to group all things RNN

#486 opened Nov 29, 2021 by araffin

4 of 8 tasks

Collector sampling with multiple environment does not seem to be unbiased with n_episodes algorithm enhancement

Not quite a new algorithm, but an enhancement to algo. functionality

question

Further information is requested

#1042 opened Feb 2, 2024 by utkarshp

5 of 9 tasks

Improve discrete control offline RL benchmark enhancement

Feature that is not a new algorithm or an algorithm enhancement

#612 opened Apr 25, 2022 by nuance1979

4 of 8 tasks

Poetry update the torch versioned from cuda (2.0.1+cu118) to cpu (2.1.1) defaultly on Windows build/test

#1145 opened May 11, 2024 by coolermzb3

6 of 9 tasks

Batch: don't just set 0 when elements have None entries Batch and Buffer

Improvements in internal data structures, temporary label

bug

Something isn't working

#1088 opened Apr 3, 2024 by MischaPanch

compute_episodic_return bug when v_s=None performance issues

Slow execution or poor-quality results

refactoring

No change to functionality

RNN

Temporary label to group all things RNN

#886 opened Jun 8, 2023 by spacegoing Release 2.0.0

Buffer: fix discrepancy in slicing order Batch and Buffer

Improvements in internal data structures, temporary label

breaking changes

Changes in public interfaces. Includes small changes or changes in keys

refactoring

No change to functionality

#1090 opened Apr 3, 2024 by MischaPanch

Support Dict observation spaces documentation enhancement

Feature that is not a new algorithm or an algorithm enhancement

good first issue

Good for newcomers

tentative

Up to discussion, may be dismissed

#1065 opened Feb 26, 2024 by MischaPanch

Errors with ParallelEnv and AECEnv question

Further information is requested

#816 opened Feb 28, 2023 by Franjrz

5 of 8 tasks

Fix handling of torch "device" association bug

Something isn't working

good first issue

Good for newcomers

#810 opened Feb 18, 2023 by jamartinh

3 of 8 tasks

Release 2.0.0

MultiAgentPolicyManager misses rewards enhancement

Feature that is not a new algorithm or an algorithm enhancement

MARL

Temporary label to group all things MARL

#399 opened Jul 14, 2021 by benblack769

4 of 8 tasks

How can I make action sampling within the range specified by my environment when using onpolicy_trainer? question

Further information is requested

#1142 opened May 9, 2024 by lidaken

Regarding the error related to SEED when I train in a homebrew environment question

Further information is requested

#1039 opened Jan 31, 2024 by iamysy

Docs and examples on how to report performance issues documentation enhancement

Feature that is not a new algorithm or an algorithm enhancement

#936 opened Sep 7, 2023 by MischaPanch Release 2.0.0

Possible Leak of Observations In Multi-Agent Policies MARL

Temporary label to group all things MARL

question

Further information is requested

#806 opened Feb 15, 2023 by uinversion

4 of 8 tasks

Some questions in recurrent-style SAC question

Further information is requested

RNN

Temporary label to group all things RNN

#470 opened Oct 25, 2021 by chocolate616

Batch: don't just strip off empty entries when creating batches Batch and Buffer

Improvements in internal data structures, temporary label

bug

Something isn't working

#1089 opened Apr 3, 2024 by MischaPanch

Use RNN in MARL MARL

Temporary label to group all things MARL

question

Further information is requested

RNN

Temporary label to group all things RNN

#965 opened Oct 13, 2023 by zhangwenjun1229

1 of 8 tasks

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly