HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.

Python 281 42 Updated Apr 26, 2024

helblazer811 / ManimML

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

Python 2,397 144 Updated Jun 22, 2024

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,057 3,234 Updated Aug 17, 2024

google-research / rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Jupyter Notebook 768 47 Updated Aug 12, 2024

google-research / reincarnating_rl

[NeurIPS 2022] Open source code for reusing prior computational work in RL.

Python 91 12 Updated Jul 5, 2023

cyanrain7 / TRPO-in-MARL

Python 184 49 Updated Jun 4, 2023

google / latexify_py

A library to generate LaTeX expression from Python code.

Python 7,244 385 Updated May 13, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

27,191 2,258 Updated Jun 18, 2024

TingFree / NLPer-Arsenal

收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴（当前赛事、往期赛事、训练赛）、NLP会议时间、常用自媒体、GPU推荐等，持续更新中

Python 2,177 251 Updated Aug 29, 2023

qihuazhong / multi-echelon-drl

Solving Multi-Echelon Inventory Management Problems with Heuristic-Guided Deep Reinforcement Learning

Python 5 3 Updated Mar 3, 2024

qihuazhong / kore-2022

This is the 4th place winning solution for the Kore 2022 simulation, a coding competition sponsored by Google.

Python 1 Updated Jan 22, 2023

hugo-toha / toha

A Hugo theme for personal portfolio

HTML 1,044 600 Updated Oct 24, 2024

openchainxyz / ethereum-transaction-viewer-frontend

Frontend for https://tx.eth.samczsun.com/

TypeScript 391 67 Updated Dec 30, 2022

georgemuriithi / investment-portfolio-optim

An investment portfolio of stocks is created using Long Short-Term Memory (LSTM) stock price prediction and optimized weights. The performance of this portfolio is better compared to an equally wei…

Jupyter Notebook 33 3 Updated Jan 18, 2024

seungkee / google_landmark_retrieval_2020_1st_place_solution

Jupyter Notebook 52 7 Updated Oct 5, 2020

jmerle / koreye-2022

Kore 2022 episode visualizer

TypeScript 9 Updated May 29, 2022

ContinuumIO / gtc2019-numba

Numba tutorial for GTC2019

Jupyter Notebook 134 42 Updated Jul 31, 2023

ContinuumIO / gtc2018-numba

Numba tutorial for GTC 2018

Jupyter Notebook 114 34 Updated Jul 31, 2023

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 9,872 850 Updated Sep 1, 2024

Farama-Foundation / PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python 2,619 414 Updated Sep 3, 2024

DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,075 514 Updated Nov 5, 2024

Louiii / ValueDecomposition

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Jupyter Notebook 21 5 Updated Apr 14, 2021

Theohhhu / UPDeT

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Python 130 17 Updated Feb 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qihua qihuazhong

Achievements

Achievements

Block or report qihuazhong

Stars

Dralliag / opera-python

neonwatty / machine_learning_refined

YyzHarry / imbalanced-regression

opendilab / DI-engine

shibing624 / text2vec

LarrySnyder / stockpyl

uoe-agents / epymarl

DeNA / HandyRL