-
Trinity College Dublin
- Dublin, Ireland
-
21:35
(UTC -12:00)
Block or Report
Block or report SSubhnil
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
RIA_base Public
RIA base version. With new Walker environment similar to DM Control Suite physics and reward function.
Python UpdatedJul 2, 2024 -
CDL-bench Public
Forked from wangzizhao/CausalDynamicsLearningBenchmarking CDL in confounded MDP and POMDP settings
Python UpdatedJul 2, 2024 -
RIA-bench Public
Forked from CR-Gjx/RIABenchmarking RIA in confounded environments for zero and few-shot generalization. Now compatible with TF2.
Python UpdatedJul 2, 2024 -
sac-bench Public
Forked from denisyarats/pytorch_sacPyTorch implementation of Soft Actor-Critic (SAC) for Unobserved Confounders
Jupyter Notebook MIT License UpdatedJun 22, 2024 -
CoGen_Benchmarking Public
Benchmarking existing RL algorithms including model-free and model-based approaches on confounded versions of popular environments. Tests generalization and sample efficiency.
-
dreamer-new Public
Updated version of DreamerV3 cloned from danijar/dreamerv3
Python MIT License UpdatedJun 19, 2024 -
mamba-test Public
Forked from zoharri/mambaMeta-RL Model-Based Algorithm - Confounding tests
Python Other UpdatedJun 17, 2024 -
dreamerv3-benchmod Public
Forked from danijar/dreamerv3Modifying DreamerV3 for benchmarking in confounded environments
Python MIT License UpdatedJun 7, 2024 -
D4PG-bench Public
Forked from msinto93/D4PGBenchmarking D4PG in confounded environements.
Python MIT License UpdatedJun 7, 2024 -
FCD-bench Public
Forked from iwhwang/Fine-Grained-Causal-RLFine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)
Python MIT License UpdatedJun 5, 2024 -
mpo-bench Public
Forked from daisatojp/mpoBaseline tests on MPO with unobserved confounders
Python GNU General Public License v3.0 UpdatedMay 29, 2024 -
GRADER-bench Public
Forked from GilgameshD/GRADERRepository for benchmarking GRADER in confounded environments for zero and few-shot generalization.
Python MIT License UpdatedMay 22, 2024 -
P2P-bench Public
Forked from ZifanWu/Plan-to-PredictCode accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".
Python UpdatedDec 21, 2023 -
Causal-Gridworld Public
Testing the causal implications of the wind in the gridworld environment. The wind is the confounder.
Python UpdatedDec 6, 2023 -
MWM-bench Public
Forked from younggyoseo/MWMBenchmarking MWM in confounded environments
Python Other UpdatedJun 11, 2023 -
rl2-bench Public
Forked from lucaslingle/pytorch_rl2Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
Python UpdatedJan 1, 2022 -
-
BAC-DAC-gym Public
Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.
-
CausalBench Public
Forked from dido1998/CausalMBRLOfficial data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
Python MIT License UpdatedJul 15, 2021 -
Vehicle-Dynamics-Toolkit Public
Some advanced tools for race car design - Steady state and transient dynamics, Tyre Data synthesis
-
RacingCARLA Public
Learning Model Predictive Control (LMPC) for autonomous racing in CARLA 3D environment.
-
TMCL-b Public
Forked from younggyoseo/trajectory_mclTrajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
Python UpdatedOct 27, 2020 -
slac-bench Public
Forked from alexlee-gk/slacStochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Python MIT License UpdatedOct 26, 2020 -
Lane-Lines-Detection-Python-OpenCV Public
Forked from tatsuyah/Lane-Lines-Detection-Python-OpenCVLane Lines Detection using Python and OpenCV for self-driving car
Jupyter Notebook UpdatedOct 12, 2017