Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

SSubhnil Follow

Overview Repositories 24 Projects 0 Packages 0 Stars 55

More

Overview
Repositories
Projects
Packages
Stars

SSubhnil

Follow

🎯

Focusing

Shubham Subhnil SSubhnil

🎯

Focusing

Follow

PhD candidate at Trinity College Dublin, Ireland. I work on RL, causality, latent variables and multi-agent systems.

3 followers · 2 following

Trinity College Dublin
Dublin, Ireland
21:35 (UTC -12:00)

Block or Report

Block or report SSubhnil

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Overview Repositories 24 Projects 0 Packages 0 Stars 55

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python Jupyter Notebook MATLAB

Sort Last updated

Select order

Last updated Name Stars

RIA_base Public

RIA base version. With new Walker environment similar to DM Control Suite physics and reward function.

Python Updated Jul 2, 2024
CDL-bench Public
Forked from wangzizhao/CausalDynamicsLearning

Benchmarking CDL in confounded MDP and POMDP settings

Python Updated Jul 2, 2024
RIA-bench Public
Forked from CR-Gjx/RIA

Benchmarking RIA in confounded environments for zero and few-shot generalization. Now compatible with TF2.

Python Updated Jul 2, 2024
sac-bench Public
Forked from denisyarats/pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC) for Unobserved Confounders

Jupyter Notebook MIT License Updated Jun 22, 2024
CoGen_Benchmarking Public

Benchmarking existing RL algorithms including model-free and model-based approaches on confounded versions of popular environments. Tests generalization and sample efficiency.

Python 1 Apache License 2.0 Updated Jun 20, 2024
dreamer-new Public

Updated version of DreamerV3 cloned from danijar/dreamerv3

Python MIT License Updated Jun 19, 2024
mamba-test Public
Forked from zoharri/mamba

Meta-RL Model-Based Algorithm - Confounding tests

Python Other Updated Jun 17, 2024
dreamerv3-benchmod Public
Forked from danijar/dreamerv3

Modifying DreamerV3 for benchmarking in confounded environments

Python MIT License Updated Jun 7, 2024
D4PG-bench Public
Forked from msinto93/D4PG

Benchmarking D4PG in confounded environements.

Python MIT License Updated Jun 7, 2024
FCD-bench Public
Forked from iwhwang/Fine-Grained-Causal-RL

Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)

Python MIT License Updated Jun 5, 2024
mpo-bench Public
Forked from daisatojp/mpo

Baseline tests on MPO with unobserved confounders

Python GNU General Public License v3.0 Updated May 29, 2024
GRADER-bench Public
Forked from GilgameshD/GRADER

Repository for benchmarking GRADER in confounded environments for zero and few-shot generalization.

Python MIT License Updated May 22, 2024
P2P-bench Public
Forked from ZifanWu/Plan-to-Predict

Code accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".

Python Updated Dec 21, 2023
Causal-Gridworld Public

Testing the causal implications of the wind in the gridworld environment. The wind is the confounder.

Python Updated Dec 6, 2023
MWM-bench Public
Forked from younggyoseo/MWM

Benchmarking MWM in confounded environments

Python Other Updated Jun 11, 2023
rl2-bench Public
Forked from lucaslingle/pytorch_rl2

Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'

Python Updated Jan 1, 2022
RacingLMPC Public

Python 1 1 MIT License Updated Oct 20, 2021
BAC-DAC-gym Public

Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.

reinforcement-learning gym bayesian-optimization bayesian-machine-learning continuous-control actor-critic model-free-rl

Python 7 1 GNU General Public License v3.0 Updated Aug 14, 2021
CausalBench Public
Forked from dido1998/CausalMBRL

Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning

Python MIT License Updated Jul 15, 2021
Vehicle-Dynamics-Toolkit Public

Some advanced tools for race car design - Steady state and transient dynamics, Tyre Data synthesis

MATLAB 1 1 Apache License 2.0 Updated Jun 14, 2021
RacingCARLA Public

Learning Model Predictive Control (LMPC) for autonomous racing in CARLA 3D environment.

reinforcement-learning computer-vision racing radar artificial-intelligence lidar self-driving-car

Python 22 7 MIT License Updated May 25, 2021
TMCL-b Public
Forked from younggyoseo/trajectory_mcl

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)

Python Updated Oct 27, 2020
slac-bench Public
Forked from alexlee-gk/slac

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Python MIT License Updated Oct 26, 2020
Lane-Lines-Detection-Python-OpenCV Public
Forked from tatsuyah/Lane-Lines-Detection-Python-OpenCV

Lane Lines Detection using Python and OpenCV for self-driving car

Jupyter Notebook Updated Oct 12, 2017

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.