cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks”

Python 37 1 Updated Jan 13, 2021

MishaLaskin / rad

RAD: Reinforcement Learning with Augmented Data

Jupyter Notebook 400 71 Updated Mar 29, 2021

jason718 / semi-sup

Code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning", Ren et al., NeurIPS'20

Python 25 6 Updated Jan 10, 2021

ryanchankh / mcr2

Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)

Python 190 41 Updated Dec 8, 2022

google-research / pisac

Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)

Python 40 10 Updated Jun 8, 2023

microsoft / jericho

A learning environment for man-made Interactive Fiction games.

C 257 42 Updated Jun 18, 2024

oxwhirl / wqmix

Code for Weighted QMIX

Python 119 34 Updated Nov 12, 2020

IouJenLiu / HTS-RL

Python 20 3 Updated Dec 22, 2020

facebookresearch / LaMCTS

The release codes of LA-MCTS with its application to Neural Architecture Search.

Python 456 70 Updated Nov 28, 2022

gregversteeg / NPEET

Non-parametric Entropy Estimation Toolbox

Python 360 88 Updated Oct 5, 2022

paulbrodersen / entropy_estimators

Estimators for the entropy and other information theoretic quantities of continuous distributions

Python 126 26 Updated May 14, 2024

google-research / seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Python 793 146 Updated Nov 29, 2022

p-christ / Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,542 1,186 Updated Jul 25, 2024

IDSIA / sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,205 380 Updated Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iou-Jen Liu IouJenLiu

Achievements

Achievements

Highlights

Block or report IouJenLiu

Stars

flowersteam / Grounding_LLMs_with_online_RL

karpathy / minGPT

google-deepmind / dqn_zoo

clvoloshin / COBS

IouJenLiu / AFK

google-research / google-research

hanjuku-kaso / awesome-offline-rl

IouJenLiu / PIC

opendilab / awesome-model-based-RL

carbonati / machine-learning

xingdi-eric-yuan / qait_public

chiphuyen / machine-learning-systems-design

allenai / advisor

IouJenLiu / CMAE

Farama-Foundation / Minigrid

mila-iqia / babyai

allenai / cordial-sync