Skip to content
View IouJenLiu's full-sized avatar

Highlights

  • Pro

Block or report IouJenLiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Python 209 24 Updated Aug 23, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,768 2,450 Updated Aug 15, 2024

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

Python 448 77 Updated Apr 6, 2024

OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.

Python 61 14 Updated Aug 9, 2022
Python 16 1 Updated May 15, 2022

Google Research

Jupyter Notebook 33,736 7,819 Updated Aug 31, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

898 87 Updated May 23, 2024

PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning

Python 48 19 Updated Jun 7, 2021

A curated list of awesome model based RL resources (continually updated)

857 46 Updated Aug 26, 2024
Jupyter Notebook 5 6 Updated Dec 23, 2017

Question Answering with Interactive Text (QAit), code for EMNLP 2019 paper "Interactive Language Learning by Question Answering"

Python 44 9 Updated Sep 3, 2019

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"

HTML 8,913 1,411 Updated Apr 15, 2023
Python 8 Updated Jan 1, 2022
HTML 44 10 Updated Jul 23, 2021

Simple and easily configurable grid world environments for reinforcement learning

Python 2,071 600 Updated Aug 24, 2024

BabyAI platform. A testbed for training agents to understand and execute language commands.

Python 686 144 Updated Oct 1, 2023

cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks”

Python 37 1 Updated Jan 13, 2021

RAD: Reinforcement Learning with Augmented Data

Jupyter Notebook 400 71 Updated Mar 29, 2021

Code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning", Ren et al., NeurIPS'20

Python 25 6 Updated Jan 10, 2021

Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)

Python 190 41 Updated Dec 8, 2022

Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)

Python 40 10 Updated Jun 8, 2023

A learning environment for man-made Interactive Fiction games.

C 257 42 Updated Jun 18, 2024

Code for Weighted QMIX

Python 119 34 Updated Nov 12, 2020
Python 20 3 Updated Dec 22, 2020

The release codes of LA-MCTS with its application to Neural Architecture Search.

Python 456 70 Updated Nov 28, 2022

Non-parametric Entropy Estimation Toolbox

Python 360 88 Updated Oct 5, 2022

Estimators for the entropy and other information theoretic quantities of continuous distributions

Python 126 26 Updated May 14, 2024

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Python 793 146 Updated Nov 29, 2022

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,542 1,186 Updated Jul 25, 2024

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Python 4,205 380 Updated Aug 26, 2024
Next