Skip to content
View jens321's full-sized avatar

Highlights

  • Pro

Block or report jens321

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the paper "Learning to Assist Humans without Inferring Rewards"

Python 1 Updated Jul 7, 2024
Python 42 6 Updated May 11, 2022
Jupyter Notebook 34 4 Updated Sep 27, 2024
Python 24 3 Updated Nov 13, 2023

Foundation Policies with Hilbert Representations (ICML 2024)

Python 67 5 Updated Apr 14, 2024

Fast and memory-efficient exact attention

Python 13,605 1,246 Updated Oct 1, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

Python 13,384 1,321 Updated Sep 30, 2024
Python 48 1 Updated May 29, 2024

Simplifying reinforcement learning for complex game environments

Python 1,083 43 Updated Oct 1, 2024

RL Environments in JAX 🌍

Python 611 61 Updated Jul 4, 2024

Really Fast End-to-End Jax RL Implementations

Python 679 56 Updated Sep 9, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,398 615 Updated Sep 24, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 785 46 Updated Aug 21, 2024

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Inform 7 42 1 Updated Jun 7, 2024

Official repository of the xLSTM.

Python 1,266 92 Updated Sep 7, 2024

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 296 41 Updated Aug 22, 2024

A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!

Python 50 4 Updated Sep 9, 2024

The NetHack Learning Environment

C 42 8 Updated Sep 18, 2024

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Python 190 18 Updated Sep 3, 2024

[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)

Python 18 2 Updated Aug 20, 2024
Python 26 4 Updated Apr 12, 2024

Official code repo of "Scaling Laws for Imitation Learning in Single-Agent Games"

Python 5 Updated Aug 14, 2024

High throughput synchronous and asynchronous reinforcement learning

Python 4 Updated Sep 30, 2024

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 287 55 Updated Aug 12, 2024

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Python 17 3 Updated Feb 10, 2024

HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)

Python 71 6 Updated Nov 21, 2023

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Python 38 2 Updated Aug 22, 2023

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Python 267 22 Updated Apr 7, 2023

The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Fo…

Python 55 4 Updated Jul 17, 2023
Next