jens321

Jens Tuyls jens321

CS PhD Student @ Princeton NLP.

16 followers · 45 following

Achievements

Highlights

Stars

vivekmyers / empowerment_successor_representations

Code for the paper "Learning to Assist Humans without Inferring Rewards"

Python 1 Updated Jul 7, 2024

orybkin / lexa-benchmark

Python 42 6 Updated May 11, 2022

MichalBortkiewicz / JaxGCRL

Jupyter Notebook 34 4 Updated Sep 27, 2024

chongyi-zheng / td_infonce

Python 24 3 Updated Nov 13, 2023

seohongpark / HILP

Foundation Policies with Hilbert Representations (ICML 2024)

Python 67 5 Updated Apr 14, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,605 1,246 Updated Oct 1, 2024

princeton-nlp / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

Python 13,384 1,321 Updated Sep 30, 2024

epfml / schedules-and-scaling

Python 48 1 Updated May 29, 2024

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

Python 1,083 43 Updated Oct 1, 2024

RobertTLange / gymnax

RL Environments in JAX 🌍

Python 611 61 Updated Jul 4, 2024

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 679 56 Updated Sep 9, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 5,398 615 Updated Sep 24, 2024

microsoft / Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 785 46 Updated Aug 21, 2024

conglu1997 / intelligent-go-explore

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Inform 7 42 1 Updated Jun 7, 2024

NX-AI / xlstm

Official repository of the xLSTM.

Python 1,266 92 Updated Sep 7, 2024

twni2016 / pomdp-baselines

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 296 41 Updated Aug 22, 2024

klieret / wandb-offline-sync-hook

A convenient way to trigger synchronizations to wandb / Weights & Biases if your compute nodes don't have internet!

Python 50 4 Updated Sep 9, 2024

heiner / nle

Forked from facebookresearch/nle

The NetHack Learning Environment

C 42 8 Updated Sep 18, 2024

MichaelTMatthews / Craftax_Baselines

Python 14 1 Updated Jul 21, 2024

MichaelTMatthews / Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

Python 190 18 Updated Sep 3, 2024

upiterbarg / diff_history

[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)

Python 18 2 Updated Aug 20, 2024

google-deepmind / spectral_ssm

Python 26 4 Updated Apr 12, 2024

princeton-nlp / il-scaling-in-games

Official code repo of "Scaling Laws for Imitation Learning in Single-Agent Games"

Python 5 Updated Aug 14, 2024

BartekCupial / sample-factory

Forked from alex-petrenko/sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 4 Updated Sep 30, 2024

Dao-AILab / causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 287 55 Updated Aug 12, 2024

princeton-nlp / lwm

We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effectively control these agents through verbal communication.

Python 17 3 Updated Feb 10, 2024

seohongpark / HIQL

HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)

Python 71 6 Updated Nov 21, 2023

corl-team / katakomba

Forked from tinkoff-ai/katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Python 38 2 Updated Aug 22, 2023

allenai / procthor

🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

Python 267 22 Updated Apr 7, 2023

facebookresearch / controllable_agent

The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Fo…

Python 55 4 Updated Jul 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jens Tuyls jens321

Achievements

Achievements

Highlights

Block or report jens321

Stars

vivekmyers / empowerment_successor_representations

orybkin / lexa-benchmark

MichalBortkiewicz / JaxGCRL

chongyi-zheng / td_infonce

seohongpark / HILP

Dao-AILab / flash-attention

princeton-nlp / SWE-agent

epfml / schedules-and-scaling

PufferAI / PufferLib

RobertTLange / gymnax

luchris429 / purejaxrl

vwxyzjn / cleanrl

microsoft / Samba

conglu1997 / intelligent-go-explore

NX-AI / xlstm

twni2016 / pomdp-baselines

klieret / wandb-offline-sync-hook

heiner / nle

MichaelTMatthews / Craftax_Baselines

MichaelTMatthews / Craftax

upiterbarg / diff_history

google-deepmind / spectral_ssm

princeton-nlp / il-scaling-in-games

BartekCupial / sample-factory

Dao-AILab / causal-conv1d

princeton-nlp / lwm

seohongpark / HIQL

corl-team / katakomba

allenai / procthor

facebookresearch / controllable_agent