Skip to content
View manila95's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@K-DAG
Block or Report

Block or report manila95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Animation engine for explanatory math videos

Python 60,839 5,717 Updated Jun 24, 2024

Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Meta-RL.

Python 227 48 Updated Sep 30, 2022

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,426 1,366 Updated May 6, 2024

Repo to reproduce the First-Explore paper results

Jupyter Notebook 35 2 Updated Jul 24, 2023

TorchOpt is an efficient library for differentiable optimization built upon PyTorch.

Python 515 35 Updated Jul 2, 2024
Python 59 11 Updated Jun 22, 2018

code release for URDFormer

Python 69 4 Updated May 21, 2024

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Python 4,481 375 Updated Jul 19, 2024

Evolutionary Computation: A Modern Perspective ---> This is a free online book, which is actively updated now (from 2023 to 2027).

41 7 Updated Jul 16, 2024

Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)

Python 10 1 Updated May 22, 2023

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,936 975 Updated Jul 21, 2024

PyTorch implementation of Risk-Averse Policy Learning

Python 6 Updated Aug 25, 2023
Python 35 6 Updated Nov 23, 2021

Real-World RL Benchmark Suite

Python 339 29 Updated Aug 11, 2020

ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Manipulation.

Python 15 2 Updated May 2, 2023

Code for Rapid Locomotion via Reinforcement Learning

Python 150 36 Updated Jul 20, 2023

Open-source reinforcement learning environment for autonomous racing — featured as a conference paper at ICCV 2021 and as the official challenge tracks at both SL4AD@ICML2022 and AI4AD@IJCAI2022. T…

Python 139 14 Updated Dec 20, 2023

Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.

Jupyter Notebook 70 14 Updated Oct 25, 2020

Awesome Open-ended AI

161 18 Updated Jul 2, 2024

A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing

Python 133 28 Updated Apr 9, 2024

A simple probabilistic programming language.

Jupyter Notebook 669 74 Updated Jul 18, 2024

Multi-Objective Reinforcement Learning algorithms implementations.

Python 255 37 Updated Jul 16, 2024

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,747 244 Updated May 3, 2024

Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method

Python 63 14 Updated Mar 24, 2023

Source code for our paper "Sim-to-real reinforcement learning applied to end-to-end vehicle control"

Python 21 8 Updated Dec 9, 2021

Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".

Python 25 6 Updated Jul 6, 2023

A goal-driven autonomous exploration through deep reinforcement learning (ICRA 2022) system that combines reactive and planned robot navigation in unknown environments

Python 110 15 Updated Feb 5, 2022

Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project

Jupyter Notebook 14 2 Updated Apr 30, 2021

RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven explora…

Jupyter Notebook 335 16 Updated Jun 2, 2024

Implementations of SAILR, PDO, and CSC

Python 29 8 Updated Jul 15, 2024
Next