- Granada
- @manjavacas_
- https://bit.ly/3oWGNzt
Highlights
- Pro
Block or Report
Block or report manjavacas
Contact GitHub support about this userβs behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Temario sobre aprendizaje por refuerzo en espaΓ±ol / Syllabus on reinforcement learning in Spanish.
Collection of domains, learners, strategies, and other tools related to reinforcement learning.
An open source playground energy storage environment to explore reinforcement learning and model predictive control.
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments
A directory and analysis of the open source ecosystem in the areas of climate change, sustainable energy, biodiversity and natural resources.
A package for creating slides in Typst
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
A tokamak (nuclear fusion reactor) simulator with LSTM-based neural network (KSTAR-NN)
Repositorio para almacenar las diapositivas y los materiales de las Charlas
A game theoretic approach to explain the output of any machine learning model.
More powerful and customizable tables in Typst
Meta reinforcement learning benchmark with building control environments. An attempt to help scale greener building controllers to more buildings.
PyTorch Implementation of REINFORCE for both discrete & continuous control
Hands-on tutorial about Meta RL and GP-MPC at the RL4AA'24 workshop.
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Single-file pytorch implementation of hybrid-SAC
Bluemira is an integrated inter-disciplinary design tool for future fusion reactors. It incorporates several modules, some of which rely on other codes, to carry out a range of typical conceptual fβ¦