- Tuebingen, Germany
- sweetice.github.io
Highlights
- Pro
Block or Report
Block or report sweetice
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
Online-RLHF Public
Forked from RLHFlow/Online-RLHFA recipe for online RLHF.
Python UpdatedJun 20, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedNov 29, 2023 -
-
BEER-ICLR2024 Public
The present anonymous repository serves as a guide for reproducing the results of the "BEER" method proposed in our ICLR submission "Adaptive Regularization of Representation Rank as an Implicit Co…
-
-
-
ColossalAI Public
Forked from hpcaitech/ColossalAIMaking large AI models cheaper, faster and more accessible
Python Apache License 2.0 UpdatedMar 29, 2023 -
dalai_llama Public
Forked from cocktailpeanut/dalaiThe simplest way to run LLaMA on your local machine
CSS UpdatedMar 25, 2023 -
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
-
stanford_alpaca Public
Forked from tatsu-lab/stanford_alpacaCode and documentation to train Stanford's Alpaca models, and generate the data.
Python Apache License 2.0 UpdatedMar 21, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python GNU General Public License v3.0 UpdatedMar 15, 2023 -
voltron-robotics Public
Forked from siddk/voltron-roboticsVoltron: Language-Driven Representation Learning for Robotics
Python MIT License UpdatedFeb 27, 2023 -
RWKV-LM Public
Forked from BlinkDL/RWKV-LMRWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, …
Python Apache License 2.0 UpdatedFeb 17, 2023 -
ffn_geyang Public
Forked from geyang/ffnPublic Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"
Python UpdatedJan 11, 2023 -
-
learned-fourier-features Public
Forked from alexlioralexli/learned-fourier-featuresCode for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"
Python UpdatedOct 2, 2022 -
LibMTL Public
Forked from median-research-group/LibMTLA PyTorch Library for Multi-Task Learning
Python MIT License UpdatedSep 3, 2022 -
-
-
reward-surfaces Public
Forked from RyanNavillus/reward-surfacesPython MIT License UpdatedMay 20, 2022 -
drqv2 Public
Forked from facebookresearch/drqv2DrQ-v2: Improved Data-Augmented Reinforcement Learning
Python MIT License UpdatedJul 21, 2021 -
-
TD3_BC Public
Forked from sfujim/TD3_BCAuthor's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
Python MIT License UpdatedJun 16, 2021 -
neural-approx-ss-lfi Public
Forked from cyz-ai/neural-approx-ss-lfiCodes for ICLR 21 paper: Neural Approximate Sufficient Statistics for Implicit Models
Jupyter Notebook UpdatedJun 15, 2021 -
-
mpo Public
Forked from daisatojp/mpoPyTorch Implementation of the Maximum a Posteriori Policy Optimisation
Python GNU General Public License v3.0 UpdatedMay 22, 2021 -
deep-successor-features-for-transfer Public
Forked from mike-gimelfarb/deep-successor-features-for-transferA reusable framework for successor features for transfer in deep reinforcement learning using keras.
Python Other UpdatedMay 11, 2021 -
pderl Public
Forked from crisbodnar/pderlCode for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020
Python UpdatedFeb 24, 2021 -
tqc_pytorch_1epo Public
Forked from SamsungLabs/tqc_pytorchImplementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/
Python MIT License UpdatedFeb 16, 2021 -
gulf Public
Forked from riejohnson/gulfGULF: GUided Learning through successive Functional gradient optimization (author implementation of DPCNN included)
Python MIT License UpdatedJan 29, 2021