Highlights
- Pro
Stars
Python implementations of contextual bandits algorithms
A tensorflow implementation for Deep clustering: Discriminative embeddings for segmentation and separation
deep clustering method for single-channel speech separation
chaodengusc / DeWave
Forked from zhr1201/deep-clusteringSingle-channel blind source separation
Additive factors model and additive factors model with slip implemented in python
Code exploring the use of reward machines in the context of cooperative multi-agent reinforcement learning.
Case-Study on Reinforcement Learning for Intralogistics
Environments from the papers "Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning" and "Induction and Exploitation of Subgoal Automata for Reinforcem…
just to reimplement a paper called " Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning"