Skip to main content

Showing 1–3 of 3 results for author: Christodoulou, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.07207  [pdf, other

    cs.LG cs.AI stat.ML

    Soft Actor-Critic for Discrete Action Settings

    Authors: Petros Christodoulou

    Abstract: Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm for continuous action settings that is not applicable to discrete action settings. Many important settings involve discrete actions, however, and so here we derive an alternative version of the Soft Actor-Critic algorithm that is applicable to discrete action settings. We then show that, even without any hyperparameter tuning… ▽ More

    Submitted 18 October, 2019; v1 submitted 16 October, 2019; originally announced October 2019.

  2. arXiv:1910.02876  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions

    Authors: Petros Christodoulou, Robert Tjarko Lange, Ali Shafti, A. Aldo Faisal

    Abstract: From a young age humans learn to use grammatical principles to hierarchically combine words into sentences. Action grammars is the parallel idea, that there is an underlying set of rules (a "grammar") that govern how we hierarchically combine actions to form new, more complex actions. We introduce the Action Grammar Reinforcement Learning (AG-RL) framework which leverages the concept of action gra… ▽ More

    Submitted 23 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

  3. arXiv:1706.04026  [pdf, other

    cs.IR cs.LG stat.ML

    Recurrent Latent Variable Networks for Session-Based Recommendation

    Authors: Sotirios Chatzis, Panayiotis Christodoulou, Andreas S. Andreou

    Abstract: In this work, we attempt to ameliorate the impact of data sparsity in the context of session-based recommendation. Specifically, we seek to devise a machine learning mechanism capable of extracting subtle and complex underlying temporal dynamics in the observed session data, so as to inform the recommendation algorithm. To this end, we improve upon systems that utilize deep learning techniques wit… ▽ More

    Submitted 13 June, 2017; originally announced June 2017.