Skip to main content

Showing 1–3 of 3 results for author: Belogolovsky, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.14020  [pdf, other

    cs.LG

    Individualized Dosing Dynamics via Neural Eigen Decomposition

    Authors: Stav Belogolovsky, Ido Greenberg, Danny Eytan, Shie Mannor

    Abstract: Dosing models often use differential equations to model biological dynamics. Neural differential equations in particular can learn to predict the derivative of a process, which permits predictions at irregular points of time. However, this temporal flexibility often comes with a high sensitivity to noise, whereas medical problems often present high noise and limited data. Moreover, medical dosing… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.00117

  2. arXiv:2202.00117  [pdf, other

    cs.LG eess.SY

    Continuous Forecasting via Neural Eigen Decomposition

    Authors: Stav Belogolovsky, Ido Greenberg, Danny Eitan, Shie Mannor

    Abstract: Neural differential equations predict the derivative of a stochastic process. This allows irregular forecasting with arbitrary time-steps. However, the expressive temporal flexibility often comes with a high sensitivity to noise. In addition, current methods model measurements and control together, limiting generalization to different control policies. These properties severely limit applicability… ▽ More

    Submitted 4 February, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

  3. arXiv:1905.09710  [pdf, other

    cs.LG stat.ML

    Inverse Reinforcement Learning in Contextual MDPs

    Authors: Stav Belogolovsky, Philip Korsunsky, Shie Mannor, Chen Tessler, Tom Zahavy

    Abstract: We consider the task of Inverse Reinforcement Learning in Contextual Markov Decision Processes (MDPs). In this setting, contexts, which define the reward and transition kernel, are sampled from a distribution. In addition, although the reward is a function of the context, it is not provided to the agent. Instead, the agent observes demonstrations from an optimal policy. The goal is to learn the re… ▽ More

    Submitted 30 December, 2020; v1 submitted 23 May, 2019; originally announced May 2019.