Skip to main content

Showing 1–29 of 29 results for author: Novikov, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.14396  [pdf, other

    quant-ph cs.LG

    Quantum Circuit Optimization with AlphaTensor

    Authors: Francisco J. R. Ruiz, Tuomas Laakkonen, Johannes Bausch, Matej Balog, Mohammadamin Barekatain, Francisco J. H. Heras, Alexander Novikov, Nathan Fitzpatrick, Bernardino Romera-Paredes, John van de Wetering, Alhussein Fawzi, Konstantinos Meichanetzidis, Pushmeet Kohli

    Abstract: A key challenge in realizing fault-tolerant quantum computers is circuit optimization. Focusing on the most expensive gates in fault-tolerant quantum computation (namely, the T gates), we address the problem of T-count optimization, i.e., minimizing the number of T gates that are needed to implement a given circuit. To achieve this, we develop AlphaTensor-Quantum, a method based on deep reinforcem… ▽ More

    Submitted 5 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 25 pages main paper + 19 pages appendix

  2. arXiv:2310.12990  [pdf, other

    cs.CV cs.LG eess.SP math.OC

    Wave-informed dictionary learning for high-resolution imaging in complex media

    Authors: Miguel Moscoso, Alexei Novikov, George Papanicolaou, Chrysoula Tsogka

    Abstract: We propose an approach for imaging in scattering media when large and diverse data sets are available. It has two steps. Using a dictionary learning algorithm the first step estimates the true Green's function vectors as columns in an unordered sensing matrix. The array data comes from many sparse sets of sources whose location and strength are not known to us. In the second step, the columns of t… ▽ More

    Submitted 21 September, 2023; originally announced October 2023.

  3. arXiv:2210.10855  [pdf, other

    cs.LG eess.SP math.PR stat.ML

    Dictionary Learning for the Almost-Linear Sparsity Regime

    Authors: Alexei Novikov, Stephen White

    Abstract: Dictionary learning, the problem of recovering a sparsely used matrix $\mathbf{D} \in \mathbb{R}^{M \times K}$ and $N$ $s$-sparse vectors $\mathbf{x}_i \in \mathbb{R}^{K}$ from samples of the form $\mathbf{y}_i = \mathbf{D}\mathbf{x}_i$, is of increasing importance to applications in signal processing and data science. When the dictionary is known, recovery of $\mathbf{x}_i$ is possible even for s… ▽ More

    Submitted 27 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  4. arXiv:2205.06175  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    A Generalist Agent

    Authors: Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas

    Abstract: Inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, dec… ▽ More

    Submitted 11 November, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: Published at TMLR, 42 pages

    Journal ref: Transactions on Machine Learning Research, 11/2022, https://openreview.net/forum?id=1ikK0kHjvj

  5. arXiv:2103.16596  [pdf, other

    cs.LG stat.ML

    Benchmarks for Deep Off-Policy Evaluation

    Authors: Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine

    Abstract: Off-policy evaluation (OPE) holds the promise of being able to leverage large, offline datasets for both evaluating and selecting complex policies for decision making. The ability to learn offline is particularly important in many real-world domains, such as in healthcare, recommender systems, or robotics, where online data collection is an expensive and potentially dangerous process. Being able t… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: ICLR 2021 paper. Policies and evaluation code are available at https://github.com/google-research/deep_ope

  6. arXiv:2103.14974  [pdf, other

    math.OC cs.LG cs.MS math.NA

    Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds

    Authors: Alexander Novikov, Maxim Rakhuba, Ivan Oseledets

    Abstract: In scientific computing and machine learning applications, matrices and more general multidimensional arrays (tensors) can often be approximated with the help of low-rank decompositions. Since matrices and tensors of fixed rank form smooth Riemannian manifolds, one of the popular tools for finding low-rank approximations is to use Riemannian optimization. Nevertheless, efficient implementation of… ▽ More

    Submitted 23 October, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

  7. arXiv:2103.11893  [pdf, ps, other

    eess.SP cs.IT math.PR

    Thresholding Greedy Pursuit for Sparse Recovery Problems

    Authors: Hai Le, Alexei Novikov

    Abstract: We study here sparse recovery problems in the presence of additive noise. We analyze a thresholding version of the CoSaMP algorithm, named Thresholding Greedy Pursuit (TGP). We demonstrate that an appropriate choice of thresholding parameter, even without the knowledge of sparsity level of the signal and strength of the noise, can result in exact recovery with no false discoveries as the dimension… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: First version

  8. arXiv:2012.06899  [pdf, other

    cs.LG cs.AI cs.RO

    Semi-supervised reward learning for offline reinforcement learning

    Authors: Ksenia Konyushkova, Konrad Zolna, Yusuf Aytar, Alexander Novikov, Scott Reed, Serkan Cabi, Nando de Freitas

    Abstract: In offline reinforcement learning (RL) agents are trained using a logged dataset. It appears to be the most natural route to attack real-life applications because in domains such as healthcare and robotics interactions with the environment are either expensive or unethical. Training agents usually requires reward functions, but unfortunately, rewards are seldom available in practice and their engi… ▽ More

    Submitted 12 December, 2020; originally announced December 2020.

    Comments: Accepted to Offline Reinforcement Learning Workshop at Neural Information Processing Systems (2020)

  9. arXiv:2011.13885  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Offline Learning from Demonstrations and Unlabeled Experience

    Authors: Konrad Zolna, Alexander Novikov, Ksenia Konyushkova, Caglar Gulcehre, Ziyu Wang, Yusuf Aytar, Misha Denil, Nando de Freitas, Scott Reed

    Abstract: Behavior cloning (BC) is often practical for robot learning because it allows a policy to be trained offline without rewards, by supervised learning on expert demonstrations. However, BC does not effectively leverage what we will refer to as unlabeled experience: data of mixed and unknown quality without reward annotations. This unlabeled data can be generated by a variety of sources such as human… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

    Comments: Accepted to Offline Reinforcement Learning Workshop at Neural Information Processing Systems (2020)

  10. Sydr: Cutting Edge Dynamic Symbolic Execution

    Authors: Alexey Vishnyakov, Andrey Fedotov, Daniil Kuts, Alexander Novikov, Darya Parygina, Eli Kobrin, Vlada Logunova, Pavel Belecky, Shamil Kurmangaleev

    Abstract: The security development lifecycle (SDL) is becoming an industry standard. Dynamic symbolic execution (DSE) has enormous amount of applications in computer security (fuzzing, vulnerability discovery, reverse-engineering, etc.). We propose several performance and accuracy improvements for dynamic symbolic execution. Skipping non-symbolic instructions allows to build a path predicate 1.2--3.5 times… ▽ More

    Submitted 26 January, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: 9 pages

    Journal ref: 2020 Ivannikov ISPRAS Open Conference (ISPRAS), IEEE, 2020, pp. 46-54

  11. arXiv:2010.07012  [pdf, other

    eess.SP cs.LG math.NA math.PR

    Fast signal recovery from quadratic measurements

    Authors: Miguel Moscoso, Alexei Novikov, George Papanicolaou, Chrysoula Tsogka

    Abstract: We present a novel approach for recovering a sparse signal from cross-correlated data. Cross-correlations naturally arise in many fields of imaging, such as optics, holography and seismic interferometry. Compared to the sparse signal recovery problem that uses linear measurements, the unknown is now a matrix formed by the cross correlation of the unknown signal. Hence, the bottleneck for inversion… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  12. arXiv:2007.09055  [pdf, other

    cs.LG cs.AI stat.ML

    Hyperparameter Selection for Offline Reinforcement Learning

    Authors: Tom Le Paine, Cosmin Paduraru, Andrea Michi, Caglar Gulcehre, Konrad Zolna, Alexander Novikov, Ziyu Wang, Nando de Freitas

    Abstract: Offline reinforcement learning (RL purely from logged data) is an important avenue for deploying RL techniques in real-world scenarios. However, existing hyperparameter selection methods for offline RL break the offline assumption by evaluating policies corresponding to each hyperparameter setting in the environment. This online execution is often infeasible and hence undermines the main aim of of… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

  13. arXiv:2006.15134  [pdf, other

    cs.LG cs.AI stat.ML

    Critic Regularized Regression

    Authors: Ziyu Wang, Alexander Novikov, Konrad Zolna, Jost Tobias Springenberg, Scott Reed, Bobak Shahriari, Noah Siegel, Josh Merel, Caglar Gulcehre, Nicolas Heess, Nando de Freitas

    Abstract: Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy optimization from large pre-recorded datasets without online environment interaction. It addresses challenges with regard to the cost of data collection and safety, both of which are particularly pertinent to real-world applications of RL. Unfortunately, most off-policy algorithms perform poorly when learnin… ▽ More

    Submitted 22 September, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: 24 pages; presented at NeurIPS 2020

  14. arXiv:2006.13888  [pdf, other

    cs.LG stat.ML

    RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning

    Authors: Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

    Abstract: Offline methods for reinforcement learning have a potential to help bridge the gap between reinforcement learning research and real-world applications. They make it possible to learn policies from offline datasets, thus overcoming concerns associated with online data collection in the real-world, including cost, safety, or ethical concerns. In this paper, we propose a benchmark called RL Unplugged… ▽ More

    Submitted 12 February, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: NeurIPS paper. 21 pages including supplementary material, the github link for the datasets: https://github.com/deepmind/deepmind-research/rl_unplugged

  15. arXiv:2006.00979  [pdf, other

    cs.LG cs.AI

    Acme: A Research Framework for Distributed Reinforcement Learning

    Authors: Matthew W. Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Nikola Momchev, Danila Sinopalnikov, Piotr Stańczyk, Sabela Ramos, Anton Raichuk, Damien Vincent, Léonard Hussenot, Robert Dadashi, Gabriel Dulac-Arnold, Manu Orsini, Alexis Jacq, Johan Ferret, Nino Vieillard, Seyed Kamyar Seyed Ghasemipour, Sertan Girgin, Olivier Pietquin, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang , et al. (14 additional authors not shown)

    Abstract: Deep reinforcement learning (RL) has led to many recent and groundbreaking advances. However, these advances have often come at the cost of both increased scale in the underlying architectures being trained as well as increased complexity of the RL algorithms used to train them. These increases have in turn made it more difficult for researchers to rapidly prototype new ideas or reproduce publishe… ▽ More

    Submitted 20 September, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: This work presents a second version of the paper which coincides with an increase in modularity, additional emphasis on offline, imitation and learning from demonstrations algorithms, as well as various new agents implemented as part of Acme

  16. arXiv:1910.01077  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Task-Relevant Adversarial Imitation Learning

    Authors: Konrad Zolna, Scott Reed, Alexander Novikov, Sergio Gomez Colmenarejo, David Budden, Serkan Cabi, Misha Denil, Nando de Freitas, Ziyu Wang

    Abstract: We show that a critical vulnerability in adversarial imitation is the tendency of discriminator networks to learn spurious associations between visual features and expert labels. When the discriminator focuses on task-irrelevant features, it does not provide an informative reward signal, leading to poor task performance. We analyze this problem in detail and propose a solution that outperforms sta… ▽ More

    Submitted 12 November, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted to CoRL 2020 (see presentation here: https://youtu.be/ZgQvFGuEgFU )

  17. arXiv:1909.12200  [pdf, other

    cs.RO cs.LG

    Scaling data-driven robotics with reward sketching and batch reinforcement learning

    Authors: Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott Reed, Rae Jeong, Konrad Zolna, Yusuf Aytar, David Budden, Mel Vecerik, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang

    Abstract: We present a framework for data-driven robotics that makes use of a large dataset of recorded robot experience and scales to several tasks using learned reward functions. We show how to apply this framework to accomplish three different object manipulation tasks on a real robot platform. Given demonstrations of a task together with task-agnostic recorded experience, we use a special form of human… ▽ More

    Submitted 4 June, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Project website: https://sites.google.com/view/data-driven-robotics/

    Journal ref: Robotics: Science and Systems Conference 2020

  18. arXiv:1908.04412  [pdf, other

    eess.SP cs.LG stat.ML

    The Noise Collector for sparse recovery in high dimensions

    Authors: Miguel Moscoso, Alexei Novikov, George Papanicolaou, Chrysoula Tsogka

    Abstract: The ability to detect sparse signals from noisy high-dimensional data is a top priority in modern science and engineering. A sparse solution of the linear system $A ρ= b_0$ can be found efficiently with an $l_1$-norm minimization approach if the data is noiseless. Detection of the signal's support from data corrupted by noise is still a challenging problem, especially if the level of noise must be… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

  19. arXiv:1908.01479  [pdf, other

    eess.IV cs.LG math.NA physics.comp-ph

    Imaging with highly incomplete and corrupted data

    Authors: Miguel Moscoso, Alexei Novikov, George Papanicolaou, Chrysoula Tsogka

    Abstract: We consider the problem of imaging sparse scenes from a few noisy data using an $l_1$-minimization approach. This problem can be cast as a linear system of the form $A \, ρ=b$, where $A$ is an $N\times K$ measurement matrix. We assume that the dimension of the unknown sparse vector $ρ\in {\mathbb{C}}^K$ is much larger than the dimension of the data vector $b \in {\mathbb{C}}^N$, i.e, $K \gg N$. We… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

  20. arXiv:1905.05308  [pdf, other

    physics.comp-ph cs.CE math.NA

    Synthetic aperture imaging with intensity-only data

    Authors: Miguel Moscoso, Alexei Novikov, George Papanicolaou, Chrysoula Tsogka

    Abstract: We consider imaging the reflectivity of scatterers from intensity-only data recorded by a single moving transducer that both emits and receives signals, forming a synthetic aperture. By exploiting frequency illumination diversity, we obtain multiple intensity measurements at each location, from which we determine field cross-correlations using an appropriate phase controlled illumination strategy… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  21. Deep Sequential Segmentation of Organs in Volumetric Medical Scans

    Authors: Alexey Novikov, David Major, Maria Wimmer, Dimitrios Lenis, Katja Bühler

    Abstract: Segmentation in 3D scans is playing an increasingly important role in current clinical practice supporting diagnosis, tissue quantification, or treatment planning. The current 3D approaches based on convolutional neural networks usually suffer from at least three main issues caused predominantly by implementation constraints - first, they require resizing the volume to the lower-resolutional refer… ▽ More

    Submitted 11 March, 2019; v1 submitted 6 July, 2018; originally announced July 2018.

    Journal ref: Published in IEEE Transactions on Medical Imaging on 16 November 2018, URL: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8537944&isnumber=4359023

  22. arXiv:1801.01928  [pdf, ps, other

    cs.MS math.NA

    Tensor Train decomposition on TensorFlow (T3F)

    Authors: Alexander Novikov, Pavel Izmailov, Valentin Khrulkov, Michael Figurnov, Ivan Oseledets

    Abstract: Tensor Train decomposition is used across many branches of machine learning. We present T3F -- a library for Tensor Train decomposition based on TensorFlow. T3F supports GPU execution, batch processing, automatic differentiation, and versatile functionality for the Riemannian optimization framework, which takes into account the underlying manifold structure to construct efficient optimization meth… ▽ More

    Submitted 2 March, 2020; v1 submitted 5 January, 2018; originally announced January 2018.

  23. arXiv:1711.00811  [pdf, other

    cs.LG

    Expressive power of recurrent neural networks

    Authors: Valentin Khrulkov, Alexander Novikov, Ivan Oseledets

    Abstract: Deep neural networks are surprisingly efficient at solving practical tasks, but the theory behind this phenomenon is only starting to catch up with the practice. Numerous works show that depth is the key to this efficiency. A certain class of deep convolutional networks -- namely those that correspond to the Hierarchical Tucker (HT) tensor decomposition -- has been proven to have exponentially hig… ▽ More

    Submitted 7 February, 2018; v1 submitted 2 November, 2017; originally announced November 2017.

    Comments: Accepted as a conference paper at ICLR 2018

  24. arXiv:1710.07324  [pdf, other

    cs.LG stat.ML

    Scalable Gaussian Processes with Billions of Inducing Inputs via Tensor Train Decomposition

    Authors: Pavel Izmailov, Alexander Novikov, Dmitry Kropotov

    Abstract: We propose a method (TT-GP) for approximate inference in Gaussian Process (GP) models. We build on previous scalable GP research including stochastic variational inference based on inducing inputs, kernel interpolation, and structure exploiting algebra. The key idea of our method is to use Tensor Train decomposition for variational parameters, which allows us to train GPs with billions of inducing… ▽ More

    Submitted 17 January, 2018; v1 submitted 19 October, 2017; originally announced October 2017.

  25. arXiv:1701.08816  [pdf, other

    cs.CV cs.LG

    Fully Convolutional Architectures for Multi-Class Segmentation in Chest Radiographs

    Authors: Alexey A. Novikov, Dimitrios Lenis, David Major, Jiri Hladůvka, Maria Wimmer, Katja Bühler

    Abstract: The success of deep convolutional neural networks on image classification and recognition tasks has led to new applications in very diversified contexts, including the field of medical imaging. In this paper we investigate and propose neural network architectures for automated multi-class segmentation of anatomical organs in chest radiographs, namely for lungs, clavicles and heart. We address seve… ▽ More

    Submitted 13 February, 2018; v1 submitted 30 January, 2017; originally announced January 2017.

    Comments: Final pre-print version accepted for publication in TMI Added new content: * additional evaluations * additional figures * improving the old content

  26. arXiv:1611.03214  [pdf, other

    cs.LG

    Ultimate tensorization: compressing convolutional and FC layers alike

    Authors: Timur Garipov, Dmitry Podoprikhin, Alexander Novikov, Dmitry Vetrov

    Abstract: Convolutional neural networks excel in image recognition tasks, but this comes at the cost of high computational and memory complexity. To tackle this problem, [1] developed a tensor factorization framework to compress fully-connected layers. In this paper, we focus on compressing convolutional layers. We show that while the direct application of the tensor framework [1] to the 4-dimensional kerne… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: NIPS 2016 workshop: Learning with Tensors: Why Now and How?

  27. arXiv:1605.03795  [pdf, other

    stat.ML cs.LG

    Exponential Machines

    Authors: Alexander Novikov, Mikhail Trofimov, Ivan Oseledets

    Abstract: Modeling interactions between features improves the performance of machine learning solutions in many domains (e.g. recommender systems or sentiment analysis). In this paper, we introduce Exponential Machines (ExM), a predictor that models all interactions of every order. The key idea is to represent an exponentially large tensor of parameters in a factorized format called Tensor Train (TT). The T… ▽ More

    Submitted 8 December, 2017; v1 submitted 12 May, 2016; originally announced May 2016.

    Comments: ICLR-2017 workshop track paper

  28. arXiv:1509.06569  [pdf, ps, other

    cs.LG cs.NE

    Tensorizing Neural Networks

    Authors: Alexander Novikov, Dmitry Podoprikhin, Anton Osokin, Dmitry Vetrov

    Abstract: Deep neural networks currently demonstrate state-of-the-art performance in several domains. At the same time, models of this class are very demanding in terms of computational resources. In particular, a large amount of memory is required by commonly used fully-connected layers, making it hard to use the models on low-end devices and stopping the further increase of the model size. In this paper w… ▽ More

    Submitted 20 December, 2015; v1 submitted 22 September, 2015; originally announced September 2015.

  29. arXiv:cs/0205058  [pdf, ps, other

    cs.NI

    Content Distribution in Unicast Replica Meshes

    Authors: Alexei Novikov

    Abstract: We propose centralized algorithm of data distribution in the unicast p2p network. Good example of such networks are meshes of WWW and FTP mirrors. Simulation of data propogation for different network topologies is performed and it is shown that proposed method performs up to 200% better then common apporaches

    Submitted 21 May, 2002; originally announced May 2002.

    Comments: 14 pages, 8 figures, submitted to ACM Transactions on Internet Technology

    ACM Class: H.3.3; H.3.4; H.3.5