Skip to main content

Showing 1–17 of 17 results for author: Bauer, M

Searching in archive stat. Search in all archives.
.
  1. Extension of the Dip-test Repertoire -- Efficient and Differentiable p-value Calculation for Clustering

    Authors: Lena G. M. Bauer, Collin Leiber, Christian Böhm, Claudia Plant

    Abstract: Over the last decade, the Dip-test of unimodality has gained increasing interest in the data mining community as it is a parameter-free statistical test that reliably rates the modality in one-dimensional samples. It returns a so called Dip-value and a corresponding probability for the sample's unimodality (Dip-p-value). These two values share a sigmoidal relationship. However, the specific transf… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Journal ref: Proceedings of the 2023 SIAM International Conference on Data Mining (SDM) (pp. 109-117). Society for Industrial and Applied Mathematics

  2. arXiv:2312.02753  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    C3: High-performance and low-complexity neural compression from a single image or video

    Authors: Hyunjik Kim, Matthias Bauer, Lucas Theis, Jonathan Richard Schwarz, Emilien Dupont

    Abstract: Most neural compression models are trained on large datasets of images or videos in order to generalize to unseen data. Such generalization typically requires large and expressive architectures with a high decoding complexity. Here we introduce C3, a neural compression method with strong rate-distortion (RD) performance that instead overfits a small model to each image or video separately. The res… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  3. arXiv:2208.02670  [pdf

    stat.ML cs.LG

    Development and Validation of ML-DQA -- a Machine Learning Data Quality Assurance Framework for Healthcare

    Authors: Mark Sendak, Gaurav Sirdeshmukh, Timothy Ochoa, Hayley Premo, Linda Tang, Kira Niederhoffer, Sarah Reed, Kaivalya Deshpande, Emily Sterrett, Melissa Bauer, Laurie Snyder, Afreen Shariff, David Whellan, Jeffrey Riggio, David Gaieski, Kristin Corey, Megan Richards, Michael Gao, Marshall Nichols, Bradley Heintze, William Knechtle, William Ratliff, Suresh Balu

    Abstract: The approaches by which the machine learning and clinical research communities utilize real world data (RWD), including data captured in the electronic health record (EHR), vary dramatically. While clinical researchers cautiously use RWD for clinical investigations, ML for healthcare teams consume public datasets with minimal scrutiny to develop new algorithms. This study bridges this gap by devel… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: Presented at 2022 Machine Learning in Health Care Conference

  4. arXiv:2203.03304  [pdf, other

    cs.LG stat.ML

    Regularising for invariance to data augmentation improves supervised learning

    Authors: Aleksander Botev, Matthias Bauer, Soham De

    Abstract: Data augmentation is used in machine learning to make the classifier invariant to label-preserving transformations. Usually this invariance is only encouraged implicitly by including a single augmented input during training. However, several works have recently shown that using multiple augmentations per input can improve generalisation or can be used to incorporate invariances more explicitly. In… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  5. arXiv:2202.03531  [pdf, ps, other

    stat.ME stat.AP

    Estimands and their Estimators for Clinical Trials Impacted by the COVID-19 Pandemic: A Report from the NISS Ingram Olkin Forum Series on Unplanned Clinical Trial Disruptions

    Authors: Kelly Van Lancker, Sergey Tarima, Jonathan Bartlett, Madeline Bauer, Bharani Bharani-Dharan, Frank Bretz, Nancy Flournoy, Hege Michiels, Camila Olarte Parra, James L Rosenberger, Suzie Cro

    Abstract: The COVID-19 pandemic continues to affect the conduct of clinical trials globally. Complications may arise from pandemic-related operational challenges such as site closures, travel limitations and interruptions to the supply chain for the investigational product, or from health-related challenges such as COVID-19 infections. Some of these complications lead to unforeseen intercurrent events in th… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  6. arXiv:2106.14806  [pdf, other

    cs.LG stat.ML

    Laplace Redux -- Effortless Bayesian Deep Learning

    Authors: Erik Daxberger, Agustinus Kristiadi, Alexander Immer, Runa Eschenhagen, Matthias Bauer, Philipp Hennig

    Abstract: Bayesian formulations of deep learning have been shown to have compelling theoretical properties and offer practical functional benefits, such as improved predictive uncertainty quantification and model selection. The Laplace approximation (LA) is a classic, and arguably the simplest family of approximations for the intractable posteriors of deep neural networks. Yet, despite its simplicity, the L… ▽ More

    Submitted 14 March, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera-ready version; source code: https://github.com/AlexImmer/Laplace

  7. arXiv:2104.04975  [pdf, other

    stat.ML cs.LG

    Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning

    Authors: Alexander Immer, Matthias Bauer, Vincent Fortuin, Gunnar Rätsch, Mohammad Emtiyaz Khan

    Abstract: Marginal-likelihood based model-selection, even though promising, is rarely used in deep learning due to estimation difficulties. Instead, most approaches rely on validation data, which may not be readily available. In this work, we present a scalable marginal-likelihood estimation method to select both hyperparameters and network architectures, based on the training data alone. Some hyperparamete… ▽ More

    Submitted 15 June, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: ICML 2021

  8. arXiv:2101.11046  [pdf, other

    stat.ML cs.LG

    Generalized Doubly Reparameterized Gradient Estimators

    Authors: Matthias Bauer, Andriy Mnih

    Abstract: Efficient low-variance gradient estimation enabled by the reparameterization trick (RT) has been essential to the success of variational autoencoders. Doubly-reparameterized gradients (DReGs) improve on the RT for multi-sample variational bounds by applying reparameterization a second time for an additional reduction in variance. Here, we develop two generalizations of the DReGs estimator and show… ▽ More

    Submitted 13 July, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Journal ref: 38th International Conference on Machine Learning (ICML 2021)

  9. arXiv:2008.08400  [pdf, other

    stat.ML cs.LG

    Improving predictions of Bayesian neural nets via local linearization

    Authors: Alexander Immer, Maciej Korzepa, Matthias Bauer

    Abstract: The generalized Gauss-Newton (GGN) approximation is often used to make practical Bayesian deep learning approaches scalable by replacing a second order derivative with a product of first order derivatives. In this paper we argue that the GGN approximation should be understood as a local linearization of the underlying Bayesian neural network (BNN), which turns the BNN into a generalized linear mod… ▽ More

    Submitted 25 February, 2021; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: AISTATS 2021

  10. arXiv:1906.02004  [pdf, other

    cs.LG stat.ML

    Interpretable and Differentially Private Predictions

    Authors: Frederik Harder, Matthias Bauer, Mijung Park

    Abstract: Interpretable predictions, where it is clear why a machine learning model has made a particular decision, can compromise privacy by revealing the characteristics of individual data points. This raises the central question addressed in this paper: Can models be interpretable without compromising privacy? For complex big data fit by correspondingly rich models, balancing privacy and explainability i… ▽ More

    Submitted 5 April, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

  11. arXiv:1810.11428  [pdf, other

    stat.ML cs.LG

    Resampled Priors for Variational Autoencoders

    Authors: Matthias Bauer, Andriy Mnih

    Abstract: We propose Learned Accept/Reject Sampling (LARS), a method for constructing richer priors using rejection sampling with a learned acceptance function. This work is motivated by recent analyses of the VAE objective, which pointed out that commonly used simple priors can lead to underfitting. As the distribution induced by LARS involves an intractable normalizing constant, we show how to estimate it… ▽ More

    Submitted 26 April, 2019; v1 submitted 26 October, 2018; originally announced October 2018.

    Journal ref: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019

  12. arXiv:1808.05563  [pdf, other

    cs.LG stat.ML

    Learning Invariances using the Marginal Likelihood

    Authors: Mark van der Wilk, Matthias Bauer, ST John, James Hensman

    Abstract: Generalising well in supervised learning tasks relies on correctly extrapolating the training data to a large region of the input space. One way to achieve this is to constrain the predictions to be invariant to transformations on the input that are known to be irrelevant (e.g. translation). Commonly, this is done through data augmentation, where the training set is enlarged by applying hand-craft… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

  13. arXiv:1805.09921  [pdf, other

    stat.ML cs.LG

    Meta-Learning Probabilistic Inference For Prediction

    Authors: Jonathan Gordon, John Bronskill, Matthias Bauer, Sebastian Nowozin, Richard E. Turner

    Abstract: This paper introduces a new framework for data efficient and versatile learning. Specifically: 1) We develop ML-PIP, a general framework for Meta-Learning approximate Probabilistic Inference for Prediction. ML-PIP extends existing probabilistic interpretations of meta-learning to cover a broad class of methods. 2) We introduce VERSA, an instance of the framework employing a flexible and versatile… ▽ More

    Submitted 6 August, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: International Conference on Learning Representations (ICLR) 2019

    Journal ref: International Conference on Learning Representations (2019)

  14. arXiv:1805.01872  [pdf, other

    cs.CV stat.ML

    Automatic Estimation of Modulation Transfer Functions

    Authors: Matthias Bauer, Valentin Volchkov, Michael Hirsch, Bernhard Schölkopf

    Abstract: The modulation transfer function (MTF) is widely used to characterise the performance of optical systems. Measuring it is costly and it is thus rarely available for a given lens specimen. Instead, MTFs based on simulations or, at best, MTFs measured on other specimens of the same lens are used. Fortunately, images recorded through an optical system contain ample information about its MTF, only tha… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  15. arXiv:1706.00326  [pdf, other

    stat.ML cs.LG

    Discriminative k-shot learning using probabilistic models

    Authors: Matthias Bauer, Mateo Rojas-Carulla, Jakub Bartłomiej Świątkowski, Bernhard Schölkopf, Richard E. Turner

    Abstract: This paper introduces a probabilistic framework for k-shot image classification. The goal is to generalise from an initial large-scale classification task to a separate task comprising new classes and small numbers of examples. The new approach not only leverages the feature-based representation learned by a neural network from the initial task (representational transfer), but also information abo… ▽ More

    Submitted 8 December, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

  16. arXiv:1606.04820  [pdf, other

    stat.ML

    Understanding Probabilistic Sparse Gaussian Process Approximations

    Authors: Matthias Bauer, Mark van der Wilk, Carl Edward Rasmussen

    Abstract: Good sparse approximations are essential for practical inference in Gaussian Processes as the computational cost of exact methods is prohibitive for large datasets. The Fully Independent Training Conditional (FITC) and the Variational Free Energy (VFE) approximations are two recent popular methods. Despite superficial similarities, these approximations have surprisingly different theoretical prope… ▽ More

    Submitted 30 May, 2017; v1 submitted 15 June, 2016; originally announced June 2016.

    Comments: published in Advances in Neural Information Processing Systems 29 (NIPS 2016)

  17. arXiv:1203.0937  [pdf, other

    q-bio.GN stat.AP

    CLEVER: Clique-Enumerating Variant Finder

    Authors: Tobias Marschall, Ivan Costa, Stefan Canzar, Markus Bauer, Gunnar Klau, Alexander Schliep, Alexander Schönhuth

    Abstract: Next-generation sequencing techniques have facilitated a large scale analysis of human genetic variation. Despite the advances in sequencing speeds, the computational discovery of structural variants is not yet standard. It is likely that many variants have remained undiscovered in most sequenced individuals. Here we present a novel internal segment size based approach, which organizes all, includ… ▽ More

    Submitted 15 July, 2012; v1 submitted 5 March, 2012; originally announced March 2012.

    Comments: 30 pages, 8 figures

    MSC Class: 92B99

    Journal ref: Bioinformatics, 28(22), 2875-2882, 2012