Skip to main content

Showing 1–29 of 29 results for author: Gordon, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.09387  [pdf

    stat.ML cs.LG stat.ME

    Meta-Analysis with Untrusted Data

    Authors: Shiva Kaul, Geoffrey J. Gordon

    Abstract: [See paper for full abstract] Meta-analysis is a crucial tool for answering scientific questions. It is usually conducted on a relatively small amount of ``trusted'' data -- ideally from randomized, controlled trials -- which allow causal effects to be reliably estimated with minimal assumptions. We show how to answer causal questions much more precisely by making two changes. First, we incorporat… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Full-length version of conference submission

  2. arXiv:2102.12013  [pdf, other

    cs.LG cs.CY stat.ML

    Understanding and Mitigating Accuracy Disparity in Regression

    Authors: Jianfeng Chi, Yuan Tian, Geoffrey J. Gordon, Han Zhao

    Abstract: With the widespread deployment of large-scale prediction systems in high-stakes domains, e.g., face recognition, criminal justice, etc., disparity in prediction accuracy between different demographic subgroups has called for fundamental understanding on the source of such disparity and algorithmic intervention to mitigate it. In this paper, we study the accuracy disparity problem in regression. To… ▽ More

    Submitted 12 June, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  3. arXiv:2101.03606  [pdf, other

    stat.ML cs.LG

    The Gaussian Neural Process

    Authors: Wessel P. Bruinsma, James Requeima, Andrew Y. K. Foong, Jonathan Gordon, Richard E. Turner

    Abstract: Neural Processes (NPs; Garnelo et al., 2018a,b) are a rich class of models for meta-learning that map data sets directly to predictive stochastic processes. We provide a rigorous analysis of the standard maximum-likelihood objective used to train conditional NPs. Moreover, we propose a new member to the Neural Process family called the Gaussian Neural Process (GNP), which models predictive correla… ▽ More

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: 34 pages; includes supplementary material; to appear in AABI 2020

  4. arXiv:2012.10713  [pdf, other

    cs.LG cs.AI stat.ML

    Fundamental Limits and Tradeoffs in Invariant Representation Learning

    Authors: Han Zhao, Chen Dan, Bryon Aragam, Tommi S. Jaakkola, Geoffrey J. Gordon, Pradeep Ravikumar

    Abstract: A wide range of machine learning applications such as privacy-preserving learning, algorithmic fairness, and domain adaptation/generalization among others, involve learning invariant representations of the data that aim to achieve two competing goals: (a) maximize information or accuracy with respect to a target response, and (b) maximize invariance or independence with respect to a set of protect… ▽ More

    Submitted 23 November, 2022; v1 submitted 19 December, 2020; originally announced December 2020.

    Comments: JMLR camera-ready version

  5. arXiv:2007.01332  [pdf, other

    stat.ML cs.LG

    Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural Processes

    Authors: Andrew Y. K. Foong, Wessel P. Bruinsma, Jonathan Gordon, Yann Dubois, James Requeima, Richard E. Turner

    Abstract: Stationary stochastic processes (SPs) are a key component of many probabilistic models, such as those for off-the-grid spatio-temporal data. They enable the statistical symmetry of underlying physical phenomena to be leveraged, thereby aiding generalization. Prediction in such models can be viewed as a translation equivariant map from observed data sets to predictive SPs, emphasizing the intimate… ▽ More

    Submitted 20 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020

  6. arXiv:2006.10801  [pdf, other

    stat.ML cs.LG

    Predictive Complexity Priors

    Authors: Eric Nalisnick, Jonathan Gordon, José Miguel Hernández-Lobato

    Abstract: Specifying a Bayesian prior is notoriously difficult for complex models such as neural networks. Reasoning about parameters is made challenging by the high-dimensionality and over-parameterization of the space. Priors that seem benign and uninformative can have unintuitive and detrimental effects on a model's predictions. For this reason, we propose predictive complexity priors: a functional prior… ▽ More

    Submitted 21 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 23 pages

  7. arXiv:2003.03284  [pdf, other

    stat.ML cs.LG

    TaskNorm: Rethinking Batch Normalization for Meta-Learning

    Authors: John Bronskill, Jonathan Gordon, James Requeima, Sebastian Nowozin, Richard E. Turner

    Abstract: Modern meta-learning approaches for image classification rely on increasingly deep networks to achieve state-of-the-art performance, making batch normalization an essential component of meta-learning pipelines. However, the hierarchical nature of the meta-learning setting presents several challenges that can render conventional batch normalization ineffective, giving rise to the need to rethink no… ▽ More

    Submitted 28 June, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Journal ref: Proceedings of Machine Learning and Systems 2020, 4683-4694

  8. arXiv:1912.01116  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    Long Distance Relationships without Time Travel: Boosting the Performance of a Sparse Predictive Autoencoder in Sequence Modeling

    Authors: Jeremy Gordon, David Rawlinson, Subutai Ahmad

    Abstract: In sequence learning tasks such as language modelling, Recurrent Neural Networks must learn relationships between input features separated by time. State of the art models such as LSTM and Transformer are trained by backpropagation of losses into prior hidden states and inputs held in memory. This allows gradients to flow from present to past and effectively learn with perfect hindsight, but at a… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: 9 pages, 6 figures, 4 tables

    MSC Class: 68T30; 68T05 ACM Class: I.2.7; I.2.4

  9. arXiv:1910.13556  [pdf, other

    stat.ML cs.LG

    Convolutional Conditional Neural Processes

    Authors: Jonathan Gordon, Wessel P. Bruinsma, Andrew Y. K. Foong, James Requeima, Yann Dubois, Richard E. Turner

    Abstract: We introduce the Convolutional Conditional Neural Process (ConvCNP), a new member of the Neural Process family that models translation equivariance in the data. Translation equivariance is an important inductive bias for many learning problems including time series modelling, spatial data, and images. The model embeds data sets into an infinite-dimensional function space as opposed to a finite-dim… ▽ More

    Submitted 25 June, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at International Conference on Learning Representations 2020

  10. arXiv:1910.07162  [pdf, other

    cs.LG cs.AI stat.ML

    Conditional Learning of Fair Representations

    Authors: Han Zhao, Amanda Coston, Tameem Adel, Geoffrey J. Gordon

    Abstract: We propose a novel algorithm for learning fair representations that can simultaneously mitigate two notions of disparity among different demographic subgroups in the classification setting. Two key components underpinning the design of our algorithm are balanced error rate and conditional alignment of representations. We show how these two components contribute to ensuring accuracy parity and equa… ▽ More

    Submitted 14 February, 2020; v1 submitted 16 October, 2019; originally announced October 2019.

  11. arXiv:1908.02144  [pdf, other

    stat.ML cs.LG

    Bayesian Batch Active Learning as Sparse Subset Approximation

    Authors: Robert Pinsler, Jonathan Gordon, Eric Nalisnick, José Miguel Hernández-Lobato

    Abstract: Leveraging the wealth of unlabeled data produced in recent years provides great potential for improving supervised models. When the cost of acquiring labels is high, probabilistic active learning methods can be used to greedily select the most informative data points to be labeled. However, for many large-scale problems standard greedy procedures become computationally infeasible and suffer from n… ▽ More

    Submitted 8 February, 2021; v1 submitted 6 August, 2019; originally announced August 2019.

    Comments: NeurIPS 2019

  12. arXiv:1907.06288  [pdf, other

    cs.LG stat.ML

    Learning Neural Networks with Adaptive Regularization

    Authors: Han Zhao, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov, Geoffrey J. Gordon

    Abstract: Feed-forward neural networks can be understood as a combination of an intermediate representation and a linear hypothesis. While most previous works aim to diversify the representations, we explore the complementary direction by performing an adaptive and data-dependent regularization motivated by the empirical Bayes method. Specifically, we propose to construct a matrix-variate normal prior (on w… ▽ More

    Submitted 23 October, 2019; v1 submitted 14 July, 2019; originally announced July 2019.

    Comments: Camera ready version

  13. arXiv:1906.08386  [pdf, other

    cs.LG cs.AI stat.ML

    Inherent Tradeoffs in Learning Fair Representations

    Authors: Han Zhao, Geoffrey J. Gordon

    Abstract: Real-world applications of machine learning tools in high-stakes domains are often regulated to be fair, in the sense that the predicted target should satisfy some quantitative notion of parity with respect to a protected attribute. However, the exact tradeoff between fairness and accuracy is not entirely clear, even for the basic paradigm of classification problems. In this paper, we characterize… ▽ More

    Submitted 3 February, 2022; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: Update to the JMLR version: A new constructive algorithm for the optimal fair classifier; Extension of the previous lower bounds to a more general setting

  14. arXiv:1906.07902  [pdf, other

    cs.LG cs.CR stat.ML

    Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation

    Authors: Han Zhao, Jianfeng Chi, Yuan Tian, Geoffrey J. Gordon

    Abstract: Crowdsourced data used in machine learning services might carry sensitive information about attributes that users do not want to share. Various methods have been proposed to minimize the potential information leakage of sensitive attributes while maximizing the task accuracy. However, little is known about the theory behind these methods. In light of this gap, we develop a novel theoretical framew… ▽ More

    Submitted 25 October, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2020

  15. arXiv:1906.07697  [pdf, other

    stat.ML cs.LG

    Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes

    Authors: James Requeima, Jonathan Gordon, John Bronskill, Sebastian Nowozin, Richard E. Turner

    Abstract: The goal of this paper is to design image classification systems that, after an initial multi-task training phase, can automatically adapt to new tasks encountered at test time. We introduce a conditional neural process based approach to the multi-task classification setting for this purpose, and establish connections to the meta-learning and few-shot learning literature. The resulting approach, c… ▽ More

    Submitted 7 January, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: Published in NeurIPS 2019

    Journal ref: Advances in Neural Information Processing Systems 32 (2019) 7957-7968

  16. arXiv:1902.05116  [pdf, other

    stat.ML cs.LG

    Probabilistic Neural Architecture Search

    Authors: Francesco Paolo Casale, Jonathan Gordon, Nicolo Fusi

    Abstract: In neural architecture search (NAS), the space of neural network architectures is automatically explored to maximize predictive accuracy for a given task. Despite the success of recent approaches, most existing methods cannot be directly applied to large scale problems because of their prohibitive computational complexity or high memory usage. In this work, we propose a Probabilistic approach to n… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

  17. arXiv:1901.09453  [pdf, other

    cs.LG cs.AI stat.ML

    On Learning Invariant Representation for Domain Adaptation

    Authors: Han Zhao, Remi Tachet des Combes, Kun Zhang, Geoffrey J. Gordon

    Abstract: Due to the ability of deep neural nets to learn rich representations, recent advances in unsupervised domain adaptation have focused on learning domain-invariant features that achieve a small error on the source domain. The hope is that the learnt representation, together with the hypothesis learnt from the source domain, can generalize to the target domain. In this paper, we first construct a sim… ▽ More

    Submitted 30 May, 2019; v1 submitted 27 January, 2019; originally announced January 2019.

    Comments: Compared with the last version, the current one adds a new corollary for the case of different feature transformations (encoders) on source/target domains. Fix a typo in Fig. 1

  18. arXiv:1812.05159  [pdf, other

    cs.LG stat.ML

    An Empirical Study of Example Forgetting during Deep Neural Network Learning

    Authors: Mariya Toneva, Alessandro Sordoni, Remi Tachet des Combes, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon

    Abstract: Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics of neural networks as they train on single classification tasks. Our goal is to understand whether a related phenomenon occurs when data does not undergo a clear distributional shift. We define a `forgetting event' to have occurred when an individual training example transitions from being classified correc… ▽ More

    Submitted 15 November, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: ICLR 2019

  19. arXiv:1811.00512  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Learning Beam Search Policies via Imitation Learning

    Authors: Renato Negrinho, Matthew R. Gormley, Geoffrey J. Gordon

    Abstract: Beam search is widely used for approximate decoding in structured prediction problems. Models often use a beam at test time but ignore its existence at train time, and therefore do not explicitly learn how to use the beam. We develop an unifying meta-algorithm for learning beam search policies using imitation learning. In our setting, the beam is part of the model, and not just an artifact of appr… ▽ More

    Submitted 25 June, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Published in NIPS 2018

  20. arXiv:1806.09866  [pdf, other

    stat.AP

    Long-term stock index forecasting based on text mining of regulatory disclosures

    Authors: Stefan Feuerriegel, Julius Gordon

    Abstract: Share valuations are known to adjust to new information entering the market, such as regulatory disclosures. We study whether the language of such news items can improve short-term and especially long-term (24 months) forecasts of stock indices. For this purpose, this work utilizes predictive models suited to high-dimensional data and specifically compares techniques for data-driven and knowledge-… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: Accepted at Decision Support Systems journal

  21. arXiv:1805.10755  [pdf, other

    cs.LG stat.ML

    Dual Policy Iteration

    Authors: Wen Sun, Geoffrey J. Gordon, Byron Boots, J. Andrew Bagnell

    Abstract: Recently, a novel class of Approximate Policy Iteration (API) algorithms have demonstrated impressive practical performance (e.g., ExIt from [2], AlphaGo-Zero from [27]). This new family of algorithms maintains, and alternately optimizes, two policies: a fast, reactive policy (e.g., a deep neural network) deployed at test time, and a slow, non-reactive policy (e.g., Tree Search), that can plan mul… ▽ More

    Submitted 5 April, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2018; Additional related works

  22. arXiv:1805.09921  [pdf, other

    stat.ML cs.LG

    Meta-Learning Probabilistic Inference For Prediction

    Authors: Jonathan Gordon, John Bronskill, Matthias Bauer, Sebastian Nowozin, Richard E. Turner

    Abstract: This paper introduces a new framework for data efficient and versatile learning. Specifically: 1) We develop ML-PIP, a general framework for Meta-Learning approximate Probabilistic Inference for Prediction. ML-PIP extends existing probabilistic interpretations of meta-learning to cover a broad class of methods. 2) We introduce VERSA, an instance of the framework employing a flexible and versatile… ▽ More

    Submitted 6 August, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: International Conference on Learning Representations (ICLR) 2019

    Journal ref: International Conference on Learning Representations (2019)

  23. arXiv:1801.07047  [pdf, other

    stat.ML stat.AP

    News-based forecasts of macroeconomic indicators: A semantic path model for interpretable predictions

    Authors: Stefan Feuerriegel, Julius Gordon

    Abstract: The macroeconomic climate influences operations with regard to, e.g., raw material prices, financing, supply chain utilization and demand quotas. In order to adapt to the economic environment, decision-makers across the public and private sectors require accurate forecasts of the economic outlook. Existing predictive frameworks base their forecasts primarily on time series analysis, as well as the… ▽ More

    Submitted 9 March, 2018; v1 submitted 22 January, 2018; originally announced January 2018.

  24. arXiv:1706.09751  [pdf, ps, other

    stat.ML

    Bayesian Semisupervised Learning with Deep Generative Models

    Authors: Jonathan Gordon, José Miguel Hernández-Lobato

    Abstract: Neural network based generative models with discriminative components are a powerful approach for semi-supervised learning. However, these techniques a) cannot account for model uncertainty in the estimation of the model's discriminative component and b) lack flexibility to capture complex stochastic patterns in the label generation process. To avoid these problems, we first propose to use a discr… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

  25. arXiv:1705.09684  [pdf, other

    cs.LG cs.AI stat.ML

    Multiple Source Domain Adaptation with Adversarial Training of Neural Networks

    Authors: Han Zhao, Shanghang Zhang, Guanhang Wu, João P. Costeira, José M. F. Moura, Geoffrey J. Gordon

    Abstract: While domain adaptation has been actively researched in recent years, most theoretical results and algorithms focus on the single-source-single-target adaptation setting. Naive application of such algorithms on multiple source domain adaptation problem may lead to suboptimal solutions. As a step toward bridging the gap, we propose a new generalization bound for domain adaptation when there are mul… ▽ More

    Submitted 27 October, 2017; v1 submitted 26 May, 2017; originally announced May 2017.

  26. arXiv:1702.03537  [pdf, other

    stat.ML

    An Efficient, Expressive and Local Minima-free Method for Learning Controlled Dynamical Systems

    Authors: Ahmed Hefny, Carlton Downey, Geoffrey J. Gordon

    Abstract: We propose a framework for modeling and estimating the state of controlled dynamical systems, where an agent can affect the system through actions and receives partial observations. Based on this framework, we propose the Predictive State Representation with Random Fourier Features (RFFPSR). A key property in RFF-PSRs is that the state estimate is represented by a conditional distribution of futur… ▽ More

    Submitted 28 February, 2018; v1 submitted 12 February, 2017; originally announced February 2017.

  27. arXiv:1701.06649  [pdf, other

    physics.chem-ph stat.ML

    Constant Size Molecular Descriptors For Use With Machine Learning

    Authors: Christopher R. Collins, Geoffrey J. Gordon, O. Anatole von Lilienfeld, David J. Yaron

    Abstract: A set of molecular descriptors whose length is independent of molecular size is developed for machine learning models that target thermodynamic and electronic properties of molecules. These features are evaluated by monitoring performance of kernel ridge regression models on well-studied data sets of small organic molecules. The features include connectivity counts, which require only the bonding… ▽ More

    Submitted 23 January, 2017; originally announced January 2017.

    Comments: 18 pages, 5 figures

  28. arXiv:1207.2491  [pdf, other

    cs.LG cs.RO stat.ML

    A Spectral Learning Approach to Range-Only SLAM

    Authors: Byron Boots, Geoffrey J. Gordon

    Abstract: We present a novel spectral learning algorithm for simultaneous localization and mapping (SLAM) from range data with known correspondences. This algorithm is an instance of a general spectral system identification framework, from which it inherits several desirable properties, including statistical consistency and no local optima. Compared with popular batch optimization or multiple-hypothesis tra… ▽ More

    Submitted 10 July, 2012; originally announced July 2012.

  29. arXiv:1011.0686  [pdf, other

    cs.LG cs.AI stat.ML

    A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

    Authors: Stephane Ross, Geoffrey J. Gordon, J. Andrew Bagnell

    Abstract: Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. This leads to poor performance in theory and often in practice. Some recent approaches provide stronger guarantees in this setting, but remain somewhat unsatisfactory as they train either non-stationary or… ▽ More

    Submitted 16 March, 2011; v1 submitted 2 November, 2010; originally announced November 2010.

    Comments: Appearing in the 14th International Conference on Artificial Intelligence and Statistics (AISTATS 2011)