Skip to main content

Showing 1–33 of 33 results for author: Stuart, A

  1. arXiv:2406.18066  [pdf, other

    cs.LG math.DS

    Learning Optimal Filters Using Variational Inference

    Authors: Enoch Luk, Eviatar Bach, Ricardo Baptista, Andrew Stuart

    Abstract: Filtering-the task of estimating the conditional distribution of states of a dynamical system given partial, noisy, observations-is important in many areas of science and engineering, including weather and climate prediction. However, the filtering distribution is generally intractable to obtain for high-dimensional, nonlinear systems. Filters used in practice, such as the ensemble Kalman filter (… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.17263  [pdf, other

    cs.LG math.DS math.NA

    Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient Flows

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

    Abstract: In this paper, we study efficient approximate sampling for probability distributions known up to normalization constants. We specifically focus on a problem class arising in Bayesian inference for large-scale inverse problems in science and engineering applications. The computational challenges we address with the proposed methodology are: (i) the need for repeated evaluations of expensive forward… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 42 pages, 9 figures

  3. arXiv:2406.06486  [pdf, other

    cs.LG math.NA

    Continuum Attention for Neural Operators

    Authors: Edoardo Calvello, Nikola B. Kovachki, Matthew E. Levine, Andrew M. Stuart

    Abstract: Transformers, and the attention mechanism in particular, have become ubiquitous in machine learning. Their success in modeling nonlocal, long-range correlations has led to their widespread adoption in natural language processing, computer vision, and time-series problems. Neural operators, which map spaces of functions into spaces of functions, are necessarily both nonlinear and nonlocal if they a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2405.17955  [pdf, other

    stat.ML cs.LG stat.CO

    Efficient Prior Calibration From Indirect Data

    Authors: O. Deniz Akyildiz, Mark Girolami, Andrew M. Stuart, Arnaud Vadeboncoeur

    Abstract: Bayesian inversion is central to the quantification of uncertainty within problems arising from numerous applications in science and engineering. To formulate the approach, four ingredients are required: a forward model mapping the unknown parameter to an element of a solution space, often the solution space for a differential equation; an observation operator mapping an element of the solution sp… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2405.13149  [pdf, other

    stat.ML cs.LG math.NA math.PR stat.CO

    Gaussian Measures Conditioned on Nonlinear Observations: Consistency, MAP Estimators, and Simulation

    Authors: Yifan Chen, Bamdad Hosseini, Houman Owhadi, Andrew M Stuart

    Abstract: The article presents a systematic study of the problem of conditioning a Gaussian random variable $ξ$ on nonlinear observations of the form $F \circ φ(ξ)$ where $φ: \mathcal{X} \to \mathbb{R}^N$ is a bounded linear operator and $F$ is nonlinear. Such problems arise in the context of Bayesian inference and recent machine learning-inspired PDE solvers. We give a representer theorem for the condition… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  6. arXiv:2405.02221  [pdf, other

    math.NA cs.LG

    Discretization Error of Fourier Neural Operators

    Authors: Samuel Lanthaler, Andrew M. Stuart, Margaret Trautner

    Abstract: Operator learning is a variant of machine learning that is designed to approximate maps between function spaces from data. The Fourier Neural Operator (FNO) is a common model architecture used for operator learning. The FNO combines pointwise linear and nonlinear operations in physical space with pointwise linear operations in Fourier space, leading to a parameterized map acting between function s… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    MSC Class: 41A35 (Primary) 65T50; 68T07 (Secondary)

  7. arXiv:2403.10642  [pdf, other

    cs.LG math.NA

    Using Uncertainty Quantification to Characterize and Improve Out-of-Domain Learning for PDEs

    Authors: S. Chandra Mouli, Danielle C. Maddix, Shima Alizadeh, Gaurav Gupta, Andrew Stuart, Michael W. Mahoney, Yuyang Wang

    Abstract: Existing work in scientific machine learning (SciML) has shown that data-driven learning of solution operators can provide a fast approximate alternative to classical numerical partial differential equation (PDE) solvers. Of these, Neural Operators (NOs) have emerged as particularly promising. We observe that several uncertainty quantification (UQ) methods for NOs fail for test inputs that are eve… ▽ More

    Submitted 12 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  8. arXiv:2402.15715  [pdf, other

    cs.LG math.NA

    Operator Learning: Algorithms and Analysis

    Authors: Nikola B. Kovachki, Samuel Lanthaler, Andrew M. Stuart

    Abstract: Operator learning refers to the application of ideas from machine learning to approximate (typically nonlinear) operators mapping between Banach spaces of functions. Such operators often arise from physical models expressed in terms of partial differential equations (PDEs). In this context, such approximate operators hold great potential as efficient surrogate models to complement traditional nume… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  9. arXiv:2401.00035  [pdf, other

    physics.comp-ph cs.LG math.DS

    Learning About Structural Errors in Models of Complex Dynamical Systems

    Authors: Jin-Long Wu, Matthew E. Levine, Tapio Schneider, Andrew Stuart

    Abstract: Complex dynamical systems are notoriously difficult to model because some degrees of freedom (e.g., small scales) may be computationally unresolvable or are incompletely understood, yet they are dynamically important. For example, the small scales of cloud dynamics and droplet formation are crucial for controlling climate, yet are unresolvable in global climate models. Semi-empirical closure model… ▽ More

    Submitted 28 May, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 40 pages, 13 figures

    MSC Class: 68T01

  10. arXiv:2310.14555  [pdf, other

    physics.geo-ph cs.LG

    Modeling groundwater levels in California's Central Valley by hierarchical Gaussian process and neural network regression

    Authors: Anshuman Pradhan, Kyra H. Adams, Venkat Chandrasekaran, Zhen Liu, John T. Reager, Andrew M. Stuart, Michael J. Turmon

    Abstract: Modeling groundwater levels continuously across California's Central Valley (CV) hydrological system is challenging due to low-quality well data which is sparsely and noisily sampled across time and space. The lack of consistent well data makes it difficult to evaluate the impact of 2017 and 2019 wet years on CV groundwater following a severe drought during 2012-2015. A novel machine learning meth… ▽ More

    Submitted 16 June, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  11. arXiv:2310.03597  [pdf, other

    stat.ML cs.LG math.DS math.NA

    Sampling via Gradient Flows in the Space of Probability Measures

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M Stuart

    Abstract: Sampling a target probability distribution with an unknown normalization constant is a fundamental challenge in computational science and engineering. Recent work shows that algorithms derived by considering gradient flows in the space of probability measures open up new avenues for algorithm development. This paper makes three contributions to this sampling approach by scrutinizing the design com… ▽ More

    Submitted 9 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Related and text overlap with arXiv:2302.11024

  12. arXiv:2306.15924  [pdf, ps, other

    cs.LG math.NA

    The Parametric Complexity of Operator Learning

    Authors: Samuel Lanthaler, Andrew M. Stuart

    Abstract: Neural operator architectures employ neural networks to approximate operators mapping between Banach spaces of functions; they may be used to accelerate model evaluations via emulation, or to discover models from data. Consequently, the methodology has received increasing attention over recent years, giving rise to the rapidly growing field of operator learning. The first contribution of this pape… ▽ More

    Submitted 1 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  13. arXiv:2306.12006  [pdf, other

    math.NA cs.LG

    Learning Homogenization for Elliptic Operators

    Authors: Kaushik Bhattacharya, Nikola Kovachki, Aakila Rajan, Andrew M. Stuart, Margaret Trautner

    Abstract: Multiscale partial differential equations (PDEs) arise in various applications, and several schemes have been developed to solve them efficiently. Homogenization theory is a powerful methodology that eliminates the small-scale dependence, resulting in simplified equations that are computationally tractable while accurately predicting the macroscopic response. In the field of continuum mechanics, h… ▽ More

    Submitted 4 January, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    MSC Class: 35B27; 35J47; 74H15

  14. arXiv:2304.13221  [pdf, other

    math.NA cs.LG

    Nonlocality and Nonlinearity Implies Universality in Operator Learning

    Authors: Samuel Lanthaler, Zongyi Li, Andrew M. Stuart

    Abstract: Neural operator architectures approximate operators between infinite-dimensional Banach spaces of functions. They are gaining increased attention in computational science and engineering, due to their potential both to accelerate traditional numerical methods and to enable data-driven discovery. As the field is in its infancy basic questions about minimal requirements for universal approximation r… ▽ More

    Submitted 14 June, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

  15. arXiv:2208.04506  [pdf, other

    math.DS cs.LG math.NA stat.ME

    Second Order Ensemble Langevin Method for Sampling and Inverse Problems

    Authors: Ziming Liu, Andrew M. Stuart, Yixuan Wang

    Abstract: We propose a sampling method based on an ensemble approximation of second order Langevin dynamics. The log target density is appended with a quadratic term in an auxiliary momentum variable and damped-driven Hamiltonian dynamics introduced; the resulting stochastic differential equation is invariant to the Gibbs measure, with marginal on the position coordinates given by the target. A precondition… ▽ More

    Submitted 24 October, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

  16. arXiv:2108.12515  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Convergence Rates for Learning Linear Operators from Noisy Data

    Authors: Maarten V. de Hoop, Nikola B. Kovachki, Nicholas H. Nelsen, Andrew M. Stuart

    Abstract: This paper studies the learning of linear operators between infinite-dimensional Hilbert spaces. The training data comprises pairs of random input vectors in a Hilbert space and their noisy images under an unknown self-adjoint linear operator. Assuming that the operator is diagonalizable in a known basis, this work solves the equivalent inverse problem of estimating the operator's eigenvalues give… ▽ More

    Submitted 2 November, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: To appear in SIAM/ASA Journal on Uncertainty Quantification (JUQ); 34 pages, 5 figures, 2 tables

    MSC Class: 62G20; 62C10; 68T05; 47A62

    Journal ref: SIAM/ASA J. Uncertainty Quantification Vol. 11 No. 2 (2023) pp. 480-513

  17. Neural Operator: Learning Maps Between Function Spaces

    Authors: Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has primarily focused on learning mappings between finite dimensional Euclidean spaces or finite sets. We propose a generalization of neural networks to learn operators, termed neural operators, that map between infinite dimensional function spaces. We formulate the neural operator as a composition of linear integral operators and nonlinear activation f… ▽ More

    Submitted 2 May, 2024; v1 submitted 18 August, 2021; originally announced August 2021.

    Journal ref: The Journal of Machine Learning Research (2023), Volume 24, Issue 1, Article No 89, pp 4061-4157

  18. arXiv:2107.06658  [pdf, other

    math.DS cs.LG stat.ML

    A Framework for Machine Learning of Model Error in Dynamical Systems

    Authors: Matthew E. Levine, Andrew M. Stuart

    Abstract: The development of data-informed predictive models for dynamical systems is of widespread interest in many disciplines. We present a unifying framework for blending mechanistic and machine-learning approaches to identify dynamical systems from noisily and partially observed data. We compare pure data-driven learning with hybrid models which incorporate imperfect domain knowledge. Our formulation i… ▽ More

    Submitted 17 August, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

  19. arXiv:2106.06898  [pdf, other

    cs.LG math.DS

    Learning Dissipative Dynamics in Chaotic Systems

    Authors: Zongyi Li, Miguel Liu-Schiaffini, Nikola Kovachki, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: Chaotic systems are notoriously challenging to predict because of their sensitivity to perturbations and errors due to time stepping. Despite this unpredictable behavior, for many dissipative systems the statistics of the long term trajectories are governed by an invariant measure supported on a set, known as the global attractor; for many problems this set is finite dimensional, even if the state… ▽ More

    Submitted 27 September, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  20. arXiv:2010.08895  [pdf, other

    cs.LG math.NA

    Fourier Neural Operator for Parametric Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has primarily focused on learning mappings between finite-dimensional Euclidean spaces. Recently, this has been generalized to neural operators that learn mappings between function spaces. For partial differential equations (PDEs), neural operators directly learn the mapping from any functional parametric dependence to the solution. Thus, they learn an… ▽ More

    Submitted 16 May, 2021; v1 submitted 17 October, 2020; originally announced October 2020.

  21. Posterior Consistency of Semi-Supervised Regression on Graphs

    Authors: Andrea L. Bertozzi, Bamdad Hosseini, Hao Li, Kevin Miller, Andrew M. Stuart

    Abstract: Graph-based semi-supervised regression (SSR) is the problem of estimating the value of a function on a weighted graph from its values (labels) on a small subset of the vertices. This paper is concerned with the consistency of SSR in the context of classification, in the setting where the labels have small noise and the underlying graph weighting is consistent with well-clustered nodes. We present… ▽ More

    Submitted 24 March, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

  22. arXiv:2006.09535  [pdf, other

    cs.LG math.NA stat.ML

    Multipole Graph Neural Operator for Parametric Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: One of the main challenges in using deep learning-based methods for simulating physical systems and solving partial differential equations (PDEs) is formulating physics-based data in the desired structure for neural networks. Graph neural networks (GNNs) have gained popularity in this area since graphs offer a natural way of modeling particle interactions and provide a clear way of discretizing th… ▽ More

    Submitted 19 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

  23. arXiv:2006.07153  [pdf

    cs.CY cs.SI

    Altruism and anxiety: Engagement with online community support initiatives (OCSIs) during Covid-19 lockdown in the UK and Ireland

    Authors: Camilla Elphick, Avelie Stuart, Richard Philpot, Zoe Walkington, Lara Frumkin, Min Zhang, Mark Levine, Blaine Price, Graham Pike, Bashar Nuseibeh, Arosha Bandara

    Abstract: Given concerns about mental health during periods of Covid-19 lockdown, it important to understand how engagement with online Covid-19 related material can affect mood. In the UK and Ireland, online community support initiatives (OCSIs) have emerged to help people manage their lives. Yet, little is known about how people engaged with these or whether they influenced subsequent mood. We conducted s… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  24. Building trust in digital policing: A scoping review of community policing apps

    Authors: Camilla Elphick, Richard Philpot, Min Zhang, Avelie Stuart, Zoe Walkington, Lara Frumkin, Graham Pike, Kelly Gardner, Mark Lacey, Mark Levine, Blaine Price, Arosha Bandara, Bashar Nuseibeh

    Abstract: Perceptions of police trustworthiness are linked to citizens' willingness to cooperate with police. Trust can be fostered by introducing accountability mechanisms, or by increasing a shared police/citizen identity, both which can be achieved digitally. Digital mechanisms can also be designed to safeguard, engage, reassure, inform, and empower diverse communities. We systematically scoped 240 exist… ▽ More

    Submitted 28 December, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Police Practice and Research (Taylor & Francis) 2020

    ACM Class: H.5.2; H.4.3; J.4

  25. arXiv:2005.10224  [pdf, other

    math.NA cs.LG physics.comp-ph stat.ML

    The Random Feature Model for Input-Output Maps between Banach Spaces

    Authors: Nicholas H. Nelsen, Andrew M. Stuart

    Abstract: Well known to the machine learning community, the random feature model is a parametric approximation to kernel interpolation or regression methods. It is typically used to approximate functions mapping a finite-dimensional input space to the real line. In this paper, we instead propose a methodology for use of the random feature model as a data-driven surrogate for operators that map an input Bana… ▽ More

    Submitted 5 June, 2021; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: To appear in SIAM Journal on Scientific Computing; 32 pages, 9 figures

    MSC Class: 65D15; 65D40; 62M45; 35R60

    Journal ref: SIAM J. Sci. Comput. Vol. 43 No. 5 (2021) pp. A3212-A3243

  26. arXiv:2005.03180  [pdf, other

    math.NA cs.LG stat.ML

    Model Reduction and Neural Networks for Parametric PDEs

    Authors: Kaushik Bhattacharya, Bamdad Hosseini, Nikola B. Kovachki, Andrew M. Stuart

    Abstract: We develop a general framework for data-driven approximation of input-output maps between infinite-dimensional spaces. The proposed approach is motivated by the recent successes of neural networks and deep learning, in combination with ideas from model reduction. This combination results in a neural network approximation which, in principle, is defined on infinite-dimensional spaces and, in practi… ▽ More

    Submitted 17 June, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 39 pages, 13 figures

    MSC Class: 65N75; 62M45; 68T05; 60H30; 60H15

  27. arXiv:2003.03485  [pdf, other

    cs.LG math.NA stat.ML

    Neural Operator: Graph Kernel Network for Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has been primarily for mappings between a finite-dimensional Euclidean space and a set of classes, or between two finite-dimensional Euclidean spaces. The purpose of this work is to generalize neural networks so that they can learn mappings between infinite-dimensional spaces (operators). The key innovation in our work is that a single set of network pa… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  28. arXiv:1906.07658  [pdf, other

    stat.ML cs.LG math.NA math.OC

    Consistency of semi-supervised learning algorithms on graphs: Probit and one-hot methods

    Authors: Franca Hoffmann, Bamdad Hosseini, Zhi Ren, Andrew M. Stuart

    Abstract: Graph-based semi-supervised learning is the problem of propagating labels from a small number of labelled data points to a larger set of unlabelled data. This paper is concerned with the consistency of optimization-based techniques for such problems, in the limit where the labels have small noise and the underlying unlabelled data is well clustered. We study graph-based probit for binary classific… ▽ More

    Submitted 9 March, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    MSC Class: 62H30; 68T10; 68Q87; 91C20

  29. arXiv:1906.04285  [pdf, other

    cs.LG math.NA stat.ML

    Continuous Time Analysis of Momentum Methods

    Authors: Nikola B. Kovachki, Andrew M. Stuart

    Abstract: Gradient descent-based optimization methods underpin the parameter training of neural networks, and hence comprise a significant component in the impressive test results found in a number of applications. Introducing stochasticity is key to their success in practical problems, and there is some understanding of the role of stochastic gradient descent in this context. Momentum modifications of grad… ▽ More

    Submitted 28 May, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: 40 pages, 7 figures

    Journal ref: Journal of Machine Learning Research 21 (2020) 1-40

  30. arXiv:1808.03620  [pdf, other

    cs.LG math.OC stat.ML

    Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks

    Authors: Nikola B. Kovachki, Andrew M. Stuart

    Abstract: The standard probabilistic perspective on machine learning gives rise to empirical risk-minimization tasks that are frequently solved by stochastic gradient descent (SGD) and variants thereof. We present a formulation of these tasks as classical inverse or filtering problems and, furthermore, we propose an efficient, gradient-free algorithm for finding a solution to these problems using ensemble K… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

    Comments: 41 pages, 14 figures

    MSC Class: 68T20; 65L09; 65K10; 49M15 ACM Class: I.2.6

  31. arXiv:1805.09450  [pdf, other

    stat.ML cs.LG math.AP

    Large Data and Zero Noise Limits of Graph-Based Semi-Supervised Learning Algorithms

    Authors: Matthew M. Dunlop, Dejan Slepčev, Andrew M. Stuart, Matthew Thorpe

    Abstract: Scalings in which the graph Laplacian approaches a differential operator in the large graph limit are used to develop understanding of a number of algorithms for semi-supervised learning; in particular the extension, to this graph setting, of the probit algorithm, level set and kriging methods, are studied. Both optimization and Bayesian approaches are considered, based around a regularizing quadr… ▽ More

    Submitted 28 December, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    MSC Class: 62G20; 62C10; 62F15; 49J55

  32. arXiv:1703.08816  [pdf, other

    cs.LG stat.ML

    Uncertainty quantification in graph-based classification of high dimensional data

    Authors: Andrea L. Bertozzi, Xiyang Luo, Andrew M. Stuart, Konstantinos C. Zygalakis

    Abstract: Classification of high dimensional data finds wide-ranging applications. In many of these applications equipping the resulting classification with a measure of uncertainty may be as important as the classification itself. In this paper we introduce, develop algorithms for, and investigate the properties of, a variety of Bayesian models for the task of binary classification; via the posterior distr… ▽ More

    Submitted 8 February, 2018; v1 submitted 26 March, 2017; originally announced March 2017.

    Comments: 33 pages, 14 figures

  33. arXiv:1110.4623  [pdf, other

    cs.OS cs.DC cs.DS cs.GR

    Efficient Synchronization Primitives for GPUs

    Authors: Jeff A. Stuart, John D. Owens

    Abstract: In this paper, we revisit the design of synchronization primitives---specifically barriers, mutexes, and semaphores---and how they apply to the GPU. Previous implementations are insufficient due to the discrepancies in hardware and programming model of the GPU and CPU. We create new implementations in CUDA and analyze the performance of spinning on the GPU, as well as a method of sleeping on the G… ▽ More

    Submitted 20 October, 2011; originally announced October 2011.

    Comments: 13 pages with appendix, several figures, plans to submit to CompSci conference in early 2012

    ACM Class: D.4.1; I.3.2