Skip to main content

Showing 1–22 of 22 results for author: Kovachki, N

  1. arXiv:2406.06486  [pdf, other

    cs.LG math.NA

    Continuum Attention for Neural Operators

    Authors: Edoardo Calvello, Nikola B. Kovachki, Matthew E. Levine, Andrew M. Stuart

    Abstract: Transformers, and the attention mechanism in particular, have become ubiquitous in machine learning. Their success in modeling nonlocal, long-range correlations has led to their widespread adoption in natural language processing, computer vision, and time-series problems. Neural operators, which map spaces of functions into spaces of functions, are necessarily both nonlinear and nonlocal if they a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2405.15992  [pdf, ps, other

    cs.LG math.NA

    Data Complexity Estimates for Operator Learning

    Authors: Nikola B. Kovachki, Samuel Lanthaler, Hrushikesh Mhaskar

    Abstract: Operator learning has emerged as a new paradigm for the data-driven approximation of nonlinear operators. Despite its empirical success, the theoretical underpinnings governing the conditions for efficient operator learning remain incomplete. The present work develops theory to study the data complexity of operator learning, complementing existing research on the parametric complexity. We investig… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2402.15715  [pdf, other

    cs.LG math.NA

    Operator Learning: Algorithms and Analysis

    Authors: Nikola B. Kovachki, Samuel Lanthaler, Andrew M. Stuart

    Abstract: Operator learning refers to the application of ideas from machine learning to approximate (typically nonlinear) operators mapping between Banach spaces of functions. Such operators often arise from physical models expressed in terms of partial differential equations (PDEs). In this context, such approximate operators hold great potential as efficient surrogate models to complement traditional nume… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  4. arXiv:2310.00120  [pdf, other

    cs.LG

    Multi-Grid Tensorized Fourier Neural Operator for High-Resolution PDEs

    Authors: Jean Kossaifi, Nikola Kovachki, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Memory complexity and data scarcity have so far prohibited learning solution operators of partial differential equations (PDEs) at high resolutions. We address these limitations by introducing a new data efficient and highly parallelizable operator learning approach with reduced memory requirement and better generalization, called multi-grid tensorized neural operator (MG-TFNO). MG-TFNO scales to… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  5. arXiv:2309.15325  [pdf, other

    cs.LG physics.comp-ph

    Neural Operators for Accelerating Scientific Simulations and Design

    Authors: Kamyar Azizzadenesheli, Nikola Kovachki, Zongyi Li, Miguel Liu-Schiaffini, Jean Kossaifi, Anima Anandkumar

    Abstract: Scientific discovery and engineering design are currently limited by the time and cost of physical experiments, selected mostly through trial-and-error and intuition that require deep domain expertise. Numerical simulations present an alternative to physical experiments but are usually infeasible for complex real-world domains due to the computational requirements of existing numerical methods. Ar… ▽ More

    Submitted 4 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  6. arXiv:2309.00583  [pdf, other

    cs.LG math.NA

    Geometry-Informed Neural Operator for Large-Scale 3D PDEs

    Authors: Zongyi Li, Nikola Borislavov Kovachki, Chris Choy, Boyi Li, Jean Kossaifi, Shourya Prakash Otta, Mohammad Amin Nabian, Maximilian Stadler, Christian Hundt, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: We propose the geometry-informed neural operator (GINO), a highly efficient approach to learning the solution operator of large-scale partial differential equations with varying geometries. GINO uses a signed distance function and point-cloud representations of the input shape and neural operators based on graph and Fourier architectures to learn the solution operator. The graph neural operator ha… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  7. arXiv:2308.08794  [pdf, other

    cs.LG math.DS

    Tipping Point Forecasting in Non-Stationary Dynamics on Function Spaces

    Authors: Miguel Liu-Schiaffini, Clare E. Singer, Nikola Kovachki, Tapio Schneider, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Tipping points are abrupt, drastic, and often irreversible changes in the evolution of non-stationary and chaotic dynamical systems. For instance, increased greenhouse gas concentrations are predicted to lead to drastic decreases in low cloud cover, referred to as a climatological tipping point. In this paper, we learn the evolution of such non-stationary dynamical systems using a novel recurrent… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 29 pages, 15 figures

  8. arXiv:2307.15034  [pdf, other

    cs.LG math.NA

    Guaranteed Approximation Bounds for Mixed-Precision Neural Operators

    Authors: Renbo Tu, Colin White, Jean Kossaifi, Boris Bonev, Nikola Kovachki, Gennady Pekhimenko, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: Neural operators, such as Fourier Neural Operators (FNO), form a principled approach for learning solution operators for PDEs and other mappings between function spaces. However, many real-world problems require high-resolution training data, and the training time and limited GPU memory pose big barriers. One solution is to train neural operators in mixed precision to reduce the memory requirement… ▽ More

    Submitted 5 May, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: ICLR 2024

  9. arXiv:2306.12006  [pdf, other

    math.NA cs.LG

    Learning Homogenization for Elliptic Operators

    Authors: Kaushik Bhattacharya, Nikola Kovachki, Aakila Rajan, Andrew M. Stuart, Margaret Trautner

    Abstract: Multiscale partial differential equations (PDEs) arise in various applications, and several schemes have been developed to solve them efficiently. Homogenization theory is a powerful methodology that eliminates the small-scale dependence, resulting in simplified equations that are computationally tractable while accurately predicting the macroscopic response. In the field of continuum mechanics, h… ▽ More

    Submitted 4 January, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    MSC Class: 35B27; 35J47; 74H15

  10. arXiv:2302.07400  [pdf, other

    cs.LG math.FA stat.ML

    Score-based Diffusion Models in Function Space

    Authors: Jae Hyun Lim, Nikola B. Kovachki, Ricardo Baptista, Christopher Beckham, Kamyar Azizzadenesheli, Jean Kossaifi, Vikram Voleti, Jiaming Song, Karsten Kreis, Jan Kautz, Christopher Pal, Arash Vahdat, Anima Anandkumar

    Abstract: Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by denoising. Despite their tremendous success, they are mostly formulated on finite-dimensional spaces, e.g. Euclidean, limiting their applications to many… ▽ More

    Submitted 22 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 52 pages

    MSC Class: 46B09 (Primary); 60J22 (Secondary) ACM Class: I.2.6; J.2

  11. arXiv:2111.03794  [pdf, other

    cs.LG math.NA

    Physics-Informed Neural Operator for Learning Partial Differential Equations

    Authors: Zongyi Li, Hongkai Zheng, Nikola Kovachki, David Jin, Haoxuan Chen, Burigede Liu, Kamyar Azizzadenesheli, Anima Anandkumar

    Abstract: In this paper, we propose physics-informed neural operators (PINO) that combine training data and physics constraints to learn the solution operator of a given family of parametric Partial Differential Equations (PDE). PINO is the first hybrid approach incorporating data and PDE constraints at different resolutions to learn the operator. Specifically, in PINO, we combine coarse-resolution training… ▽ More

    Submitted 29 July, 2023; v1 submitted 5 November, 2021; originally announced November 2021.

  12. arXiv:2108.12515  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Convergence Rates for Learning Linear Operators from Noisy Data

    Authors: Maarten V. de Hoop, Nikola B. Kovachki, Nicholas H. Nelsen, Andrew M. Stuart

    Abstract: This paper studies the learning of linear operators between infinite-dimensional Hilbert spaces. The training data comprises pairs of random input vectors in a Hilbert space and their noisy images under an unknown self-adjoint linear operator. Assuming that the operator is diagonalizable in a known basis, this work solves the equivalent inverse problem of estimating the operator's eigenvalues give… ▽ More

    Submitted 2 November, 2022; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: To appear in SIAM/ASA Journal on Uncertainty Quantification (JUQ); 34 pages, 5 figures, 2 tables

    MSC Class: 62G20; 62C10; 68T05; 47A62

    Journal ref: SIAM/ASA J. Uncertainty Quantification Vol. 11 No. 2 (2023) pp. 480-513

  13. Neural Operator: Learning Maps Between Function Spaces

    Authors: Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has primarily focused on learning mappings between finite dimensional Euclidean spaces or finite sets. We propose a generalization of neural networks to learn operators, termed neural operators, that map between infinite dimensional function spaces. We formulate the neural operator as a composition of linear integral operators and nonlinear activation f… ▽ More

    Submitted 2 May, 2024; v1 submitted 18 August, 2021; originally announced August 2021.

    Journal ref: The Journal of Machine Learning Research (2023), Volume 24, Issue 1, Article No 89, pp 4061-4157

  14. arXiv:2106.06898  [pdf, other

    cs.LG math.DS

    Learning Dissipative Dynamics in Chaotic Systems

    Authors: Zongyi Li, Miguel Liu-Schiaffini, Nikola Kovachki, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: Chaotic systems are notoriously challenging to predict because of their sensitivity to perturbations and errors due to time stepping. Despite this unpredictable behavior, for many dissipative systems the statistics of the long term trajectories are governed by an invariant measure supported on a set, known as the global attractor; for many problems this set is finite dimensional, even if the state… ▽ More

    Submitted 27 September, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  15. arXiv:2010.08895  [pdf, other

    cs.LG math.NA

    Fourier Neural Operator for Parametric Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has primarily focused on learning mappings between finite-dimensional Euclidean spaces. Recently, this has been generalized to neural operators that learn mappings between function spaces. For partial differential equations (PDEs), neural operators directly learn the mapping from any functional parametric dependence to the solution. Thus, they learn an… ▽ More

    Submitted 16 May, 2021; v1 submitted 17 October, 2020; originally announced October 2020.

  16. arXiv:2006.09535  [pdf, other

    cs.LG math.NA stat.ML

    Multipole Graph Neural Operator for Parametric Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: One of the main challenges in using deep learning-based methods for simulating physical systems and solving partial differential equations (PDEs) is formulating physics-based data in the desired structure for neural networks. Graph neural networks (GNNs) have gained popularity in this area since graphs offer a natural way of modeling particle interactions and provide a clear way of discretizing th… ▽ More

    Submitted 19 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

  17. arXiv:2006.06755  [pdf, other

    stat.ML cs.LG stat.CO

    Conditional Sampling with Monotone GANs: from Generative Models to Likelihood-Free Inference

    Authors: Ricardo Baptista, Bamdad Hosseini, Nikola B. Kovachki, Youssef Marzouk

    Abstract: We present a novel framework for conditional sampling of probability measures, using block triangular transport maps. We develop the theoretical foundations of block triangular transport in a Banach space setting, establishing general conditions under which conditional sampling can be achieved and drawing connections between monotone block triangular maps and optimal transport. Based on this theor… ▽ More

    Submitted 5 June, 2023; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Major expansion of earlier version, with new theoretical results. 33 pages, 8 figures, 1 table

  18. arXiv:2005.03180  [pdf, other

    math.NA cs.LG stat.ML

    Model Reduction and Neural Networks for Parametric PDEs

    Authors: Kaushik Bhattacharya, Bamdad Hosseini, Nikola B. Kovachki, Andrew M. Stuart

    Abstract: We develop a general framework for data-driven approximation of input-output maps between infinite-dimensional spaces. The proposed approach is motivated by the recent successes of neural networks and deep learning, in combination with ideas from model reduction. This combination results in a neural network approximation which, in principle, is defined on infinite-dimensional spaces and, in practi… ▽ More

    Submitted 17 June, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 39 pages, 13 figures

    MSC Class: 65N75; 62M45; 68T05; 60H30; 60H15

  19. arXiv:2003.03485  [pdf, other

    cs.LG math.NA stat.ML

    Neural Operator: Graph Kernel Network for Partial Differential Equations

    Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, Anima Anandkumar

    Abstract: The classical development of neural networks has been primarily for mappings between a finite-dimensional Euclidean space and a set of classes, or between two finite-dimensional Euclidean spaces. The purpose of this work is to generalize neural networks so that they can learn mappings between infinite-dimensional spaces (operators). The key innovation in our work is that a single set of network pa… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  20. arXiv:1909.02041  [pdf, other

    physics.chem-ph cs.LG

    Regression-clustering for Improved Accuracy and Training Cost with Molecular-Orbital-Based Machine Learning

    Authors: Lixue Cheng, Nikola B. Kovachki, Matthew Welborn, Thomas F. Miller III

    Abstract: Machine learning (ML) in the representation of molecular-orbital-based (MOB) features has been shown to be an accurate and transferable approach to the prediction of post-Hartree-Fock correlation energies. Previous applications of MOB-ML employed Gaussian Process Regression (GPR), which provides good prediction accuracy with small training sets; however, the cost of GPR training scales cubically w… ▽ More

    Submitted 23 October, 2019; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: 31 pages, 10 figures, with an SI

  21. arXiv:1906.04285  [pdf, other

    cs.LG math.NA stat.ML

    Continuous Time Analysis of Momentum Methods

    Authors: Nikola B. Kovachki, Andrew M. Stuart

    Abstract: Gradient descent-based optimization methods underpin the parameter training of neural networks, and hence comprise a significant component in the impressive test results found in a number of applications. Introducing stochasticity is key to their success in practical problems, and there is some understanding of the role of stochastic gradient descent in this context. Momentum modifications of grad… ▽ More

    Submitted 28 May, 2021; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: 40 pages, 7 figures

    Journal ref: Journal of Machine Learning Research 21 (2020) 1-40

  22. arXiv:1808.03620  [pdf, other

    cs.LG math.OC stat.ML

    Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks

    Authors: Nikola B. Kovachki, Andrew M. Stuart

    Abstract: The standard probabilistic perspective on machine learning gives rise to empirical risk-minimization tasks that are frequently solved by stochastic gradient descent (SGD) and variants thereof. We present a formulation of these tasks as classical inverse or filtering problems and, furthermore, we propose an efficient, gradient-free algorithm for finding a solution to these problems using ensemble K… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

    Comments: 41 pages, 14 figures

    MSC Class: 68T20; 65L09; 65K10; 49M15 ACM Class: I.2.6