-
Caustics: A Python Package for Accelerated Strong Gravitational Lensing Simulations
Authors:
Connor Stone,
Alexandre Adam,
Adam Coogan,
M. J. Yantovski-Barth,
Andreas Filipp,
Landung Setiawan,
Cordero Core,
Ronan Legin,
Charles Wilson,
Gabriel Missael Barco,
Yashar Hezaveh,
Laurence Perreault-Levasseur
Abstract:
Gravitational lensing is the deflection of light rays due to the gravity of intervening masses. This phenomenon is observed in a variety of scales and configurations, involving any non-uniform mass such as planets, stars, galaxies, clusters of galaxies, and even the large scale structure of the universe. Strong lensing occurs when the distortions are significant and multiple images of the backgrou…
▽ More
Gravitational lensing is the deflection of light rays due to the gravity of intervening masses. This phenomenon is observed in a variety of scales and configurations, involving any non-uniform mass such as planets, stars, galaxies, clusters of galaxies, and even the large scale structure of the universe. Strong lensing occurs when the distortions are significant and multiple images of the background source are observed. The lens objects must align on the sky of order ~1 arcsecond for galaxy-galaxy lensing, or 10's of arcseonds for cluster-galaxy lensing. As the discovery of lens systems has grown to the low thousands, these systems have become pivotal for precision measurements and addressing critical questions in astrophysics. Notably, they facilitate the measurement of the Universe's expansion rate, dark matter, supernovae, quasars, and the first stars among other topics. With future surveys expected to discover hundreds of thousands of lensing systems, the modelling and simulation of such systems must occur at orders of magnitude larger scale then ever before. Here we present `caustics`, a Python package designed to handle the extensive computational demands of modeling such a vast number of lensing systems.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Multi-phase black-hole feedback and a bright [CII] halo in a Lo-BAL quasar at $z\sim6.6$
Authors:
Manuela Bischetti,
Hyunseop Choi,
Fabrizio Fiore,
Chiara Feruglio,
Stefano Carniani,
Valentina D'Odorico,
Eduardo Bañados,
Huanqing Chen,
Roberto Decarli,
Simona Gallerani,
Julie Hlavacek-Larrondo,
Samuel Lai,
Karen M. Leighly,
Chiara Mazzucchelli,
Laurence Perreault-Levasseur,
Roberta Tripodi,
Fabian Walter,
Feige Wang,
Jinyi Yang,
Maria Vittoria Zanchettin,
Yongda Zhu
Abstract:
Although the mass growth of supermassive black holes during the Epoch of Reionisation is expected to play a role in shaping the concurrent growth of their host-galaxies, observational evidence of feedback at z$\gtrsim$6 is still sparse. We perform the first multi-scale and multi-phase characterisation of black-hole driven outflows in the $z\sim6.6$ quasar J0923+0402 and assess how these winds impa…
▽ More
Although the mass growth of supermassive black holes during the Epoch of Reionisation is expected to play a role in shaping the concurrent growth of their host-galaxies, observational evidence of feedback at z$\gtrsim$6 is still sparse. We perform the first multi-scale and multi-phase characterisation of black-hole driven outflows in the $z\sim6.6$ quasar J0923+0402 and assess how these winds impact the cold gas reservoir. We employ the SimBAL spectral synthesis to fit broad absorption line (BAL) features and find a powerful ionized outflow on $\lesssim210$ pc scale, with a kinetic power $\sim2-100$\% of the quasar luminosity. ALMA observations of [CII] emission allow us to study the morphology and kinematics of the cold gas. We detect high-velocity [CII] emission, likely associated with a cold neutral outflow at $\sim0.5-2$ kpc scale in the host-galaxy, and a bright extended [CII] halo with a size of $\sim15$ kpc. For the first time at such an early epoch, we accurately constrain the outflow energetics in both the ionized and the atomic neutral gas phases. We find such energetics to be consistent with expectations for an efficient feedback mechanism, and both ejective and preventative feedback modes are likely at play. The scales and energetics of the ionized and atomic outflows suggest that they might be associated with different quasar accretion episodes. The results of this work indicate that strong black hole feedback is occurring in quasars at $z\gtrsim6$ and is likely responsible for shaping the properties of the cold gas reservoir up to circum-galactic scales.
△ Less
Submitted 16 May, 2024; v1 submitted 18 April, 2024;
originally announced April 2024.
-
Learning an Effective Evolution Equation for Particle-Mesh Simulations Across Cosmologies
Authors:
Nicolas Payot,
Pablo Lemos,
Laurence Perreault-Levasseur,
Carolina Cuesta-Lazaro,
Chirag Modi,
Yashar Hezaveh
Abstract:
Particle-mesh simulations trade small-scale accuracy for speed compared to traditional, computationally expensive N-body codes in cosmological simulations. In this work, we show how a data-driven model could be used to learn an effective evolution equation for the particles, by correcting the errors of the particle-mesh potential incurred on small scales during simulations. We find that our learnt…
▽ More
Particle-mesh simulations trade small-scale accuracy for speed compared to traditional, computationally expensive N-body codes in cosmological simulations. In this work, we show how a data-driven model could be used to learn an effective evolution equation for the particles, by correcting the errors of the particle-mesh potential incurred on small scales during simulations. We find that our learnt correction yields evolution equations that generalize well to new, unseen initial conditions and cosmologies. We further demonstrate that the resulting corrected maps can be used in a simulation-based inference framework to yield an unbiased inference of cosmological parameters. The model, a network implemented in Fourier space, is exclusively trained on the particle positions and velocities.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Unraveling the Mysteries of Galaxy Clusters: Recurrent Inference Deconvolution of X-ray Spectra
Authors:
Carter Rhea,
Julie Hlavacek-Larrondo,
Ralph Kraft,
Akos Bogdan,
Alexandre Adam,
Laurence Perreault-Levasseur
Abstract:
In the realm of X-ray spectral analysis, the true nature of spectra has remained elusive, as observed spectra have long been the outcome of convolution between instrumental response functions and intrinsic spectra. In this study, we employ a recurrent neural network framework, the Recurrent Inference Machine (RIM), to achieve the high-precision deconvolution of intrinsic spectra from instrumental…
▽ More
In the realm of X-ray spectral analysis, the true nature of spectra has remained elusive, as observed spectra have long been the outcome of convolution between instrumental response functions and intrinsic spectra. In this study, we employ a recurrent neural network framework, the Recurrent Inference Machine (RIM), to achieve the high-precision deconvolution of intrinsic spectra from instrumental response functions. Our RIM model is meticulously trained on cutting-edge thermodynamic models and authentic response matrices sourced from the Chandra X-ray Observatory archive. Demonstrating remarkable accuracy, our model successfully reconstructs intrinsic spectra well below the 1-sigma error level. We showcase the practical application of this novel approach through real Chandra observations of the galaxy cluster Abell 1550 - a vital calibration target for the recently launched X-ray telescope, XRISM. This work marks a significant stride in the domain of X-ray spectral analysis, offering a promising avenue for unlocking hitherto concealed insights into spectra.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Bayesian Imaging for Radio Interferometry with Score-Based Priors
Authors:
Noe Dia,
M. J. Yantovski-Barth,
Alexandre Adam,
Micah Bowles,
Pablo Lemos,
Anna M. M. Scaife,
Yashar Hezaveh,
Laurence Perreault-Levasseur
Abstract:
The inverse imaging task in radio interferometry is a key limiting factor to retrieving Bayesian uncertainties in radio astronomy in a computationally effective manner. We use a score-based prior derived from optical images of galaxies to recover images of protoplanetary disks from the DSHARP survey. We demonstrate that our method produces plausible posterior samples despite the misspecified galax…
▽ More
The inverse imaging task in radio interferometry is a key limiting factor to retrieving Bayesian uncertainties in radio astronomy in a computationally effective manner. We use a score-based prior derived from optical images of galaxies to recover images of protoplanetary disks from the DSHARP survey. We demonstrate that our method produces plausible posterior samples despite the misspecified galaxy prior. We show that our approach produces results which are competitive with existing radio interferometry imaging algorithms.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Active learning meets fractal decision boundaries: a cautionary tale from the Sitnikov three-body problem
Authors:
Nicolas Payot,
Mario Pasquato,
Alessandro Alberto Trani,
Yashar Hezaveh,
Laurence Perreault-Levasseur
Abstract:
Chaotic systems such as the gravitational N-body problem are ubiquitous in astronomy. Machine learning (ML) is increasingly deployed to predict the evolution of such systems, e.g. with the goal of speeding up simulations. Strategies such as active Learning (AL) are a natural choice to optimize ML training. Here we showcase an AL failure when predicting the stability of the Sitnikov three-body prob…
▽ More
Chaotic systems such as the gravitational N-body problem are ubiquitous in astronomy. Machine learning (ML) is increasingly deployed to predict the evolution of such systems, e.g. with the goal of speeding up simulations. Strategies such as active Learning (AL) are a natural choice to optimize ML training. Here we showcase an AL failure when predicting the stability of the Sitnikov three-body problem, the simplest case of N-body problem displaying chaotic behavior. We link this failure to the fractal nature of our classification problem's decision boundary. This is a potential pitfall in optimizing large sets of N-body simulations via AL in the context of star cluster physics, galactic dynamics, or cosmology.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Echoes in the Noise: Posterior Samples of Faint Galaxy Surface Brightness Profiles with Score-Based Likelihoods and Priors
Authors:
Alexandre Adam,
Connor Stone,
Connor Bottrell,
Ronan Legin,
Yashar Hezaveh,
Laurence Perreault-Levasseur
Abstract:
Examining the detailed structure of galaxy populations provides valuable insights into their formation and evolution mechanisms. Significant barriers to such analysis are the non-trivial noise properties of real astronomical images and the point spread function (PSF) which blurs structure. Here we present a framework which combines recent advances in score-based likelihood characterization and dif…
▽ More
Examining the detailed structure of galaxy populations provides valuable insights into their formation and evolution mechanisms. Significant barriers to such analysis are the non-trivial noise properties of real astronomical images and the point spread function (PSF) which blurs structure. Here we present a framework which combines recent advances in score-based likelihood characterization and diffusion model priors to perform a Bayesian analysis of image deconvolution. The method, when applied to minimally processed \emph{Hubble Space Telescope} (\emph{HST}) data, recovers structures which have otherwise only become visible in next-generation \emph{James Webb Space Telescope} (\emph{JWST}) imaging.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
The search for the lost attractor
Authors:
Mario Pasquato,
Syphax Haddad,
Pierfrancesco Di Cintio,
Alexandre Adam,
Pablo Lemos,
Noé Dia,
Mircea Petrache,
Ugo Niccolò Di Carlo,
Alessandro Alberto Trani,
Laurence Perreault-Levasseur,
Yashar Hezaveh
Abstract:
N-body systems characterized by inverse square attractive forces may display a self similar collapse known as the gravo-thermal catastrophe. In star clusters, collapse is halted by binary stars, and a large fraction of Milky Way clusters may have already reached this phase. It has been speculated -- with guidance from simulations -- that macroscopic variables such as central density and velocity d…
▽ More
N-body systems characterized by inverse square attractive forces may display a self similar collapse known as the gravo-thermal catastrophe. In star clusters, collapse is halted by binary stars, and a large fraction of Milky Way clusters may have already reached this phase. It has been speculated -- with guidance from simulations -- that macroscopic variables such as central density and velocity dispersion are governed post-collapse by an effective, low-dimensional system of ODEs. It is still hard to distinguish chaotic, low dimensional motion, from high dimensional stochastic noise. Here we apply three machine learning tools to state-of-the-art dynamical simulations to constrain the post collapse dynamics: topological data analysis (TDA) on a lag embedding of the relevant time series, Sparse Identification of Nonlinear Dynamics (SINDY), and Tests of Accuracy with Random Points (TARP).
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Time Delay Cosmography with a Neural Ratio Estimator
Authors:
Ève Campeau-Poirier,
Laurence Perreault-Levasseur,
Adam Coogan,
Yashar Hezaveh
Abstract:
We explore the use of a Neural Ratio Estimator (NRE) to determine the Hubble constant ($H_0$) in the context of time delay cosmography. Assuming a Singular Isothermal Ellipsoid (SIE) mass profile for the deflector, we simulate time delay measurements, image position measurements, and modeled lensing parameters. We train the NRE to output the posterior distribution of $H_0$ given the time delay mea…
▽ More
We explore the use of a Neural Ratio Estimator (NRE) to determine the Hubble constant ($H_0$) in the context of time delay cosmography. Assuming a Singular Isothermal Ellipsoid (SIE) mass profile for the deflector, we simulate time delay measurements, image position measurements, and modeled lensing parameters. We train the NRE to output the posterior distribution of $H_0$ given the time delay measurements, the relative Fermat potentials (calculated from the modeled parameters and the measured image positions), the deflector redshift, and the source redshift. We compare the accuracy and precision of the NRE with traditional explicit likelihood methods in the limit where the latter is tractable and reliable, using Gaussian noise to emulate measurement uncertainties in the input parameters. The NRE posteriors track the ones from the conventional method and, while they show a slight tendency to overestimate uncertainties, they can be combined in a population inference without bias.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
AstroPhot: Fitting Everything Everywhere All at Once in Astronomical Images
Authors:
Connor Stone,
Stephane Courteau,
Jean-Charles Cuillandre,
Yashar Hezaveh,
Laurence Perreault-Levasseur,
Nikhil Arora
Abstract:
We present AstroPhot, a fast, powerful, and user-friendly Python based astronomical image photometry solver. AstroPhot incorporates automatic differentiation and GPU (or parallel CPU) acceleration, powered by the machine learning library PyTorch. Everything: AstroPhot can fit models for sky, stars, galaxies, PSFs, and more in a principled Chi^2 forward optimization, recovering Bayesian posterior i…
▽ More
We present AstroPhot, a fast, powerful, and user-friendly Python based astronomical image photometry solver. AstroPhot incorporates automatic differentiation and GPU (or parallel CPU) acceleration, powered by the machine learning library PyTorch. Everything: AstroPhot can fit models for sky, stars, galaxies, PSFs, and more in a principled Chi^2 forward optimization, recovering Bayesian posterior information and covariance of all parameters. Everywhere: AstroPhot can optimize forward models on CPU or GPU; across images that are large, multi-band, multi-epoch, rotated, dithered, and more. All at once: The models are optimized together, thus handling overlapping objects and including the covariance between parameters (including PSF and galaxy parameters). A number of optimization algorithms are available including Levenberg-Marquardt, Gradient descent, and No-U-Turn MCMC sampling. With an object-oriented user interface, AstroPhot makes it easy to quickly extract detailed information from complex astronomical data for individual images or large survey programs. This paper outlines novel features of the AstroPhot code and compares it to other popular astronomical image modeling software. AstroPhot is open-source, fully Python based, and freely accessible here: https://github.com/Autostronomy/AstroPhot
△ Less
Submitted 6 September, 2023; v1 submitted 3 August, 2023;
originally announced August 2023.
-
Posterior Sampling of the Initial Conditions of the Universe from Non-linear Large Scale Structures using Score-Based Generative Models
Authors:
Ronan Legin,
Matthew Ho,
Pablo Lemos,
Laurence Perreault-Levasseur,
Shirley Ho,
Yashar Hezaveh,
Benjamin Wandelt
Abstract:
Reconstructing the initial conditions of the universe is a key problem in cosmology. Methods based on simulating the forward evolution of the universe have provided a way to infer initial conditions consistent with present-day observations. However, due to the high complexity of the inference problem, these methods either fail to sample a distribution of possible initial density fields or require…
▽ More
Reconstructing the initial conditions of the universe is a key problem in cosmology. Methods based on simulating the forward evolution of the universe have provided a way to infer initial conditions consistent with present-day observations. However, due to the high complexity of the inference problem, these methods either fail to sample a distribution of possible initial density fields or require significant approximations in the simulation model to be tractable, potentially leading to biased results. In this work, we propose the use of score-based generative models to sample realizations of the early universe given present-day observations. We infer the initial density field of full high-resolution dark matter N-body simulations from the present-day density field and verify the quality of produced samples compared to the ground truth based on summary statistics. The proposed method is capable of providing plausible realizations of the early universe density field from the initial conditions posterior distribution marginalized over cosmological parameters and can sample orders of magnitude faster than current state-of-the-art methods.
△ Less
Submitted 7 April, 2023;
originally announced April 2023.
-
Sampling-Based Accuracy Testing of Posterior Estimators for General Inference
Authors:
Pablo Lemos,
Adam Coogan,
Yashar Hezaveh,
Laurence Perreault-Levasseur
Abstract:
Parameter inference, i.e. inferring the posterior distribution of the parameters of a statistical model given some data, is a central problem to many scientific disciplines. Generative models can be used as an alternative to Markov Chain Monte Carlo methods for conducting posterior inference, both in likelihood-based and simulation-based problems. However, assessing the accuracy of posteriors enco…
▽ More
Parameter inference, i.e. inferring the posterior distribution of the parameters of a statistical model given some data, is a central problem to many scientific disciplines. Generative models can be used as an alternative to Markov Chain Monte Carlo methods for conducting posterior inference, both in likelihood-based and simulation-based problems. However, assessing the accuracy of posteriors encoded in generative models is not straightforward. In this paper, we introduce `Tests of Accuracy with Random Points' (TARP) coverage testing as a method to estimate coverage probabilities of generative posterior estimators. Our method differs from previously-existing coverage-based methods, which require posterior evaluations. We prove that our approach is necessary and sufficient to show that a posterior estimator is accurate. We demonstrate the method on a variety of synthetic examples, and show that TARP can be used to test the results of posterior inference analyses in high-dimensional spaces. We also show that our method can detect inaccurate inferences in cases where existing methods fail.
△ Less
Submitted 2 June, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Pixelated Reconstruction of Foreground Density and Background Surface Brightness in Gravitational Lensing Systems using Recurrent Inference Machines
Authors:
Alexandre Adam,
Laurence Perreault-Levasseur,
Yashar Hezaveh,
Max Welling
Abstract:
Modeling strong gravitational lenses in order to quantify the distortions in the images of background sources and to reconstruct the mass density in the foreground lenses has been a difficult computational challenge. As the quality of gravitational lens images increases, the task of fully exploiting the information they contain becomes computationally and algorithmically more difficult. In this wo…
▽ More
Modeling strong gravitational lenses in order to quantify the distortions in the images of background sources and to reconstruct the mass density in the foreground lenses has been a difficult computational challenge. As the quality of gravitational lens images increases, the task of fully exploiting the information they contain becomes computationally and algorithmically more difficult. In this work, we use a neural network based on the Recurrent Inference Machine (RIM) to simultaneously reconstruct an undistorted image of the background source and the lens mass density distribution as pixelated maps. The method iteratively reconstructs the model parameters (the image of the source and a pixelated density map) by learning the process of optimizing the likelihood given the data using the physical model (a ray-tracing simulation), regularized by a prior implicitly learned by the neural network through its training data. When compared to more traditional parametric models, the proposed method is significantly more expressive and can reconstruct complex mass distributions, which we demonstrate by using realistic lensing galaxies taken from the IllustrisTNG cosmological hydrodynamic simulation.
△ Less
Submitted 24 April, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Morphological Parameters and Associated Uncertainties for 8 Million Galaxies in the Hyper Suprime-Cam Wide Survey
Authors:
Aritra Ghosh,
C. Megan Urry,
Aayush Mishra,
Laurence Perreault-Levasseur,
Priyamvada Natarajan,
David B. Sanders,
Daisuke Nagai,
Chuan Tian,
Nico Cappelluti,
Jeyhan S. Kartaltepe,
Meredith C. Powell,
Amrit Rau,
Ezequiel Treister
Abstract:
We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for $\sim 8$ million galaxies in the Hyper Suprime-Cam (HSC) Wide survey with $z \leq 0.75$ and $m \leq 23$. GaMPEN is a machine learning framework that estimates Bayesian posteriors for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and…
▽ More
We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for $\sim 8$ million galaxies in the Hyper Suprime-Cam (HSC) Wide survey with $z \leq 0.75$ and $m \leq 23$. GaMPEN is a machine learning framework that estimates Bayesian posteriors for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and flux ($F$). By first training on simulations of galaxies and then applying transfer learning using real data, we trained GaMPEN with $<1\%$ of our dataset. This two-step process will be critical for applying machine learning algorithms to future large imaging surveys, such as the Rubin-Legacy Survey of Space and Time (LSST), the Nancy Grace Roman Space Telescope (NGRST), and Euclid. By comparing our results to those obtained using light-profile fitting, we demonstrate that GaMPEN's predicted posterior distributions are well-calibrated ($\lesssim 5\%$ deviation) and accurate. This represents a significant improvement over light profile fitting algorithms which underestimate uncertainties by as much as $\sim60\%$. For an overlapping sub-sample, we also compare the derived morphological parameters with values in two external catalogs and find that the results agree within the limits of uncertainties predicted by GaMPEN. This step also permits us to define an empirical relationship between the Sérsic index and $L_B/L_T$ that can be used to convert between these two parameters. The catalog presented here represents a significant improvement in size ($\sim10 \times $), depth ($\sim4$ magnitudes), and uncertainty quantification over previous state-of-the-art bulge+disk decomposition catalogs. With this work, we also release GaMPEN's source code and trained models, which can be adapted to other datasets.
△ Less
Submitted 1 March, 2024; v1 submitted 30 November, 2022;
originally announced December 2022.
-
A Framework for Obtaining Accurate Posteriors of Strong Gravitational Lensing Parameters with Flexible Priors and Implicit Likelihoods using Density Estimation
Authors:
Ronan Legin,
Yashar Hezaveh,
Laurence Perreault-Levasseur,
Benjamin Wandelt
Abstract:
We report the application of implicit likelihood inference to the prediction of the macro-parameters of strong lensing systems with neural networks. This allows us to perform deep learning analysis of lensing systems within a well-defined Bayesian statistical framework to explicitly impose desired priors on lensing variables, to obtain accurate posteriors, and to guarantee convergence to the optim…
▽ More
We report the application of implicit likelihood inference to the prediction of the macro-parameters of strong lensing systems with neural networks. This allows us to perform deep learning analysis of lensing systems within a well-defined Bayesian statistical framework to explicitly impose desired priors on lensing variables, to obtain accurate posteriors, and to guarantee convergence to the optimal posterior in the limit of perfect performance. We train neural networks to perform a regression task to produce point estimates of lensing parameters. We then interpret these estimates as compressed statistics in our inference setup and model their likelihood function using mixture density networks. We compare our results with those of approximate Bayesian neural networks, discuss their significance, and point to future directions. Based on a test set of 100,000 strong lensing simulations, our amortized model produces accurate posteriors for any arbitrary confidence interval, with a maximum percentage deviation of $1.4\%$ at $21.8\%$ confidence level, without the need for any added calibration procedure. In total, inferring 100,000 different posteriors takes a day on a single GPU, showing that the method scales well to the thousands of lenses expected to be discovered by upcoming sky surveys.
△ Less
Submitted 30 November, 2022;
originally announced December 2022.
-
Posterior samples of source galaxies in strong gravitational lenses with score-based priors
Authors:
Alexandre Adam,
Adam Coogan,
Nikolay Malkin,
Ronan Legin,
Laurence Perreault-Levasseur,
Yashar Hezaveh,
Yoshua Bengio
Abstract:
Inferring accurate posteriors for high-dimensional representations of the brightness of gravitationally-lensed sources is a major challenge, in part due to the difficulties of accurately quantifying the priors. Here, we report the use of a score-based model to encode the prior for the inference of undistorted images of background galaxies. This model is trained on a set of high-resolution images o…
▽ More
Inferring accurate posteriors for high-dimensional representations of the brightness of gravitationally-lensed sources is a major challenge, in part due to the difficulties of accurately quantifying the priors. Here, we report the use of a score-based model to encode the prior for the inference of undistorted images of background galaxies. This model is trained on a set of high-resolution images of undistorted galaxies. By adding the likelihood score to the prior score and using a reverse-time stochastic differential equation solver, we obtain samples from the posterior. Our method produces independent posterior samples and models the data almost down to the noise level. We show how the balance between the likelihood and the prior meet our expectations in an experiment with out-of-distribution data.
△ Less
Submitted 29 November, 2022; v1 submitted 7 November, 2022;
originally announced November 2022.
-
GaMPEN: A Machine Learning Framework for Estimating Bayesian Posteriors of Galaxy Morphological Parameters
Authors:
Aritra Ghosh,
C. Megan Urry,
Amrit Rau,
Laurence Perreault-Levasseur,
Miles Cranmer,
Kevin Schawinski,
Dominic Stark,
Chuan Tian,
Ryan Ofman,
Tonima Tasnim Ananna,
Connor Auge,
Nico Cappelluti,
David B. Sanders,
Ezequiel Treister
Abstract:
We introduce a novel machine learning framework for estimating the Bayesian posteriors of morphological parameters for arbitrarily large numbers of galaxies. The Galaxy Morphology Posterior Estimation Network (GaMPEN) estimates values and uncertainties for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and flux ($F$). To estimate posteriors, GaMPEN uses the Monte Carl…
▽ More
We introduce a novel machine learning framework for estimating the Bayesian posteriors of morphological parameters for arbitrarily large numbers of galaxies. The Galaxy Morphology Posterior Estimation Network (GaMPEN) estimates values and uncertainties for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and flux ($F$). To estimate posteriors, GaMPEN uses the Monte Carlo Dropout technique and incorporates the full covariance matrix between the output parameters in its loss function. GaMPEN also uses a Spatial Transformer Network (STN) to automatically crop input galaxy frames to an optimal size before determining their morphology. This will allow it to be applied to new data without prior knowledge of galaxy size. Training and testing GaMPEN on galaxies simulated to match $z < 0.25$ galaxies in Hyper Suprime-Cam Wide $g$-band images, we demonstrate that GaMPEN achieves typical errors of $0.1$ in $L_B/L_T$, $0.17$ arcsec ($\sim 7\%$) in $R_e$, and $6.3\times10^4$ nJy ($\sim 1\%$) in $F$. GaMPEN's predicted uncertainties are well-calibrated and accurate ($<5\%$ deviation) -- for regions of the parameter space with high residuals, GaMPEN correctly predicts correspondingly large uncertainties. We also demonstrate that we can apply categorical labels (i.e., classifications such as "highly bulge-dominated") to predictions in regions with high residuals and verify that those labels are $\gtrsim 97\%$ accurate. To the best of our knowledge, GaMPEN is the first machine learning framework for determining joint posterior distributions of multiple morphological parameters and is also the first application of an STN to optical imaging in astronomy.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
Population-Level Inference of Strong Gravitational Lenses with Neural Network-Based Selection Correction
Authors:
Ronan Legin,
Connor Stone,
Yashar Hezaveh,
Laurence Perreault-Levasseur
Abstract:
A new generation of sky surveys is poised to provide unprecedented volumes of data containing hundreds of thousands of new strong lensing systems in the coming years. Convolutional neural networks are currently the only state-of-the-art method that can handle the onslaught of data to discover and infer the parameters of individual systems. However, many important measurements that involve strong l…
▽ More
A new generation of sky surveys is poised to provide unprecedented volumes of data containing hundreds of thousands of new strong lensing systems in the coming years. Convolutional neural networks are currently the only state-of-the-art method that can handle the onslaught of data to discover and infer the parameters of individual systems. However, many important measurements that involve strong lensing require population-level inference of these systems. In this work, we propose a hierarchical inference framework that uses the inference of individual lensing systems in combination with the selection function to estimate population-level parameters. In particular, we show that it is possible to model the selection function of a CNN-based lens finder with a neural network classifier, enabling fast inference of population-level parameters without the need for expensive Monte Carlo simulations.
△ Less
Submitted 8 July, 2022;
originally announced July 2022.
-
Pixelated Reconstruction of Gravitational Lenses using Recurrent Inference Machines
Authors:
Alexandre Adam,
Laurence Perreault-Levasseur,
Yashar Hezaveh
Abstract:
Modeling strong gravitational lenses in order to quantify the distortions in the images of background sources and to reconstruct the mass density in the foreground lenses has traditionally been a difficult computational challenge. As the quality of gravitational lens images increases, the task of fully exploiting the information they contain becomes computationally and algorithmically more difficu…
▽ More
Modeling strong gravitational lenses in order to quantify the distortions in the images of background sources and to reconstruct the mass density in the foreground lenses has traditionally been a difficult computational challenge. As the quality of gravitational lens images increases, the task of fully exploiting the information they contain becomes computationally and algorithmically more difficult. In this work, we use a neural network based on the Recurrent Inference Machine (RIM) to simultaneously reconstruct an undistorted image of the background source and the lens mass density distribution as pixelated maps. The method we present iteratively reconstructs the model parameters (the source and density map pixels) by learning the process of optimization of their likelihood given the data using the physical model (a ray-tracing simulation), regularized by a prior implicitly learned by the neural network through its training data. When compared to more traditional parametric models, the proposed method is significantly more expressive and can reconstruct complex mass distributions, which we demonstrate by using realistic lensing galaxies taken from the cosmological hydrodynamic simulation IllustrisTNG.
△ Less
Submitted 3 July, 2022;
originally announced July 2022.
-
Correlated Read Noise Reduction in Infrared Arrays Using Deep Learning
Authors:
Guillaume Payeur,
Étienne Artigau,
Laurence Perreault-Levasseur,
René Doyon
Abstract:
We present a new procedure rooted in deep learning to construct science images from data cubes collected by astronomical instruments using HxRG detectors in low-flux regimes. It improves on the drawbacks of the conventional algorithms to construct 2D images from multiple readouts by using the readout scheme of the detectors to reduce the impact of correlated readout noise. We train a convolutional…
▽ More
We present a new procedure rooted in deep learning to construct science images from data cubes collected by astronomical instruments using HxRG detectors in low-flux regimes. It improves on the drawbacks of the conventional algorithms to construct 2D images from multiple readouts by using the readout scheme of the detectors to reduce the impact of correlated readout noise. We train a convolutional recurrent neural network on simulated astrophysical scenes added to laboratory darks to estimate the flux on each pixel of science images. This method achieves a reduction of the noise on constructed science images when compared to standard flux-measurement schemes (correlated double sampling, up-the-ramp sampling), which results in a reduction of the error on the spectrum extracted from these science images. Over simulated data cubes created in a low signal-to-noise ratio regime where this method could have the largest impact, we find that the error on our constructed science images falls faster than a $1/\sqrt{N}$ decay, and that the spectrum extracted from the images has, averaged over a test set of three images, a standard error reduced by a factor of 1.85 in comparison to the standard up-the-ramp pixel sampling scheme. The code used in this project is publicly available on GitHub
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
CosmicRIM : Reconstructing Early Universe by Combining Differentiable Simulations with Recurrent Inference Machines
Authors:
Chirag Modi,
François Lanusse,
Uroš Seljak,
David N. Spergel,
Laurence Perreault-Levasseur
Abstract:
Reconstructing the Gaussian initial conditions at the beginning of the Universe from the survey data in a forward modeling framework is a major challenge in cosmology. This requires solving a high dimensional inverse problem with an expensive, non-linear forward model: a cosmological N-body simulation. While intractable until recently, we propose to solve this inference problem using an automatica…
▽ More
Reconstructing the Gaussian initial conditions at the beginning of the Universe from the survey data in a forward modeling framework is a major challenge in cosmology. This requires solving a high dimensional inverse problem with an expensive, non-linear forward model: a cosmological N-body simulation. While intractable until recently, we propose to solve this inference problem using an automatically differentiable N-body solver, combined with a recurrent networks to learn the inference scheme and obtain the maximum-a-posteriori (MAP) estimate of the initial conditions of the Universe. We demonstrate using realistic cosmological observables that learnt inference is 40 times faster than traditional algorithms such as ADAM and LBFGS, which require specialized annealing schemes, and obtains solution of higher quality.
△ Less
Submitted 26 April, 2021;
originally announced April 2021.
-
A Machine Learning Approach to Integral Field Unit Spectroscopy Observations: II. HII Region LineRatios
Authors:
Carter Rhea,
Laurie Rousseau-Nepton,
Simon Prunet,
Myriam Prasow-Emond,
Julie Hlavacek-Larrondo,
Natalia Vale Asari,
Kathryn Grasha,
Laurence Perreault-Levasseur
Abstract:
In the first paper of this series (Rhea et al. 2020), we demonstrated that neural networks can robustly and efficiently estimate kinematic parameters for optical emission-line spectra taken by SITELLE at the Canada-France-Hawaii Telescope. This paper expands upon this notion by developing an artificial neural network to estimate the line ratios of strong emission-lines present in the SN1, SN2, and…
▽ More
In the first paper of this series (Rhea et al. 2020), we demonstrated that neural networks can robustly and efficiently estimate kinematic parameters for optical emission-line spectra taken by SITELLE at the Canada-France-Hawaii Telescope. This paper expands upon this notion by developing an artificial neural network to estimate the line ratios of strong emission-lines present in the SN1, SN2, and SN3 filters of SITELLE. We construct a set of 50,000 synthetic spectra using line ratios taken from the Mexican Million Model database replicating Hii regions. Residual analysis of the network on the test set reveals the network's ability to apply tight constraints to the line ratios. We verified the network's efficacy by constructing an activation map, checking the [N ii] doublet fixed ratio, and applying a standard k-fold cross-correlation. Additionally, we apply the network to SITELLE observation of M33; the residuals between the algorithm's estimates and values calculated using standard fitting methods show general agreement. Moreover, the neural network reduces the computational costs by two orders of magnitude. Although standard fitting routines do consistently well depending on the signal-to-noise ratio of the spectral features, the neural network can also excel at predictions in the low signal-to-noise regime within the controlled environment of the training set as well as on observed data when the source spectral properties are well constrained by models. These results reinforce the power of machine learning in spectral analysis.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
Modeling assembly bias with machine learning and symbolic regression
Authors:
Digvijay Wadekar,
Francisco Villaescusa-Navarro,
Shirley Ho,
Laurence Perreault-Levasseur
Abstract:
Upcoming 21cm surveys will map the spatial distribution of cosmic neutral hydrogen (HI) over unprecedented volumes. Mock catalogues are needed to fully exploit the potential of these surveys. Standard techniques employed to create these mock catalogs, like Halo Occupation Distribution (HOD), rely on assumptions such as the baryonic properties of dark matter halos only depend on their masses. In th…
▽ More
Upcoming 21cm surveys will map the spatial distribution of cosmic neutral hydrogen (HI) over unprecedented volumes. Mock catalogues are needed to fully exploit the potential of these surveys. Standard techniques employed to create these mock catalogs, like Halo Occupation Distribution (HOD), rely on assumptions such as the baryonic properties of dark matter halos only depend on their masses. In this work, we use the state-of-the-art magneto-hydrodynamic simulation IllustrisTNG to show that the HI content of halos exhibits a strong dependence on their local environment. We then use machine learning techniques to show that this effect can be 1) modeled by these algorithms and 2) parametrized in the form of novel analytic equations. We provide physical explanations for this environmental effect and show that ignoring it leads to underprediction of the real-space 21-cm power spectrum at $k\gtrsim 0.05$ h/Mpc by $\gtrsim$10\%, which is larger than the expected precision from upcoming surveys on such large scales. Our methodology of combining numerical simulations with machine learning techniques is general, and opens a new direction at modeling and parametrizing the complex physics of assembly bias needed to generate accurate mocks for galaxy and line intensity mapping surveys.
△ Less
Submitted 30 November, 2020;
originally announced December 2020.
-
deep21: a Deep Learning Method for 21cm Foreground Removal
Authors:
T. Lucas Makinen,
Lachlan Lancaster,
Francisco Villaescusa-Navarro,
Peter Melchior,
Shirley Ho,
Laurence Perreault-Levasseur,
David N. Spergel
Abstract:
We seek to remove foreground contaminants from 21cm intensity mapping observations. We demonstrate that a deep convolutional neural network (CNN) with a UNet architecture and three-dimensional convolutions, trained on simulated observations, can effectively separate frequency and spatial patterns of the cosmic neutral hydrogen (HI) signal from foregrounds in the presence of noise. Cleaned maps rec…
▽ More
We seek to remove foreground contaminants from 21cm intensity mapping observations. We demonstrate that a deep convolutional neural network (CNN) with a UNet architecture and three-dimensional convolutions, trained on simulated observations, can effectively separate frequency and spatial patterns of the cosmic neutral hydrogen (HI) signal from foregrounds in the presence of noise. Cleaned maps recover cosmological clustering statistics within 10% at all relevant angular scales and frequencies. This amounts to a reduction in prediction variance of over an order of magnitude on small angular scales ($\ell > 300$), and improved accuracy for small radial scales ($k_{\parallel} > 0.17\ \rm h\ Mpc^{-1})$ compared to standard Principal Component Analysis (PCA) methods. We estimate posterior confidence intervals for the network's prediction by training an ensemble of UNets. Our approach demonstrates the feasibility of analyzing 21cm intensity maps, as opposed to derived summary statistics, for upcoming radio experiments, as long as the simulated foreground model is sufficiently realistic. We provide the code used for this analysis on Github https://github.com/tlmakinen/deep21 as well as a browser-based tutorial for the experiment and UNet model via the accompanying http://bit.ly/deep21-colab Colab notebook.
△ Less
Submitted 1 June, 2021; v1 submitted 29 October, 2020;
originally announced October 2020.
-
A Novel Machine Learning Approach to Disentangle Multi-Temperature Regions in Galaxy Clusters
Authors:
Carter L. Rhea,
Julie Hlavacek-Larrondo,
Laurence Perreault-Levasseur,
Marie-Lou Gendron-Marsolais,
Ralph Kraft
Abstract:
The hot intra-cluster medium (ICM) surrounding the heart of galaxy clusters is a complex medium comprised of various emitting components. Although previous studies of nearby galaxy clusters, such as the Perseus, the Coma, or the Virgo cluster, have demonstrated the need for multiple thermal components when spectroscopically fitting the ICM's X-ray emission, no systematic methodology for calculatin…
▽ More
The hot intra-cluster medium (ICM) surrounding the heart of galaxy clusters is a complex medium comprised of various emitting components. Although previous studies of nearby galaxy clusters, such as the Perseus, the Coma, or the Virgo cluster, have demonstrated the need for multiple thermal components when spectroscopically fitting the ICM's X-ray emission, no systematic methodology for calculating the number of underlying components currently exists. In turn, underestimating or overestimating the number of components can cause systematic errors in the emission parameter estimations. In this paper, we present a novel approach to determining the number of components using an amalgam of machine learning techniques. Synthetic spectra containing a various number of underlying thermal components were created using well-established tools available from the \textit{Chandra} X-ray Observatory. The dimensions of the training set was initially reduced using the Principal Component Analysis and then categorized based on the number of underlying components using a Random Forest Classifier. Our trained and tested algorithm was subsequently applied to \textit{Chandra} X-ray observations of the Perseus cluster. Our results demonstrate that machine learning techniques can efficiently and reliably estimate the number of underlying thermal components in the spectra of galaxy clusters, regardless of the thermal model (MEKAL versus APEC). %and signal-to-noise ratio used. We also confirm that the core of the Perseus cluster contains a mix of differing underlying thermal components. We emphasize that although this methodology was trained and applied on \textit{Chandra} X-ray observations, it is readily portable to other current (e.g. XMM-Newton, eROSITA) and upcoming (e.g. Athena, Lynx, XRISM) X-ray telescopes. The code is publicly available at \url{https://github.com/XtraAstronomy/Pumpkin}.
△ Less
Submitted 1 September, 2020;
originally announced September 2020.
-
HInet: Generating neutral hydrogen from dark matter with neural networks
Authors:
Digvijay Wadekar,
Francisco Villaescusa-Navarro,
Shirley Ho,
Laurence Perreault-Levasseur
Abstract:
Upcoming 21cm surveys will map the spatial distribution of cosmic neutral hydrogen (HI) over very large cosmological volumes. In order to maximize the scientific return of these surveys, accurate theoretical predictions are needed. Hydrodynamic simulations currently are the most accurate tool to provide those predictions in the mildly to non-linear regime. Unfortunately, their computational cost i…
▽ More
Upcoming 21cm surveys will map the spatial distribution of cosmic neutral hydrogen (HI) over very large cosmological volumes. In order to maximize the scientific return of these surveys, accurate theoretical predictions are needed. Hydrodynamic simulations currently are the most accurate tool to provide those predictions in the mildly to non-linear regime. Unfortunately, their computational cost is very high: tens of millions of CPU hours. We use convolutional neural networks to find the mapping between the spatial distribution of matter from N-body simulations and HI from the state-of-the-art hydrodynamic simulation IllustrisTNG. Our model performs better than the widely used theoretical model: Halo Occupation Distribution (HOD) for all statistical properties up to the non-linear scales $k\lesssim1$ h/Mpc. Our method allows the generation of 21cm mocks over very big cosmological volumes with similar properties as hydrodynamic simulations.
△ Less
Submitted 27 July, 2021; v1 submitted 20 July, 2020;
originally announced July 2020.
-
Bayesian Neural Networks
Authors:
Tom Charnock,
Laurence Perreault-Levasseur,
François Lanusse
Abstract:
In recent times, neural networks have become a powerful tool for the analysis of complex and abstract data models. However, their introduction intrinsically increases our uncertainty about which features of the analysis are model-related and which are due to the neural network. This means that predictions by neural networks have biases which cannot be trivially distinguished from being due to the…
▽ More
In recent times, neural networks have become a powerful tool for the analysis of complex and abstract data models. However, their introduction intrinsically increases our uncertainty about which features of the analysis are model-related and which are due to the neural network. This means that predictions by neural networks have biases which cannot be trivially distinguished from being due to the true nature of the creation and observation of data or not. In order to attempt to address such issues we discuss Bayesian neural networks: neural networks where the uncertainty due to the network can be characterised. In particular, we present the Bayesian statistical framework which allows us to categorise uncertainty in terms of the ingrained randomness of observing certain data and the uncertainty from our lack of knowledge about how data can be created and observed. In presenting such techniques we show how errors in prediction by neural networks can be obtained in principle, and provide the two favoured methods for characterising these errors. We will also describe how both of these methods have substantial pitfalls when put into practice, highlighting the need for other statistical techniques to truly be able to do inference when using neural networks.
△ Less
Submitted 6 November, 2020; v1 submitted 2 June, 2020;
originally announced June 2020.
-
LRP2020: Probing Diverse Phenomena through Data-Intensive Astronomy
Authors:
Mubdi Rahman,
Dustin Lang,
Renée Hložek,
Jo Bovy,
Laurence Perreault-Levasseur
Abstract:
The era of data-intensive astronomy is being ushered in with the increasing size and complexity of observational data across wavelength and time domains, the development of algorithms to extract information from this complexity, and the computational power to apply these algorithms to the growing repositories of data. Data-intensive approaches are pushing the boundaries of nearly all fields of ast…
▽ More
The era of data-intensive astronomy is being ushered in with the increasing size and complexity of observational data across wavelength and time domains, the development of algorithms to extract information from this complexity, and the computational power to apply these algorithms to the growing repositories of data. Data-intensive approaches are pushing the boundaries of nearly all fields of astronomy, from exoplanet science to cosmology, and they are becoming a critical modality for how we understand the universe. The success of these approaches range from the discovery of rare or unexpected phenomena, to characterizing processes that are now accessible with precision astrophysics and a deep statistical understanding of the datasets, to developing algorithms that maximize the science that can be extracted from any set of observations.
In this white paper, we propose a number of initiatives to maximize Canada's ability to compete in this data-intensive era. We propose joining international collaborations and leveraging Canadian facilities for legacy data potential. We propose continuing to build a more agile computing infrastructure that's responsive to the needs of tackling larger and more complex data, as well as enabling quick prototyping and scaling of algorithms. We recognize that developing the fundamental skills of the field will be critical for Canadian astronomers, and discuss avenues through with the appropriate computational and statistical training could occur. Finally, we note that the transition to data-intensive techniques is not limited to astronomy, and we should coordinate with other disciplines to develop and make use of best practises in methods, infrastructure, and education.
△ Less
Submitted 4 October, 2019;
originally announced October 2019.
-
LRP2020: Machine Learning Advantages in Canadian Astrophysics
Authors:
K. A. Venn,
S. Fabbro,
A Liu,
Y. Hezaveh,
L. Perreault-Levasseur,
G. Eadie,
S. Ellison,
J. Woo,
JJ. Kavelaars,
K. M. Yi,
R. Hlozek,
J. Bovy,
H. Teimoorinia,
S. Ravanbakhsh,
L. Spencer
Abstract:
The application of machine learning (ML) methods to the analysis of astrophysical datasets is on the rise, particularly as the computing power and complex algorithms become more powerful and accessible. As the field of ML enjoys a continuous stream of breakthroughs, its applications demonstrate the great potential of ML, ranging from achieving tens of millions of times increase in analysis speed (…
▽ More
The application of machine learning (ML) methods to the analysis of astrophysical datasets is on the rise, particularly as the computing power and complex algorithms become more powerful and accessible. As the field of ML enjoys a continuous stream of breakthroughs, its applications demonstrate the great potential of ML, ranging from achieving tens of millions of times increase in analysis speed (e.g., modeling of gravitational lenses or analysing spectroscopic surveys) to solutions of previously unsolved problems (e.g., foreground subtraction or efficient telescope operations). The number of astronomical publications that include ML has been steadily increasing since 2010.
With the advent of extremely large datasets from a new generation of surveys in the 2020s, ML methods will become an indispensable tool in astrophysics. Canada is an unambiguous world leader in the development of the field of machine learning, attracting large investments and skilled researchers to its prestigious AI Research Institutions. This provides a unique opportunity for Canada to also be a world leader in the application of machine learning in the field of astrophysics, and foster the training of a new generation of highly skilled researchers.
△ Less
Submitted 15 October, 2019; v1 submitted 2 October, 2019;
originally announced October 2019.
-
Cleaning our own Dust: Simulating and Separating Galactic Dust Foregrounds with Neural Networks
Authors:
K. Aylor,
M. Haq,
L. Knox,
Y. Hezaveh,
L. Perreault-Levasseur
Abstract:
Separating galactic foreground emission from maps of the cosmic microwave background (CMB), and quantifying the uncertainty in the CMB maps due to errors in foreground separation are important for avoiding biases in scientific conclusions. Our ability to quantify such uncertainty is limited by our lack of a model for the statistical distribution of the foreground emission. Here we use a Deep Convo…
▽ More
Separating galactic foreground emission from maps of the cosmic microwave background (CMB), and quantifying the uncertainty in the CMB maps due to errors in foreground separation are important for avoiding biases in scientific conclusions. Our ability to quantify such uncertainty is limited by our lack of a model for the statistical distribution of the foreground emission. Here we use a Deep Convolutional Generative Adversarial Network (DCGAN) to create an effective non-Gaussian statistical model for intensity of emission by interstellar dust. For training data we use a set of dust maps inferred from observations by the Planck satellite. A DCGAN is uniquely suited for such unsupervised learning tasks as it can learn to model a complex non-Gaussian distribution directly from examples. We then use these simulations to train a second neural network to estimate the underlying CMB signal from dust-contaminated maps. We discuss other potential uses for the trained DCGAN, and the generalization to polarized emission from both dust and synchrotron.
△ Less
Submitted 13 September, 2019;
originally announced September 2019.