-
A Strong Gravitational Lens Is Worth a Thousand Dark Matter Halos: Inference on Small-Scale Structure Using Sequential Methods
Authors:
Sebastian Wagner-Carena,
Jaehoon Lee,
Jeffrey Pennington,
Jelle Aalbers,
Simon Birrer,
Risa H. Wechsler
Abstract:
Strong gravitational lenses are a singular probe of the universe's small-scale structure $\unicode{x2013}$ they are sensitive to the gravitational effects of low-mass $(<10^{10} M_\odot)$ halos even without a luminous counterpart. Recent strong-lensing analyses of dark matter structure rely on simulation-based inference (SBI). Modern SBI methods, which leverage neural networks as density estimator…
▽ More
Strong gravitational lenses are a singular probe of the universe's small-scale structure $\unicode{x2013}$ they are sensitive to the gravitational effects of low-mass $(<10^{10} M_\odot)$ halos even without a luminous counterpart. Recent strong-lensing analyses of dark matter structure rely on simulation-based inference (SBI). Modern SBI methods, which leverage neural networks as density estimators, have shown promise in extracting the halo-population signal. However, it is unclear whether the constraining power of these models has been limited by the methodology or the information content of the data. In this study, we introduce an accelerator-optimized simulation pipeline that can generate lens images with realistic subhalo populations in a matter of milliseconds. Leveraging this simulator, we identify the main methodological limitation of our fiducial SBI analysis: training set size. We then adopt a sequential neural posterior estimation (SNPE) approach, allowing us to iteratively refine the distribution of simulated training images to better align with the observed data. Using only one-fifth as many mock Hubble Space Telescope (HST) images, SNPE matches the constraints on the low-mass halo population produced by our best non-sequential model. Our experiments suggest that an over three order-of-magnitude increase in training set size and GPU hours would be required to achieve an equivalent result without sequential methods. While the full potential of the existing strong lens sample remains to be explored, the notable improvement in constraining power enabled by our sequential approach highlights that the current constraints are limited primarily by methodology and not the data itself. Moreover, our results emphasize the need to treat training set generation and model optimization as interconnected stages of any cosmological analysis using simulation-based inference techniques.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Hierarchical Inference of the Lensing Convergence from Photometric Catalogs with Bayesian Graph Neural Networks
Authors:
Ji Won Park,
Simon Birrer,
Madison Ueland,
Miles Cranmer,
Adriano Agnello,
Sebastian Wagner-Carena,
Philip J. Marshall,
Aaron Roodman,
the LSST Dark Energy Science Collaboration
Abstract:
We present a Bayesian graph neural network (BGNN) that can estimate the weak lensing convergence ($κ$) from photometric measurements of galaxies along a given line of sight. The method is of particular interest in strong gravitational time delay cosmography (TDC), where characterizing the "external convergence" ($κ_{\rm ext}$) from the lens environment and line of sight is necessary for precise in…
▽ More
We present a Bayesian graph neural network (BGNN) that can estimate the weak lensing convergence ($κ$) from photometric measurements of galaxies along a given line of sight. The method is of particular interest in strong gravitational time delay cosmography (TDC), where characterizing the "external convergence" ($κ_{\rm ext}$) from the lens environment and line of sight is necessary for precise inference of the Hubble constant ($H_0$). Starting from a large-scale simulation with a $κ$ resolution of $\sim$1$'$, we introduce fluctuations on galaxy-galaxy lensing scales of $\sim$1$''$ and extract random sightlines to train our BGNN. We then evaluate the model on test sets with varying degrees of overlap with the training distribution. For each test set of 1,000 sightlines, the BGNN infers the individual $κ$ posteriors, which we combine in a hierarchical Bayesian model to yield constraints on the hyperparameters governing the population. For a test field well sampled by the training set, the BGNN recovers the population mean of $κ$ precisely and without bias, resulting in a contribution to the $H_0$ error budget well under 1\%. In the tails of the training set with sparse samples, the BGNN, which can ingest all available information about each sightline, extracts more $κ$ signal compared to a simplified version of the traditional method based on matching galaxy number counts, which is limited by sample variance. Our hierarchical inference pipeline using BGNNs promises to improve the $κ_{\rm ext}$ characterization for precision TDC. The implementation of our pipeline is available as a public Python package, Node to Joy.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Symphony: Cosmological Zoom-in Simulation Suites over Four Decades of Host Halo Mass
Authors:
Ethan O. Nadler,
Philip Mansfield,
Yunchong Wang,
Xiaolong Du,
Susmita Adhikari,
Arka Banerjee,
Andrew Benson,
Elise Darragh-Ford,
Yao-Yuan Mao,
Sebastian Wagner-Carena,
Risa H. Wechsler,
Hao-Yi Wu
Abstract:
We present Symphony, a compilation of $262$ cosmological, cold-dark-matter-only zoom-in simulations spanning four decades of host halo mass, from $10^{11}$$-$$10^{15}~M_{\mathrm{\odot}}$. This compilation includes three existing simulation suites at the cluster and Milky Way$-$mass scales, and two new suites: $39$ Large Magellanic Cloud-mass ($10^{11}~M_{\mathrm{\odot}}$) and $49$ strong-lens-anal…
▽ More
We present Symphony, a compilation of $262$ cosmological, cold-dark-matter-only zoom-in simulations spanning four decades of host halo mass, from $10^{11}$$-$$10^{15}~M_{\mathrm{\odot}}$. This compilation includes three existing simulation suites at the cluster and Milky Way$-$mass scales, and two new suites: $39$ Large Magellanic Cloud-mass ($10^{11}~M_{\mathrm{\odot}}$) and $49$ strong-lens-analog ($10^{13}~M_{\mathrm{\odot}}$) group-mass hosts. Across the entire host halo mass range, the highest-resolution regions in these simulations are resolved with a dark matter particle mass of $\approx 3\times 10^{-7}$ times the host virial mass and a Plummer-equivalent gravitational softening length of $\approx 9\times 10^{-4}$ times the host virial radius, on average. We measure correlations between subhalo abundance and host concentration, formation time, and maximum subhalo mass, all of which peak at the Milky Way host halo mass scale. Subhalo abundances are $\approx 50\%$ higher in clusters than in lower-mass hosts at fixed sub-to-host halo mass ratios. Subhalo radial distributions are approximately self-similar as a function of host mass and are less concentrated than hosts' underlying dark matter distributions. We compare our results to the semianalytic model $\mathrm{\texttt{Galacticus}}$, which predicts subhalo mass functions with a higher normalization at the low-mass end and radial distributions that are slightly more concentrated than Symphony. We use $\mathrm{\texttt{UniverseMachine}}$ to model halo and subhalo star formation histories in Symphony, and we demonstrate that these predictions resolve the formation histories of the halos that host nearly all currently observable satellite galaxies in the universe. To promote open use of Symphony, data products are publicly available at http://web.stanford.edu/group/gfc/symphony.
△ Less
Submitted 16 March, 2023; v1 submitted 6 September, 2022;
originally announced September 2022.
-
From Images to Dark Matter: End-To-End Inference of Substructure From Hundreds of Strong Gravitational Lenses
Authors:
Sebastian Wagner-Carena,
Jelle Aalbers,
Simon Birrer,
Ethan O. Nadler,
Elise Darragh-Ford,
Philip J. Marshall,
Risa H. Wechsler
Abstract:
Constraining the distribution of small-scale structure in our universe allows us to probe alternatives to the cold dark matter paradigm. Strong gravitational lensing offers a unique window into small dark matter halos ($<10^{10} M_\odot$) because these halos impart a gravitational lensing signal even if they do not host luminous galaxies. We create large datasets of strong lensing images with real…
▽ More
Constraining the distribution of small-scale structure in our universe allows us to probe alternatives to the cold dark matter paradigm. Strong gravitational lensing offers a unique window into small dark matter halos ($<10^{10} M_\odot$) because these halos impart a gravitational lensing signal even if they do not host luminous galaxies. We create large datasets of strong lensing images with realistic low-mass halos, Hubble Space Telescope (HST) observational effects, and galaxy light from HST's COSMOS field. Using a simulation-based inference pipeline, we train a neural posterior estimator of the subhalo mass function (SHMF) and place constraints on populations of lenses generated using a separate set of galaxy sources. We find that by combining our network with a hierarchical inference framework, we can both reliably infer the SHMF across a variety of configurations and scale efficiently to populations with hundreds of lenses. By conducting precise inference on large and complex simulated datasets, our method lays a foundation for extracting dark matter constraints from the next generation of wide-field optical imaging surveys.
△ Less
Submitted 6 March, 2023; v1 submitted 1 March, 2022;
originally announced March 2022.
-
lenstronomy II: A gravitational lensing software ecosystem
Authors:
Simon Birrer,
Anowar J. Shajib,
Daniel Gilman,
Aymeric Galan,
Jelle Aalbers,
Martin Millon,
Robert Morgan,
Giulia Pagano,
Ji Won Park,
Luca Teodori,
Nicolas Tessore,
Madison Ueland,
Lyne Van de Vyvere,
Sebastian Wagner-Carena,
Ewoud Wempe,
Lilan Yang,
Xuheng Ding,
Thomas Schmidt,
Dominique Sluse,
Ming Zhang,
Adam Amara
Abstract:
lenstronomy is an Astropy-affiliated Python package for gravitational lensing simulations and analyses. lenstronomy was introduced by Birrer and Amara (2018) and is based on the linear basis set approach by Birrer et a. (2015). The user and developer base of lenstronomy has substantially grown since then, and the software has become an integral part of a wide range of recent analyses, such as meas…
▽ More
lenstronomy is an Astropy-affiliated Python package for gravitational lensing simulations and analyses. lenstronomy was introduced by Birrer and Amara (2018) and is based on the linear basis set approach by Birrer et a. (2015). The user and developer base of lenstronomy has substantially grown since then, and the software has become an integral part of a wide range of recent analyses, such as measuring the Hubble constant with time-delay strong lensing or constraining the nature of dark matter from resolved and unresolved small scale lensing distortion statistics. The modular design has allowed the community to incorporate innovative new methods, as well as to develop enhanced software and wrappers with more specific aims on top of the lenstronomy API. Through community engagement and involvement, lenstronomy has become a foundation of an ecosystem of affiliated packages extending the original scope of the software and proving its robustness and applicability at the forefront of the strong gravitational lensing community in an open source and reproducible manner.
△ Less
Submitted 10 June, 2021;
originally announced June 2021.
-
Large-Scale Gravitational Lens Modeling with Bayesian Neural Networks for Accurate and Precise Inference of the Hubble Constant
Authors:
Ji Won Park,
Sebastian Wagner-Carena,
Simon Birrer,
Philip J. Marshall,
Joshua Yao-Yu Lin,
Aaron Roodman
Abstract:
We investigate the use of approximate Bayesian neural networks (BNNs) in modeling hundreds of time-delay gravitational lenses for Hubble constant ($H_0$) determination. Our BNN was trained on synthetic HST-quality images of strongly lensed active galactic nuclei (AGN) with lens galaxy light included. The BNN can accurately characterize the posterior PDFs of model parameters governing the elliptica…
▽ More
We investigate the use of approximate Bayesian neural networks (BNNs) in modeling hundreds of time-delay gravitational lenses for Hubble constant ($H_0$) determination. Our BNN was trained on synthetic HST-quality images of strongly lensed active galactic nuclei (AGN) with lens galaxy light included. The BNN can accurately characterize the posterior PDFs of model parameters governing the elliptical power-law mass profile in an external shear field. We then propagate the BNN-inferred posterior PDFs into ensemble $H_0$ inference, using simulated time delay measurements from a plausible dedicated monitoring campaign. Assuming well-measured time delays and a reasonable set of priors on the environment of the lens, we achieve a median precision of $9.3$\% per lens in the inferred $H_0$. A simple combination of 200 test-set lenses results in a precision of 0.5 $\textrm{km s}^{-1} \textrm{ Mpc}^{-1}$ ($0.7\%$), with no detectable bias in this $H_0$ recovery test. The computation time for the entire pipeline -- including the training set generation, BNN training, and $H_0$ inference -- translates to 9 minutes per lens on average for 200 lenses and converges to 6 minutes per lens as the sample size is increased. Being fully automated and efficient, our pipeline is a promising tool for exploring ensemble-level systematics in lens modeling for $H_0$ inference.
△ Less
Submitted 11 April, 2021; v1 submitted 30 November, 2020;
originally announced December 2020.
-
Hierarchical Inference With Bayesian Neural Networks: An Application to Strong Gravitational Lensing
Authors:
Sebastian Wagner-Carena,
Ji Won Park,
Simon Birrer,
Philip J. Marshall,
Aaron Roodman,
Risa H. Wechsler
Abstract:
In the past few years, approximate Bayesian Neural Networks (BNNs) have demonstrated the ability to produce statistically consistent posteriors on a wide range of inference problems at unprecedented speed and scale. However, any disconnect between training sets and the distribution of real-world objects can introduce bias when BNNs are applied to data. This is a common challenge in astrophysics an…
▽ More
In the past few years, approximate Bayesian Neural Networks (BNNs) have demonstrated the ability to produce statistically consistent posteriors on a wide range of inference problems at unprecedented speed and scale. However, any disconnect between training sets and the distribution of real-world objects can introduce bias when BNNs are applied to data. This is a common challenge in astrophysics and cosmology, where the unknown distribution of objects in our Universe is often the science goal. In this work, we incorporate BNNs with flexible posterior parameterizations into a hierarchical inference framework that allows for the reconstruction of population hyperparameters and removes the bias introduced by the training distribution. We focus on the challenge of producing posterior PDFs for strong gravitational lens mass model parameters given Hubble Space Telescope (HST) quality single-filter, lens-subtracted, synthetic imaging data. We show that the posterior PDFs are sufficiently accurate (i.e., statistically consistent with the truth) across a wide variety of power-law elliptical lens mass distributions. We then apply our approach to test data sets whose lens parameters are drawn from distributions that are drastically different from the training set. We show that our hierarchical inference framework mitigates the bias introduced by an unrepresentative training set's interim prior. Simultaneously, given a sufficiently broad training set, we can precisely reconstruct the population hyperparameters governing our test distributions. Our full pipeline, from training to hierarchical inference on thousands of lenses, can be run in a day. The framework presented here will allow us to efficiently exploit the full constraining power of future ground- and space-based surveys.
△ Less
Submitted 22 March, 2021; v1 submitted 26 October, 2020;
originally announced October 2020.
-
TDCOSMO IV: Hierarchical time-delay cosmography -- joint inference of the Hubble constant and galaxy density profiles
Authors:
S. Birrer,
A. J. Shajib,
A. Galan,
M. Millon,
T. Treu,
A. Agnello,
M. Auger,
G. C. -F. Chen,
L. Christensen,
T. Collett,
F. Courbin,
C. D. Fassnacht,
L. V. E. Koopmans,
P. J. Marshall,
J. -W. Park,
C. E. Rusu,
D. Sluse,
C. Spiniello,
S. H. Suyu,
S. Wagner-Carena,
K. C. Wong,
M. Barnabè,
A. S. Bolton,
O. Czoske,
X. Ding
, et al. (2 additional authors not shown)
Abstract:
The H0LiCOW collaboration inferred via gravitational lensing time delays a Hubble constant $H_0=73.3^{+1.7}_{-1.8}$ km s$^{-1}{\rm Mpc}^{-1}$, describing deflector mass density profiles by either a power-law or stars plus standard dark matter halos. The mass-sheet transform (MST) that leaves the lensing observables unchanged is considered the dominant source of residual uncertainty in $H_0$. We qu…
▽ More
The H0LiCOW collaboration inferred via gravitational lensing time delays a Hubble constant $H_0=73.3^{+1.7}_{-1.8}$ km s$^{-1}{\rm Mpc}^{-1}$, describing deflector mass density profiles by either a power-law or stars plus standard dark matter halos. The mass-sheet transform (MST) that leaves the lensing observables unchanged is considered the dominant source of residual uncertainty in $H_0$. We quantify any potential effect of the MST with flexible mass models that are maximally degenerate with H0. Our calculation is based on a new hierarchical approach in which the MST is only constrained by stellar kinematics. The approach is validated on hydrodynamically simulated lenses. We apply the method to the TDCOSMO sample of 7 lenses (6 from H0LiCOW) and measure $H_0=74.5^{+5.6}_{-6.1}$ km s$^{-1}{\rm Mpc}^{-1}$. In order to further constrain the deflector mass profiles, we then add imaging and spectroscopy for 33 strong gravitational lenses from the SLACS sample. For 9 of the SLAC lenses we use resolved kinematics to constrain the stellar anisotropy. From the joint analysis of the TDCOSMO+SLACS sample, we measure $H_0=67.4^{+4.1}_{-3.2}$ km s$^{-1}{\rm Mpc}^{-1}$, assuming that the TDCOSMO and SLACS galaxies are drawn from the same parent population. The blind H0LiCOW, TDCOSMO-only and TDCOSMO+SLACS analyses are in mutual statistical agreement. The TDCOSMO+SLACS analysis prefers marginally shallower mass profiles than H0LiCOW or TDCOSMO-only. While our new analysis does not statistically invalidate the mass profile assumptions by H0LiCOW, and thus their $H_0$ measurement relying on those, it demonstrates the importance of understanding the mass density profile of elliptical galaxies. The uncertainties on $H_0$ derived in this paper can be reduced by physical or observational priors on the form of the mass profile, or by additional data, chiefly spatially resolved kinematics of lens galaxies.
△ Less
Submitted 19 December, 2020; v1 submitted 6 July, 2020;
originally announced July 2020.
-
A Novel CMB Component Separation Method: Hierarchical Generalized Morphological Component Analysis
Authors:
Sebastian Wagner-Carena,
Max Hopkins,
Ana Diaz Rivero,
Cora Dvorkin
Abstract:
We present a novel technique for Cosmic Microwave Background (CMB) foreground subtraction based on the framework of blind source separation. Inspired by previous work incorporating local variation to Generalized Morphological Component Analysis (GMCA), we introduce Hierarchical GMCA (HGMCA), a Bayesian hierarchical graphical model for source separation. We test our method on $N_{\rm side}=256$ sim…
▽ More
We present a novel technique for Cosmic Microwave Background (CMB) foreground subtraction based on the framework of blind source separation. Inspired by previous work incorporating local variation to Generalized Morphological Component Analysis (GMCA), we introduce Hierarchical GMCA (HGMCA), a Bayesian hierarchical graphical model for source separation. We test our method on $N_{\rm side}=256$ simulated sky maps that include dust, synchrotron, free-free and anomalous microwave emission, and show that HGMCA reduces foreground contamination by $25\%$ over GMCA in both the regions included and excluded by the Planck UT78 mask, decreases the error in the measurement of the CMB temperature power spectrum to the $0.02-0.03\%$ level at $\ell>200$ (and $<0.26\%$ for all $\ell$), and reduces correlation to all the foregrounds. We find equivalent or improved performance when compared to state-of-the-art Internal Linear Combination (ILC)-type algorithms on these simulations, suggesting that HGMCA may be a competitive alternative to foreground separation techniques previously applied to observed CMB data. Additionally, we show that our performance does not suffer when we perturb model parameters or alter the CMB realization, which suggests that our algorithm generalizes well beyond our simplified simulations. Our results open a new avenue for constructing CMB maps through Bayesian hierarchical analysis.
△ Less
Submitted 26 April, 2020; v1 submitted 17 October, 2019;
originally announced October 2019.