-
Physics-informed neural networks in the recreation of hydrodynamic simulations from dark matter
Authors:
Zhenyu Dai,
Ben Moews,
Ricardo Vilalta,
Romeel Dave
Abstract:
Physics-informed neural networks have emerged as a coherent framework for building predictive models that combine statistical patterns with domain knowledge. The underlying notion is to enrich the optimization loss function with known relationships to constrain the space of possible solutions. Hydrodynamic simulations are a core constituent of modern cosmology, while the required computations are…
▽ More
Physics-informed neural networks have emerged as a coherent framework for building predictive models that combine statistical patterns with domain knowledge. The underlying notion is to enrich the optimization loss function with known relationships to constrain the space of possible solutions. Hydrodynamic simulations are a core constituent of modern cosmology, while the required computations are both expensive and time-consuming. At the same time, the comparatively fast simulation of dark matter requires fewer resources, which has led to the emergence of machine learning algorithms for baryon inpainting as an active area of research; here, recreating the scatter found in hydrodynamic simulations is an ongoing challenge. This paper presents the first application of physics-informed neural networks to baryon inpainting by combining advances in neural network architectures with physical constraints, injecting theory on baryon conversion efficiency into the model loss function. We also introduce a punitive prediction comparison based on the Kullback-Leibler divergence, which enforces scatter reproduction. By simultaneously extracting the complete set of baryonic properties for the Simba suite of cosmological simulations, our results demonstrate improved accuracy of baryonic predictions based on dark matter halo properties, successful recovery of the fundamental metallicity relation, and retrieve scatter that traces the target simulation's distribution.
△ Less
Submitted 19 October, 2023; v1 submitted 24 March, 2023;
originally announced March 2023.
-
A New Constraint on the Nuclear Equation of State from Statistical Distributions of Compact Remnants of Supernovae
Authors:
Mikhail M. Meskhi,
Noah E. Wolfe,
Zhenyu Dai,
Carla Frohlich,
Jonah M. Miller,
Raymond K. W. Wong,
Ricardo Vilalta
Abstract:
Understanding how matter behaves at the highest densities and temperatures is a major open problem in both nuclear physics and relativistic astrophysics. This physics is often encapsulated in the so-called high-temperature nuclear equation of state, which influences compact binary mergers, core-collapse supernovae, and many more phenomena. One such case is the type (either black hole or neutron st…
▽ More
Understanding how matter behaves at the highest densities and temperatures is a major open problem in both nuclear physics and relativistic astrophysics. This physics is often encapsulated in the so-called high-temperature nuclear equation of state, which influences compact binary mergers, core-collapse supernovae, and many more phenomena. One such case is the type (either black hole or neutron star) and mass of the remnant of the core collapse of a massive star. For each of six candidate equations of state, we use a very large suite of spherically symmetric supernova models to generate a suite of synthetic populations of such remnants. We then compare these synthetic populations to the observed remnant population. We thus provide a novel constraint on the high-temperature nuclear equation of state and describe which EOS candidates are more or less favored by this metric.
△ Less
Submitted 27 March, 2022; v1 submitted 2 November, 2021;
originally announced November 2021.
-
Active learning with RESSPECT: Resource allocation for extragalactic astronomical transients
Authors:
Noble Kennamer,
Emille E. O. Ishida,
Santiago Gonzalez-Gaitan,
Rafael S. de Souza,
Alexander Ihler,
Kara Ponder,
Ricardo Vilalta,
Anais Moller,
David O. Jones,
Mi Dai,
Alberto Krone-Martins,
Bruno Quint,
Sreevarsha Sreejith,
Alex I. Malz,
Lluis Galbany
Abstract:
The recent increase in volume and complexity of available astronomical data has led to a wide use of supervised machine learning techniques. Active learning strategies have been proposed as an alternative to optimize the distribution of scarce labeling resources. However, due to the specific conditions in which labels can be acquired, fundamental assumptions, such as sample representativeness and…
▽ More
The recent increase in volume and complexity of available astronomical data has led to a wide use of supervised machine learning techniques. Active learning strategies have been proposed as an alternative to optimize the distribution of scarce labeling resources. However, due to the specific conditions in which labels can be acquired, fundamental assumptions, such as sample representativeness and labeling cost stability cannot be fulfilled. The Recommendation System for Spectroscopic follow-up (RESSPECT) project aims to enable the construction of optimized training samples for the Rubin Observatory Legacy Survey of Space and Time (LSST), taking into account a realistic description of the astronomical data environment. In this work, we test the robustness of active learning techniques in a realistic simulated astronomical data scenario. Our experiment takes into account the evolution of training and pool samples, different costs per object, and two different sources of budget. Results show that traditional active learning strategies significantly outperform random sampling. Nevertheless, more complex batch strategies are not able to significantly overcome simple uncertainty sampling techniques. Our findings illustrate three important points: 1) active learning strategies are a powerful tool to optimize the label-acquisition task in astronomy, 2) for upcoming large surveys like LSST, such techniques allow us to tailor the construction of the training sample for the first day of the survey, and 3) the peculiar data environment related to the detection of astronomical transients is a fertile ground that calls for the development of tailored machine learning algorithms.
△ Less
Submitted 26 October, 2020; v1 submitted 12 October, 2020;
originally announced October 2020.
-
Ridges in the Dark Energy Survey for cosmic trough identification
Authors:
Ben Moews,
Morgan A. Schmitz,
Andrew J. Lawler,
Joe Zuntz,
Alex I. Malz,
Rafael S. de Souza,
Ricardo Vilalta,
Alberto Krone-Martins,
Emille E. O. Ishida
Abstract:
Cosmic voids and their corresponding redshift-projected mass densities, known as troughs, play an important role in our attempt to model the large-scale structure of the Universe. Understanding these structures enables us to compare the standard model with alternative cosmologies, constrain the dark energy equation of state, and distinguish between different gravitational theories. In this paper,…
▽ More
Cosmic voids and their corresponding redshift-projected mass densities, known as troughs, play an important role in our attempt to model the large-scale structure of the Universe. Understanding these structures enables us to compare the standard model with alternative cosmologies, constrain the dark energy equation of state, and distinguish between different gravitational theories. In this paper, we extend the subspace-constrained mean shift algorithm, a recently introduced method to estimate density ridges, and apply it to 2D weak lensing mass density maps from the Dark Energy Survey Y1 data release to identify curvilinear filamentary structures. We compare the obtained ridges with previous approaches to extract trough structure in the same data, and apply curvelets as an alternative wavelet-based method to constrain densities. We then invoke the Wasserstein distance between noisy and noiseless simulations to validate the denoising capabilities of our method. Our results demonstrate the viability of ridge estimation as a precursor for denoising weak lensing observables to recover the large-scale structure, paving the way for a more versatile and effective search for troughs.
△ Less
Submitted 14 November, 2022; v1 submitted 18 May, 2020;
originally announced May 2020.
-
Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era
Authors:
Brian Nord,
Andrew J. Connolly,
Jamie Kinney,
Jeremy Kubica,
Gautaum Narayan,
Joshua E. G. Peek,
Chad Schafer,
Erik J. Tollerud,
Camille Avestruz,
G. Jogesh Babu,
Simon Birrer,
Douglas Burke,
João Caldeira,
Douglas A. Caldwell,
Joleen K. Carlberg,
Yen-Chi Chen,
Chuanfei Dong,
Eric D. Feigelson,
V. Zach Golkhou,
Vinay Kashyap,
T. S. Li,
Thomas Loredo,
Luisa Lucie-Smith,
Kaisey S. Mandel,
J. R. Martínez-Galarza
, et al. (13 additional authors not shown)
Abstract:
The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our…
▽ More
The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable algorithms and computational modeling. We start with six distinct recommendations that are supported by the commentary following them. This white paper is related to a larger corpus of effort that has taken place within and around the Petabytes to Science Workshops (https://petabytestoscience.github.io/).
△ Less
Submitted 4 November, 2019;
originally announced November 2019.
-
Dark Matter Science in the Era of LSST
Authors:
Keith Bechtol,
Alex Drlica-Wagner,
Kevork N. Abazajian,
Muntazir Abidi,
Susmita Adhikari,
Yacine Ali-Haïmoud,
James Annis,
Behzad Ansarinejad,
Robert Armstrong,
Jacobo Asorey,
Carlo Baccigalupi,
Arka Banerjee,
Nilanjan Banik,
Charles Bennett,
Florian Beutler,
Simeon Bird,
Simon Birrer,
Rahul Biswas,
Andrea Biviano,
Jonathan Blazek,
Kimberly K. Boddy,
Ana Bonaca,
Julian Borrill,
Sownak Bose,
Jo Bovy
, et al. (155 additional authors not shown)
Abstract:
Astrophysical observations currently provide the only robust, empirical measurements of dark matter. In the coming decade, astrophysical observations will guide other experimental efforts, while simultaneously probing unique regions of dark matter parameter space. This white paper summarizes astrophysical observations that can constrain the fundamental physics of dark matter in the era of LSST. We…
▽ More
Astrophysical observations currently provide the only robust, empirical measurements of dark matter. In the coming decade, astrophysical observations will guide other experimental efforts, while simultaneously probing unique regions of dark matter parameter space. This white paper summarizes astrophysical observations that can constrain the fundamental physics of dark matter in the era of LSST. We describe how astrophysical observations will inform our understanding of the fundamental properties of dark matter, such as particle mass, self-interaction strength, non-gravitational interactions with the Standard Model, and compact object abundances. Additionally, we highlight theoretical work and experimental/observational facilities that will complement LSST to strengthen our understanding of the fundamental characteristics of dark matter.
△ Less
Submitted 11 March, 2019;
originally announced March 2019.
-
Probing the Fundamental Nature of Dark Matter with the Large Synoptic Survey Telescope
Authors:
Alex Drlica-Wagner,
Yao-Yuan Mao,
Susmita Adhikari,
Robert Armstrong,
Arka Banerjee,
Nilanjan Banik,
Keith Bechtol,
Simeon Bird,
Kimberly K. Boddy,
Ana Bonaca,
Jo Bovy,
Matthew R. Buckley,
Esra Bulbul,
Chihway Chang,
George Chapline,
Johann Cohen-Tanugi,
Alessandro Cuoco,
Francis-Yan Cyr-Racine,
William A. Dawson,
Ana Díaz Rivero,
Cora Dvorkin,
Denis Erkal,
Christopher D. Fassnacht,
Juan García-Bellido,
Maurizio Giannotti
, et al. (75 additional authors not shown)
Abstract:
Astrophysical and cosmological observations currently provide the only robust, empirical measurements of dark matter. Future observations with Large Synoptic Survey Telescope (LSST) will provide necessary guidance for the experimental dark matter program. This white paper represents a community effort to summarize the science case for studying the fundamental physics of dark matter with LSST. We d…
▽ More
Astrophysical and cosmological observations currently provide the only robust, empirical measurements of dark matter. Future observations with Large Synoptic Survey Telescope (LSST) will provide necessary guidance for the experimental dark matter program. This white paper represents a community effort to summarize the science case for studying the fundamental physics of dark matter with LSST. We discuss how LSST will inform our understanding of the fundamental properties of dark matter, such as particle mass, self-interaction strength, non-gravitational couplings to the Standard Model, and compact object abundances. Additionally, we discuss the ways that LSST will complement other experiments to strengthen our understanding of the fundamental characteristics of dark matter. More information on the LSST dark matter effort can be found at https://lsstdarkmatter.github.io/ .
△ Less
Submitted 24 April, 2019; v1 submitted 4 February, 2019;
originally announced February 2019.
-
Transfer Learning in Astronomy: A New Machine-Learning Paradigm
Authors:
Ricardo Vilalta
Abstract:
The widespread dissemination of machine learning tools in science, particularly in astronomy, has revealed the limitation of working with simple single-task scenarios in which any task in need of a predictive model is looked in isolation, and ignores the existence of other similar tasks. In contrast, a new generation of techniques is emerging where predictive models can take advantage of previous…
▽ More
The widespread dissemination of machine learning tools in science, particularly in astronomy, has revealed the limitation of working with simple single-task scenarios in which any task in need of a predictive model is looked in isolation, and ignores the existence of other similar tasks. In contrast, a new generation of techniques is emerging where predictive models can take advantage of previous experience to leverage information from similar tasks. The new emerging area is referred to as transfer learning. In this paper, I briefly describe the motivation behind the use of transfer learning techniques, and explain how such techniques can be used to solve popular problems in astronomy. As an example, a prevalent problem in astronomy is to estimate the class of an object (e.g., Supernova Ia) using a generation of photometric light-curve datasets where data abounds, but class labels are scarce; such analysis can benefit from spectroscopic data where class labels are known with high confidence, but the data sample is small. Transfer learning provides a robust and practical solution to leverage information from one domain to improve the accuracy of a model built on a different domain. In the example above, transfer learning would look to overcome the difficulty in the compatibility of models between spectroscopic data and photometric data, since data properties such as size, class priors, and underlying distributions, are all expected to be significantly different.
△ Less
Submitted 20 December, 2018;
originally announced December 2018.
-
Stress testing the dark energy equation of state imprint on supernova data
Authors:
Ben Moews,
Rafael S. de Souza,
Emille E. O. Ishida,
Alex I. Malz,
Caroline Heneka,
Ricardo Vilalta,
Joe Zuntz
Abstract:
This work determines the degree to which a standard Lambda-CDM analysis based on type Ia supernovae can identify deviations from a cosmological constant in the form of a redshift-dependent dark energy equation of state w(z). We introduce and apply a novel random curve generator to simulate instances of w(z) from constraint families with increasing distinction from a cosmological constant. After pr…
▽ More
This work determines the degree to which a standard Lambda-CDM analysis based on type Ia supernovae can identify deviations from a cosmological constant in the form of a redshift-dependent dark energy equation of state w(z). We introduce and apply a novel random curve generator to simulate instances of w(z) from constraint families with increasing distinction from a cosmological constant. After producing a series of mock catalogs of binned type Ia supernovae corresponding to each w(z) curve, we perform a standard Lambda-CDM analysis to estimate the corresponding posterior densities of the absolute magnitude of type Ia supernovae, the present-day matter density, and the equation of state parameter. Using the Kullback-Leibler divergence between posterior densities as a difference measure, we demonstrate that a standard type Ia supernova cosmology analysis has limited sensitivity to extensive redshift dependencies of the dark energy equation of state. In addition, we report that larger redshift-dependent departures from a cosmological constant do not necessarily manifest easier-detectable incompatibilities with the Lambda-CDM model. Our results suggest that physics beyond the standard model may simply be hidden in plain sight.
△ Less
Submitted 5 July, 2019; v1 submitted 23 December, 2018;
originally announced December 2018.
-
Optimizing spectroscopic follow-up strategies for supernova photometric classification with active learning
Authors:
E. E. O. Ishida,
R. Beck,
S. Gonzalez-Gaitan,
R. S. de Souza,
A. Krone-Martins,
J. W. Barrett,
N. Kennamer,
R. Vilalta,
J. M. Burgess,
B. Quint,
A. Z. Vitorelli,
A. Mahabal,
E. Gangler
Abstract:
We report a framework for spectroscopic follow-up design for optimizing supernova photometric classification. The strategy accounts for the unavoidable mismatch between spectroscopic and photometric samples, and can be used even in the beginning of a new survey -- without any initial training set. The framework falls under the umbrella of active learning (AL), a class of algorithms that aims to mi…
▽ More
We report a framework for spectroscopic follow-up design for optimizing supernova photometric classification. The strategy accounts for the unavoidable mismatch between spectroscopic and photometric samples, and can be used even in the beginning of a new survey -- without any initial training set. The framework falls under the umbrella of active learning (AL), a class of algorithms that aims to minimize labelling costs by identifying a few, carefully chosen, objects which have high potential in improving the classifier predictions. As a proof of concept, we use the simulated data released after the Supernova Photometric Classification Challenge (SNPCC) and a random forest classifier. Our results show that, using only 12\% the number of training objects in the SNPCC spectroscopic sample, this approach is able to double purity results. Moreover, in order to take into account multiple spectroscopic observations in the same night, we propose a semi-supervised batch-mode AL algorithm which selects a set of $N=5$ most informative objects at each night. In comparison with the initial state using the traditional approach, our method achieves 2.3 times higher purity and comparable figure of merit results after only 180 days of observation, or 800 queries (73% of the SNPCC spectroscopic sample size). Such results were obtained using the same amount of spectroscopic time necessary to observe the original SNPCC spectroscopic sample, showing that this type of strategy is feasible with current available spectroscopic resources. The code used in this work is available in the COINtoolbox: https://github.com/COINtoolbox/ActSNClass .
△ Less
Submitted 3 January, 2019; v1 submitted 10 April, 2018;
originally announced April 2018.
-
A probabilistic approach to emission-line galaxy classification
Authors:
R. S. de Souza,
M. L. L. Dantas,
M. V. Costa-Duarte,
E. D. Feigelson,
M. Killedar,
P. -Y. Lablanche,
R. Vilalta,
A. Krone-Martins,
R. Beck,
F. Gieseke
Abstract:
We invoke a Gaussian mixture model (GMM) to jointly analyse two traditional emission-line classification schemes of galaxy ionization sources: the Baldwin-Phillips-Terlevich (BPT) and $\rm W_{Hα}$ vs. [NII]/H$α$ (WHAN) diagrams, using spectroscopic data from the Sloan Digital Sky Survey Data Release 7 and SEAGal/STARLIGHT datasets. We apply a GMM to empirically define classes of galaxies in a thre…
▽ More
We invoke a Gaussian mixture model (GMM) to jointly analyse two traditional emission-line classification schemes of galaxy ionization sources: the Baldwin-Phillips-Terlevich (BPT) and $\rm W_{Hα}$ vs. [NII]/H$α$ (WHAN) diagrams, using spectroscopic data from the Sloan Digital Sky Survey Data Release 7 and SEAGal/STARLIGHT datasets. We apply a GMM to empirically define classes of galaxies in a three-dimensional space spanned by the $\log$ [OIII]/H$β$, $\log$ [NII]/H$α$, and $\log$ EW(H$α$), optical parameters. The best-fit GMM based on several statistical criteria suggests a solution around four Gaussian components (GCs), which are capable to explain up to 97 per cent of the data variance. Using elements of information theory, we compare each GC to their respective astronomical counterpart. GC1 and GC4 are associated with star-forming galaxies, suggesting the need to define a new starburst subgroup. GC2 is associated with BPT's Active Galaxy Nuclei (AGN) class and WHAN's weak AGN class. GC3 is associated with BPT's composite class and WHAN's strong AGN class. Conversely, there is no statistical evidence -- based on four GCs -- for the existence of a Seyfert/LINER dichotomy in our sample. Notwithstanding, the inclusion of an additional GC5 unravels it. The GC5 appears associated to the LINER and Passive galaxies on the BPT and WHAN diagrams respectively. Subtleties aside, we demonstrate the potential of our methodology to recover/unravel different objects inside the wilderness of astronomical datasets, without lacking the ability to convey physically interpretable results. The probabilistic classifications from the GMM analysis are publicly available within the COINtoolbox (https://cointoolbox.github.io/GMM\_Catalogue/).
△ Less
Submitted 18 August, 2017; v1 submitted 22 March, 2017;
originally announced March 2017.
-
Exploring the spectroscopic diversity of type Ia supernovae with DRACULA: a machine learning approach
Authors:
Michele Sasdelli,
E. E. O. Ishida,
R. Vilalta,
M. Aguena,
V. C. Busti,
H. Camacho,
A. M. M. Trindade,
F. Gieseke,
R. S. de Souza,
Y. T. Fantaye,
P. A. Mazzali
Abstract:
The existence of multiple subclasses of type Ia supernovae (SNeIa) has been the subject of great debate in the last decade. One major challenge inevitably met when trying to infer the existence of one or more subclasses is the time consuming, and subjective, process of subclass definition. In this work, we show how machine learning tools facilitate identification of subtypes of SNeIa through the e…
▽ More
The existence of multiple subclasses of type Ia supernovae (SNeIa) has been the subject of great debate in the last decade. One major challenge inevitably met when trying to infer the existence of one or more subclasses is the time consuming, and subjective, process of subclass definition. In this work, we show how machine learning tools facilitate identification of subtypes of SNeIa through the establishment of a hierarchical group structure in the continuous space of spectral diversity formed by these objects. Using Deep Learning, we were capable of performing such identification in a 4 dimensional feature space (+1 for time evolution), while the standard Principal Component Analysis barely achieves similar results using 15 principal components. This is evidence that the progenitor system and the explosion mechanism can be described by a small number of initial physical parameters. As a proof of concept, we show that our results are in close agreement with a previously suggested classification scheme and that our proposed method can grasp the main spectral features behind the definition of such subtypes. This allows the confirmation of the velocity of lines as a first order effect in the determination of SNIa subtypes, followed by 91bg-like events. Given the expected data deluge in the forthcoming years, our proposed approach is essential to allow a quick and statistically coherent identification of SNeIa subtypes (and outliers). All tools used in this work were made publicly available in the Python package Dimensionality Reduction And Clustering for Unsupervised Learning in Astronomy (DRACULA) and can be found within COINtoolbox (https://github.com/COINtoolbox/DRACULA).
△ Less
Submitted 30 June, 2016; v1 submitted 21 December, 2015;
originally announced December 2015.
-
The Overlooked Potential of Generalized Linear Models in Astronomy - I: Binomial Regression
Authors:
R. S. de Souza,
E. Cameron,
M. Killedar,
J. Hilbe,
R. Vilalta,
U. Maio,
V. Biffi,
B. Ciardi,
J. D. Riggs
Abstract:
Revealing hidden patterns in astronomical data is often the path to fundamental scientific breakthroughs; meanwhile the complexity of scientific inquiry increases as more subtle relationships are sought. Contemporary data analysis problems often elude the capabilities of classical statistical techniques, suggesting the use of cutting edge statistical methods. In this light, astronomers have overlo…
▽ More
Revealing hidden patterns in astronomical data is often the path to fundamental scientific breakthroughs; meanwhile the complexity of scientific inquiry increases as more subtle relationships are sought. Contemporary data analysis problems often elude the capabilities of classical statistical techniques, suggesting the use of cutting edge statistical methods. In this light, astronomers have overlooked a whole family of statistical techniques for exploratory data analysis and robust regression, the so-called Generalized Linear Models (GLMs). In this paper -- the first in a series aimed at illustrating the power of these methods in astronomical applications -- we elucidate the potential of a particular class of GLMs for handling binary/binomial data, the so-called logit and probit regression techniques, from both a maximum likelihood and a Bayesian perspective. As a case in point, we present the use of these GLMs to explore the conditions of star formation activity and metal enrichment in primordial minihaloes from cosmological hydro-simulations including detailed chemistry, gas physics, and stellar feedback. We predict that for a dark mini-halo with metallicity $\approx 1.3 \times 10^{-4} Z_{\bigodot}$, an increase of $1.2 \times 10^{-2}$ in the gas molecular fraction, increases the probability of star formation occurrence by a factor of 75%. Finally, we highlight the use of receiver operating characteristic curves as a diagnostic for binary classifiers, and ultimately we use these to demonstrate the competitive predictive performance of GLMs against the popular technique of artificial neural networks.
△ Less
Submitted 4 April, 2015; v1 submitted 26 September, 2014;
originally announced September 2014.