-
The importance of stochasticity in determining galaxy emissivities and UV LFs during cosmic dawn and reionization
Authors:
Ivan Nikolić,
Andrei Mesinger,
James E. Davies,
David Prelogović
Abstract:
The stochastic nature of star formation and photon propagation in high-redshift galaxies can result in sizable galaxy-to-galaxy scatter in their properties. Ignoring this scatter by assuming mean quantities can bias estimates of their emissivity and corresponding observables. We construct a flexible, semi-empirical model, sampling scatter around the following mean relations: (i) the conditional ha…
▽ More
The stochastic nature of star formation and photon propagation in high-redshift galaxies can result in sizable galaxy-to-galaxy scatter in their properties. Ignoring this scatter by assuming mean quantities can bias estimates of their emissivity and corresponding observables. We construct a flexible, semi-empirical model, sampling scatter around the following mean relations: (i) the conditional halo mass function (CHMF); (ii) the stellar-to-halo mass relation (SHMR); (iii) galaxy star formation main sequence (SFMS); (iv) fundamental metallicity relation (FMR); (v) conditional intrinsic luminosity; and (vi) photon escape fraction. In our fiducial model, ignoring scatter in these galaxy properties overestimates the duration of the EoR, delaying its completion by up to $Δz$ ~ 2. We quantify the relative importance of each of the above sources of scatter in determining the ionizing, soft-band X-ray and Lyman Werner (LW) emissivities as a function of scale and redshift. We find that scatter around the SFMS is important for all bands, especially at the highest redshifts where the emissivity is dominated by the faintest, most "bursty" galaxies. Ignoring this scatter would underestimate the mean emissivity and its standard deviation computed over 5 cMpc regions by factors of up to $\sim$2-10 at $5< z < 15$. Scatter around the X-ray luminosity to star formation rate relation is important for determining X-ray emissivity, accounting for roughly half of its mean and standard deviation. The importance of scatter in the ionizing escape fraction depends on its functional form, while scatter around the SHMR contributes at the level of ~10-20%. Although scatter does flatten the UV luminosity functions, shifting the bright end by 1-2 magnitudes, the level of scatter in our fiducial model is insufficient to fully explain recent estimates from JWST photometry (consistent with previous studies).
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Exploring the role of the halo mass function for inferring astrophysical parameters during reionisation
Authors:
Bradley Greig,
David Prelogović,
Jordan Mirocha,
Yuxiang Qin,
Yuan-Sen Ting,
Andrei Mesinger
Abstract:
The detection of the 21-cm signal at $z\gtrsim6$ will reveal insights into the properties of the first galaxies responsible for driving reionisation. To extract this information, we perform parameter inference which requires embedding 3D simulations of the 21-cm signal within a Bayesian inference pipeline. Presently, when performing inference we must choose which sources of uncertainty to sample a…
▽ More
The detection of the 21-cm signal at $z\gtrsim6$ will reveal insights into the properties of the first galaxies responsible for driving reionisation. To extract this information, we perform parameter inference which requires embedding 3D simulations of the 21-cm signal within a Bayesian inference pipeline. Presently, when performing inference we must choose which sources of uncertainty to sample and which to hold fixed. Since the astrophysics of galaxies are much more uncertain than those of the underlying halo-mass function (HMF), we usually parameterise and model the former while fixing the latter. However, in doing so we may bias our inference of the properties of these first galaxies. In this work, we explore the consequences of assuming an incorrect choice of HMF and quantify the relative biases in our inferred astrophysical model parameters when considering the wrong HMF. We then relax this assumption by constructing a generalised five parameter model for the HMF and simultaneously recover these parameters along with our underlying astrophysical model. For this analysis, we use 21cmFAST and perform Simulation-Based Inference by applying marginal neural ratio estimation to learn the likelihood-to-evidence ratio using Swyft. Using a mock 1000 hour observation of the 21-cm power spectrum from the forthcoming Square Kilometre Array, conservatively assuming foreground wedge avoidance, we find assuming the incorrect HMF can bias the recovered astrophysical parameters by up to $\sim3-4σ$ even when including independent information from observed luminosity functions. When considering our generalised HMF model, we recover constraints on our astrophysical parameters with a factor of $\sim2-4$ larger marginalised uncertainties. Importantly, these constraints are unbiased, agnostic to the underlying HMF and therefore more conservative.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Inferring astrophysical parameters using the 2D cylindrical power spectrum from reionisation
Authors:
Bradley Greig,
David Prelogović,
Yuxiang Qin,
Yuan-Sen Ting,
Andrei Mesinger
Abstract:
Enlightening our understanding of the first galaxies responsible for driving reionisation requires detecting the 21-cm signal from neutral hydrogen. Interpreting the wealth of information embedded in this signal requires Bayesian inference. Parameter inference from the 21-cm signal is primarily restricted to the spherically averaged power spectrum (1D PS) owing to its relatively straightforward de…
▽ More
Enlightening our understanding of the first galaxies responsible for driving reionisation requires detecting the 21-cm signal from neutral hydrogen. Interpreting the wealth of information embedded in this signal requires Bayesian inference. Parameter inference from the 21-cm signal is primarily restricted to the spherically averaged power spectrum (1D PS) owing to its relatively straightforward derivation of an analytic likelihood function enabling traditional Monte-Carlo Markov-Chain (MCMC) approaches. However, in recent years, simulation-based inference (SBI) has become feasible which removes the necessity of having an analytic likelihood, enabling more complex summary statistics of the 21-cm signal to be used for Bayesian inference. In this work, we use SBI, specifically marginal neural ratio estimation to learn the likelihood-to-evidence ratio with Swyft, to explore parameter inference using the cylindrically averaged 2D PS. Since the 21-cm signal is anisotropic, the 2D PS should yield more constraining information compared to the 1D PS which isotropically averages the signal. For this, we consider a mock 1000 hr observation of the 21-cm signal using the SKA and compare the performance of the 2D PS relative to the 1D PS. Additionally, we explore two separate foreground mitigation strategies, perfect foreground removal and wedge avoidance. We find the 2D PS outperforms the 1D PS by improving the marginalised uncertainties on individual astrophysical parameters by up to $\sim30-40$ per cent irrespective of the foreground mitigation strategy. Primarily, these improvements stem from how the 2D PS distinguishes between the transverse, $k_{\perp}$, and redshift dependent, $k_{\parallel}$ information which enables greater sensitivity to the complex reionisation morphology.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
How informative are summaries of the cosmic 21-cm signal?
Authors:
David Prelogović,
Andrei Mesinger
Abstract:
The cosmic 21-cm signal will bring data-driven advances to studies of the Cosmic Dawn (CD) and Epoch of Reionization (EoR). Radio telescopes such as the SKA will eventually map the HI fluctuations over the first billion years - the majority of our observable Universe. With such large data volumes, it becomes increasingly important to develop "optimal" summary statistics, allowing us to learn as mu…
▽ More
The cosmic 21-cm signal will bring data-driven advances to studies of the Cosmic Dawn (CD) and Epoch of Reionization (EoR). Radio telescopes such as the SKA will eventually map the HI fluctuations over the first billion years - the majority of our observable Universe. With such large data volumes, it becomes increasingly important to develop "optimal" summary statistics, allowing us to learn as much as possible about the CD and EoR. In this work we compare the constraining power of several 21-cm summary statistics, using the determinant of the Fisher information matrix, $\det F$. Since we do not have an established "fiducial" model for the astrophysics of the first galaxies, we compute the distribution of $\det F$ across the prior volume. Using a large database of cosmic 21-cm lightcones that include realizations of telescope noise, we compare the following summaries: (i) the spherically-averaged power spectrum (1DPS), (ii) the cylindrically-averaged power spectrum (2DPS), (iii) the 2D Wavelet scattering transform (WST), (iv) a recurrent neural network (RNN), (v) an information-maximizing neural network (IMNN), and (vi) the combination of 2DPS and IMNN. Our best performing individual summary is the 2DPS, having relatively high Fisher information throughout parameter space. Although capable of achieving the highest Fisher information for some parameter choices, the IMNN does not generalize well, resulting in a broad distribution. Our best results are achieved with the concatenation of the 2DPS and IMNN. The combination of only these two complimentary summaries reduces the recovered parameter variances on average by factors of $\sim$6.5 - 9.5, compared with using each summary independently. Finally, we point out that that the common assumption of a constant covariance matrix when doing Fisher forecasts using 21-cm summaries can significantly underestimate parameter constraints.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
21cmEMU: an emulator of 21cmFAST summary observables
Authors:
Daniela Breitman,
Andrei Mesinger,
Steven Murray,
David Prelogovic,
Yuxiang Qin,
Roberto Trotta
Abstract:
Recent years have witnessed rapid progress in observations of the Epoch of Reionization (EoR). These have enabled high-dimensional inference of galaxy and intergalactic medium (IGM) properties during the first billion years of our Universe. However, even using efficient, semi-numerical simulations, traditional inference approaches that compute 3D lightcones on-the-fly can take $10^5$ core hours. H…
▽ More
Recent years have witnessed rapid progress in observations of the Epoch of Reionization (EoR). These have enabled high-dimensional inference of galaxy and intergalactic medium (IGM) properties during the first billion years of our Universe. However, even using efficient, semi-numerical simulations, traditional inference approaches that compute 3D lightcones on-the-fly can take $10^5$ core hours. Here we present 21cmEMU: an emulator of several summary observables from the popular 21cmFAST simulation code. 21cmEMU takes as input nine parameters characterizing EoR galaxies, and outputs the following summary statistics: (i) the IGM mean neutral fraction; (ii) the 21-cm power spectrum; (iii) the mean 21-cm spin temperature; (iv) the sky-averaged (global) 21-cm signal; (vi) the ultraviolet (UV) luminosity functions (LFs); and (vii) the Thomson scattering optical depth to the cosmic microwave background (CMB). All observables are predicted with sub-percent median accuracy, with a reduction of the computational cost by a factor of over 10$^4$. After validating inference results, we showcase a few applications, including: (i) quantifying the relative constraining power of different observational datasets; (ii) seeing how recent claims of a late EoR impact previous inferences; and (iii) forecasting upcoming constraints from the sixth observing season of the Hydrogen Epoch of Reionization Array (HERA) telescope. 21cmEMU is publicly-available, and is included as an alternative simulator in the public 21CMMC sampler.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Are there more galaxies than we see around high-$z$ quasars?
Authors:
Tommaso Zana,
Stefano Carniani,
David Prelogović,
Fabio Vito,
Viola Allevato,
Andrea Ferrara,
Simona Gallerani,
Eleonora Parlanti
Abstract:
Whether or not $z \gtrsim 6$ quasars lie in the most massive dark-matter halos of the Universe is still a subject of dispute. While most theoretical studies support this scenario, current observations yield discordant results when they probe the halo mass through the detection rate of quasar companion galaxies. Feedback processes from supermassive black holes and dust obscuration have been blamed…
▽ More
Whether or not $z \gtrsim 6$ quasars lie in the most massive dark-matter halos of the Universe is still a subject of dispute. While most theoretical studies support this scenario, current observations yield discordant results when they probe the halo mass through the detection rate of quasar companion galaxies. Feedback processes from supermassive black holes and dust obscuration have been blamed for this discrepancy, but the impact of these effects is complex and far from being clearly understood. This paper aims to improve the interpretation of current far-infrared observations by taking into account the cosmological volume probed by the Atacama Large Millimeter/submillimeter Array Telescope and to explain the observational discrepancies. We statistically investigate the detection rate of quasar companions in current observations and verify if they match the expected distribution from various theoretical models, once convolved with the ALMA field-of-view, through the use of Monte Carlo simulations. We demonstrate that the telescope geometrical bias is fundamental and can alone explain the scatter in the number of detected satellite galaxies in different observations. We conclude that the resulting companion densities depend on the chosen galaxy distributions. According to our fiducial models, current data favour a density scenario where quasars lie in dark-matter halos of viral mass $M_{\rm vir} \gtrsim 10^{12}~{\rm M_{\odot}}$, in agreement with most theoretical studies. According to our analysis, each quasar has about 2 companion galaxies, with a [CII] luminosity $L_{\rm [CII]} \gtrsim 10^8~{\rm L}_{\odot}$, within a distance of about 1~Mpc from the quasar.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Exploring the likelihood of the 21-cm power spectrum with simulation-based inference
Authors:
David Prelogović,
Andrei Mesinger
Abstract:
Observations of the cosmic 21-cm power spectrum (PS) are starting to enable precision Bayesian inference of galaxy properties and physical cosmology, during the first billion years of our Universe. Here we investigate the impact of common approximations about the likelihood used in such inferences, including: (i) assuming a Gaussian functional form; (ii) estimating the mean from a single realizati…
▽ More
Observations of the cosmic 21-cm power spectrum (PS) are starting to enable precision Bayesian inference of galaxy properties and physical cosmology, during the first billion years of our Universe. Here we investigate the impact of common approximations about the likelihood used in such inferences, including: (i) assuming a Gaussian functional form; (ii) estimating the mean from a single realization; and (iii) estimating the (co)variance at a single point in parameter space. We compare "classical" inference that uses an explicit likelihood with simulation based inference (SBI) that estimates the likelihood from a training set. Our forward-models include: (i) realizations of the cosmic 21-cm signal computed with 21cmFAST by varying UV and X-ray galaxy parameters together with the initial conditions; (ii) realizations of the telescope noise corresponding to a 1000 h integration with SKA1-Low; (iii) the excision of Fourier modes corresponding to a foreground-dominated, horizon "wedge". We find that the 1D PS likelihood is well described by a Gaussian accounting for covariances between wavemodes and redshift bins (higher order correlations are small). However, common approaches of estimating the forward-modeled mean and (co)variance from a random realization or at a single point in parameter space result in biased and over-constrained posteriors. Our best results come from using SBI to fit a non-Gaussian likelihood with a Gaussian mixture neural density estimator. Such SBI can be performed with up to an order of magnitude fewer simulations than classical, explicit likelihood inference. Thus SBI provides accurate posteriors at a comparably low computational cost.
△ Less
Submitted 4 July, 2023; v1 submitted 4 May, 2023;
originally announced May 2023.
-
Deep learning approach for identification of HII regions during reionization in 21-cm observations -- II. foreground contamination
Authors:
Michele Bianco,
Sambit. K. Giri,
David Prelogović,
Tianyue Chen,
Florent G. Mertens,
Emma Tolley,
Andrei Mesinger,
Jean-Paul Kneib
Abstract:
The upcoming Square Kilometre Array Observatory (SKAO) will produce images of neutral hydrogen distribution during the epoch of reionization by observing the corresponding 21-cm signal. However, the 21-cm signal will be subject to instrumental limitations such as noise and galactic foreground contamination which pose a challenge for accurate detection. In this study, we present the SegU-Net v2 fra…
▽ More
The upcoming Square Kilometre Array Observatory (SKAO) will produce images of neutral hydrogen distribution during the epoch of reionization by observing the corresponding 21-cm signal. However, the 21-cm signal will be subject to instrumental limitations such as noise and galactic foreground contamination which pose a challenge for accurate detection. In this study, we present the SegU-Net v2 framework, an enhanced version of our convolutional neural network, built to identify neutral and ionized regions in the 21-cm signal contaminated with foreground emission. We trained our neural network on 21-cm image data processed by a foreground removal method based on Principal Component Analysis achieving an average classification accuracy of 71 per cent between redshift $z=7$ to $11$. We tested SegU-Net v2 against various foreground removal methods, including Gaussian Process Regression, Polynomial Fitting, and Foreground-Wedge Removal. Results show comparable performance, highlighting SegU-Net v2's independence on these pre-processing methods. Statistical analysis shows that a perfect classification score with $AUC=95\%$ is possible for $8<z<10$. While the network prediction lacks the ability to correctly identify ionized regions at higher redshift and differentiate well the few remaining neutral regions at lower redshift due to low contrast between 21-cm signal, noise and foreground residual in images. Moreover, as the photon sources driving reionization are expected to be located inside ionised regions, we show that SegU-Net v2 can be used to correctly identify and measure the volume of isolated bubbles with $V_{\rm ion}>(10\, {\rm cMpc})^3$ at $z>9$, for follow-up studies with infrared/optical telescopes to detect these sources.
△ Less
Submitted 28 February, 2024; v1 submitted 5 April, 2023;
originally announced April 2023.
-
Characterizing Beam Errors for Radio Interferometric Observations of Reionization
Authors:
Ainulnabilah Nasirudin,
David Prelogovic,
Steven G. Murray,
Andrei Mesinger,
Gianni Bernardi
Abstract:
A limiting systematic effect in 21-cm interferometric experiments is the chromaticity due to the coupling between the sky and the instrument. This coupling is sourced by the instrument primary beam; therefore it is important to know the beam to extremely high precision. Here we demonstrate how known beam uncertainties can be characterized using databases of beam models. In this introductory work,…
▽ More
A limiting systematic effect in 21-cm interferometric experiments is the chromaticity due to the coupling between the sky and the instrument. This coupling is sourced by the instrument primary beam; therefore it is important to know the beam to extremely high precision. Here we demonstrate how known beam uncertainties can be characterized using databases of beam models. In this introductory work, we focus on beam errors arising from physically offset and/or broken antennas within a station. We use the public code OSKAR to generate an "ideal" SKA beam formed from 256 antennas regularly-spaced in a 35-m circle, as well as a large database of "perturbed" beams sampling distributions of broken/offset antennas. We decompose the beam errors ("ideal" minus "perturbed") using Principal Component Analysis (PCA) and Kernel PCA (KPCA). Using 20 components, we find that PCA/KPCA can reduce the residual of the beam in our datasets by 60-90% compared with the assumption of an ideal beam. Using a simulated observation of the cosmic signal plus foregrounds, we find that assuming the ideal beam can result in 1% error in the EoR window and 10% in the wedge of the 2D power spectrum. When PCA/KPCA is used to characterize the beam uncertainties, the error in the power spectrum shrinks to below 0.01% in the EoR window and <1% in the wedge. Our framework can be used to characterize and then marginalize over uncertainties in the beam for robust next-generation 21-cm parameter estimation.
△ Less
Submitted 26 January, 2022;
originally announced January 2022.
-
Machine learning astrophysics from 21 cm lightcones: impact of network architectures and signal contamination
Authors:
David Prelogović,
Andrei Mesinger,
Steven Murray,
Giuseppe Fiameni,
Nicolas Gillet
Abstract:
Imaging the cosmic 21 cm signal will map out the first billion years of our Universe. The resulting 3D lightcone (LC) will encode the properties of the unseen first galaxies and physical cosmology. Here, we build on previous work using neural networks (NNs) to infer astrophysical parameters directly from 21 cm LC images. We introduce recurrent neural networks (RNNs), capable of efficiently charact…
▽ More
Imaging the cosmic 21 cm signal will map out the first billion years of our Universe. The resulting 3D lightcone (LC) will encode the properties of the unseen first galaxies and physical cosmology. Here, we build on previous work using neural networks (NNs) to infer astrophysical parameters directly from 21 cm LC images. We introduce recurrent neural networks (RNNs), capable of efficiently characterizing the evolution along the redshift axis of 21 cm LC images. Using a large database of simulated cosmic 21 cm LCs, we compare the relative performance in parameter estimation of different network architectures. These including two types of RNNs, which differ in their complexity, as well as a more traditional convolutional neural network (CNN). For the ideal case of no instrumental effects, our simplest and easiest to train RNN performs the best, with a mean squared parameter estimation error (MSE) that is lower by a factor of $\ge 2$ compared with the other architectures studied here, and a factor of $\ge 8$ lower than the previously-studied CNN. We also corrupt the cosmic signal by adding noise expected from a 1000 h integration with the Square Kilometre Array, as well as excising a foreground-contaminated 'horizon wedge'. Parameter prediction errors increase when the NNs are trained on these contaminated LC images, though recovery is still good even in the most pessimistic case (with $R^2 \ge 0.5-0.95$). However, we find no notable differences in performance between network architectures on the contaminated images. We argue this is due to the size of our data set, highlighting the need for larger data sets and/or better data augmentation in order to maximize the potential of NNs in 21 cm parameter estimation.
△ Less
Submitted 16 February, 2022; v1 submitted 30 June, 2021;
originally announced July 2021.
-
Modulation instability in the nonlinear Schrödinger equation with a synthetic magnetic field: gauge matters
Authors:
Karlo Lelas,
Ozana Čelan,
David Prelogović,
Hrvoje Buljan,
Dario Jukić
Abstract:
We theoretically investigate the phenomenon of modulation instability for systems obeying nonlinear Schrödinger equation, which are under the influence of an external homogeneous synthetic magnetic field. For an initial condition, the instability is detected numerically by comparing dynamics with and without a small initial perturbation; the perturbations are characterized in a standard fashion by…
▽ More
We theoretically investigate the phenomenon of modulation instability for systems obeying nonlinear Schrödinger equation, which are under the influence of an external homogeneous synthetic magnetic field. For an initial condition, the instability is detected numerically by comparing dynamics with and without a small initial perturbation; the perturbations are characterized in a standard fashion by wavevectors in momentum space. We demonstrate that the region of (in)stability in momentum space, as well as time-evolution in real space, for identical initial conditions, depend on the choice of the gauge (i.e., vector potential) used to describe the homogeneous synthetic magnetic field. This superficially appears as if the gauge invariance is broken, but this is not true. When the system is evolved from an identical initial condition in two different gauges, it is equivalent to suddenly turning on the synthetic magnetic field at $t=0$. This gives rise, via Faraday's law, to an initial instantaneous kick of a synthetic electric field to the wavepacket, which can differ for gauges yielding an identical uniform magnetic field at $t>0$.
△ Less
Submitted 27 March, 2020;
originally announced March 2020.
-
Magnetically aligned straight depolarisation canals and the rolling Hough transform
Authors:
Vibor Jelić,
David Prelogović,
Marijke Haverkorn,
Jur Remeijn,
Dora Klindžić
Abstract:
Aims. We aim to characterize the properties of the straight depolarization canals detected in the Low Frequency Array (LOFAR) polarimetric observations of a field centered on the extragalactic source 3C 196. We also compare the canal orientations with magnet- ically aligned Hi filaments and the magnetic field probed by polarized dust emission. Methods. We used the rolling Hough transform (RHT) to…
▽ More
Aims. We aim to characterize the properties of the straight depolarization canals detected in the Low Frequency Array (LOFAR) polarimetric observations of a field centered on the extragalactic source 3C 196. We also compare the canal orientations with magnet- ically aligned Hi filaments and the magnetic field probed by polarized dust emission. Methods. We used the rolling Hough transform (RHT) to identify and characterize the orientation of the straight depolarization canals in radio polarimetric data and the filaments in Hi data. Results. The majority of the straight depolarization canals and the Hi filaments are inclined by 10deg with respect to the Galactic plane and are aligned with the plane-of-sky magnetic field orientation probed by the Planck dust polarization data. The other distinct orientation, of 65deg with respect to the Galactic plane, is associated with the orientation of a bar-like structure observed in the 3C 196 field at 350 MHz. Conclusions. An alignment between three distinct tracers of the (local) interstellar medium (ISM) suggests that an ordered magnetic field plays a crucial role in confining different ISM phases. The majority of the straight depolarization canals are a result of a projection of the complicated 3D distribution of the ISM. The RHT analysis is a robust method for identifying and characterizing the straight depolarization canals observed in radio-polarimetric data.
△ Less
Submitted 18 June, 2018;
originally announced June 2018.