Improved Weak Lensing Photometric Redshift Calibration via StratLearn and Hierarchical Modeling
Authors:
Maximilian Autenrieth,
Angus H. Wright,
Roberto Trotta,
David A. van Dyk,
David C. Stenning,
Benjamin Joachimi
Abstract:
Discrepancies between cosmological parameter estimates from cosmic shear surveys and from recent Planck cosmic microwave background measurements challenge the ability of the highly successful $Λ$CDM model to describe the nature of the Universe. To rule out systematic biases in cosmic shear survey analyses, accurate redshift calibration within tomographic bins is key. In this paper, we improve phot…
▽ More
Discrepancies between cosmological parameter estimates from cosmic shear surveys and from recent Planck cosmic microwave background measurements challenge the ability of the highly successful $Λ$CDM model to describe the nature of the Universe. To rule out systematic biases in cosmic shear survey analyses, accurate redshift calibration within tomographic bins is key. In this paper, we improve photo-$z$ calibration via Bayesian hierarchical modeling of full galaxy photo-$z$ conditional densities, by employing $\textit{StratLearn}$, a recently developed statistical methodology, which accounts for systematic differences in the distribution of the spectroscopic training/source set and the photometric target set. Using realistic simulations that were designed to resemble the KiDS+VIKING-450 dataset, we show that $\textit{StratLearn}$-estimated conditional densities improve the galaxy tomographic bin assignment, and that our $\textit{StratLearn}$-Bayesian framework leads to nearly unbiased estimates of the target population means. This leads to a factor of $\sim 2$ improvement upon the previously best photo-$z$ calibration method. Our approach delivers a maximum bias per tomographic bin of $Δ\langle z \rangle = 0.0095 \pm 0.0089$, with an average absolute bias of $0.0052 \pm 0.0067$ across the five tomographic bins.
△ Less
Submitted 12 March, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
Stratified Learning: A General-Purpose Statistical Method for Improved Learning under Covariate Shift
Authors:
Maximilian Autenrieth,
David A. van Dyk,
Roberto Trotta,
David C. Stenning
Abstract:
We propose a simple, statistically principled, and theoretically justified method to improve supervised learning when the training set is not representative, a situation known as covariate shift. We build upon a well-established methodology in causal inference, and show that the effects of covariate shift can be reduced or eliminated by conditioning on propensity scores. In practice, this is achie…
▽ More
We propose a simple, statistically principled, and theoretically justified method to improve supervised learning when the training set is not representative, a situation known as covariate shift. We build upon a well-established methodology in causal inference, and show that the effects of covariate shift can be reduced or eliminated by conditioning on propensity scores. In practice, this is achieved by fitting learners within strata constructed by partitioning the data based on the estimated propensity scores, leading to approximately balanced covariates and much-improved target prediction. We demonstrate the effectiveness of our general-purpose method on two contemporary research questions in cosmology, outperforming state-of-the-art importance weighting methods. We obtain the best reported AUC (0.958) on the updated "Supernovae photometric classification challenge", and we improve upon existing conditional density estimation of galaxy redshift from Sloan Data Sky Survey (SDSS) data.
△ Less
Submitted 17 May, 2023; v1 submitted 21 June, 2021;
originally announced June 2021.