-
Dark Energy Survey Year 3 Results: Cosmology from galaxy clustering and galaxy-galaxy lensing in harmonic space
Authors:
L. Faga,
F. Andrade-Oliveira,
H. Camacho,
R. Rosenfeld,
M. Lima,
C. Doux,
X. Fang,
J. Prat,
A. Porredon,
M. Aguena,
A. Alarcon,
S. Allam,
O. Alves,
A. Amon,
S. Avila,
D. Bacon,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
S. Bocquet,
D. Brooks,
E. Buckley-Geer,
A. Campos,
A. Carnero Rosell,
M. Carrasco Kind
, et al. (78 additional authors not shown)
Abstract:
We present the joint tomographic analysis of galaxy-galaxy lensing and galaxy clustering in harmonic space, using galaxy catalogues from the first three years of observations by the Dark Energy Survey (DES Y3). We utilise the redMaGiC and MagLim catalogues as lens galaxies and the METACALIBRATION catalogue as source galaxies. The measurements of angular power spectra are performed using the pseudo…
▽ More
We present the joint tomographic analysis of galaxy-galaxy lensing and galaxy clustering in harmonic space, using galaxy catalogues from the first three years of observations by the Dark Energy Survey (DES Y3). We utilise the redMaGiC and MagLim catalogues as lens galaxies and the METACALIBRATION catalogue as source galaxies. The measurements of angular power spectra are performed using the pseudo-$C_\ell$ method, and our theoretical modelling follows the fiducial analyses performed by DES Y3 in configuration space, accounting for galaxy bias, intrinsic alignments, magnification bias, shear magnification bias and photometric redshift uncertainties. We explore different approaches for scale cuts based on non-linear galaxy bias and baryonic effects contamination. Our fiducial covariance matrix is computed analytically, accounting for mask geometry in the Gaussian term, and including non-Gaussian contributions and super-sample covariance terms. To validate our harmonic space pipelines and covariance matrix, we used a suite of 1800 log-normal simulations. We also perform a series of stress tests to gauge the robustness of our harmonic space analysis. In the $Λ$CDM model, the clustering amplitude $S_8 =σ_8(Ω_m/0.3)^{0.5}$ is constrained to $S_8 = 0.704\pm 0.029$ and $S_8 = 0.753\pm 0.024$ ($68\%$ C.L.) for the redMaGiC and MagLim catalogues, respectively. For the $w$CDM, the dark energy equation of state is constrained to $w = -1.28 \pm 0.29$ and $w = -1.26^{+0.34}_{-0.27}$, for redMaGiC and MagLim catalogues, respectively. These results are compatible with the corresponding DES Y3 results in configuration space and pave the way for harmonic space analyses using the DES Y6 data.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
The Dark Energy Survey Supernova Program: An updated measurement of the Hubble constant using the Inverse Distance Ladder
Authors:
R. Camilleri,
T. M. Davis,
S. R. Hinton,
P. Armstrong,
D. Brout,
L. Galbany,
K. Glazebrook,
J. Lee,
C. Lidman,
R. C. Nichol,
M. Sako,
D. Scolnic,
P. Shah,
M. Smith,
M. Sullivan,
B. O. Sánchez,
M. Vincenzi,
P. Wiseman,
S. Allam,
T. M. C. Abbott,
M. Aguena,
F. Andrade-Oliveira,
J. Asorey,
S. Avila,
D. Bacon
, et al. (55 additional authors not shown)
Abstract:
We measure the current expansion rate of the Universe, Hubble's constant $H_0$, by calibrating the absolute magnitudes of supernovae to distances measured by Baryon Acoustic Oscillations. This `inverse distance ladder' technique provides an alternative to calibrating supernovae using nearby absolute distance measurements, replacing the calibration with a high-redshift anchor. We use the recent rel…
▽ More
We measure the current expansion rate of the Universe, Hubble's constant $H_0$, by calibrating the absolute magnitudes of supernovae to distances measured by Baryon Acoustic Oscillations. This `inverse distance ladder' technique provides an alternative to calibrating supernovae using nearby absolute distance measurements, replacing the calibration with a high-redshift anchor. We use the recent release of 1829 supernovae from the Dark Energy Survey spanning $0.01\lt z \lt1.13$ anchored to the recent Baryon Acoustic Oscillation measurements from DESI spanning $0.30 \lt z_{\mathrm{eff}} \lt 2.33$. To trace cosmology to $z=0$, we use the third-, fourth- and fifth-order cosmographic models, which, by design, are agnostic about the energy content and expansion history of the universe. With the inclusion of the higher-redshift DESI-BAO data, the third-order model is a poor fit to both data sets, with the fourth-order model being preferred by the Akaike Information Criterion. Using the fourth-order cosmographic model, we find $H_0=67.19^{+0.66}_{-0.64}\mathrm{~km} \mathrm{~s}^{-1} \mathrm{~Mpc}^{-1}$, in agreement with the value found by Planck without the need to assume Flat-$Λ$CDM. However the best-fitting expansion history differs from that of Planck, providing continued motivation to investigate these tensions.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
The Dark Energy Survey Supernova Program: Investigating Beyond-$Λ$CDM
Authors:
R. Camilleri,
T. M. Davis,
M. Vincenzi,
P. Shah,
J. Frieman,
R. Kessler,
P. Armstrong,
D. Brout,
A. Carr,
R. Chen,
L. Galbany,
K. Glazebrook,
S. R. Hinton,
J. Lee,
C. Lidman,
A. Möller,
B. Popovic,
H. Qu,
M. Sako,
D. Scolnic,
M. Smith,
M. Sullivan,
B. O. Sánchez,
G. Taylor,
M. Toy
, et al. (55 additional authors not shown)
Abstract:
We report constraints on a variety of non-standard cosmological models using the full 5-year photometrically-classified type Ia supernova sample from the Dark Energy Survey (DES-SN5YR). Both Akaike Information Criterion (AIC) and Suspiciousness calculations find no strong evidence for or against any of the non-standard models we explore. When combined with external probes, the AIC and Suspiciousne…
▽ More
We report constraints on a variety of non-standard cosmological models using the full 5-year photometrically-classified type Ia supernova sample from the Dark Energy Survey (DES-SN5YR). Both Akaike Information Criterion (AIC) and Suspiciousness calculations find no strong evidence for or against any of the non-standard models we explore. When combined with external probes, the AIC and Suspiciousness agree that 11 of the 15 models are moderately preferred over Flat-$Λ$CDM suggesting additional flexibility in our cosmological models may be required beyond the cosmological constant. We also provide a detailed discussion of all cosmological assumptions that appear in the DES supernova cosmology analyses, evaluate their impact, and provide guidance on using the DES Hubble diagram to test non-standard models. An approximate cosmological model, used to perform bias corrections to the data holds the biggest potential for harbouring cosmological assumptions. We show that even if the approximate cosmological model is constructed with a matter density shifted by $ΔΩ_m\sim0.2$ from the true matter density of a simulated data set the bias that arises is sub-dominant to statistical uncertainties. Nevertheless, we present and validate a methodology to reduce this bias.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
The Dark Energy Survey Supernova Program: Light curves and 5-Year data release
Authors:
B. O. Sánchez,
D. Brout,
M. Vincenzi,
M. Sako,
K. Herner,
R. Kessler,
T. M. Davis,
D. Scolnic,
M. Acevedo,
J. Lee,
A. Möller,
H. Qu,
L. Kelsey,
P. Wiseman,
P. Armstrong,
B. Rose,
R. Camilleri,
R. Chen,
L. Galbany,
E. Kovacs,
C. Lidman,
B. Popovic,
M. Smith,
M. Sullivan,
M. Toy
, et al. (60 additional authors not shown)
Abstract:
We present $griz$ photometric light curves for the full 5 years of the Dark Energy Survey Supernova program (DES-SN), obtained with both forced Point Spread Function (PSF) photometry on Difference Images (DIFFIMG) performed during survey operations, and Scene Modelling Photometry (SMP) on search images processed after the survey. This release contains $31,636$ DIFFIMG and $19,706$ high-quality SMP…
▽ More
We present $griz$ photometric light curves for the full 5 years of the Dark Energy Survey Supernova program (DES-SN), obtained with both forced Point Spread Function (PSF) photometry on Difference Images (DIFFIMG) performed during survey operations, and Scene Modelling Photometry (SMP) on search images processed after the survey. This release contains $31,636$ DIFFIMG and $19,706$ high-quality SMP light curves, the latter of which contains $1635$ photometrically-classified supernovae that pass cosmology quality cuts. This sample spans the largest redshift ($z$) range ever covered by a single SN survey ($0.1<z<1.13$) and is the largest single sample from a single instrument of SNe ever used for cosmological constraints. We describe in detail the improvements made to obtain the final DES-SN photometry and provide a comparison to what was used in the DES-SN3YR spectroscopically-confirmed SN Ia sample. We also include a comparative analysis of the performance of the SMP photometry with respect to the real-time DIFFIMG forced photometry and find that SMP photometry is more precise, more accurate, and less sensitive to the host-galaxy surface brightness anomaly. The public release of the light curves and ancillary data can be found at https://github.com/des-science/DES-SN5YR. Finally, we discuss implications for future transient surveys, such as the forthcoming Vera Rubin Observatory Legacy Survey of Space and Time (LSST).
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Gender Bias Detection in Court Decisions: A Brazilian Case Study
Authors:
Raysa Benatti,
Fabiana Severi,
Sandra Avila,
Esther Luna Colombini
Abstract:
Data derived from the realm of the social sciences is often produced in digital text form, which motivates its use as a source for natural language processing methods. Researchers and practitioners have developed and relied on artificial intelligence techniques to collect, process, and analyze documents in the legal field, especially for tasks such as text summarization and classification. While i…
▽ More
Data derived from the realm of the social sciences is often produced in digital text form, which motivates its use as a source for natural language processing methods. Researchers and practitioners have developed and relied on artificial intelligence techniques to collect, process, and analyze documents in the legal field, especially for tasks such as text summarization and classification. While increasing procedural efficiency is often the primary motivation behind natural language processing in the field, several works have proposed solutions for human rights-related issues, such as assessment of public policy and institutional social settings. One such issue is the presence of gender biases in court decisions, which has been largely studied in social sciences fields; biased institutional responses to gender-based violence are a violation of international human rights dispositions since they prevent gender minorities from accessing rights and hamper their dignity. Natural language processing-based approaches can help detect these biases on a larger scale. Still, the development and use of such tools require researchers and practitioners to be mindful of legal and ethical aspects concerning data sharing and use, reproducibility, domain expertise, and value-charged choices. In this work, we (a) present an experimental framework developed to automatically detect gender biases in court decisions issued in Brazilian Portuguese and (b) describe and elaborate on features we identify to be critical in such a technology, given its proposed use as a support tool for research and assessment of court~activity.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Back to the Basics on Predicting Transfer Performance
Authors:
Levy Chaves,
Eduardo Valle,
Alceu Bissoto,
Sandra Avila
Abstract:
In the evolving landscape of deep learning, selecting the best pre-trained models from a growing number of choices is a challenge. Transferability scorers propose alleviating this scenario, but their recent proliferation, ironically, poses the challenge of their own assessment. In this work, we propose both robust benchmark guidelines for transferability scorers, and a well-founded technique to co…
▽ More
In the evolving landscape of deep learning, selecting the best pre-trained models from a growing number of choices is a challenge. Transferability scorers propose alleviating this scenario, but their recent proliferation, ironically, poses the challenge of their own assessment. In this work, we propose both robust benchmark guidelines for transferability scorers, and a well-founded technique to combine multiple scorers, which we show consistently improves their results. We extensively evaluate 13 scorers from literature across 11 datasets, comprising generalist, fine-grained, and medical imaging datasets. We show that few scorers match the predictive performance of the simple raw metric of models on ImageNet, and that all predictors suffer on medical datasets. Our results highlight the potential of combining different information sources for reliably predicting transferability across varied domains.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Euclid. V. The Flagship galaxy mock catalogue: a comprehensive simulation for the Euclid mission
Authors:
Euclid Collaboration,
F. J. Castander,
P. Fosalba,
J. Stadel,
D. Potter,
J. Carretero,
P. Tallada-Crespí,
L. Pozzetti,
M. Bolzonella,
G. A. Mamon,
L. Blot,
K. Hoffmann,
M. Huertas-Company,
P. Monaco,
E. J. Gonzalez,
G. De Lucia,
C. Scarlata,
M. -A. Breton,
L. Linke,
C. Viglione,
S. -S. Li,
Z. Zhai,
Z. Baghkhani,
K. Pardede,
C. Neissner
, et al. (344 additional authors not shown)
Abstract:
We present the Flagship galaxy mock, a simulated catalogue of billions of galaxies designed to support the scientific exploitation of the Euclid mission. Euclid is a medium-class mission of the European Space Agency optimised to determine the properties of dark matter and dark energy on the largest scales of the Universe. It probes structure formation over more than 10 billion years primarily from…
▽ More
We present the Flagship galaxy mock, a simulated catalogue of billions of galaxies designed to support the scientific exploitation of the Euclid mission. Euclid is a medium-class mission of the European Space Agency optimised to determine the properties of dark matter and dark energy on the largest scales of the Universe. It probes structure formation over more than 10 billion years primarily from the combination of weak gravitational lensing and galaxy clustering data. The breath of Euclid's data will also foster a wide variety of scientific analyses. The Flagship simulation was developed to provide a realistic approximation to the galaxies that will be observed by Euclid and used in its scientific analyses. We ran a state-of-the-art N-body simulation with four trillion particles, producing a lightcone on the fly. From the dark matter particles, we produced a catalogue of 16 billion haloes in one octant of the sky in the lightcone up to redshift z=3. We then populated these haloes with mock galaxies using a halo occupation distribution and abundance matching approach, calibrating the free parameters of the galaxy mock against observed correlations and other basic galaxy properties. Modelled galaxy properties include luminosity and flux in several bands, redshifts, positions and velocities, spectral energy distributions, shapes and sizes, stellar masses, star formation rates, metallicities, emission line fluxes, and lensing properties. We selected a final sample of 3.4 billion galaxies with a magnitude cut of H_E<26, where we are complete. We have performed a comprehensive set of validation tests to check the similarity to observational data and theoretical models. In particular, our catalogue is able to closely reproduce the main characteristics of the weak lensing and galaxy clustering samples to be used in the mission's main cosmological analysis. (abridged)
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Euclid. I. Overview of the Euclid mission
Authors:
Euclid Collaboration,
Y. Mellier,
Abdurro'uf,
J. A. Acevedo Barroso,
A. Achúcarro,
J. Adamek,
R. Adam,
G. E. Addison,
N. Aghanim,
M. Aguena,
V. Ajani,
Y. Akrami,
A. Al-Bahlawan,
A. Alavi,
I. S. Albuquerque,
G. Alestas,
G. Alguero,
A. Allaoui,
S. W. Allen,
V. Allevato,
A. V. Alonso-Tetilla,
B. Altieri,
A. Alvarez-Candal,
A. Amara,
L. Amendola
, et al. (1086 additional authors not shown)
Abstract:
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14…
▽ More
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14,000 deg^2 of extragalactic sky. In addition to accurate weak lensing and clustering measurements that probe structure formation over half of the age of the Universe, its primary probes for cosmology, these exquisite data will enable a wide range of science. This paper provides a high-level overview of the mission, summarising the survey characteristics, the various data-processing steps, and data products. We also highlight the main science objectives and expected performance.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
DESI 2024 VI: Cosmological Constraints from the Measurements of Baryon Acoustic Oscillations
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
D. M. Alexander,
M. Alvarez,
O. Alves,
A. Anand,
U. Andrade,
E. Armengaud,
S. Avila,
A. Aviles,
H. Awan,
B. Bahr-Kalus,
S. Bailey,
C. Baltay,
A. Bault,
J. Behera,
S. BenZvi,
A. Bera,
F. Beutler,
D. Bianchi,
C. Blake,
R. Blum
, et al. (178 additional authors not shown)
Abstract:
We present cosmological results from the measurement of baryon acoustic oscillations (BAO) in galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations from the Dark Energy Spectroscopic Instrument (DESI), to be released in the DESI Data Release 1. DESI BAO provide robust measurements of the transverse comoving distance and Hubble rate, or their combination, relative to the s…
▽ More
We present cosmological results from the measurement of baryon acoustic oscillations (BAO) in galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations from the Dark Energy Spectroscopic Instrument (DESI), to be released in the DESI Data Release 1. DESI BAO provide robust measurements of the transverse comoving distance and Hubble rate, or their combination, relative to the sound horizon, in seven redshift bins from over 6 million extragalactic objects in the redshift range $0.1<z<4.2$. DESI BAO data alone are consistent with the standard flat $Λ$CDM cosmological model with a matter density $Ω_\mathrm{m}=0.295\pm 0.015$. Paired with a BBN prior and the robustly measured acoustic angular scale from the CMB, DESI requires $H_0=(68.52\pm0.62)$ km/s/Mpc. In conjunction with CMB anisotropies from Planck and CMB lensing data from Planck and ACT, we find $Ω_\mathrm{m}=0.307\pm 0.005$ and $H_0=(67.97\pm0.38)$ km/s/Mpc. Extending the baseline model with a constant dark energy equation of state parameter $w$, DESI BAO alone require $w=-0.99^{+0.15}_{-0.13}$. In models with a time-varying dark energy equation of state parametrized by $w_0$ and $w_a$, combinations of DESI with CMB or with SN~Ia individually prefer $w_0>-1$ and $w_a<0$. This preference is 2.6$σ$ for the DESI+CMB combination, and persists or grows when SN~Ia are added in, giving results discrepant with the $Λ$CDM model at the $2.5σ$, $3.5σ$ or $3.9σ$ levels for the addition of Pantheon+, Union3, or DES-SN5YR datasets respectively. For the flat $Λ$CDM model with the sum of neutrino mass $\sum m_ν$ free, combining the DESI and CMB data yields an upper limit $\sum m_ν< 0.072$ $(0.113)$ eV at 95% confidence for a $\sum m_ν>0$ $(\sum m_ν>0.059)$ eV prior. These neutrino-mass constraints are substantially relaxed in models beyond $Λ$CDM. [Abridged.]
△ Less
Submitted 24 April, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
DESI 2024 IV: Baryon Acoustic Oscillations from the Lyman Alpha Forest
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
D. M. Alexander,
M. Alvarez,
O. Alves,
A. Anand,
U. Andrade,
E. Armengaud,
S. Avila,
A. Aviles,
H. Awan,
S. Bailey,
C. Baltay,
A. Bault,
J. Bautista,
J. Behera,
S. BenZvi,
F. Beutler,
D. Bianchi,
C. Blake,
R. Blum,
S. Brieden
, et al. (174 additional authors not shown)
Abstract:
We present the measurement of Baryon Acoustic Oscillations (BAO) from the Lyman-$α$ (Ly$α$) forest of high-redshift quasars with the first-year dataset of the Dark Energy Spectroscopic Instrument (DESI). Our analysis uses over $420\,000$ Ly$α$ forest spectra and their correlation with the spatial distribution of more than $700\,000$ quasars. An essential facet of this work is the development of a…
▽ More
We present the measurement of Baryon Acoustic Oscillations (BAO) from the Lyman-$α$ (Ly$α$) forest of high-redshift quasars with the first-year dataset of the Dark Energy Spectroscopic Instrument (DESI). Our analysis uses over $420\,000$ Ly$α$ forest spectra and their correlation with the spatial distribution of more than $700\,000$ quasars. An essential facet of this work is the development of a new analysis methodology on a blinded dataset. We conducted rigorous tests using synthetic data to ensure the reliability of our methodology and findings before unblinding. Additionally, we conducted multiple data splits to assess the consistency of the results and scrutinized various analysis approaches to confirm their robustness. For a given value of the sound horizon ($r_d$), we measure the expansion at $z_{\rm eff}=2.33$ with 2\% precision, $H(z_{\rm eff}) = (239.2 \pm 4.8) (147.09~{\rm Mpc} /r_d)$ km/s/Mpc. Similarly, we present a 2.4\% measurement of the transverse comoving distance to the same redshift, $D_M(z_{\rm eff}) = (5.84 \pm 0.14) (r_d/147.09~{\rm Mpc})$ Gpc. Together with other DESI BAO measurements at lower redshifts, these results are used in a companion paper to constrain cosmological parameters.
△ Less
Submitted 12 April, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
DESI 2024 III: Baryon Acoustic Oscillations from Galaxies and Quasars
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
D. M. Alexander,
M. Alvarez,
O. Alves,
A. Anand,
U. Andrade,
E. Armengaud,
S. Avila,
A. Aviles,
H. Awan,
S. Bailey,
C. Baltay,
A. Bault,
J. Behera,
S. BenZvi,
F. Beutler,
D. Bianchi,
C. Blake,
R. Blum,
S. Brieden,
A. Brodzeller
, et al. (171 additional authors not shown)
Abstract:
We present the DESI 2024 galaxy and quasar baryon acoustic oscillations (BAO) measurements using over 5.7 million unique galaxy and quasar redshifts in the range 0.1<z<2.1. Divided by tracer type, we utilize 300,017 galaxies from the magnitude-limited Bright Galaxy Survey with 0.1<z<0.4, 2,138,600 Luminous Red Galaxies with 0.4<z<1.1, 2,432,022 Emission Line Galaxies with 0.8<z<1.6, and 856,652 qu…
▽ More
We present the DESI 2024 galaxy and quasar baryon acoustic oscillations (BAO) measurements using over 5.7 million unique galaxy and quasar redshifts in the range 0.1<z<2.1. Divided by tracer type, we utilize 300,017 galaxies from the magnitude-limited Bright Galaxy Survey with 0.1<z<0.4, 2,138,600 Luminous Red Galaxies with 0.4<z<1.1, 2,432,022 Emission Line Galaxies with 0.8<z<1.6, and 856,652 quasars with 0.8<z<2.1, over a ~7,500 square degree footprint. The analysis was blinded at the catalog-level to avoid confirmation bias. All fiducial choices of the BAO fitting and reconstruction methodology, as well as the size of the systematic errors, were determined on the basis of the tests with mock catalogs and the blinded data catalogs. We present several improvements to the BAO analysis pipeline, including enhancing the BAO fitting and reconstruction methods in a more physically-motivated direction, and also present results using combinations of tracers. We present a re-analysis of SDSS BOSS and eBOSS results applying the improved DESI methodology and find scatter consistent with the level of the quoted SDSS theoretical systematic uncertainties. With the total effective survey volume of ~ 18 Gpc$^3$, the combined precision of the BAO measurements across the six different redshift bins is ~0.52%, marking a 1.2-fold improvement over the previous state-of-the-art results using only first-year data. We detect the BAO in all of these six redshift bins. The highest significance of BAO detection is $9.1σ$ at the effective redshift of 0.93, with a constraint of 0.86% placed on the BAO scale. We find our measurements are systematically larger than the prediction of Planck-2018 LCDM model at z<0.8. We translate the results into transverse comoving distance and radial Hubble distance measurements, which are used to constrain cosmological models in our companion paper [abridged].
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Constraining primordial non-Gaussianity from the large scale structure two-point and three-point correlation functions
Authors:
Z. Brown,
R. Demina,
A. G. Adame,
S. Avila,
E. Chaussidon,
S. Yuan,
V. Gonzalez-Perez,
J. García-Bellido,
J. Aguilar,
S. Ahlen,
R. Blum,
D. Brooks,
T. Claybaugh,
S. Cole,
A. de la Macorra,
B. Dey,
P. Doel,
K. Fanning,
J. E. Forero-Romero,
E. Gaztañaga,
S. Gontcho A Gontcho,
K. Honscheid,
C. Howlett,
S. Juneau,
R. Kehoe
, et al. (25 additional authors not shown)
Abstract:
Surveys of cosmological large-scale structure (LSS) are sensitive to the presence of local primordial non-Gaussianity (PNG), and may be used to constrain models of inflation. Local PNG, characterized by fNL, the amplitude of the quadratic correction to the potential of a Gaussian random field, is traditionally measured from LSS two-point and three-point clustering via the power spectrum and bi-spe…
▽ More
Surveys of cosmological large-scale structure (LSS) are sensitive to the presence of local primordial non-Gaussianity (PNG), and may be used to constrain models of inflation. Local PNG, characterized by fNL, the amplitude of the quadratic correction to the potential of a Gaussian random field, is traditionally measured from LSS two-point and three-point clustering via the power spectrum and bi-spectrum. We propose a framework to measure fNL using the configuration space two-point correlation function (2pcf) monopole and three-point correlation function (3pcf) monopole of survey tracers. Our model estimates the effect of the scale-dependent bias induced by the presence of PNG on the 2pcf and 3pcf from the clustering of simulated dark matter halos. We describe how this effect may be scaled to an arbitrary tracer of the cosmological matter density. The 2pcf and 3pcf of this tracer are measured to constrain the value of fNL. Using simulations of luminous red galaxies observed by the Dark Energy Spectroscopic Instrument (DESI), we demonstrate the accuracy and constraining power of our model, and forecast the ability to constrainfNL to a precision of sigma(fNL) = 22 with one year of DESI survey data.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Leveraging Self-Supervised Learning for Scene Recognition in Child Sexual Abuse Imagery
Authors:
Pedro H. V. Valois,
João Macedo,
Leo S. F. Ribeiro,
Jefersson A. dos Santos,
Sandra Avila
Abstract:
Crime in the 21st century is split into a virtual and real world. However, the former has become a global menace to people's well-being and security in the latter. The challenges it presents must be faced with unified global cooperation, and we must rely more than ever on automated yet trustworthy tools to combat the ever-growing nature of online offenses. Over 10 million child sexual abuse report…
▽ More
Crime in the 21st century is split into a virtual and real world. However, the former has become a global menace to people's well-being and security in the latter. The challenges it presents must be faced with unified global cooperation, and we must rely more than ever on automated yet trustworthy tools to combat the ever-growing nature of online offenses. Over 10 million child sexual abuse reports are submitted to the US National Center for Missing & Exploited Children every year, and over 80% originated from online sources. Therefore, investigation centers and clearinghouses cannot manually process and correctly investigate all imagery. In light of that, reliable automated tools that can securely and efficiently deal with this data are paramount. In this sense, the scene recognition task looks for contextual cues in the environment, being able to group and classify child sexual abuse data without requiring to be trained on sensitive material. The scarcity and limitations of working with child sexual abuse images lead to self-supervised learning, a machine-learning methodology that leverages unlabeled data to produce powerful representations that can be more easily transferred to target tasks. This work shows that self-supervised deep learning models pre-trained on scene-centric data can reach 71.6% balanced accuracy on our indoor scene classification task and, on average, 2.2 percentage points better performance than a fully supervised version. We cooperate with Brazilian Federal Police experts to evaluate our indoor classification model on actual child abuse material. The results demonstrate a notable discrepancy between the features observed in widely used scene datasets and those depicted on sensitive materials.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
Dark Energy Survey: Galaxy Sample for the Baryonic Acoustic Oscillation Measurement from the Final Dataset
Authors:
J. Mena-Fernández,
M. Rodríguez-Monroy,
S. Avila,
A. Porredon,
K. C. Chan,
H. Camacho,
N. Weaverdyck,
I. Sevilla-Noarbe,
E. Sanchez,
L. Toribio San Cipriano,
J. De Vicente,
I. Ferrero,
R. Cawthon,
A. Carnero Rosell,
J. Elvin-Poole,
G. Giannini,
M. Adamow,
K. Bechtol,
A. Drlica-Wagner,
R. A. Gruendl,
W. G. Hartley,
A. Pieres,
A. J. Ross,
E. S. Rykoff,
E. Sheldon
, et al. (63 additional authors not shown)
Abstract:
In this paper we present and validate the galaxy sample used for the analysis of the baryon acoustic oscillation (BAO) signal in the Dark Energy Survey (DES) Y6 data. The definition is based on a color and redshift-dependent magnitude cut optimized to select galaxies at redshifts higher than 0.6, while ensuring a high-quality photo-$z$ determination. The optimization is performed using a Fisher fo…
▽ More
In this paper we present and validate the galaxy sample used for the analysis of the baryon acoustic oscillation (BAO) signal in the Dark Energy Survey (DES) Y6 data. The definition is based on a color and redshift-dependent magnitude cut optimized to select galaxies at redshifts higher than 0.6, while ensuring a high-quality photo-$z$ determination. The optimization is performed using a Fisher forecast algorithm, finding the optimal $i$-magnitude cut to be given by $i$<19.64+2.894$z_{\rm ph}$. For the optimal sample, we forecast an increase in precision in the BAO measurement of $\sim$25% with respect to the Y3 analysis. Our BAO sample has a total of 15,937,556 galaxies in the redshift range 0.6<$z_{\rm ph}$<1.2, and its angular mask covers 4,273.42 deg${}^2$ to a depth of $i$=22.5. We validate its redshift distributions with three different methods: directional neighborhood fitting algorithm (DNF), which is our primary photo-$z$ estimation; direct calibration with spectroscopic redshifts from VIPERS; and clustering redshift using SDSS galaxies. The fiducial redshift distribution is a combination of these three techniques performed by modifying the mean and width of the DNF distributions to match those of VIPERS and clustering redshift. In this paper we also describe the methodology used to mitigate the effect of observational systematics, which is analogous to the one used in the Y3 analysis. This paper is one of the two dedicated to the analysis of the BAO signal in DES Y6. In its companion paper, we present the angular diameter distance constraints obtained through the fitting to the BAO scale.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Dark Energy Survey: A 2.1% measurement of the angular Baryonic Acoustic Oscillation scale at redshift $z_{\rm eff}$=0.85 from the final dataset
Authors:
DES Collaboration,
T. M. C. Abbott,
M. Adamow,
M. Aguena,
S. Allam,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
J. Asorey,
S. Avila,
D. Bacon,
K. Bechtol,
G. M. Bernstein,
E. Bertin,
J. Blazek,
S. Bocquet,
D. Brooks,
D. L. Burke,
H. Camacho,
A. Carnero Rosell,
D. Carollo,
J. Carretero,
F. J. Castander,
R. Cawthon,
K. C. Chan
, et al. (83 additional authors not shown)
Abstract:
We present the angular diameter distance measurement obtained with the Baryonic Acoustic Oscillation feature from galaxy clustering in the completed Dark Energy Survey, consisting of six years (Y6) of observations. We use the Y6 BAO galaxy sample, optimized for BAO science in the redshift range 0.6<$z$<1.2, with an effective redshift at $z_{\rm eff}$=0.85 and split into six tomographic bins. The s…
▽ More
We present the angular diameter distance measurement obtained with the Baryonic Acoustic Oscillation feature from galaxy clustering in the completed Dark Energy Survey, consisting of six years (Y6) of observations. We use the Y6 BAO galaxy sample, optimized for BAO science in the redshift range 0.6<$z$<1.2, with an effective redshift at $z_{\rm eff}$=0.85 and split into six tomographic bins. The sample has nearly 16 million galaxies over 4,273 square degrees. Our consensus measurement constrains the ratio of the angular distance to sound horizon scale to $D_M(z_{\rm eff})/r_d$ = 19.51$\pm$0.41 (at 68.3% confidence interval), resulting from comparing the BAO position in our data to that predicted by Planck $Λ$CDM via the BAO shift parameter $α=(D_M/r_d)/(D_M/r_d)_{\rm Planck}$. To achieve this, the BAO shift is measured with three different methods, Angular Correlation Function (ACF), Angular Power Spectrum (APS), and Projected Correlation Function (PCF) obtaining $α=$ 0.952$\pm$0.023, 0.962$\pm$0.022, and 0.955$\pm$0.020, respectively, which we combine to $α=$ 0.957$\pm$0.020, including systematic errors. When compared with the $Λ$CDM model that best fits Planck data, this measurement is found to be 4.3% and 2.1$σ$ below the angular BAO scale predicted. To date, it represents the most precise angular BAO measurement at $z$>0.75 from any survey and the most precise measurement at any redshift from photometric surveys. The analysis was performed blinded to the BAO position and it is shown to be robust against analysis choices, data removal, redshift calibrations and observational systematics.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Tweet Influence on Market Trends: Analyzing the Impact of Social Media Sentiment on Biotech Stocks
Authors:
C. Sarai R. Avila
Abstract:
This study investigates the relationship between tweet sentiment across diverse categories: news, company opinions, CEO opinions, competitor opinions, and stock market behavior in the biotechnology sector, with a focus on understanding the impact of social media discourse on investor sentiment and decision-making processes. We analyzed historical stock market data for ten of the largest and most i…
▽ More
This study investigates the relationship between tweet sentiment across diverse categories: news, company opinions, CEO opinions, competitor opinions, and stock market behavior in the biotechnology sector, with a focus on understanding the impact of social media discourse on investor sentiment and decision-making processes. We analyzed historical stock market data for ten of the largest and most influential pharmaceutical companies alongside Twitter data related to COVID-19, vaccines, the companies, and their respective CEOs. Using VADER sentiment analysis, we examined the sentiment scores of tweets and assessed their relationships with stock market performance. We employed ARIMA (AutoRegressive Integrated Moving Average) and VAR (Vector AutoRegression) models to forecast stock market performance, incorporating sentiment covariates to improve predictions. Our findings revealed a complex interplay between tweet sentiment, news, biotech companies, their CEOs, and stock market performance, emphasizing the importance of considering diverse factors when modeling and predicting stock prices. This study provides valuable insights into the influence of social media on the financial sector and lays a foundation for future research aimed at refining stock price prediction models.
△ Less
Submitted 26 January, 2024;
originally announced February 2024.
-
The Dark Energy Survey: Cosmology Results With ~1500 New High-redshift Type Ia Supernovae Using The Full 5-year Dataset
Authors:
DES Collaboration,
T. M. C. Abbott,
M. Acevedo,
M. Aguena,
A. Alarcon,
S. Allam,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
J. Annis,
P. Armstrong,
J. Asorey,
S. Avila,
D. Bacon,
B. A. Bassett,
K. Bechtol,
P. H. Bernardinelli,
G. M. Bernstein,
E. Bertin,
J. Blazek,
S. Bocquet,
D. Brooks,
D. Brout,
E. Buckley-Geer,
D. L. Burke
, et al. (134 additional authors not shown)
Abstract:
We present cosmological constraints from the sample of Type Ia supernovae (SN Ia) discovered during the full five years of the Dark Energy Survey (DES) Supernova Program. In contrast to most previous cosmological samples, in which SN are classified based on their spectra, we classify the DES SNe using a machine learning algorithm applied to their light curves in four photometric bands. Spectroscop…
▽ More
We present cosmological constraints from the sample of Type Ia supernovae (SN Ia) discovered during the full five years of the Dark Energy Survey (DES) Supernova Program. In contrast to most previous cosmological samples, in which SN are classified based on their spectra, we classify the DES SNe using a machine learning algorithm applied to their light curves in four photometric bands. Spectroscopic redshifts are acquired from a dedicated follow-up survey of the host galaxies. After accounting for the likelihood of each SN being a SN Ia, we find 1635 DES SNe in the redshift range $0.10<z<1.13$ that pass quality selection criteria sufficient to constrain cosmological parameters. This quintuples the number of high-quality $z>0.5$ SNe compared to the previous leading compilation of Pantheon+, and results in the tightest cosmological constraints achieved by any SN data set to date. To derive cosmological constraints we combine the DES supernova data with a high-quality external low-redshift sample consisting of 194 SNe Ia spanning $0.025<z<0.10$. Using SN data alone and including systematic uncertainties we find $Ω_{\rm M}=0.352\pm 0.017$ in flat $Λ$CDM. Supernova data alone now require acceleration ($q_0<0$ in $Λ$CDM) with over $5σ$ confidence. We find $(Ω_{\rm M},w)=(0.264^{+0.074}_{-0.096},-0.80^{+0.14}_{-0.16})$ in flat $w$CDM. For flat $w_0w_a$CDM, we find $(Ω_{\rm M},w_0,w_a)=(0.495^{+0.033}_{-0.043},-0.36^{+0.36}_{-0.30},-8.8^{+3.7}_{-4.5})$. Including Planck CMB data, SDSS BAO data, and DES $3\times2$-point data gives $(Ω_{\rm M},w)=(0.321\pm0.007,-0.941\pm0.026)$. In all cases dark energy is consistent with a cosmological constant to within $\sim2σ$. In our analysis, systematic errors on cosmological parameters are subdominant compared to statistical errors; paving the way for future photometrically classified supernova analyses.
△ Less
Submitted 6 June, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
SPT Clusters with DES and HST Weak Lensing. II. Cosmological Constraints from the Abundance of Massive Halos
Authors:
S. Bocquet,
S. Grandis,
L. E. Bleem,
M. Klein,
J. J. Mohr,
T. Schrabback,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
A. Alarcon,
S. Allam,
S. W. Allen,
O. Alves,
A. Amon,
A. J. Anderson,
J. Annis,
B. Ansarinejad,
J. E. Austermann,
S. Avila,
D. Bacon,
M. Bayliss,
J. A. Beall,
K. Bechtol,
M. R. Becker,
A. N. Bender
, et al. (171 additional authors not shown)
Abstract:
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d…
▽ More
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d surveys, and comprises 1,005 confirmed clusters in the redshift range $0.25-1.78$ over a total sky area of 5,200 deg$^2$. We use DES Year 3 weak-lensing data for 688 clusters with redshifts $z<0.95$ and HST weak-lensing data for 39 clusters with $0.6<z<1.7$. The weak-lensing measurements enable robust mass measurements of sample clusters and allow us to empirically constrain the SZ observable--mass relation. For a flat $Λ$CDM cosmology, and marginalizing over the sum of massive neutrinos, we measure $Ω_\mathrm{m}=0.286\pm0.032$, $σ_8=0.817\pm0.026$, and the parameter combination $σ_8\,(Ω_\mathrm{m}/0.3)^{0.25}=0.805\pm0.016$. Our measurement of $S_8\equivσ_8\,\sqrt{Ω_\mathrm{m}/0.3}=0.795\pm0.029$ and the constraint from Planck CMB anisotropies (2018 TT,TE,EE+lowE) differ by $1.1σ$. In combination with that Planck dataset, we place a 95% upper limit on the sum of neutrino masses $\sum m_ν<0.18$ eV. When additionally allowing the dark energy equation of state parameter $w$ to vary, we obtain $w=-1.45\pm0.31$ from our cluster-based analysis. In combination with Planck data, we measure $w=-1.34^{+0.22}_{-0.15}$, or a $2.2σ$ difference with a cosmological constant. We use the cluster abundance to measure $σ_8$ in five redshift bins between 0.25 and 1.8, and we find the results to be consistent with structure growth as predicted by the $Λ$CDM model fit to Planck primary CMB data.
△ Less
Submitted 21 June, 2024; v1 submitted 4 January, 2024;
originally announced January 2024.
-
An improved Halo Occupation Distribution prescription from UNITsim H_alpha Emission Line Galaxies: conformity and modified radial profile
Authors:
Guillermo Reyes-Peraza,
Santiago Avila,
Violeta Gonzalez-Perez,
Daniel Lopez-Cano,
Alexander Knebe,
Sujatha Ramakrishnan,
Gustavo Yepes
Abstract:
Emission line galaxies (ELGs) are targeted by the new generation of spectroscopic surveys to make unprecedented measurements in cosmology from their distribution. Accurately interpreting this data requires understanding the imprints imposed by the physics of galaxy formation and evolution on galaxy clustering. In this work we utilize a semi-analytical model of galaxy formation (SAGE) to explore th…
▽ More
Emission line galaxies (ELGs) are targeted by the new generation of spectroscopic surveys to make unprecedented measurements in cosmology from their distribution. Accurately interpreting this data requires understanding the imprints imposed by the physics of galaxy formation and evolution on galaxy clustering. In this work we utilize a semi-analytical model of galaxy formation (SAGE) to explore the necessary components for accurately reproducing the clustering of ELGs. We focus on developing a Halo Occupation Distribution (HOD) prescription able to reproduce the clustering of SAGE galaxies. Typically, HOD models assume that satellite and central galaxies of a given type are independent events. We investigate the need for conformity, i.e. whether the average satellite occupation depends on the existence of a central galaxy of a given type. Incorporating conformity into HOD models is crucial for reproducing the clustering in the reference galaxy sample. Another aspect we investigate is the radial distribution of satellite galaxies within haloes. The traditional density profile models, NFW and Einasto profiles, fail to accurately replicate the small-scale clustering measured for SAGE satellite galaxies. To overcome this limitation, we propose a generalization of the NFW profile, thereby enhancing our understanding of galaxy clustering.
△ Less
Submitted 13 March, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
PNG-UNITsims: Halo clustering response to primordial non-Gaussianities as a function of mass
Authors:
Adrian G. Adame,
Santiago Avila,
Violeta Gonzalez-Perez,
Gustavo Yepes,
Marcos Pellejero,
Mike S. Wang,
Chia-Hsun Chuang,
Yu Feng,
Juan Garcia-Bellido,
Alexander Knebe
Abstract:
We present the largest full N-body simulation to date with local primordial non-Gaussianities (L-PNG), the \textsc{PNG-UNITsim}. It tracks the evolution of $4096^3$ particles within a periodic box with $L_{\rm box} = 1 \; h^{-1}\,{\rm Gpc}$, leading to a mass resolution of $m_{p} = 1.24\times 10^{9}\; h^{-1}\,M_\odot$. This is enough to resolve galaxies targeted by stage-IV spectroscopic surveys.…
▽ More
We present the largest full N-body simulation to date with local primordial non-Gaussianities (L-PNG), the \textsc{PNG-UNITsim}. It tracks the evolution of $4096^3$ particles within a periodic box with $L_{\rm box} = 1 \; h^{-1}\,{\rm Gpc}$, leading to a mass resolution of $m_{p} = 1.24\times 10^{9}\; h^{-1}\,M_\odot$. This is enough to resolve galaxies targeted by stage-IV spectroscopic surveys. The \textsc{PNG-UNIT} has \textit{Fixed} initial conditions whose phases are also \textit{Matched} to the pre-existing \textsc{UNIT} simulation. These two features in the simulations reduce our uncertainty significantly so we use 100 \textsc{FastPM} mocks to estimate this reduction. The amplitude of the non-Gaussianities used to set the initial conditions of this new simulation is $f_{\rm NL}^{\rm local} = 100$. In this first study, we use mass selected dark matter haloes from the \textsc{PNG-UNIT} simulation to constrain the local PNG parameters. PNG induce a scale dependent bias, parameterised through \bp or $p$, which might depend on the type of cosmological tracer. Those cases when $p=1$ are referred to as the {\it universality relation}. We measure $p$ as a function of the halo mass. Haloes with masses between $1\times 10^{12}$ and $2\times 10^{13} \, h^{-1} M_\odot$ are well described by the {\it universality relation}. For haloes with masses between $2\times 10^{10}$ and $1\times 10^{12} \, h^{-1} M_\odot$ we find that $p<1$ at $3σ$. Combining all the mass bins, we find $p$ consistent with a value of $0.955\pm0.013$, which is $3σ$ away from \textit{universality}, as low mass haloes are more numerous. We also study the effect of using priors on $p$ when constraining $f_{\rm NL}$. Using the values we obtain for $b_φ$ as priors, we forecast that a DESI-like (stage-IV) survey will be able to constrain $f_{\rm NL}$ better than if the universality relation is assumed.
△ Less
Submitted 30 May, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Improving and extending non-Poissonian distributions for satellite galaxies sampling in HOD: applications to eBOSS ELGs
Authors:
Bernhard Vos-Ginés,
Santiago Avila,
Violeta Gonzalez-Perez,
Gustavo Yepes
Abstract:
Halo Occupation Distribution (HOD) models help us to connect observations and theory, by assigning galaxies to dark matter haloes. In this work we study one of the components of HOD models: the probability distribution function (PDF), which is used to assign a discrete number of galaxies to a halo, given a mean number of galaxies. For satellite galaxies, the most commonly used PDF is a Poisson Dis…
▽ More
Halo Occupation Distribution (HOD) models help us to connect observations and theory, by assigning galaxies to dark matter haloes. In this work we study one of the components of HOD models: the probability distribution function (PDF), which is used to assign a discrete number of galaxies to a halo, given a mean number of galaxies. For satellite galaxies, the most commonly used PDF is a Poisson Distribution. PDFs with super-Poisson variances have also been studied, allowing for continuous values of variances. This has not been the case for sub-Poisson variances, for which only the Nearest Integer distribution, with a single variance, has been used in the past. In this work we propose a distribution based on the binomial one, which provides continuous sub-Poisson variances. We have generated mock galaxy catalogues from two dark-matter only simulations, UNIT and OUTERIM, with HOD models assuming different PDFs. We show that the variance of the PDF for satellite galaxies affects the one-halo term of the projected correlation function, and the Count-In-Cells (CIC) one point statistics. We fit the clustering of eBOSS Emission Line Galaxies, finding a preference for a sub-poissonian PDF, when we only vary the parameter controlling the PDF variance and the fraction of satellites. Using a mock catalogue as a reference, we have also included both the clustering and CIC to constrain the parameters of the HOD model. CIC can provide strong constraints to the PDF variance of satellite galaxies.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages
Authors:
Gabriel Oliveira dos Santos,
Diego A. B. Moreira,
Alef Iury Ferreira,
Jhessica Silva,
Luiz Pereira,
Pedro Bueno,
Thiago Sousa,
Helena Maia,
Nádia Da Silva,
Esther Colombini,
Helio Pedrini,
Sandra Avila
Abstract:
This work introduces CAPIVARA, a cost-efficient framework designed to enhance the performance of multilingual CLIP models in low-resource languages. While CLIP has excelled in zero-shot vision-language tasks, the resource-intensive nature of model training remains challenging. Many datasets lack linguistic diversity, featuring solely English descriptions for images. CAPIVARA addresses this by augm…
▽ More
This work introduces CAPIVARA, a cost-efficient framework designed to enhance the performance of multilingual CLIP models in low-resource languages. While CLIP has excelled in zero-shot vision-language tasks, the resource-intensive nature of model training remains challenging. Many datasets lack linguistic diversity, featuring solely English descriptions for images. CAPIVARA addresses this by augmenting text data using image captioning and machine translation to generate multiple synthetic captions in low-resource languages. We optimize the training pipeline with LiT, LoRA, and gradient checkpointing to alleviate the computational cost. Through extensive experiments, CAPIVARA emerges as state of the art in zero-shot tasks involving images and Portuguese texts. We show the potential for significant improvements in other low-resource languages, achieved by fine-tuning the pre-trained multilingual CLIP using CAPIVARA on a single GPU for 2 hours. Our model and code is available at https://github.com/hiaac-nlp/CAPIVARA.
△ Less
Submitted 23 October, 2023; v1 submitted 20 October, 2023;
originally announced October 2023.
-
Assessing the Generalizability of Deep Neural Networks-Based Models for Black Skin Lesions
Authors:
Luana Barros,
Levy Chaves,
Sandra Avila
Abstract:
Melanoma is the most severe type of skin cancer due to its ability to cause metastasis. It is more common in black people, often affecting acral regions: palms, soles, and nails. Deep neural networks have shown tremendous potential for improving clinical care and skin cancer diagnosis. Nevertheless, prevailing studies predominantly rely on datasets of white skin tones, neglecting to report diagnos…
▽ More
Melanoma is the most severe type of skin cancer due to its ability to cause metastasis. It is more common in black people, often affecting acral regions: palms, soles, and nails. Deep neural networks have shown tremendous potential for improving clinical care and skin cancer diagnosis. Nevertheless, prevailing studies predominantly rely on datasets of white skin tones, neglecting to report diagnostic outcomes for diverse patient skin tones. In this work, we evaluate supervised and self-supervised models in skin lesion images extracted from acral regions commonly observed in black individuals. Also, we carefully curate a dataset containing skin lesions in acral regions and assess the datasets concerning the Fitzpatrick scale to verify performance on black skin. Our results expose the poor generalizability of these models, revealing their favorable performance for lesions on white skin. Neglecting to create diverse datasets, which necessitates the development of specialized models, is unacceptable. Deep neural networks have great potential to improve diagnosis, particularly for populations with limited access to dermatology. However, including black skin lesions is necessary to ensure these populations can access the benefits of inclusive technology.
△ Less
Submitted 25 January, 2024; v1 submitted 30 September, 2023;
originally announced October 2023.
-
The Performance of Transferability Metrics does not Translate to Medical Tasks
Authors:
Levy Chaves,
Alceu Bissoto,
Eduardo Valle,
Sandra Avila
Abstract:
Transfer learning boosts the performance of medical image analysis by enabling deep learning (DL) on small datasets through the knowledge acquired from large ones. As the number of DL architectures explodes, exhaustively attempting all candidates becomes unfeasible, motivating cheaper alternatives for choosing them. Transferability scoring methods emerge as an enticing solution, allowing to effici…
▽ More
Transfer learning boosts the performance of medical image analysis by enabling deep learning (DL) on small datasets through the knowledge acquired from large ones. As the number of DL architectures explodes, exhaustively attempting all candidates becomes unfeasible, motivating cheaper alternatives for choosing them. Transferability scoring methods emerge as an enticing solution, allowing to efficiently calculate a score that correlates with the architecture accuracy on any target dataset. However, since transferability scores have not been evaluated on medical datasets, their use in this context remains uncertain, preventing them from benefiting practitioners. We fill that gap in this work, thoroughly evaluating seven transferability scores in three medical applications, including out-of-distribution scenarios. Despite promising results in general-purpose datasets, our results show that no transferability score can reliably and consistently estimate target performance in medical contexts, inviting further work in that direction.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Test-Time Selection for Robust Skin Lesion Analysis
Authors:
Alceu Bissoto,
Catarina Barata,
Eduardo Valle,
Sandra Avila
Abstract:
Skin lesion analysis models are biased by artifacts placed during image acquisition, which influence model predictions despite carrying no clinical information. Solutions that address this problem by regularizing models to prevent learning those spurious features achieve only partial success, and existing test-time debiasing techniques are inappropriate for skin lesion analysis due to either makin…
▽ More
Skin lesion analysis models are biased by artifacts placed during image acquisition, which influence model predictions despite carrying no clinical information. Solutions that address this problem by regularizing models to prevent learning those spurious features achieve only partial success, and existing test-time debiasing techniques are inappropriate for skin lesion analysis due to either making unrealistic assumptions on the distribution of test data or requiring laborious annotation from medical practitioners. We propose TTS (Test-Time Selection), a human-in-the-loop method that leverages positive (e.g., lesion area) and negative (e.g., artifacts) keypoints in test samples. TTS effectively steers models away from exploiting spurious artifact-related correlations without retraining, and with less annotation requirements. Our solution is robust to a varying availability of annotations, and different levels of bias. We showcase on the ISIC2019 dataset (for which we release a subset of annotated images) how our model could be deployed in the real-world for mitigating bias.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
Local primordial non-Gaussianity from the large-scale clustering of photometric DESI luminous red galaxies
Authors:
Mehdi Rezaie,
Ashley J. Ross,
Hee-Jong Seo,
Hui Kong,
Anna Porredon,
Lado Samushia,
Edmond Chaussidon,
Alex Krolewski,
Arnaud de Mattia,
Florian Beutler,
Jessica Nicole Aguilar,
Steven Ahlen,
Shadab Alam,
Santiago Avila,
Benedict Bahr-Kalus,
Jose Bermejo-Climent,
David Brooks,
Todd Claybaugh,
Shaun Cole,
Kyle Dawson,
Axel de la Macorra,
Peter Doel,
Andreu Font-Ribera,
Jaime E. Forero-Romero,
Satya Gontcho A Gontcho
, et al. (24 additional authors not shown)
Abstract:
We use angular clustering of luminous red galaxies from the Dark Energy Spectroscopic Instrument (DESI) imaging surveys to constrain the local primordial non-Gaussianity parameter $\fnl$. Our sample comprises over 12 million targets, covering 14,000 square degrees of the sky, with redshifts in the range $0.2< z < 1.35$. We identify Galactic extinction, survey depth, and astronomical seeing as the…
▽ More
We use angular clustering of luminous red galaxies from the Dark Energy Spectroscopic Instrument (DESI) imaging surveys to constrain the local primordial non-Gaussianity parameter $\fnl$. Our sample comprises over 12 million targets, covering 14,000 square degrees of the sky, with redshifts in the range $0.2< z < 1.35$. We identify Galactic extinction, survey depth, and astronomical seeing as the primary sources of systematic error, and employ linear regression and artificial neural networks to alleviate non-cosmological excess clustering on large scales. Our methods are tested against simulations with and without $\fnl$ and systematics, showing superior performance of the neural network treatment. The neural network with a set of nine imaging property maps passes our systematic null test criteria, and is chosen as the fiducial treatment. Assuming the universality relation, we find $\fnl = 34^{+24(+50)}_{-44(-73)}$ at 68\%(95\%) confidence. We apply a series of robustness tests (e.g., cuts on imaging, declination, or scales used) that show consistency in the obtained constraints. We study how the regression method biases the measured angular power-spectrum and degrades the $\fnl$ constraining power. The use of the nine maps more than doubles the uncertainty compared to using only the three primary maps in the regression. Our results thus motivate the development of more efficient methods that avoid over-correction, protect large-scale clustering information, and preserve constraining power. Additionally, our results encourage further studies of $\fnl$ with DESI spectroscopic samples, where the inclusion of 3D clustering modes should help separate imaging systematics and lessen the degradation in the $\fnl$ uncertainty.
△ Less
Submitted 25 June, 2024; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Cosmological constraints from the tomography of DES-Y3 galaxies with CMB lensing from ACT DR4
Authors:
G. A. Marques,
M. S. Madhavacheril,
O. Darwish,
S. Shaikh,
M. Aguena,
O. Alves,
S. Avila,
D. Bacon,
E. J. Baxter,
K. Bechtol,
M. R. Becker,
E. Bertin,
J. Blazek,
J. Richard Bond,
D. Brooks,
H. Cai,
E. Calabrese,
A. Carnero Rosell,
M. Carrasco Kind J. Carretero,
R. Cawthon,
M. Crocce,
L. N. da Costa,
M. E. S. Pereira,
J. De Vicente,
S. Desai
, et al. (70 additional authors not shown)
Abstract:
We present a measurement of the cross-correlation between the MagLim galaxies selected from the Dark Energy Survey (DES) first three years of observations (Y3) and cosmic microwave background (CMB) lensing from the Atacama Cosmology Telescope (ACT) Data Release 4 (DR4), reconstructed over $\sim 436$ sq.deg. of the sky. Our galaxy sample, which covers $\sim 4143$ sq.deg., is divided into six redshi…
▽ More
We present a measurement of the cross-correlation between the MagLim galaxies selected from the Dark Energy Survey (DES) first three years of observations (Y3) and cosmic microwave background (CMB) lensing from the Atacama Cosmology Telescope (ACT) Data Release 4 (DR4), reconstructed over $\sim 436$ sq.deg. of the sky. Our galaxy sample, which covers $\sim 4143$ sq.deg., is divided into six redshift bins spanning the redshift range of $0.20<z<1.05$. We adopt a blinding procedure until passing all consistency and systematics tests. After imposing scale cuts for the cross-power spectrum measurement, we reject the null hypothesis of no correlation at 9.1σ. We constrain cosmological parameters from a joint analysis of galaxy and CMB lensing-galaxy power spectra considering a flat \LCDM model, marginalized over 23 astrophysical and systematic nuisance parameters. We find the clustering amplitude $S_8\equiv σ_8 (Ω_m/0.3)^{0.5} = 0.75^{+0.04}_{-0.05}$. In addition, we constrain the linear growth of cosmic structure as a function of redshift. Our results are consistent with recent DES Y3 analyses and suggest a preference for a lower $S_8$ compared to results from measurements of CMB anisotropies by the Planck satellite, although at a mild level ($< 2 σ$) of statistical significance.
△ Less
Submitted 11 October, 2023; v1 submitted 29 June, 2023;
originally announced June 2023.
-
The Early Data Release of the Dark Energy Spectroscopic Instrument
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
G. Aldering,
D. M. Alexander,
R. Alfarsy,
C. Allende Prieto,
M. Alvarez,
O. Alves,
A. Anand,
F. Andrade-Oliveira,
E. Armengaud,
J. Asorey,
S. Avila,
A. Aviles,
S. Bailey,
A. Balaguera-Antolínez,
O. Ballester,
C. Baltay,
A. Bault,
J. Bautista,
J. Behera,
S. F. Beltran
, et al. (240 additional authors not shown)
Abstract:
The Dark Energy Spectroscopic Instrument (DESI) completed its five-month Survey Validation in May 2021. Spectra of stellar and extragalactic targets from Survey Validation constitute the first major data sample from the DESI survey. This paper describes the public release of those spectra, the catalogs of derived properties, and the intermediate data products. In total, the public release includes…
▽ More
The Dark Energy Spectroscopic Instrument (DESI) completed its five-month Survey Validation in May 2021. Spectra of stellar and extragalactic targets from Survey Validation constitute the first major data sample from the DESI survey. This paper describes the public release of those spectra, the catalogs of derived properties, and the intermediate data products. In total, the public release includes good-quality spectral information from 466,447 objects targeted as part of the Milky Way Survey, 428,758 as part of the Bright Galaxy Survey, 227,318 as part of the Luminous Red Galaxy sample, 437,664 as part of the Emission Line Galaxy sample, and 76,079 as part of the Quasar sample. In addition, the release includes spectral information from 137,148 objects that expand the scope beyond the primary samples as part of a series of secondary programs. Here, we describe the spectral data, data quality, data products, Large-Scale Structure science catalogs, access to the data, and references that provide relevant background to using these spectra.
△ Less
Submitted 15 June, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
Validation of the Scientific Program for the Dark Energy Spectroscopic Instrument
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
G. Aldering,
D. M. Alexander,
R. Alfarsy,
C. Allende Prieto,
M. Alvarez,
O. Alves,
A. Anand,
F. Andrade-Oliveira,
E. Armengaud,
J. Asorey,
S. Avila,
A. Aviles,
S. Bailey,
A. Balaguera-Antolínez,
O. Ballester,
C. Baltay,
A. Bault,
J. Bautista,
J. Behera,
S. F. Beltran
, et al. (239 additional authors not shown)
Abstract:
The Dark Energy Spectroscopic Instrument (DESI) was designed to conduct a survey covering 14,000 deg$^2$ over five years to constrain the cosmic expansion history through precise measurements of Baryon Acoustic Oscillations (BAO). The scientific program for DESI was evaluated during a five month Survey Validation (SV) campaign before beginning full operations. This program produced deep spectra of…
▽ More
The Dark Energy Spectroscopic Instrument (DESI) was designed to conduct a survey covering 14,000 deg$^2$ over five years to constrain the cosmic expansion history through precise measurements of Baryon Acoustic Oscillations (BAO). The scientific program for DESI was evaluated during a five month Survey Validation (SV) campaign before beginning full operations. This program produced deep spectra of tens of thousands of objects from each of the stellar (MWS), bright galaxy (BGS), luminous red galaxy (LRG), emission line galaxy (ELG), and quasar target classes. These SV spectra were used to optimize redshift distributions, characterize exposure times, determine calibration procedures, and assess observational overheads for the five-year program. In this paper, we present the final target selection algorithms, redshift distributions, and projected cosmology constraints resulting from those studies. We also present a `One-Percent survey' conducted at the conclusion of Survey Validation covering 140 deg$^2$ using the final target selection algorithms with exposures of a depth typical of the main survey. The Survey Validation indicates that DESI will be able to complete the full 14,000 deg$^2$ program with spectroscopically-confirmed targets from the MWS, BGS, LRG, ELG, and quasar programs with total sample sizes of 7.2, 13.8, 7.46, 15.7, and 2.87 million, respectively. These samples will allow exploration of the Milky Way halo, clustering on all scales, and BAO measurements with a statistical precision of 0.28% over the redshift interval $z<1.1$, 0.39% over the redshift interval $1.1<z<1.9$, and 0.46% over the redshift interval $1.9<z<3.5$.
△ Less
Submitted 12 January, 2024; v1 submitted 9 June, 2023;
originally announced June 2023.
-
DES Y3 + KiDS-1000: Consistent cosmology combining cosmic shear surveys
Authors:
Dark Energy Survey,
Kilo-Degree Survey Collaboration,
:,
T. M. C. Abbott,
M. Aguena,
A. Alarcon,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
M. Asgari,
S. Avila,
D. Bacon,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
E. Bertin,
M. Bilicki,
J. Blazek,
S. Bocquet,
D. Brooks,
P. Burger,
D. L. Burke,
H. Camacho,
A. Campos,
A. Carnero Rosell
, et al. (138 additional authors not shown)
Abstract:
We present a joint cosmic shear analysis of the Dark Energy Survey (DES Y3) and the Kilo-Degree Survey (KiDS-1000) in a collaborative effort between the two survey teams. We find consistent cosmological parameter constraints between DES Y3 and KiDS-1000 which, when combined in a joint-survey analysis, constrain the parameter $S_8 = σ_8 \sqrt{Ω_{\rm m}/0.3}$ with a mean value of…
▽ More
We present a joint cosmic shear analysis of the Dark Energy Survey (DES Y3) and the Kilo-Degree Survey (KiDS-1000) in a collaborative effort between the two survey teams. We find consistent cosmological parameter constraints between DES Y3 and KiDS-1000 which, when combined in a joint-survey analysis, constrain the parameter $S_8 = σ_8 \sqrt{Ω_{\rm m}/0.3}$ with a mean value of $0.790^{+0.018}_{-0.014}$. The mean marginal is lower than the maximum a posteriori estimate, $S_8=0.801$, owing to skewness in the marginal distribution and projection effects in the multi-dimensional parameter space. Our results are consistent with $S_8$ constraints from observations of the cosmic microwave background by Planck, with agreement at the $1.7σ$ level. We use a Hybrid analysis pipeline, defined from a mock survey study quantifying the impact of the different analysis choices originally adopted by each survey team. We review intrinsic alignment models, baryon feedback mitigation strategies, priors, samplers and models of the non-linear matter power spectrum.
△ Less
Submitted 19 October, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
Even Small Correlation and Diversity Shifts Pose Dataset-Bias Issues
Authors:
Alceu Bissoto,
Catarina Barata,
Eduardo Valle,
Sandra Avila
Abstract:
Distribution shifts are common in real-world datasets and can affect the performance and reliability of deep learning models. In this paper, we study two types of distribution shifts: diversity shifts, which occur when test samples exhibit patterns unseen during training, and correlation shifts, which occur when test data present a different correlation between seen invariant and spurious features…
▽ More
Distribution shifts are common in real-world datasets and can affect the performance and reliability of deep learning models. In this paper, we study two types of distribution shifts: diversity shifts, which occur when test samples exhibit patterns unseen during training, and correlation shifts, which occur when test data present a different correlation between seen invariant and spurious features. We propose an integrated protocol to analyze both types of shifts using datasets where they co-exist in a controllable manner. Finally, we apply our approach to a real-world classification problem of skin cancer analysis, using out-of-distribution datasets and specialized bias annotations. Our protocol reveals three findings: 1) Models learn and propagate correlation shifts even with low-bias training; this poses a risk of accumulating and combining unaccountable weak biases; 2) Models learn robust features in high- and low-bias scenarios but use spurious ones if test samples have them; this suggests that spurious correlations do not impair the learning of robust features; 3) Diversity shift can reduce the reliance on spurious correlations; this is counter intuitive since we expect biased models to depend more on biases when invariant features are missing. Our work has implications for distribution shift research and practice, providing new insights into how models learn and rely on spurious correlations under different types of shifts.
△ Less
Submitted 21 December, 2023; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Should I disclose my dataset? Caveats between reproducibility and individual data rights
Authors:
Raysa M. Benatti,
Camila M. L. Villarroel,
Sandra Avila,
Esther L. Colombini,
Fabiana C. Severi
Abstract:
Natural language processing techniques have helped domain experts solve legal problems. Digital availability of court documents increases possibilities for researchers, who can access them as a source for building datasets -- whose disclosure is aligned with good reproducibility practices in computational research. Large and digitized court systems, such as the Brazilian one, are prone to be explo…
▽ More
Natural language processing techniques have helped domain experts solve legal problems. Digital availability of court documents increases possibilities for researchers, who can access them as a source for building datasets -- whose disclosure is aligned with good reproducibility practices in computational research. Large and digitized court systems, such as the Brazilian one, are prone to be explored in that sense. However, personal data protection laws impose restrictions on data exposure and state principles about which researchers should be mindful. Special caution must be taken in cases with human rights violations, such as gender discrimination, over which we elaborate as an example of interest. We present legal and ethical considerations on the issue, as well as guidelines for researchers dealing with this kind of data and deciding whether to disclose it.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Mapping gas around massive galaxies: cross-correlation of DES Y3 galaxies and Compton-$y$-maps from SPT and Planck
Authors:
J. Sánchez,
Y. Omori,
C. Chang,
L. E. Bleem,
T. Crawford,
A. Drlica-Wagner,
S. Raghunathan,
G. Zacharegkas,
T. M. C. Abbott,
M. Aguena,
A. Alarcon,
S. Allam,
O. Alves,
A. Amon,
S. Avila,
E. Baxter,
K. Bechtol,
B. A. Benson,
G. M. Bernstein,
E. Bertin,
S. Bocquet,
D. Brooks,
D. L. Burke,
A. Campos,
J. E. Carlstrom
, et al. (102 additional authors not shown)
Abstract:
We cross-correlate positions of galaxies measured in data from the first three years of the Dark Energy Survey with Compton-$y$-maps generated using data from the South Pole Telescope (SPT) and the {\it Planck} mission. We model this cross-correlation measurement together with the galaxy auto-correlation to constrain the distribution of gas in the Universe. We measure the hydrostatic mass bias or,…
▽ More
We cross-correlate positions of galaxies measured in data from the first three years of the Dark Energy Survey with Compton-$y$-maps generated using data from the South Pole Telescope (SPT) and the {\it Planck} mission. We model this cross-correlation measurement together with the galaxy auto-correlation to constrain the distribution of gas in the Universe. We measure the hydrostatic mass bias or, equivalently, the mean halo bias-weighted electron pressure $\langle b_{h}P_{e}\rangle$, using large-scale information. We find $\langle b_{h}P_{e}\rangle$ to be $[0.16^{+0.03}_{-0.04},0.28^{+0.04}_{-0.05},0.45^{+0.06}_{-0.10},0.54^{+0.08}_{-0.07},0.61^{+0.08}_{-0.06},0.63^{+0.07}_{-0.08}]$ meV cm$^{-3}$ at redshifts $z \sim [0.30, 0.46, 0.62,0.77, 0.89, 0.97]$. These values are consistent with previous work where measurements exist in the redshift range. We also constrain the mean gas profile using small-scale information, enabled by the high-resolution of the SPT data. We compare our measurements to different parametrized profiles based on the cosmo-OWLS hydrodynamical simulations. We find that our data are consistent with the simulation that assumes an AGN heating temperature of $10^{8.5}$K but are incompatible with the model that assumes an AGN heating temperature of $10^{8.0}$K. These comparisons indicate that the data prefer a higher value of electron pressure than the simulations within $r_{500c}$ of the galaxies' halos.
△ Less
Submitted 18 October, 2022; v1 submitted 16 October, 2022;
originally announced October 2022.
-
Dark Energy Survey Year 3 Results: Measurement of the Baryon Acoustic Oscillations with Three-dimensional Clustering
Authors:
K. C. Chan,
S. Avila,
A. Carnero Rosell,
I. Ferrero,
J. Elvin-Poole,
E. Sanchez,
H. Camacho,
A. Porredon,
M. Crocce,
T. M. C. Abbott,
M. Aguena,
S. Allam,
F. Andrade-Oliveira,
E. Bertin,
S. Bocquet,
D. Brooks,
D. L. Burke,
M. Carrasco Kind,
J. Carretero,
F. J. Castander,
R. Cawthon,
C. Conselice,
M. Costanzi,
M. E. S. Pereira,
J. De Vicente
, et al. (44 additional authors not shown)
Abstract:
The three-dimensional correlation function offers an effective way to summarize the correlation of the large-scale structure even for imaging galaxy surveys. We have applied the projected three-dimensional correlation function, $ξ_{\rm p}$ to measure the Baryonic Acoustic Oscillations (BAO) scale on the first-three years Dark Energy Survey data. The sample consists of about 7 million galaxies in t…
▽ More
The three-dimensional correlation function offers an effective way to summarize the correlation of the large-scale structure even for imaging galaxy surveys. We have applied the projected three-dimensional correlation function, $ξ_{\rm p}$ to measure the Baryonic Acoustic Oscillations (BAO) scale on the first-three years Dark Energy Survey data. The sample consists of about 7 million galaxies in the redshift range $ 0.6 < z_{\rm p } < 1.1 $ over a footprint of $4108 \, \mathrm{deg}^2 $. Our theory modeling includes the impact of realistic true redshift distributions beyond Gaussian photo-$z$ approximation. To increase the signal-to-noise of the measurements, a Gaussian stacking window function is adopted in place of the commonly used top-hat. Using the full sample, $ D_{\rm M}(z_{\rm eff} ) / r_{\rm s} $, the ratio between the comoving angular diameter distance and the sound horizon, is constrained to be $ 19.00 \pm 0.67 $ (top-hat) and $ 19.15 \pm 0.58 $ (Gaussian) at $z_{\rm eff} = 0.835$. The constraint is weaker than the angular correlation $w$ constraint ($18.84 \pm 0.50$) because the BAO signals are heterogeneous across redshift. When a homogeneous BAO-signal sub-sample in the range $ 0.7 < z_{\rm p } < 1.0 $ ($z_{\rm eff} = 0.845$) is considered, $ξ_{\rm p} $ yields $ 19.80 \pm 0.67 $ (top-hat) and $ 19.84 \pm 0.53 $ (Gaussian). The latter is mildly stronger than the $w$ constraint ($19.86 \pm 0.55 $). We find that the $ξ_{\rm p} $ results are more sensitive to photo-$z$ errors than $w$ because $ξ_{\rm p}$ keeps the three-dimensional clustering information causing it to be more prone to photo-$z$ noise. The Gaussian window gives more robust results than the top-hat as the former is designed to suppress the low signal modes. $ξ_{\rm p}$ and the angular statistics such as $w$ have their own pros and cons, and they serve an important crosscheck with each other.
△ Less
Submitted 12 December, 2022; v1 submitted 10 October, 2022;
originally announced October 2022.
-
Dark Energy Survey Year 3 results: Magnification modeling and impact on cosmological constraints from galaxy clustering and galaxy-galaxy lensing
Authors:
J. Elvin-Poole,
N. MacCrann,
S. Everett,
J. Prat,
E. S. Rykoff,
J. De Vicente,
B. Yanny,
K. Herner,
A. Ferté,
E. Di Valentino,
A. Choi,
D. L. Burke,
I. Sevilla-Noarbe,
A. Alarcon,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
E. Baxter,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
J. Blazek,
H. Camacho,
A. Campos,
A. Carnero Rosell
, et al. (71 additional authors not shown)
Abstract:
We study the effect of magnification in the Dark Energy Survey Year 3 analysis of galaxy clustering and galaxy-galaxy lensing, using two different lens samples: a sample of Luminous red galaxies, redMaGiC, and a sample with a redshift-dependent magnitude limit, MagLim. We account for the effect of magnification on both the flux and size selection of galaxies, accounting for systematic effects usin…
▽ More
We study the effect of magnification in the Dark Energy Survey Year 3 analysis of galaxy clustering and galaxy-galaxy lensing, using two different lens samples: a sample of Luminous red galaxies, redMaGiC, and a sample with a redshift-dependent magnitude limit, MagLim. We account for the effect of magnification on both the flux and size selection of galaxies, accounting for systematic effects using the Balrog image simulations. We estimate the impact of magnification on the galaxy clustering and galaxy-galaxy lensing cosmology analysis, finding it to be a significant systematic for the MagLim sample. We show cosmological constraints from the galaxy clustering auto-correlation and galaxy-galaxy lensing signal with different magnifications priors, finding broad consistency in cosmological parameters in $Λ$CDM and $w$CDM. However, when magnification bias amplitude is allowed to be free, we find the two-point correlations functions prefer a different amplitude to the fiducial input derived from the image simulations. We validate the magnification analysis by comparing the cross-clustering between lens bins with the prediction from the baseline analysis, which uses only the auto-correlation of the lens bins, indicating systematics other than magnification may be the cause of the discrepancy. We show adding the cross-clustering between lens redshift bins to the fit significantly improves the constraints on lens magnification parameters and allows uninformative priors to be used on magnification coefficients, without any loss of constraining power or prior volume concerns.
△ Less
Submitted 26 May, 2023; v1 submitted 20 September, 2022;
originally announced September 2022.
-
Primordial non-Gaussianity with Angular correlation function: Integral constraint and validation for DES
Authors:
Walter Riquelme,
Santiago Avila,
Juan Garcia-Bellido,
Anna Porredon,
Ismael Ferrero,
Kwan Chuen Chan,
Rogerio Rosenfeld,
Hugo Camacho,
Adrian G. Adame,
Aurelio Carnero Rosell,
Martin Crocce,
Juan De Vicente,
Tim Eifler,
Jack Elvin-Poole,
Xiao Fang,
Elisabeth Krause,
Martin Rodriguez Monroy,
Ashley J. Ross,
Eusebio Sanchez,
Ignacio Sevilla
Abstract:
Local primordial non-Gaussianity (PNG) is a promising observable of the underlying physics of inflation, characterised by $f_{\rm NL}^{\rm loc}$. We present the methodology to measure $f_{\rm NL}^{\rm loc}$ from the Dark Energy Survey (DES) data using the 2-point angular correlation function (ACF) with scale-dependent bias. One of the focuses of the work is the integral constraint. This condition…
▽ More
Local primordial non-Gaussianity (PNG) is a promising observable of the underlying physics of inflation, characterised by $f_{\rm NL}^{\rm loc}$. We present the methodology to measure $f_{\rm NL}^{\rm loc}$ from the Dark Energy Survey (DES) data using the 2-point angular correlation function (ACF) with scale-dependent bias. One of the focuses of the work is the integral constraint. This condition appears when estimating the mean number density of galaxies from the data and is key in obtaining unbiased $f_{\rm NL}^{\rm loc}$ constraints. The methods are analysed for two types of simulations: $\sim 246$ GOLIAT-PNG N-body small area simulations with $f_{\rm NL}$ equal to -100 and 100, and 1952 Gaussian ICE-COLA mocks with $f_{\rm NL}=0$ that follow the DES angular and redshift distribution. We use the ensemble of GOLIAT-PNG mocks to show the importance of the integral constraint when measuring PNG, where we recover the fiducial values of $f_{\rm NL}$ within the $1σ$ when including the integral constraint. In contrast, we found a bias of $Δf_{\rm NL}\sim 100$ when not including it. For a DES-like scenario, we forecast a bias of $Δf_{\rm NL} \sim 23$, equivalent to $1.8σ$, when not using the IC for a fiducial value of $f_{\rm NL}=100$. We use the ICE-COLA mocks to validate our analysis in a realistic DES-like setup finding it robust to different analysis choices: best-fit estimator, the effect of IC, BAO damping, covariance, and scale choices. We forecast a measurement of $f_{\rm NL}$ within $σ(f_{\rm NL})=31$ when using the DES-Y3 BAO sample, with the ACF in the $1\ {\rm deg}<θ<20\ {\rm deg}$ range.
△ Less
Submitted 18 April, 2023; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Artifact-Based Domain Generalization of Skin Lesion Models
Authors:
Alceu Bissoto,
Catarina Barata,
Eduardo Valle,
Sandra Avila
Abstract:
Deep Learning failure cases are abundant, particularly in the medical area. Recent studies in out-of-distribution generalization have advanced considerably on well-controlled synthetic datasets, but they do not represent medical imaging contexts. We propose a pipeline that relies on artifacts annotation to enable generalization evaluation and debiasing for the challenging skin lesion analysis cont…
▽ More
Deep Learning failure cases are abundant, particularly in the medical area. Recent studies in out-of-distribution generalization have advanced considerably on well-controlled synthetic datasets, but they do not represent medical imaging contexts. We propose a pipeline that relies on artifacts annotation to enable generalization evaluation and debiasing for the challenging skin lesion analysis context. First, we partition the data into levels of increasingly higher biased training and test sets for better generalization assessment. Then, we create environments based on skin lesion artifacts to enable domain generalization methods. Finally, after robust training, we perform a test-time debiasing procedure, reducing spurious features in inference images. Our experiments show our pipeline improves performance metrics in biased cases, and avoids artifacts when using explanation methods. Still, when evaluating such models in out-of-distribution data, they did not prefer clinically-meaningful features. Instead, performance only improved in test sets that present similar artifacts from training, suggesting models learned to ignore the known set of artifacts. Our results raise a concern that debiasing models towards a single aspect may not be enough for fair skin lesion analysis.
△ Less
Submitted 20 August, 2022;
originally announced August 2022.
-
Dark Energy Survey Year 3 Results: Constraints on extensions to $Λ$CDM with weak lensing and galaxy clustering
Authors:
DES Collaboration,
T. M. C. Abbott,
M. Aguena,
A. Alarcon,
O. Alves,
A. Amon,
J. Annis,
S. Avila,
D. Bacon,
E. Baxter,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
S. Birrer,
J. Blazek,
S. Bocquet,
A. Brandao-Souza,
S. L. Bridle,
D. Brooks,
D. L. Burke,
H. Camacho,
A. Campos,
A. Carnero Rosell,
M. Carrasco Kind,
J. Carretero
, et al. (137 additional authors not shown)
Abstract:
We constrain extensions to the $Λ$CDM model using measurements from the Dark Energy Survey's first three years of observations and external data. The DES data are the two-point correlation functions of weak gravitational lensing, galaxy clustering, and their cross-correlation. We use simulated data and blind analyses of real data to validate the robustness of our results. In many cases, constraini…
▽ More
We constrain extensions to the $Λ$CDM model using measurements from the Dark Energy Survey's first three years of observations and external data. The DES data are the two-point correlation functions of weak gravitational lensing, galaxy clustering, and their cross-correlation. We use simulated data and blind analyses of real data to validate the robustness of our results. In many cases, constraining power is limited by the absence of nonlinear predictions that are reliable at our required precision. The models are: dark energy with a time-dependent equation of state, non-zero spatial curvature, sterile neutrinos, modifications of gravitational physics, and a binned $σ_8(z)$ model which serves as a probe of structure growth. For the time-varying dark energy equation of state evaluated at the pivot redshift we find $(w_{\rm p}, w_a)= (-0.99^{+0.28}_{-0.17},-0.9\pm 1.2)$ at 68% confidence with $z_{\rm p}=0.24$ from the DES measurements alone, and $(w_{\rm p}, w_a)= (-1.03^{+0.04}_{-0.03},-0.4^{+0.4}_{-0.3})$ with $z_{\rm p}=0.21$ for the combination of all data considered. Curvature constraints of $Ω_k=0.0009\pm 0.0017$ and effective relativistic species $N_{\rm eff}=3.10^{+0.15}_{-0.16}$ are dominated by external data. For massive sterile neutrinos, we improve the upper bound on the mass $m_{\rm eff}$ by a factor of three compared to previous analyses, giving 95% limits of $(ΔN_{\rm eff},m_{\rm eff})\leq (0.28, 0.20\, {\rm eV})$. We also constrain changes to the lensing and Poisson equations controlled by functions $Σ(k,z) = Σ_0 Ω_Λ(z)/Ω_{Λ,0}$ and $μ(k,z)=μ_0 Ω_Λ(z)/Ω_{Λ,0}$ respectively to $Σ_0=0.6^{+0.4}_{-0.5}$ from DES alone and $(Σ_0,μ_0)=(0.04\pm 0.05,0.08^{+0.21}_{-0.19})$ for the combination of all data. Overall, we find no significant evidence for physics beyond $Λ$CDM.
△ Less
Submitted 29 October, 2023; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Joint analysis of DES Year 3 data and CMB lensing from SPT and Planck III: Combined cosmological constraints
Authors:
T. M. C. Abbott,
M. Aguena,
A. Alarcon,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
J. Annis,
B. Ansarinejad,
S. Avila,
D. Bacon,
E. J. Baxter,
K. Bechtol,
M. R. Becker,
B. A. Benson,
G. M. Bernstein,
E. Bertin,
J. Blazek,
L. E. Bleem,
S. Bocquet,
D. Brooks,
E. Buckley-Geer,
D. L. Burke,
H. Camacho,
A. Campos,
J. E. Carlstrom
, et al. (146 additional authors not shown)
Abstract:
We present cosmological constraints from the analysis of two-point correlation functions between galaxy positions and galaxy lensing measured in Dark Energy Survey (DES) Year 3 data and measurements of cosmic microwave background (CMB) lensing from the South Pole Telescope (SPT) and Planck. When jointly analyzing the DES-only two-point functions and the DES cross-correlations with SPT+Planck CMB l…
▽ More
We present cosmological constraints from the analysis of two-point correlation functions between galaxy positions and galaxy lensing measured in Dark Energy Survey (DES) Year 3 data and measurements of cosmic microwave background (CMB) lensing from the South Pole Telescope (SPT) and Planck. When jointly analyzing the DES-only two-point functions and the DES cross-correlations with SPT+Planck CMB lensing, we find $Ω_{\rm m} = 0.344\pm 0.030$ and $S_8 \equiv σ_8 (Ω_{\rm m}/0.3)^{0.5} = 0.773\pm 0.016$, assuming $Λ$CDM. When additionally combining with measurements of the CMB lensing autospectrum, we find $Ω_{\rm m} = 0.306^{+0.018}_{-0.021}$ and $S_8 = 0.792\pm 0.012$. The high signal-to-noise of the CMB lensing cross-correlations enables several powerful consistency tests of these results, including comparisons with constraints derived from cross-correlations only, and comparisons designed to test the robustness of the galaxy lensing and clustering measurements from DES. Applying these tests to our measurements, we find no evidence of significant biases in the baseline cosmological constraints from the DES-only analyses or from the joint analyses with CMB lensing cross-correlations. However, the CMB lensing cross-correlations suggest possible problems with the correlation function measurements using alternative lens galaxy samples, in particular the redMaGiC galaxies and high-redshift MagLim galaxies, consistent with the findings of previous studies. We use the CMB lensing cross-correlations to identify directions for further investigating these problems.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
A Survey on Deep Learning for Skin Lesion Segmentation
Authors:
Zahra Mirikharaji,
Kumar Abhishek,
Alceu Bissoto,
Catarina Barata,
Sandra Avila,
Eduardo Valle,
M. Emre Celebi,
Ghassan Hamarneh
Abstract:
Skin cancer is a major public health problem that could benefit from computer-aided diagnosis to reduce the burden of this common disease. Skin lesion segmentation from images is an important step toward achieving this goal. However, the presence of natural and artificial artifacts (e.g., hair and air bubbles), intrinsic factors (e.g., lesion shape and contrast), and variations in image acquisitio…
▽ More
Skin cancer is a major public health problem that could benefit from computer-aided diagnosis to reduce the burden of this common disease. Skin lesion segmentation from images is an important step toward achieving this goal. However, the presence of natural and artificial artifacts (e.g., hair and air bubbles), intrinsic factors (e.g., lesion shape and contrast), and variations in image acquisition conditions make skin lesion segmentation a challenging task. Recently, various researchers have explored the applicability of deep learning models to skin lesion segmentation. In this survey, we cross-examine 177 research papers that deal with deep learning-based segmentation of skin lesions. We analyze these works along several dimensions, including input data (datasets, preprocessing, and synthetic data generation), model design (architecture, modules, and losses), and evaluation aspects (data annotation requirements and segmentation performance). We discuss these dimensions both from the viewpoint of select seminal works, and from a systematic viewpoint, examining how those choices have influenced current trends, and how their limitations should be addressed. To facilitate comparisons, we summarize all examined works in a comprehensive table as well as an interactive table available online at https://github.com/sfu-mial/skin-lesion-segmentation-survey.
△ Less
Submitted 20 June, 2023; v1 submitted 1 June, 2022;
originally announced June 2022.
-
Seeing without Looking: Analysis Pipeline for Child Sexual Abuse Datasets
Authors:
Camila Laranjeira,
João Macedo,
Sandra Avila,
Jefersson A. dos Santos
Abstract:
The online sharing and viewing of Child Sexual Abuse Material (CSAM) are growing fast, such that human experts can no longer handle the manual inspection. However, the automatic classification of CSAM is a challenging field of research, largely due to the inaccessibility of target data that is - and should forever be - private and in sole possession of law enforcement agencies. To aid researchers…
▽ More
The online sharing and viewing of Child Sexual Abuse Material (CSAM) are growing fast, such that human experts can no longer handle the manual inspection. However, the automatic classification of CSAM is a challenging field of research, largely due to the inaccessibility of target data that is - and should forever be - private and in sole possession of law enforcement agencies. To aid researchers in drawing insights from unseen data and safely providing further understanding of CSAM images, we propose an analysis template that goes beyond the statistics of the dataset and respective labels. It focuses on the extraction of automatic signals, provided both by pre-trained machine learning models, e.g., object categories and pornography detection, as well as image metrics such as luminance and sharpness. Only aggregated statistics of sparse signals are provided to guarantee the anonymity of children and adolescents victimized. The pipeline allows filtering the data by applying thresholds to each specified signal and provides the distribution of such signals within the subset, correlations between signals, as well as a bias evaluation. We demonstrated our proposal on the Region-based annotated Child Pornography Dataset (RCPD), one of the few CSAM benchmarks in the literature, composed of over 2000 samples among regular and CSAM images, produced in partnership with Brazil's Federal Police. Although noisy and limited in several senses, we argue that automatic signals can highlight important aspects of the overall distribution of data, which is valuable for databases that can not be disclosed. Our goal is to safely publicize the characteristics of CSAM datasets, encouraging researchers to join the field and perhaps other institutions to provide similar reports on their benchmarks.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.
-
Validating galaxy clustering models with Fixed & Paired and Matched-ICs simulations: application to Primordial Non-Gaussianities
Authors:
Santiago Avila,
Adrian G. Adame
Abstract:
The Fix and Pair techniques were designed to generate simulations with reduced variance in the 2-point statistics by modifying the Initial Conditions (ICs). In this paper we show that this technique is also valid when the initial conditions have local Primordial non-Gaussianities (PNG), parametrised by $f_{\rm NL}$, without biasing the 2-point statistics but reducing significantly their variance.…
▽ More
The Fix and Pair techniques were designed to generate simulations with reduced variance in the 2-point statistics by modifying the Initial Conditions (ICs). In this paper we show that this technique is also valid when the initial conditions have local Primordial non-Gaussianities (PNG), parametrised by $f_{\rm NL}$, without biasing the 2-point statistics but reducing significantly their variance. We show how to quantitatively use these techniques to test the accuracy of galaxy/halo clustering models down to a much reduced uncertainty and we apply them to test the standard model for halo clustering in the presence of PNG. Additionally, we show that by Matching the stochastic part of the ICs for two different cosmologies (Gaussian and non-Gaussian) we obtain a large correlation between the (2-point) statistics that can explicitly be used to further reduce the uncertainty of the model testing. For our reference analysis ($f_{\rm NL}=100$, $V=1 [h^{-1}{\rm Gpc}]^3$, $n= 2.5\times 10^{-4}[h^{-1}{\rm Mpc}]^{-3}$, $b=2.32$), we obtain an uncertainty of $σ(f_{\rm NL})=60$ with a standard simulation, whereas using Fixed [Fixed-Paired] initial conditions it reduces to $σ(f_{\rm NL})=12$ [$σ(f_{\rm NL})=12$]. When also Matching the ICs we obtain $σ(f_{\rm NL})=18$ for the standard case, and $σ(f_{\rm NL})=8$ [$σ(f_{\rm NL})=7$] for Fixed [Fixed-Paired]. The combination of the Fix, Pair and Match techniques can be used in the context of PNG to create simulations with an effective volume incremented by a factor $\sim 70$ at given computational resources.
△ Less
Submitted 16 December, 2022; v1 submitted 23 April, 2022;
originally announced April 2022.
-
Towards Optimized Charge Transport in Multilayer Reduced Graphene Oxides
Authors:
Mustafa Neset Cinar,
Aleandro Antidormi,
Viet-Hung Nguyen,
Alessandro Kovtun,
Samuel Lara Avila,
Andrea Liscio,
Jean-Christophe Charlier,
Stephan Roche,
Haldun Sevincli
Abstract:
In the context of graphene-based composite applications, a complete understanding of charge conduction in multilayer reduced graphene oxides (rGO) is highly desirable. However, these rGO compounds are characterized by multiple and different sources of disorder depending on the chemical method used for their synthesis. Most importantly the precise role of interlayer interaction in promoting or jeop…
▽ More
In the context of graphene-based composite applications, a complete understanding of charge conduction in multilayer reduced graphene oxides (rGO) is highly desirable. However, these rGO compounds are characterized by multiple and different sources of disorder depending on the chemical method used for their synthesis. Most importantly the precise role of interlayer interaction in promoting or jeopardizing electronic flow remains unclear. Here, thanks to the development of a multiscale computational approach combining first-principles calculations with large scale transport simulations, the transport scaling laws in multilayer rGO are unraveled, explaining why diffusion worsens with increasing film thickness. In contrast, contacted films are found to exhibit an opposite trend when the mean free path becomes shorter than the channel length, since conduction becomes predominantly driven by interlayer hopping. These predictions are favourably compared with experimental data and open a road towards the optimization of graphene-based composites with improved electrical conduction.
△ Less
Submitted 31 January, 2022;
originally announced January 2022.
-
Clustering with general photo-$z$ uncertainties: Application to Baryon Acoustic Oscillations
Authors:
Kwan Chuen Chan,
Ismael Ferrero,
Santiago Avila,
Ashley J. Ross,
Martin Crocce,
Enrique Gaztanaga
Abstract:
Photometric data can be analyzed using the three-dimensional correlation function $ξ_{\rm p}$ to extract cosmological information via e.g., measurement of the Baryon Acoustic Oscillations (BAO). Previous studies modeled $ξ_{\rm p} $ assuming a Gaussian photo-$z$ approximation. In this work we improve the modeling by incorporating realistic photo-$z$ distribution. We show that the position of the B…
▽ More
Photometric data can be analyzed using the three-dimensional correlation function $ξ_{\rm p}$ to extract cosmological information via e.g., measurement of the Baryon Acoustic Oscillations (BAO). Previous studies modeled $ξ_{\rm p} $ assuming a Gaussian photo-$z$ approximation. In this work we improve the modeling by incorporating realistic photo-$z$ distribution. We show that the position of the BAO scale in $ξ_{\rm p}$ is determined by the photo-$z$ distribution and the Jacobian of the transformation. The latter diverges at the transverse scale of the separation $s_\perp $, and it explains why $ξ_{\rm p } $ traces the underlying correlation function at $s_\perp $, rather than $s$, when the photo-$z$ uncertainty $ σ_z / (1+ z) \gtrsim 0.02$. We also obtain the Gaussian covariance for $ξ_{\rm p}$. Due to photo-$z$ mixing, the covariance of $ξ_{\mathrm{p}}$ shows strong off-diagonal elements. The high correlation of the data causes some issues to the data fitting. Nonetheless, we find that either it can be solved by suppressing the largest eigenvalues of the covariance or it is not directly related to the BAO. We test our BAO fitting pipeline using a set of mock catalogs. The data set is dedicated for Dark Energy Survey Year 3 (DES Y3) BAO analyses and includes realistic photo-$z$ distributions. The theory template is in good agreement with mock measurement. Based on the DES Y3 mocks, $ξ_{\rm p}$ statistic is forecast to constrain the BAO shift parameter $α$ to be $1.001 \pm 0.023$, which is well consistent with the corresponding constraint derived from the angular correlation function measurements. Thus $ξ_{\rm p}$ offers a competitive alternative for the photometric data analyses.
△ Less
Submitted 7 February, 2022; v1 submitted 25 October, 2021;
originally announced October 2021.
-
Weakly Supervised Attention-based Models Using Activation Maps for Citrus Mite and Insect Pest Classification
Authors:
Edson Bollis,
Helena Maia,
Helio Pedrini,
Sandra Avila
Abstract:
Citrus juices and fruits are commodities with great economic potential in the international market, but productivity losses caused by mites and other pests are still far from being a good mark. Despite the integrated pest mechanical aspect, only a few works on automatic classification have handled images with orange mite characteristics, which means tiny and noisy regions of interest. On the compu…
▽ More
Citrus juices and fruits are commodities with great economic potential in the international market, but productivity losses caused by mites and other pests are still far from being a good mark. Despite the integrated pest mechanical aspect, only a few works on automatic classification have handled images with orange mite characteristics, which means tiny and noisy regions of interest. On the computational side, attention-based models have gained prominence in deep learning research, and, along with weakly supervised learning algorithms, they have improved tasks performed with some label restrictions. In agronomic research of pests and diseases, these techniques can improve classification performance while pointing out the location of mites and insects without specific labels, reducing deep learning development costs related to generating bounding boxes. In this context, this work proposes an attention-based activation map approach developed to improve the classification of tiny regions called Two-Weighted Activation Mapping, which also produces locations using feature map scores learned from class labels. We apply our method in a two-stage network process called Attention-based Multiple Instance Learning Guided by Saliency Maps. We analyze the proposed approach in two challenging datasets, the Citrus Pest Benchmark, which was captured directly in the field using magnifying glasses, and the Insect Pest, a large pest image benchmark. In addition, we evaluate and compare our models with weakly supervised methods, such as Attention-based Deep MIL and WILDCAT. The results show that our classifier is superior to literature methods that use tiny regions in their classification tasks, surpassing them in all scenarios by at least 16 percentage points. Moreover, our approach infers bounding box locations for salient insects, even training without any location labels.
△ Less
Submitted 2 October, 2021;
originally announced October 2021.
-
CIDEr-R: Robust Consensus-based Image Description Evaluation
Authors:
Gabriel Oliveira dos Santos,
Esther Luna Colombini,
Sandra Avila
Abstract:
This paper shows that CIDEr-D, a traditional evaluation metric for image description, does not work properly on datasets where the number of words in the sentence is significantly greater than those in the MS COCO Captions dataset. We also show that CIDEr-D has performance hampered by the lack of multiple reference sentences and high variance of sentence length. To bypass this problem, we introduc…
▽ More
This paper shows that CIDEr-D, a traditional evaluation metric for image description, does not work properly on datasets where the number of words in the sentence is significantly greater than those in the MS COCO Captions dataset. We also show that CIDEr-D has performance hampered by the lack of multiple reference sentences and high variance of sentence length. To bypass this problem, we introduce CIDEr-R, which improves CIDEr-D, making it more flexible in dealing with datasets with high sentence length variance. We demonstrate that CIDEr-R is more accurate and closer to human judgment than CIDEr-D; CIDEr-R is more robust regarding the number of available references. Our results reveal that using Self-Critical Sequence Training to optimize CIDEr-R generates descriptive captions. In contrast, when CIDEr-D is optimized, the generated captions' length tends to be similar to the reference length. However, the models also repeat several times the same word to increase the sentence length.
△ Less
Submitted 28 September, 2021;
originally announced September 2021.
-
C/2014 UN271 (Bernardinelli-Bernstein): the nearly spherical cow of comets
Authors:
Pedro H. Bernardinelli,
Gary M. Bernstein,
Benjamin T. Montet,
Robert Weryk,
Richard Wainscoat,
M. Aguena,
S. Allam,
F. Andrade-Oliveira,
J. Annis,
S. Avila,
E. Bertin,
D. Brooks,
D. L. Burke,
A. Carnero Rosell,
M. Carrasco Kind,
J. Carretero,
R. Cawthon,
C. Conselice,
M. Costanzi,
L. N. da Costa,
M. E. S. Pereira,
J. De Vicente,
H. T. Diehl,
S. Everett,
I. Ferrero
, et al. (42 additional authors not shown)
Abstract:
C/2014 UN271 (Bernardinelli-Bernstein) is a comet incoming from the Oort cloud which is remarkable in having the brightest (and presumably largest) nucleus of any well-measured comet, and having been discovered at heliocentric distance $r_h\approx29$ au farther than any Oort-cloud member. We describe the properties that can be inferred from images recorded until the first reports of activity in Ju…
▽ More
C/2014 UN271 (Bernardinelli-Bernstein) is a comet incoming from the Oort cloud which is remarkable in having the brightest (and presumably largest) nucleus of any well-measured comet, and having been discovered at heliocentric distance $r_h\approx29$ au farther than any Oort-cloud member. We describe the properties that can be inferred from images recorded until the first reports of activity in June 2021. The orbit has $i=95^\circ,$ with perihelion of 10.97 au to be reached in 2031, and previous aphelion at $40,400\pm260$ au. Backwards integration of the orbit under a standard Galactic tidal model and known stellar encounters suggests this is a pristine new comet, with a perihelion of $q\approx18$ au on its previous perihelion passage 3.5 Myr ago. The photometric data show an unresolved nucleus with absolute magnitude $H_r=8.0,$ colors that are typical of comet nuclei or Damocloids, and no secular trend as it traversed the range 34--23 au. For $r$-band geometric albedo $p_r,$ this implies a diameter of $150 (p_r/0.04)^{-0.5}$ km. There is strong evidence of brightness fluctuations at $\pm0.2$ mag level, but no rotation period can be discerned. A coma consistent with a ``stationary' $1/ρ$ surface-brightness distribution grew in scattering cross-section at an exponential rate from $A f ρ\approx1$ m to $\approx150$ m as the comet approached from 28 to 20 au. The activity is consistent with a simple model of sublimation of a surface species in radiative equilibrium with the Sun. The inferred enthalpy of sublimation matches those of $CO_2$ and $NH_3$. More-volatile species -- $N_2,$ $CH_4,$ and $CO$ -- must be far less abundant on the sublimating surfaces.
△ Less
Submitted 22 September, 2021; v1 submitted 20 September, 2021;
originally announced September 2021.
-
Dark Energy Survey Year 3 Results: Galaxy Sample for BAO Measurement
Authors:
A. Carnero Rosell,
M. Rodriguez-Monroy,
M. Crocce,
J. Elvin-Poole,
A. Porredon,
I. Ferrero,
J. Mena-Fernandez,
R. Cawthon,
J. De Vicente,
E. Gaztanaga,
A. J. Ross,
E. Sanchez,
I. Sevilla-Noarbe,
O. Alves,
F. Andrade-Oliveira,
J. Asorey,
S. Avila,
A. Brandao-Souza,
H. Camacho,
K. C. Chan,
A. Ferte,
J. Muir,
W. Riquelme,
R. Rosenfeld,
D. Sanchez Cid
, et al. (84 additional authors not shown)
Abstract:
In this paper we present and validate the galaxy sample used for the analysis of the Baryon Acoustic Oscillation signal (BAO) in the Dark Energy Survey (DES) Y3 data. The definition is based on a colour and redshift-dependent magnitude cut optimized to select galaxies at redshifts higher than 0.5, while ensuring a high quality photometric redshift determination. The sample covers $\approx 4100$ sq…
▽ More
In this paper we present and validate the galaxy sample used for the analysis of the Baryon Acoustic Oscillation signal (BAO) in the Dark Energy Survey (DES) Y3 data. The definition is based on a colour and redshift-dependent magnitude cut optimized to select galaxies at redshifts higher than 0.5, while ensuring a high quality photometric redshift determination. The sample covers $\approx 4100$ square degrees to a depth of $i = 22.3 \ (AB)$ at $10σ$. It contains 7,031,993 galaxies in the redshift range from $z$= 0.6 to 1.1, with a mean effective redshift of 0.835. Photometric redshifts are estimated with the machine learning algorithm DNF, and are validated using the VIPERS PDR2 sample. We find a mean redshift bias of $z_{\mathrm{bias}} \approx 0.01$ and a mean uncertainty, in units of $1+z$, of $σ_{68} \approx 0.03$. We evaluate the galaxy population of the sample, showing it is mostly built upon Elliptical to Sbc types. Furthermore, we find a low level of stellar contamination of $\lesssim 4\%$. We present the method used to mitigate the effect of spurious clustering coming from observing conditions and other large-scale systematics. We apply it to the DES Y3 BAO sample and calculate sample weights that are used to get a robust estimate of the galaxy clustering signal. This paper is one of a series dedicated to the analysis of the BAO signal in the DES Y3 data. In the companion papers, Ferrero et al. (2021) and DES Collaboration (2021), we present the galaxy mock catalogues used to calibrate the analysis and the angular diameter distance constraints obtained through the fitting to the BAO scale, respectively. The galaxy sample, masks and additional material will be released in the public DES data repository upon acceptance.
△ Less
Submitted 21 October, 2021; v1 submitted 12 July, 2021;
originally announced July 2021.
-
Dark Energy Survey Year 3 Results: A 2.7% measurement of Baryon Acoustic Oscillation distance scale at redshift 0.835
Authors:
DES Collaboration,
T. M. C. Abbott,
M. Aguena,
S. Allam,
F. Andrade-Oliveira,
J. Asorey,
S. Avila,
G. M. Bernstein,
E. Bertin,
A. Brandao-Souza,
D. Brooks,
D. L. Burke,
J. Calcino,
H. Camacho,
A. Carnero Rosell,
D. Carollo,
M. Carrasco Kind,
J. Carretero,
F. J. Castander,
R. Cawthon,
K. C. Chan,
A. Choi,
C. Conselice,
M. Costanzi,
M. Crocce
, et al. (86 additional authors not shown)
Abstract:
We present angular diameter measurements obtained by measuring the position of Baryon Acoustic Oscillations (BAO) in an optimised sample of galaxies from the first three years of Dark Energy Survey data (DES Y3). The sample consists of 7 million galaxies distributed over a footprint of 4100 deg$^2$ with $0.6 < z_{\rm photo} < 1.1$ and a typical redshift uncertainty of $0.03(1+z)$. The sample selec…
▽ More
We present angular diameter measurements obtained by measuring the position of Baryon Acoustic Oscillations (BAO) in an optimised sample of galaxies from the first three years of Dark Energy Survey data (DES Y3). The sample consists of 7 million galaxies distributed over a footprint of 4100 deg$^2$ with $0.6 < z_{\rm photo} < 1.1$ and a typical redshift uncertainty of $0.03(1+z)$. The sample selection is the same as in the BAO measurement with the first year of DES data, but the analysis presented here uses three times the area, extends to higher redshift and makes a number of improvements, including a fully analytical BAO template, the use of covariances from both theory and simulations, and an extensive pre-unblinding protocol. We used two different statistics: angular correlation function and power spectrum, and validate our pipeline with an ensemble of over 1500 realistic simulations. Both statistics yield compatible results. We combine the likelihoods derived from angular correlations and spherical harmonics to constrain the ratio of comoving angular diameter distance $D_M$ at the effective redshift of our sample to the sound horizon scale at the drag epoch. We obtain $D_M(z_{\rm eff}=0.835)/r_{\rm d} = 18.92 \pm 0.51$, which is consistent with, but smaller than, the Planck prediction assuming flat \lcdm, at the level of $2.3 σ$. The analysis was performed blind and is robust to changes in a number of analysis choices. It represents the most precise BAO distance measurement from imaging data to date, and is competitive with the latest transverse ones from spectroscopic samples at $z>0.75$. When combined with DES 3x2pt + SNIa, they lead to improvements in $H_0$ and $Ω_m$ constraints by $\sim 20\%$
△ Less
Submitted 18 March, 2022; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Dark Energy Survey Year 3 Results: Galaxy mock catalogs for BAO analysis
Authors:
I. Ferrero,
M. Crocce,
I. Tutusaus,
A. Porredon,
L. Blot,
P. Fosalba,
A. Carnero Rosell,
S. Avila,
A. Izard,
J. Elvin-Poole,
K. C. Chan,
H. Camacho,
R. Rosenfeld,
E. Sanchez,
P. Tallada-Crespí,
J. Carretero,
I. Sevilla-Noarbe,
E. Gaztanaga,
F. Andrade-Oliveira,
J. De Vicente,
J. Mena-Fernández,
A. J. Ross,
D. Sanchez Cid,
A. Ferté,
A. Brandao-Souza
, et al. (61 additional authors not shown)
Abstract:
The calibration and validation of scientific analysis in simulations is a fundamental tool to ensure unbiased and robust results in observational cosmology. In particular, mock galaxy catalogs are a crucial resource to achieve these goals in the measurement of baryon acoustic oscillation (BAO) in the clustering of galaxies. Here we present a set of 1952 galaxy mock catalogs designed to mimic the D…
▽ More
The calibration and validation of scientific analysis in simulations is a fundamental tool to ensure unbiased and robust results in observational cosmology. In particular, mock galaxy catalogs are a crucial resource to achieve these goals in the measurement of baryon acoustic oscillation (BAO) in the clustering of galaxies. Here we present a set of 1952 galaxy mock catalogs designed to mimic the Dark Energy Survey (DES) Year 3 BAO sample over its full photometric redshift range $0.6<z_{\rm photo}<1.1$. The mocks are based upon 488 ICE-COLA fast $N$-body simulations of full-sky light cones and were created by populating halos with galaxies, using a hybrid halo occupation distribution - halo abundance matching model. This model has ten free parameters, which were determined, for the first time, using an automatic likelihood minimization procedure. We also introduced a novel technique to assign photometric redshift for simulated galaxies, following a two-dimensional probability distribution with VIMOS Public Extragalactic Redshift Survey (VIPERS) data. The calibration was designed to match the observed abundance of galaxies as a function of photometric redshift, the distribution of photometric redshift errors, and the clustering amplitude on scales smaller than those used for BAO measurements. An exhaustive analysis was done to ensure that the mocks reproduce the input properties. Finally, mocks were tested by comparing the angular correlation function $w(θ)$, angular power spectrum $C_\ell$, and projected clustering $ξ_p(r_\perp)$ to theoretical predictions and data. The impact of volume replication in the estimate of the covariance is also investigated. The success in accurately reproducing the photometric redshift uncertainties and the galaxy clustering as a function of redshift render this mock creation pipeline as a benchmark for future analyses of photometric galaxy surveys.
△ Less
Submitted 13 December, 2021; v1 submitted 9 July, 2021;
originally announced July 2021.