Detecting the edges of galaxies with deep learning
Authors:
Jesús Fernández,
Fernando Buitrago,
Benjamín Sahelices
Abstract:
Galaxy edges or truncations are low-surface-brightness (LSB) features located in the galaxy outskirts that delimit the distance up to where the gas density enables efficient star formation. As such, they could be interpreted as a non-arbitrary means to determine the galaxy size and this is also reinforced by the smaller scatter in the galaxy mass-size relation when comparing them with other size p…
▽ More
Galaxy edges or truncations are low-surface-brightness (LSB) features located in the galaxy outskirts that delimit the distance up to where the gas density enables efficient star formation. As such, they could be interpreted as a non-arbitrary means to determine the galaxy size and this is also reinforced by the smaller scatter in the galaxy mass-size relation when comparing them with other size proxies. However, there are several problems attached to this novel metric, namely, the access to deep imaging and the need to contrast the surface brightness, color, and mass profiles to derive the edge position. While the first hurdle is already overcome by new ultra-deep galaxy observations, we hereby propose the use of machine learning (ML) algorithms to determine the position of these features for very large datasets. We compare the semantic segmentation by our deep learning (DL) models with the results obtained by humans for HST observations of a sample of 1052 massive (M$_{\rm stellar}$ > 10$^{10}$ M$_{\odot}$) galaxies at $z < 1$. In addition, the concept of astronomic augmentations is introduced to endow the inputs of the networks with a physical meaning. Our findings suggest that similar performances than humans could be routinely achieved. The best results are obtained by combining the output of several neural networks using ensemble learning. Additionally, we find that using edge-aware loss functions allows for the networks to focus their optimization on the galaxy boundaries. The experiments reveal a great similarity between the segmentation performed by the AI compared to the human model. For the best model, an average dice of 0.8969 is achieved, while an average dice of 0.9104 is reached by the best ensemble. This methodology will be profusely used in future datasets, such as that of Euclid, to derive scaling relations that are expected to closely follow the galaxy mass assembly.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
Identification of tidal features in deep optical galaxy images with Convolutional Neural Networks
Authors:
H. Domínguez Sánchez,
G. Martin,
I. Damjanov,
F. Buitrago,
M. Huertas-Company,
C. Bottrell,
M. Bernardi,
J. H. Knapen,
J. Vega-Ferrero,
R. Hausen,
E. Kado-Fong,
D. Población-Criado,
H. Souchereau,
O. K. Leste,
B. Robertson,
B. Sahelices,
K. V. Johnston
Abstract:
Interactions between galaxies leave distinguishable imprints in the form of tidal features which hold important clues about their mass assembly. Unfortunately, these structures are difficult to detect because they are low surface brightness features so deep observations are needed. Upcoming surveys promise several orders of magnitude increase in depth and sky coverage, for which automated methods…
▽ More
Interactions between galaxies leave distinguishable imprints in the form of tidal features which hold important clues about their mass assembly. Unfortunately, these structures are difficult to detect because they are low surface brightness features so deep observations are needed. Upcoming surveys promise several orders of magnitude increase in depth and sky coverage, for which automated methods for tidal feature detection will become mandatory. We test the ability of a convolutional neural network to reproduce human visual classifications for tidal detections. We use as training $\sim$6000 simulated images classified by professional astronomers. The mock Hyper Suprime Cam Subaru (HSC) images include variations with redshift, projection angle and surface brightness ($μ_{lim}$ =26-35 mag arcsec$^{-2}$). We obtain satisfactory results with accuracy, precision and recall values of Acc=0.84, P=0.72 and R=0.85, respectively, for the test sample. While the accuracy and precision values are roughly constant for all surface brightness, the recall (completeness) is significantly affected by image depth. The recovery rate shows strong dependence on the type of tidal features: we recover all the images showing shell features and 87% of the tidal streams; these fractions are below 75% for mergers, tidal tails and bridges. When applied to real HSC images, the performance of the model worsens significantly. We speculate that this is due to the lack of realism of the simulations and take it as a warning on applying deep learning models to different data domains without prior testing on the actual data.
△ Less
Submitted 6 March, 2023;
originally announced March 2023.