-
ATAT: Astronomical Transformer for time series And Tabular data
Authors:
G. Cabrera-Vives,
D. Moreno-Cartagena,
N. Astorga,
I. Reyes-Jainaga,
F. Förster,
P. Huijse,
J. Arredondo,
A. M. Muñoz Arancibia,
A. Bayo,
M. Catelan,
P. A. Estévez,
P. Sánchez-Sáez,
A. Álvarez,
P. Castellanos,
P. Gallardo,
A. Moya,
D. Rodriguez-Mancini
Abstract:
The advent of next-generation survey instruments, such as the Vera C. Rubin Observatory and its Legacy Survey of Space and Time (LSST), is opening a window for new research in time-domain astronomy. The Extended LSST Astronomical Time-Series Classification Challenge (ELAsTiCC) was created to test the capacity of brokers to deal with a simulated LSST stream. We describe ATAT, the Astronomical Trans…
▽ More
The advent of next-generation survey instruments, such as the Vera C. Rubin Observatory and its Legacy Survey of Space and Time (LSST), is opening a window for new research in time-domain astronomy. The Extended LSST Astronomical Time-Series Classification Challenge (ELAsTiCC) was created to test the capacity of brokers to deal with a simulated LSST stream. We describe ATAT, the Astronomical Transformer for time series And Tabular data, a classification model conceived by the ALeRCE alert broker to classify light-curves from next-generation alert streams. ATAT was tested in production during the first round of the ELAsTiCC campaigns. ATAT consists of two Transformer models that encode light curves and features using novel time modulation and quantile feature tokenizer mechanisms, respectively. ATAT was trained on different combinations of light curves, metadata, and features calculated over the light curves. We compare ATAT against the current ALeRCE classifier, a Balanced Hierarchical Random Forest (BHRF) trained on human-engineered features derived from light curves and metadata. When trained on light curves and metadata, ATAT achieves a macro F1-score of 82.9 +- 0.4 in 20 classes, outperforming the BHRF model trained on 429 features, which achieves a macro F1-score of 79.4 +- 0.1. The use of Transformer multimodal architectures, combining light curves and tabular data, opens new possibilities for classifying alerts from a new generation of large etendue telescopes, such as the Vera C. Rubin Observatory, in real-world brokering scenarios.
△ Less
Submitted 16 May, 2024; v1 submitted 5 May, 2024;
originally announced May 2024.
-
Efficient Gravitational-Wave Model for Fully-Precessing and Moderately-Eccentric, Compact Binary Inspirals
Authors:
J. Nijaid Arredondo,
Antoine Klein,
Nicolás Yunes
Abstract:
Future gravitational-wave detectors, especially the Laser Interferometer Space Antenna (LISA), will be sensitive to black hole binaries formed in astrophysical environments that promote large eccentricities and spin precession. Gravitational-wave templates that include both effects have only recently begun to be developed. The Efficient Fully Precessing Eccentric (EFPE) family is one such model, c…
▽ More
Future gravitational-wave detectors, especially the Laser Interferometer Space Antenna (LISA), will be sensitive to black hole binaries formed in astrophysical environments that promote large eccentricities and spin precession. Gravitational-wave templates that include both effects have only recently begun to be developed. The Efficient Fully Precessing Eccentric (EFPE) family is one such model, covering the inspiral stage with small-eccentricity-expanded gravitational-wave amplitudes accurate for eccentricities $e < 0.3$. In this work, we extend this model to cover a larger range of eccentricities. The new EFPE_ME model is able to accurately represent the leading-order gravitational-wave amplitudes to $e \leq 0.8$. Comparing the EFPE and the EFPE_ME models in the LISA band, however, reveals that there is no significant difference when $e_0 \leq 0.5$ for binaries at 4 years before merger, as radiation reaction circularizes supermassive black hole binaries too quickly. This suggests that the EFPE model may have a larger regime of validity in eccentricity space than previously thought, making it suitable for some inspiral parameter estimation with LISA data. On the other hand, for systems with $e_0 > 0.5$, the deviations between the models are significant, particularly for binaries with total masses below $10^5\, \mathrm{M}_{\odot}$. This suggests that the EFPE_ME model will be crucial to avoid systematic bias in parameter estimation with LISA in the future, once this model has been hybridized to include the merger and ringdown.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Multi-scale stamps for real-time classification of alert streams
Authors:
Ignacio Reyes-Jainaga,
Francisco Förster,
Alejandra M. Muñoz Arancibia,
Guillermo Cabrera-Vives,
Amelia Bayo,
Franz E. Bauer,
Javier Arredondo,
Esteban Reyes,
Giuliano Pignata,
A. M. Mourão,
Javier Silva-Farfán,
Lluís Galbany,
Alex Álvarez,
Nicolás Astorga,
Pablo Castellanos,
Pedro Gallardo,
Alberto Moya,
Diego Rodríguez
Abstract:
In recent years, automatic classifiers of image cutouts (also called "stamps") have shown to be key for fast supernova discovery. The Vera C. Rubin Observatory will distribute about ten million alerts with their respective stamps each night, enabling the discovery of approximately one million supernovae each year. A growing source of confusion for these classifiers is the presence of satellite gli…
▽ More
In recent years, automatic classifiers of image cutouts (also called "stamps") have shown to be key for fast supernova discovery. The Vera C. Rubin Observatory will distribute about ten million alerts with their respective stamps each night, enabling the discovery of approximately one million supernovae each year. A growing source of confusion for these classifiers is the presence of satellite glints, sequences of point-like sources produced by rotating satellites or debris. The currently planned Rubin stamps will have a size smaller than the typical separation between these point sources. Thus, a larger field of view stamp could enable the automatic identification of these sources. However, the distribution of larger stamps would be limited by network bandwidth restrictions. We evaluate the impact of using image stamps of different angular sizes and resolutions for the fast classification of events (AGNs, asteroids, bogus, satellites, SNe, and variable stars), using data from the Zwicky Transient Facility. We compare four scenarios: three with the same number of pixels (small field of view with high resolution, large field of view with low resolution, and a multi-scale proposal) and a scenario with the full stamp that has a larger field of view and higher resolution. Compared to small field of view stamps, our multi-scale strategy reduces misclassifications of satellites as asteroids or supernovae, performing on par with high-resolution stamps that are 15 times heavier. We encourage Rubin and its Science Collaborations to consider the benefits of implementing multi-scale stamps as a possible update to the alert specification.
△ Less
Submitted 14 July, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Persistent and occasional: searching for the variable population of the ZTF/4MOST sky using ZTF data release 11
Authors:
P. Sánchez-Sáez,
J. Arredondo,
A. Bayo,
P. Arévalo,
F. E. Bauer,
G. Cabrera-Vives,
M. Catelan,
P. Coppi,
P. A. Estévez,
F. Förster,
L. Hernández-García,
P. Huijse,
R. Kurtev,
P. Lira,
A. M. Muñoz Arancibia,
G. Pignata
Abstract:
We present a variability, color and morphology based classifier, designed to identify transients, persistently variable, and non-variable sources, from the Zwicky Transient Facility (ZTF) Data Release 11 (DR11) light curves of extended and point sources. The main motivation to develop this model was to identify active galactic nuclei (AGN) at different redshift ranges to be observed by the 4MOST C…
▽ More
We present a variability, color and morphology based classifier, designed to identify transients, persistently variable, and non-variable sources, from the Zwicky Transient Facility (ZTF) Data Release 11 (DR11) light curves of extended and point sources. The main motivation to develop this model was to identify active galactic nuclei (AGN) at different redshift ranges to be observed by the 4MOST ChANGES project. Still, it serves as a more general time-domain astronomy study. The model uses nine colors computed from CatWISE and PS1, a morphology score from PS1, and 61 single-band variability features computed from the ZTF DR11 g and r light curves. We trained two versions of the model, one for each ZTF band. We used a hierarchical local classifier per parent node approach, where each node was composed of a balanced random forest model. We adopted a 17-class taxonomy, including non-variable stars and galaxies, three transient classes, five classes of stochastic variables, and seven classes of periodic variables. The macro averaged precision, recall and F1-score are 0.61, 0.75, and 0.62 for the g-band model, and 0.60, 0.74, and 0.61, for the r-band model. When grouping the four AGN classes into one single class, its precision, recall, and F1-score are 1.00, 0.95, and 0.97, respectively, for both the g and r bands. We applied the model to all the sources in the ZTF/4MOST overlapping sky, avoiding ZTF fields covering the Galactic bulge, including 86,576,577 light curves in the g-band and 140,409,824 in the r-band. Only 0.73\% of the g-band light curves and 2.62\% of the r-band light curves were classified as stochastic, periodic, or transient with high probability ($P_{init}\geq0.9$). We found that, in general, more reliable results are obtained when using the g-band model. Using the latter, we identified 384,242 AGN candidates, 287,156 of which have $P_{init}\geq0.9$.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
DELIGHT: Deep Learning Identification of Galaxy Hosts of Transients using Multi-resolution Images
Authors:
Francisco Förster,
Alejandra M. Muñoz Arancibia,
Ignacio Reyes,
Alexander Gagliano,
Dylan Britt,
Sara Cuellar-Carrillo,
Felipe Figueroa-Tapia,
Ava Polzin,
Yara Yousef,
Javier Arredondo,
Diego Rodríguez-Mancini,
Javier Correa-Orellana,
Amelia Bayo,
Franz E. Bauer,
Márcio Catelan,
Guillermo Cabrera-Vives,
Raya Dastidar,
Pablo A. Estévez,
Giuliano Pignata,
Lorena Hernandez-Garcia,
Pablo Huijse,
Esteban Reyes,
Paula Sánchez-Sáez,
Mauricio Ramirez,
Daniela Grandón
, et al. (3 additional authors not shown)
Abstract:
We present DELIGHT, or Deep Learning Identification of Galaxy Hosts of Transients, a new algorithm designed to automatically and in real-time identify the host galaxies of extragalactic transients. The proposed algorithm receives as input compact, multi-resolution images centered at the position of a transient candidate and outputs two-dimensional offset vectors that connect the transient with the…
▽ More
We present DELIGHT, or Deep Learning Identification of Galaxy Hosts of Transients, a new algorithm designed to automatically and in real-time identify the host galaxies of extragalactic transients. The proposed algorithm receives as input compact, multi-resolution images centered at the position of a transient candidate and outputs two-dimensional offset vectors that connect the transient with the center of its predicted host. The multi-resolution input consists of a set of images with the same number of pixels, but with progressively larger pixel sizes and fields of view. A sample of \nSample galaxies visually identified by the ALeRCE broker team was used to train a convolutional neural network regression model. We show that this method is able to correctly identify both relatively large ($10\arcsec < r < 60\arcsec$) and small ($r \le 10\arcsec$) apparent size host galaxies using much less information (32 kB) than with a large, single-resolution image (920 kB). The proposed method has fewer catastrophic errors in recovering the position and is more complete and has less contamination ($< 0.86\%$) recovering the cross-matched redshift than other state-of-the-art methods. The more efficient representation provided by multi-resolution input images could allow for the identification of transient host galaxies in real-time, if adopted in alert streams from new generation of large etendue telescopes such as the Vera C. Rubin Observatory.
△ Less
Submitted 8 August, 2022;
originally announced August 2022.
-
Searching for changing-state AGNs in massive datasets -- I: applying deep learning and anomaly detection techniques to find AGNs with anomalous variability behaviours
Authors:
P. Sánchez-Sáez,
H. Lira,
L. Martí,
N. Sánchez-Pi,
J. Arredondo,
F. E. Bauer,
A. Bayo,
G. Cabrera-Vives,
C. Donoso-Oliva,
P. A. Estévez,
S. Eyheramendy,
F. Förster,
L. Hernández-García,
A. M. Muñoz Arancibia,
M. Pérez-Carrasco,
M. Sepúlveda,
J. R. Vergara
Abstract:
The classic classification scheme for Active Galactic Nuclei (AGNs) was recently challenged by the discovery of the so-called changing-state (changing-look) AGNs (CSAGNs). The physical mechanism behind this phenomenon is still a matter of open debate and the samples are too small and of serendipitous nature to provide robust answers. In order to tackle this problem, we need to design methods that…
▽ More
The classic classification scheme for Active Galactic Nuclei (AGNs) was recently challenged by the discovery of the so-called changing-state (changing-look) AGNs (CSAGNs). The physical mechanism behind this phenomenon is still a matter of open debate and the samples are too small and of serendipitous nature to provide robust answers. In order to tackle this problem, we need to design methods that are able to detect AGN right in the act of changing-state. Here we present an anomaly detection (AD) technique designed to identify AGN light curves with anomalous behaviors in massive datasets. The main aim of this technique is to identify CSAGN at different stages of the transition, but it can also be used for more general purposes, such as cleaning massive datasets for AGN variability analyses. We used light curves from the Zwicky Transient Facility data release 5 (ZTF DR5), containing a sample of 230,451 AGNs of different classes. The ZTF DR5 light curves were modeled with a Variational Recurrent Autoencoder (VRAE) architecture, that allowed us to obtain a set of attributes from the VRAE latent space that describes the general behaviour of our sample. These attributes were then used as features for an Isolation Forest (IF) algorithm, that is an anomaly detector for a "one class" kind of problem. We used the VRAE reconstruction errors and the IF anomaly score to select a sample of 8,809 anomalies. These anomalies are dominated by bogus candidates, but we were able to identify 75 promising CSAGN candidates.
△ Less
Submitted 12 July, 2021; v1 submitted 14 June, 2021;
originally announced June 2021.
-
Neutron Stars in the Effective Fly-By Framework: $f$-Mode Re-summation
Authors:
Jose Nijaid Arredondo,
Nicholas Loutrel
Abstract:
Eccentric compact binaries pose not only a challenge for gravitational wave detectors, but also provide a probe into the nuclear equation of state if one of the objects is a neutron star. At the short pericenter passage, tidal interactions excite f-modes on the star, which in turn emit their own gravitational waves. We derive an analytic waveform for these stellar oscillations within the effective…
▽ More
Eccentric compact binaries pose not only a challenge for gravitational wave detectors, but also provide a probe into the nuclear equation of state if one of the objects is a neutron star. At the short pericenter passage, tidal interactions excite f-modes on the star, which in turn emit their own gravitational waves. We derive an analytic waveform for these stellar oscillations within the effective fly-by framework, modeling the emission to leading post-Newtonian order. At this order, the f-mode response can be written in a Fourier decomposition in terms of orbital harmonics, with the amplitudes of each harmonic depending on Hansen coefficients. Re-summing the harmonics of the f-mode results in a simple decaying harmonic oscillator, with the amplitude now determined by a Hansen coefficient of complex harmonic number. We compute the match between the re-summed f-mode and numerical integrations of the tidal response, and find ${\cal{M}} > 0.98$ for systems with high orbital eccentricity $(e > 0.8)$ and low semi-latus rectum $(p < 15M)$ for three equations of state. We further compare our model to modes generated from subsequent pericenter passages under the effect of radiation reaction, and develop an accurate model to time pericenter passages. We show how the timing model can be used to specify initial conditions to accurately track the f-mode excitation across multiple pericenter passages.
△ Less
Submitted 8 February, 2022; v1 submitted 26 January, 2021;
originally announced January 2021.
-
Alert Classification for the ALeRCE Broker System: The Light Curve Classifier
Authors:
P. Sánchez-Sáez,
I. Reyes,
C. Valenzuela,
F. Förster,
S. Eyheramendy,
F. Elorrieta,
F. E. Bauer,
G. Cabrera-Vives,
P. A. Estévez,
M. Catelan,
G. Pignata,
P. Huijse,
D. De Cicco,
P. Arévalo,
R. Carrasco-Davis,
J. Abril,
R. Kurtev,
J. Borissova,
J. Arredondo,
E. Castillo-Navarrete,
D. Rodriguez,
D. Ruz-Mieres,
A. Moya,
L. Sabatini-Gacitúa,
C. Sepúlveda-Cobo
, et al. (1 additional authors not shown)
Abstract:
We present the first version of the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker light curve classifier. ALeRCE is currently processing the Zwicky Transient Facility (ZTF) alert stream, in preparation for the Vera C. Rubin Observatory. The ALeRCE light curve classifier uses variability features computed from the ZTF alert stream, and colors obtained from AllWISE and ZT…
▽ More
We present the first version of the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker light curve classifier. ALeRCE is currently processing the Zwicky Transient Facility (ZTF) alert stream, in preparation for the Vera C. Rubin Observatory. The ALeRCE light curve classifier uses variability features computed from the ZTF alert stream, and colors obtained from AllWISE and ZTF photometry. We apply a Balanced Random Forest algorithm with a two-level scheme, where the top level classifies each source as periodic, stochastic, or transient, and the bottom level further resolves each of these hierarchical classes, amongst 15 total classes. This classifier corresponds to the first attempt to classify multiple classes of stochastic variables (including core- and host-dominated active galactic nuclei, blazars, young stellar objects, and cataclysmic variables) in addition to different classes of periodic and transient sources, using real data. We created a labeled set using various public catalogs (such as the Catalina Surveys and {\em Gaia} DR2 variable stars catalogs, and the Million Quasars catalog), and we classify all objects with $\geq6$ $g$-band or $\geq6$ $r$-band detections in ZTF (868,371 sources as of 2020/06/09), providing updated classifications for sources with new alerts every day. For the top level we obtain macro-averaged precision and recall scores of 0.96 and 0.99, respectively, and for the bottom level we obtain macro-averaged precision and recall scores of 0.57 and 0.76, respectively. Updated classifications from the light curve classifier can be found at the \href{http://alerce.online}{ALeRCE Explorer website}.
△ Less
Submitted 19 November, 2020; v1 submitted 7 August, 2020;
originally announced August 2020.
-
Alert Classification for the ALeRCE Broker System: The Real-time Stamp Classifier
Authors:
Rodrigo Carrasco-Davis,
Esteban Reyes,
Camilo Valenzuela,
Francisco Förster,
Pablo A. Estévez,
Giuliano Pignata,
Franz E. Bauer,
Ignacio Reyes,
Paula Sánchez-Sáez,
Guillermo Cabrera-Vives,
Susana Eyheramendy,
Márcio Catelan,
Javier Arredondo,
Ernesto Castillo-Navarrete,
Diego Rodríguez-Mancini,
Daniela Ruz-Mieres,
Alberto Moya,
Luis Sabatini-Gacitúa,
Cristóbal Sepúlveda-Cobo,
Ashish A. Mahabal,
Javier Silva-Farfán,
Ernesto Camacho-Iñiquez,
Lluís Galbany
Abstract:
We present a real-time stamp classifier of astronomical events for the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker. The classifier is based on a convolutional neural network, trained on alerts ingested from the Zwicky Transient Facility (ZTF). Using only the \textit{science, reference} and \textit{difference} images of the first detection as inputs, along with the met…
▽ More
We present a real-time stamp classifier of astronomical events for the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker. The classifier is based on a convolutional neural network, trained on alerts ingested from the Zwicky Transient Facility (ZTF). Using only the \textit{science, reference} and \textit{difference} images of the first detection as inputs, along with the metadata of the alert as features, the classifier is able to correctly classify alerts from active galactic nuclei, supernovae (SNe), variable stars, asteroids and bogus classes, with high accuracy ($\sim$94\%) in a balanced test set. In order to find and analyze SN candidates selected by our classifier from the ZTF alert stream, we designed and deployed a visualization tool called SN Hunter, where relevant information about each possible SN is displayed for the experts to choose among candidates to report to the Transient Name Server database. From June 26th 2019 to February 28th 2021, we have reported 6846 SN candidates to date (11.8 candidates per day on average), of which 971 have been confirmed spectroscopically. Our ability to report objects using only a single detection means that 70\% of the reported SNe occurred within one day after the first detection. ALeRCE has only reported candidates not otherwise detected or selected by other groups, therefore adding new early transients to the bulk of objects available for early follow-up. Our work represents an important milestone toward rapid alert classifications with the next generation of large etendue telescopes, such as the Vera C. Rubin Observatory.
△ Less
Submitted 3 June, 2021; v1 submitted 7 August, 2020;
originally announced August 2020.
-
The Automatic Learning for the Rapid Classification of Events (ALeRCE) Alert Broker
Authors:
F. Förster,
G. Cabrera-Vives,
E. Castillo-Navarrete,
P. A. Estévez,
P. Sánchez-Sáez,
J. Arredondo,
F. E. Bauer,
R. Carrasco-Davis,
M. Catelan,
F. Elorrieta,
S. Eyheramendy,
P. Huijse,
G. Pignata,
E. Reyes,
I. Reyes,
D. Rodríguez-Mancini,
D. Ruz-Mieres,
C. Valenzuela,
I. Alvarez-Maldonado,
N. Astorga,
J. Borissova,
A. Clocchiatti,
D. De Cicco,
C. Donoso-Oliva,
M. J. Graham
, et al. (15 additional authors not shown)
Abstract:
We introduce the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker, an astronomical alert broker designed to provide a rapid and self--consistent classification of large etendue telescope alert streams, such as that provided by the Zwicky Transient Facility (ZTF) and, in the future, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). ALeRCE is a Chilean--l…
▽ More
We introduce the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker, an astronomical alert broker designed to provide a rapid and self--consistent classification of large etendue telescope alert streams, such as that provided by the Zwicky Transient Facility (ZTF) and, in the future, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). ALeRCE is a Chilean--led broker run by an interdisciplinary team of astronomers and engineers, working to become intermediaries between survey and follow--up facilities. ALeRCE uses a pipeline which includes the real--time ingestion, aggregation, cross--matching, machine learning (ML) classification, and visualization of the ZTF alert stream. We use two classifiers: a stamp--based classifier, designed for rapid classification, and a light--curve--based classifier, which uses the multi--band flux evolution to achieve a more refined classification. We describe in detail our pipeline, data products, tools and services, which are made public for the community (see \url{https://alerce.science}). Since we began operating our real--time ML classification of the ZTF alert stream in early 2019, we have grown a large community of active users around the globe. We describe our results to date, including the real--time processing of $9.7\times10^7$ alerts, the stamp classification of $1.9\times10^7$ objects, the light curve classification of $8.5\times10^5$ objects, the report of 3088 supernova candidates, and different experiments using LSST-like alert streams. Finally, we discuss the challenges ahead to go from a single-stream of alerts such as ZTF to a multi--stream ecosystem dominated by LSST.
△ Less
Submitted 7 August, 2020;
originally announced August 2020.