-
Novel bivariate autoregressive model for predicting and forecasting irregularly observed time series
Authors:
Felipe Elorrieta,
Susana Eyheramendy,
Wilfredo Palma,
Cesar Ojeda
Abstract:
In several disciplines it is common to find time series measured at irregular observational times. In particular, in astronomy there are a large number of surveys that gather information over irregular time gaps and in more than one passband. Some examples are Pan-STARRS, ZTF and also the LSST. However, current commonly used time series models that estimate the time dependency in astronomical ligh…
▽ More
In several disciplines it is common to find time series measured at irregular observational times. In particular, in astronomy there are a large number of surveys that gather information over irregular time gaps and in more than one passband. Some examples are Pan-STARRS, ZTF and also the LSST. However, current commonly used time series models that estimate the time dependency in astronomical light curves consider the information of each band separately (e.g, CIAR, IAR and CARMA models) disregarding the dependency that might exist between different passbands. In this paper we propose a novel bivariate model for irregularly sampled time series, called the bivariate irregular autoregressive (BIAR) model. The BIAR model assumes an autoregressive structure on each time series, it is stationary, and it allows to estimate the autocorrelation, the cross-correlation and the contemporary correlation between two unequally spaced time series. We implemented the BIAR model on light curves, in the g and r bands, obtained from the ZTF alerts processed by the ALeRCE broker. We show that if the light curves of the two bands are highly correlated, the model has more accurate forecast and prediction using the bivariate model than a similar method that uses only univariate information. Further, the estimated parameters of the BIAR are useful to characterize LongPeriod Variable Stars and to distinguish between classes of stochastic objects, providing promising features that can be used for classification purposes
△ Less
Submitted 25 April, 2021;
originally announced April 2021.
-
Alert Classification for the ALeRCE Broker System: The Light Curve Classifier
Authors:
P. Sánchez-Sáez,
I. Reyes,
C. Valenzuela,
F. Förster,
S. Eyheramendy,
F. Elorrieta,
F. E. Bauer,
G. Cabrera-Vives,
P. A. Estévez,
M. Catelan,
G. Pignata,
P. Huijse,
D. De Cicco,
P. Arévalo,
R. Carrasco-Davis,
J. Abril,
R. Kurtev,
J. Borissova,
J. Arredondo,
E. Castillo-Navarrete,
D. Rodriguez,
D. Ruz-Mieres,
A. Moya,
L. Sabatini-Gacitúa,
C. Sepúlveda-Cobo
, et al. (1 additional authors not shown)
Abstract:
We present the first version of the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker light curve classifier. ALeRCE is currently processing the Zwicky Transient Facility (ZTF) alert stream, in preparation for the Vera C. Rubin Observatory. The ALeRCE light curve classifier uses variability features computed from the ZTF alert stream, and colors obtained from AllWISE and ZT…
▽ More
We present the first version of the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker light curve classifier. ALeRCE is currently processing the Zwicky Transient Facility (ZTF) alert stream, in preparation for the Vera C. Rubin Observatory. The ALeRCE light curve classifier uses variability features computed from the ZTF alert stream, and colors obtained from AllWISE and ZTF photometry. We apply a Balanced Random Forest algorithm with a two-level scheme, where the top level classifies each source as periodic, stochastic, or transient, and the bottom level further resolves each of these hierarchical classes, amongst 15 total classes. This classifier corresponds to the first attempt to classify multiple classes of stochastic variables (including core- and host-dominated active galactic nuclei, blazars, young stellar objects, and cataclysmic variables) in addition to different classes of periodic and transient sources, using real data. We created a labeled set using various public catalogs (such as the Catalina Surveys and {\em Gaia} DR2 variable stars catalogs, and the Million Quasars catalog), and we classify all objects with $\geq6$ $g$-band or $\geq6$ $r$-band detections in ZTF (868,371 sources as of 2020/06/09), providing updated classifications for sources with new alerts every day. For the top level we obtain macro-averaged precision and recall scores of 0.96 and 0.99, respectively, and for the bottom level we obtain macro-averaged precision and recall scores of 0.57 and 0.76, respectively. Updated classifications from the light curve classifier can be found at the \href{http://alerce.online}{ALeRCE Explorer website}.
△ Less
Submitted 19 November, 2020; v1 submitted 7 August, 2020;
originally announced August 2020.
-
The Automatic Learning for the Rapid Classification of Events (ALeRCE) Alert Broker
Authors:
F. Förster,
G. Cabrera-Vives,
E. Castillo-Navarrete,
P. A. Estévez,
P. Sánchez-Sáez,
J. Arredondo,
F. E. Bauer,
R. Carrasco-Davis,
M. Catelan,
F. Elorrieta,
S. Eyheramendy,
P. Huijse,
G. Pignata,
E. Reyes,
I. Reyes,
D. Rodríguez-Mancini,
D. Ruz-Mieres,
C. Valenzuela,
I. Alvarez-Maldonado,
N. Astorga,
J. Borissova,
A. Clocchiatti,
D. De Cicco,
C. Donoso-Oliva,
M. J. Graham
, et al. (15 additional authors not shown)
Abstract:
We introduce the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker, an astronomical alert broker designed to provide a rapid and self--consistent classification of large etendue telescope alert streams, such as that provided by the Zwicky Transient Facility (ZTF) and, in the future, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). ALeRCE is a Chilean--l…
▽ More
We introduce the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker, an astronomical alert broker designed to provide a rapid and self--consistent classification of large etendue telescope alert streams, such as that provided by the Zwicky Transient Facility (ZTF) and, in the future, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). ALeRCE is a Chilean--led broker run by an interdisciplinary team of astronomers and engineers, working to become intermediaries between survey and follow--up facilities. ALeRCE uses a pipeline which includes the real--time ingestion, aggregation, cross--matching, machine learning (ML) classification, and visualization of the ZTF alert stream. We use two classifiers: a stamp--based classifier, designed for rapid classification, and a light--curve--based classifier, which uses the multi--band flux evolution to achieve a more refined classification. We describe in detail our pipeline, data products, tools and services, which are made public for the community (see \url{https://alerce.science}). Since we began operating our real--time ML classification of the ZTF alert stream in early 2019, we have grown a large community of active users around the globe. We describe our results to date, including the real--time processing of $9.7\times10^7$ alerts, the stamp classification of $1.9\times10^7$ objects, the light curve classification of $8.5\times10^5$ objects, the report of 3088 supernova candidates, and different experiments using LSST-like alert streams. Finally, we discuss the challenges ahead to go from a single-stream of alerts such as ZTF to a multi--stream ecosystem dominated by LSST.
△ Less
Submitted 7 August, 2020;
originally announced August 2020.
-
Discrete-time autoregressive model for unequally spaced time-series observations
Authors:
Felipe Elorrieta,
Susana Eyheramendy,
Wilfredo Palma
Abstract:
Most time-series models assume that the data come from observations that are equally spaced in time. However, this assumption does not hold in many diverse scientific fields, such as astronomy, finance, and climatology, among others. There are some techniques that fit unequally spaced time series, such as the continuous-time autoregressive moving average (CARMA) processes. These models are defined…
▽ More
Most time-series models assume that the data come from observations that are equally spaced in time. However, this assumption does not hold in many diverse scientific fields, such as astronomy, finance, and climatology, among others. There are some techniques that fit unequally spaced time series, such as the continuous-time autoregressive moving average (CARMA) processes. These models are defined as the solution of a stochastic differential equation. It is not uncommon in astronomical time series, that the time gaps between observations are large. Therefore, an alternative suitable approach to modeling astronomical time series with large gaps between observations should be based on the solution of a difference equation of a discrete process. In this work we propose a novel model to fit irregular time series called the complex irregular autoregressive (CIAR) model that is represented directly as a discrete-time process. We show that the model is weakly stationary and that it can be represented as a state-space system, allowing efficient maximum likelihood estimation based on the Kalman recursions. Furthermore, we show via Monte Carlo simulations that the finite sample performance of the parameter estimation is accurate. The proposed methodology is applied to light curves from periodic variable stars, illustrating how the model can be implemented to detect poor adjustment of the harmonic model. This can occur when the period has not been accurately estimated or when the variable stars are multiperiodic. Last, we show how the CIAR model, through its state space representation, allows unobserved measurements to be forecast.
△ Less
Submitted 26 June, 2019;
originally announced June 2019.
-
An irregular discrete time series model to identify residuals with autocorrelation in astronomical light curves
Authors:
Susana Eyheramendy,
Felipe Elorrieta,
Wilfredo Palma
Abstract:
Time series observations are ubiquitous in astronomy, and are generated to distinguish between different types of supernovae, to detect and characterize extrasolar planets and to classify variable stars. These time series are usually modeled using a parametric and/or physical model that assumes independent and homoscedastic errors, but in many cases these assumptions are not accurate and there rem…
▽ More
Time series observations are ubiquitous in astronomy, and are generated to distinguish between different types of supernovae, to detect and characterize extrasolar planets and to classify variable stars. These time series are usually modeled using a parametric and/or physical model that assumes independent and homoscedastic errors, but in many cases these assumptions are not accurate and there remains a temporal dependency structure on the errors. This can occur, for example, when the proposed model cannot explain all the variability of the data or when the parameters of the model are not properly estimated. In this work we define an autoregressive model for irregular discrete-time series, based on the discrete time representation of the continuous autoregressive model of order 1. We show that the model is ergodic and stationary. We further propose a maximum likelihood estimation procedure and assess the finite sample performance by Monte Carlo simulations. We implement the model on real and simulated data from Gaussian as well as other distributions, showing that the model can flexibly adapt to different data distributions. We apply the irregular autoregressive model to the residuals of a transit of an extrasolar planet to illustrate errors that remain with temporal structure. We also apply this model to residuals of an harmonic fit of light-curves from variable stars to illustrate how the model can be used to detect incorrect parameter estimation.
△ Less
Submitted 11 September, 2018;
originally announced September 2018.
-
A Near-infrared RR Lyrae census along the southern Galactic plane: the Milky Way's stellar fossil brought to light
Authors:
István Dékány,
Gergely Hajdu,
Eva K. Grebel,
Márcio Catelan,
Felipe Elorrieta,
Susana Eyheramendy,
Daniel Majaess,
Andrés Jordán
Abstract:
RR Lyrae stars (RRLs) are tracers of the Milky Way's fossil record, holding valuable information on its formation and early evolution. Owing to the high interstellar extinction endemic to the Galactic plane, distant RRLs lying at low Galactic latitudes have been elusive. We attained a census of 1892 high-confidence RRLs by exploiting the near-infrared photometric database of the VVV survey's disk…
▽ More
RR Lyrae stars (RRLs) are tracers of the Milky Way's fossil record, holding valuable information on its formation and early evolution. Owing to the high interstellar extinction endemic to the Galactic plane, distant RRLs lying at low Galactic latitudes have been elusive. We attained a census of 1892 high-confidence RRLs by exploiting the near-infrared photometric database of the VVV survey's disk footprint spanning $\sim$70$^\circ$ of Galactic longitude, using a machine-learned classifier. Novel data-driven methods were employed to accurately characterize their spatial distribution using sparsely sampled multi-band photometry. The RRL metallicity distribution function (MDF) was derived from their $K_s$-band light curve parameters using machine-learning methods. The MDF shows remarkable structural similarities to both the spectroscopic MDF of red clump giants and the MDF of bulge RRLs. We model the MDF with a multi-component density distribution and find that the number density of stars associated with the different model components systematically changes with both the Galactocentric radius and vertical distance from the Galactic plane, equivalent to weak metallicity gradients. Based on the consistency with results from the ARGOS survey, three MDF modes are attributed to the old disk populations, while the most metal-poor RRLs are probably halo interlopers. We propose that the dominant [Fe/H] component with a mean of $-1$ dex might correspond to the outskirts of an ancient Galactic spheroid or classical bulge component residing in the central Milky Way. The physical origins of the RRLs in this study need to be verified by kinematical information.
△ Less
Submitted 4 April, 2018;
originally announced April 2018.
-
A machine learned classifier for RR Lyrae in the VVV survey
Authors:
Felipe Elorrieta,
Susana Eyheramendy,
Andrés Jordán,
István Dékány,
Márcio Catelan,
Rodolfo Angeloni,
Javier Alonso-García,
Rodrigo Contreras-Ramos,
Felipe Gran,
Gergely Hajdu,
Néstor Espinoza,
Roberto K. Saito,
Dante Minniti
Abstract:
Variable stars of RR Lyrae type are a prime tool to obtain distances to old stellar populations in the Milky Way, and one of the main aims of the Vista Variables in the Via Lactea (VVV) near-infrared survey is to use them to map the structure of the Galactic Bulge. Due to the large number of expected sources, this requires an automated mechanism for selecting RR Lyrae,and particularly those of the…
▽ More
Variable stars of RR Lyrae type are a prime tool to obtain distances to old stellar populations in the Milky Way, and one of the main aims of the Vista Variables in the Via Lactea (VVV) near-infrared survey is to use them to map the structure of the Galactic Bulge. Due to the large number of expected sources, this requires an automated mechanism for selecting RR Lyrae,and particularly those of the more easily recognized type ab (i.e., fundamental-mode pulsators), from the 10^6-10^7 variables expected in the VVV survey area. In this work we describe a supervised machine-learned classifier constructed for assigning a score to a K_s-band VVV light curve that indicates its likelihood of being ab-type RR Lyrae. We describe the key steps in the construction of the classifier, which were the choice of features, training set, selection of aperture and family of classifiers. We find that the AdaBoost family of classifiers give consistently the best performance for our problem, and obtain a classifier based on the AdaBoost algorithm that achieves a harmonic mean between false positives and false negatives of ~7% for typical VVV light curve sets. This performance is estimated using cross-validation and through the comparison to two independent datasets that were classified by human experts.
△ Less
Submitted 18 October, 2016;
originally announced October 2016.
-
Mapping the outer bulge with RRab stars from the VVV Survey
Authors:
F. Gran,
D. Minniti,
R. K. Saito,
M. Zoccali,
O. A. Gonzalez,
C. Navarrete,
M. Catelan,
R. Contreras Ramos,
F. Elorrieta,
S. Eyheramendy,
A. Jordán
Abstract:
The VISTA Variables in the Vía Láctea (VVV) is a near-IR time-domain survey of the Galactic bulge and southern plane. One of the main goals of this survey is to reveal the 3D structure of the Milky Way through their variable stars. Particularly the RR Lyrae stars have been massively discovered in the inner regions of the bulge ($-8^\circ \lesssim b \lesssim -1^\circ$) by optical surveys such as OG…
▽ More
The VISTA Variables in the Vía Láctea (VVV) is a near-IR time-domain survey of the Galactic bulge and southern plane. One of the main goals of this survey is to reveal the 3D structure of the Milky Way through their variable stars. Particularly the RR Lyrae stars have been massively discovered in the inner regions of the bulge ($-8^\circ \lesssim b \lesssim -1^\circ$) by optical surveys such as OGLE and MACHO but leaving an unexplored window of more than $\sim 47$ sq deg ($-10.0^\circ \lesssim \ell \lesssim +10.7^\circ$ and $-10.3^\circ \lesssim b \lesssim -8.0^\circ$) observed by the VVV Survey. Our goal is to characterize the RR Lyrae stars in the outer bulge in terms of their periods, amplitudes, Fourier coefficients, and distances, in order to evaluate the 3D structure of the bulge in this area. The distance distribution of RR Lyrae stars will be compared to the one of red clump stars that is known to trace a X-shaped structure in order to determine if these two different stellar populations share the same Galactic distribution. We report the detection of more than 1000 RR Lyrae ab-type stars in the VVV Survey located in the outskirts of the Galactic bulge. Some of them are possibly associated with the Sagittarius Dwarf Spheroidal Galaxy. We calculated colors, reddening, extinction, and distances of the detected RR Lyrae stars in order to determine the outer bulge 3D structure. Our main result is that, at the low galactic latitudes mapped here, the RR Lyrae stars trace a centrally concentrated spheroidal distribution. This is a noticeably different spatial distribution to the one traced by red clump stars known to follow a bar and X-shape structure. We estimate the completeness of our RRab sample in $80\%$ for $K_{\rm s}\lesssim15$ mag.
△ Less
Submitted 5 April, 2016;
originally announced April 2016.