-
A data science platform to enable time-domain astronomy
Authors:
Michael W. Coughlin,
Joshua S. Bloom,
Guy Nir,
Sarah Antier,
Theophile Jegou du Laz,
Stéfan van der Walt,
Arien Crellin-Quick,
Thomas Culino,
Dmitry A. Duev,
Daniel A. Goldstein,
Brian F. Healy,
Viraj Karambelkar,
Jada Lilleboe,
Kyung Min Shin,
Leo P. Singer,
Tomas Ahumada,
Shreya Anand,
Eric C. Bellm,
Richard Dekany,
Matthew J. Graham,
Mansi M. Kasliwal,
Ivona Kostadinova,
R. Weizmann Kiendrebeogo,
Shrinivas R. Kulkarni,
Sydney Jenkins
, et al. (28 additional authors not shown)
Abstract:
SkyPortal is an open-source software package designed to efficiently discover interesting transients, manage follow-up, perform characterization, and visualize the results. By enabling fast access to archival and catalog data, cross-matching heterogeneous data streams, and the triggering and monitoring of on-demand observations for further characterization, a SkyPortal-based platform has been oper…
▽ More
SkyPortal is an open-source software package designed to efficiently discover interesting transients, manage follow-up, perform characterization, and visualize the results. By enabling fast access to archival and catalog data, cross-matching heterogeneous data streams, and the triggering and monitoring of on-demand observations for further characterization, a SkyPortal-based platform has been operating at scale for 2 yr for the Zwicky Transient Facility Phase II community, with hundreds of users, containing tens of millions of time-domain sources, interacting with dozens of telescopes, and enabling community reporting. While SkyPortal emphasizes rich user experiences (UX) across common frontend workflows, recognizing that scientific inquiry is increasingly performed programmatically, SkyPortal also surfaces an extensive and well-documented API system. From backend and frontend software to data science analysis tools and visualization frameworks, the SkyPortal design emphasizes the re-use and leveraging of best-in-class approaches, with a strong extensibility ethos. For instance, SkyPortal now leverages ChatGPT large-language models (LLMs) to automatically generate and surface source-level human-readable summaries. With the imminent re-start of the next-generation of gravitational wave detectors, SkyPortal now also includes dedicated multi-messenger features addressing the requirements of rapid multi-messenger follow-up: multi-telescope management, team/group organizing interfaces, and cross-matching of multi-messenger data streams with time-domain optical surveys, with interfaces sufficiently intuitive for the newcomers to the field. (abridged)
△ Less
Submitted 14 June, 2023; v1 submitted 28 April, 2023;
originally announced May 2023.
-
SN 2019zrk, a bright SN 2009ip analog with a precursor
Authors:
Claes Fransson,
Jesper Sollerman,
Nora L. Strotjohann,
Sheng Yang,
Steve Schulze,
Cristina Barbarino,
Erik C. Kool,
Eran O. Ofek,
Arien Crellin-Quick,
Kishalay De,
Andrew J. Drake,
Christoffer Fremling,
Avishay Gal-Yam,
Anna Y. Q. Ho,
Mansi M. Kasliwal
Abstract:
We present photometric and spectroscopic observations of the Type IIn supernova SN 2019zrk (also known as ZTF20aacbyec). The SN shows a $\gtrsim$ 100 day precursor, with a slow rise, followed by a rapid rise to M $\sim -19.2$ in the $r$ and $g$ bands. The post-peak light-curve decline is well fit with an exponential decay with a timescale of $\sim 39$ days, but it shows prominent undulations, with…
▽ More
We present photometric and spectroscopic observations of the Type IIn supernova SN 2019zrk (also known as ZTF20aacbyec). The SN shows a $\gtrsim$ 100 day precursor, with a slow rise, followed by a rapid rise to M $\sim -19.2$ in the $r$ and $g$ bands. The post-peak light-curve decline is well fit with an exponential decay with a timescale of $\sim 39$ days, but it shows prominent undulations, with an amplitude of $\sim 1$ mag. Both the light curve and spectra are dominated by an interaction with a dense circumstellar medium (CSM), probably from previous mass ejections. The spectra evolve from a scattering-dominated Type IIn spectrum to a spectrum with strong P-Cygni absorptions. The expansion velocity is high, $\sim 16,000$ km s$^{-1}$, even in the last spectra. The last spectrum $\sim 110$ days after the main eruption reveals no evidence for advanced nucleosynthesis. From analysis of the spectra and light curves, we estimate the mass-loss rate to be $\sim 4 \times 10^{-2}$ M$_\odot$ yr$^{-1}$ for a CSM velocity of 100 km s$^{-1}$, and a CSM mass of $\gtrsim 1$ M$_\odot$. We find strong similarities for both the precursor, general light curve, and spectral evolution with SN 2009ip and similar SNe, although SN 2019zrk displays a brighter peak magnitude. Different scenarios for the nature of the 09ip-class of SNe, based on pulsational pair instability eruptions, wave heating, and mergers, are discussed. }
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
HEALPix Alchemy: Fast All-Sky Geometry and Image Arithmetic in a Relational Database for Multimessenger Astronomy Brokers
Authors:
Leo P. Singer,
B. Parazin,
Michael W. Coughlin,
Joshua S. Bloom,
Arien Crellin-Quick,
Daniel A. Goldstein,
Stéfan van der Walt
Abstract:
Efficient searches for electromagnetic counterparts to gravitational wave, high-energy neutrino, and gamma-ray burst events demand rapid processing of image arithmetic and geometry set operations in a database to cross-match galaxy catalogs, observation footprints, and all-sky images. Here we introduce HEALPix Alchemy, an open-source, pure Python implementation of a set of methods that enables rap…
▽ More
Efficient searches for electromagnetic counterparts to gravitational wave, high-energy neutrino, and gamma-ray burst events demand rapid processing of image arithmetic and geometry set operations in a database to cross-match galaxy catalogs, observation footprints, and all-sky images. Here we introduce HEALPix Alchemy, an open-source, pure Python implementation of a set of methods that enables rapid all-sky geometry calculations. HEALPix Alchemy is built upon HEALPix, a spatial indexing strategy that is widely used in astronomical databases as well as the native format of LIGO-Virgo-KAGRA gravitational-wave sky localization maps. Our approach leverages new multirange types built into the PostgreSQL 14 database engine. This enables fast all-sky queries against probabilistic multimessenger event localizations and telescope survey footprints. Questions such as "What are the galaxies contained within the 90% credible region of an event?" and "What is the rank-ordered list of the fields within an observing footprint with the highest probability of containing the event?" can be performed in less than a few seconds on commodity hardware using off-the-shelf cloud-managed database implementations without server-side database extensions. Common queries scale roughly linearly with the number of telescope pointings. As the number of fields grows into the hundreds or thousands, HEALPix Alchemy is orders of magnitude faster than other implementations. HEALPix Alchemy is now used as the spatial geometry engine within SkyPortal, which forms the basis of the Zwicky Transient Facility transient marshal, called Fritz.
△ Less
Submitted 27 April, 2022; v1 submitted 13 December, 2021;
originally announced December 2021.
-
Fast-transient Searches in Real Time with ZTFReST: Identification of Three Optically-discovered Gamma-ray Burst Afterglows and New Constraints on the Kilonova Rate
Authors:
Igor Andreoni,
Michael W. Coughlin,
Erik C. Kool,
Mansi M. Kasliwal,
Harsh Kumar,
Varun Bhalerao,
Ana Sagués Carracedo,
Anna Y. Q. Ho,
Peter T. H. Pang,
Divita Saraogi,
Kritti Sharma,
Vedant Shenoy,
Eric Burns,
Tomás Ahumada,
Shreya Anand,
Leo P. Singer,
Daniel A. Perley,
Kishalay De,
U. C. Fremling,
Eric C. Bellm,
Mattia Bulla,
Arien Crellin-Quick,
Tim Dietrich,
Andrew Drake,
Dmitry A. Duev
, et al. (10 additional authors not shown)
Abstract:
While optical surveys regularly discover slow transients like supernovae on their own, the most common way to discover extragalactic fast transients, fading away in a few nights, is via follow-up observations of gamma-ray burst and gravitational-wave triggers. However, wide-field surveys have the potential to also identify rapidly fading transients independently of such external triggers. The volu…
▽ More
While optical surveys regularly discover slow transients like supernovae on their own, the most common way to discover extragalactic fast transients, fading away in a few nights, is via follow-up observations of gamma-ray burst and gravitational-wave triggers. However, wide-field surveys have the potential to also identify rapidly fading transients independently of such external triggers. The volumetric survey speed of the Zwicky Transient Facility (ZTF) makes it sensitive to faint and fast-fading objects as kilonovae, the optical counterparts to binary neutron stars and neutron star-black hole mergers, out to almost 200Mpc. We introduce an open-source software infrastructure, the ZTF REaltime Search and Triggering, ZTFReST, designed to identify kilonovae and fast optical transients in ZTF data. Using the ZTF alert stream combined with forced photometry, we have implemented automated candidate ranking based on their photometric evolution and fitting to kilonova models. Automated triggering of follow-up systems, such as Las Cumbres Observatory, has also been implemented. In 13 months of science validation, we found several extragalactic fast transients independent of any external trigger (though some counterparts were identified later), including at least one supernova with post-shock cooling emission, two known afterglows with an associated gamma-ray burst, two known afterglows without any known gamma-ray counterpart, and three new fast-declining sources (ZTF20abtxwfx, ZTF20acozryr, and ZTF21aagwbjr) that are likely associated with GRB200817A, GRB201103B, and GRB210204A. However, we have not found any objects which appear to be kilonovae; therefore, we constrain the rate of GW170817-like kilonovae to $R < 900$Gpc$^{-3}$yr$^{-1}$. A framework such as ZTFReST could become a prime tool for kilonova and fast transient discovery with the Vera C. Rubin Observatory.
△ Less
Submitted 13 April, 2021;
originally announced April 2021.
-
Construction of a Calibrated Probabilistic Classification Catalog: Application to 50k Variable Sources in the All-Sky Automated Survey
Authors:
Joseph W. Richards,
Dan L. Starr,
Adam A. Miller,
Joshua S. Bloom,
Nathaniel R. Butler,
Henrik Brink,
Arien Crellin-Quick
Abstract:
With growing data volumes from synoptic surveys, astronomers must become more abstracted from the discovery and introspection processes. Given the scarcity of follow-up resources, there is a particularly sharp onus on the frameworks that replace these human roles to provide accurate and well-calibrated probabilistic classification catalogs. Such catalogs inform the subsequent follow-up, allowing c…
▽ More
With growing data volumes from synoptic surveys, astronomers must become more abstracted from the discovery and introspection processes. Given the scarcity of follow-up resources, there is a particularly sharp onus on the frameworks that replace these human roles to provide accurate and well-calibrated probabilistic classification catalogs. Such catalogs inform the subsequent follow-up, allowing consumers to optimize the selection of specific sources for further study and permitting rigorous treatment of purities and efficiencies for population studies. Here, we describe a process to produce a probabilistic classification catalog of variability with machine learning from a multi-epoch photometric survey. In addition to producing accurate classifications, we show how to estimate calibrated class probabilities, and motivate the importance of probability calibration. We also introduce a methodology for feature-based anomaly detection, which allows discovery of objects in the survey that do not fit within the predefined class taxonomy. Finally, we apply these methods to sources observed by the All Sky Automated Survey (ASAS), and unveil the Machine-learned ASAS Classification Catalog (MACC), which is a 28-class probabilistic classification catalog of 50,124 ASAS sources. We estimate that MACC achieves a sub-20% classification error rate, and demonstrate that the class posterior probabilities are reasonably calibrated. MACC classifications compare favorably to the classifications of several previous domain-specific ASAS papers and to the ASAS Catalog of Variable Stars, which had classified only 24% of those sources into one of 12 science classes. The MACC is publicly available at http://www.bigmacc.info.
△ Less
Submitted 24 April, 2012; v1 submitted 18 April, 2012;
originally announced April 2012.
-
On Machine-Learned Classification of Variable Stars with Sparse and Noisy Time-Series Data
Authors:
Joseph W. Richards,
Dan L. Starr,
Nathaniel R. Butler,
Joshua S. Bloom,
John M. Brewer,
Arien Crellin-Quick,
Justin Higgins,
Rachel Kennedy,
Maxime Rischard
Abstract:
With the coming data deluge from synoptic surveys, there is a growing need for frameworks that can quickly and automatically produce calibrated classification probabilities for newly-observed variables based on a small number of time-series measurements. In this paper, we introduce a methodology for variable-star classification, drawing from modern machine-learning techniques. We describe how to h…
▽ More
With the coming data deluge from synoptic surveys, there is a growing need for frameworks that can quickly and automatically produce calibrated classification probabilities for newly-observed variables based on a small number of time-series measurements. In this paper, we introduce a methodology for variable-star classification, drawing from modern machine-learning techniques. We describe how to homogenize the information gleaned from light curves by selection and computation of real-numbered metrics ("feature"), detail methods to robustly estimate periodic light-curve features, introduce tree-ensemble methods for accurate variable star classification, and show how to rigorously evaluate the classification results using cross validation. On a 25-class data set of 1542 well-studied variable stars, we achieve a 22.8% overall classification error using the random forest classifier; this represents a 24% improvement over the best previous classifier on these data. This methodology is effective for identifying samples of specific science classes: for pulsational variables used in Milky Way tomography we obtain a discovery efficiency of 98.2% and for eclipsing systems we find an efficiency of 99.1%, both at 95% purity. We show that the random forest (RF) classifier is superior to other machine-learned methods in terms of accuracy, speed, and relative immunity to features with no useful class information; the RF classifier can also be used to estimate the importance of each feature in classification. Additionally, we present the first astronomical use of hierarchical classification methods to incorporate a known class taxonomy in the classifier, which further reduces the catastrophic error rate to 7.8%. Excluding low-amplitude sources, our overall error rate improves to 14%, with a catastrophic error rate of 3.5%.
△ Less
Submitted 10 January, 2011;
originally announced January 2011.