-
DESI Constraints on Exponential Quintessence
Authors:
Omar F. Ramadan,
Jeremy Sakstein,
David Rubin
Abstract:
The DESI collaboration have recently analyzed their first year of data, finding a preference for thawing dark energy scenarios when using parameterized equations of state for dark energy. We investigate whether this preference persists when the data is analyzed within the context of a well-studied field theory model of thawing dark energy, exponential quintessence. No preference for this model ove…
▽ More
The DESI collaboration have recently analyzed their first year of data, finding a preference for thawing dark energy scenarios when using parameterized equations of state for dark energy. We investigate whether this preference persists when the data is analyzed within the context of a well-studied field theory model of thawing dark energy, exponential quintessence. No preference for this model over $Λ$CDM is found, and both models are poorer fits to the data than the Chevallier-Polarski-Linder $w_0$--$w_a$ parameterization. We demonstrate that the worse fit is due to a lack of sharp features in the potential that results in a slowly-evolving dark energy equation of state that does not have enough freedom to simultaneously fit the combination of the supernovae, DESI, and cosmic microwave background data. Our analysis provides guidance for constructing dynamical dark energy models that are able to better accommodate the data.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
The DEHVILS in the Details: Type Ia Supernova Hubble Residual Comparisons and Mass Step Analysis in the Near-Infrared
Authors:
Erik R. Peterson,
Daniel Scolnic,
David O. Jones,
Aaron Do,
Brodie Popovic,
Adam G. Riess,
Arianna Dwomoh,
Joel Johansson,
David Rubin,
Bruno O. Sánchez,
Benjamin J. Shappee,
John L. Tonry,
R. Brent Tully,
Maria Vincenzi
Abstract:
Measurements of Type Ia Supernovae (SNe Ia) in the near-infrared (NIR) have been used both as an alternate path to cosmology compared to optical measurements and as a method of constraining key systematics for the larger optical studies. With the DEHVILS sample, the largest published NIR sample with consistent NIR coverage of maximum light across three NIR bands ($Y$, $J$, and $H$), we check three…
▽ More
Measurements of Type Ia Supernovae (SNe Ia) in the near-infrared (NIR) have been used both as an alternate path to cosmology compared to optical measurements and as a method of constraining key systematics for the larger optical studies. With the DEHVILS sample, the largest published NIR sample with consistent NIR coverage of maximum light across three NIR bands ($Y$, $J$, and $H$), we check three key systematics: (i) the reduction in Hubble residual scatter as compared to the optical, (ii) the measurement of a "mass step" or lack thereof and its implications, and (iii) the ability to distinguish between various dust models by analyzing correlations between Hubble residuals in the NIR and optical. We produce accurate simulations of the DEHVILS sample and find, contrary to assumptions in the literature, it is $\textit{harder}$ to differentiate between various dust models than previously understood. Additionally, we find that fitting with the current SALT3 model does not yield accurate wavelength-dependent stretch-luminosity correlations, and we propose a limited solution for this problem. From the data, we see that (i) the standard deviation of Hubble residual values from NIR bands treated as standard candles are 0.007-0.042 mag smaller than those in the optical, (ii) the NIR mass step is not constrainable with the current sample size from DEHVILS, and (iii) Hubble residuals in the NIR and optical are correlated in both the simulations and the data. We test a few variations on the number and combinations of filters and data samples, and we observe that none of our findings or conclusions are significantly impacted by these modifications.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Hawai`i Supernova Flows: A Peculiar Velocity Survey Using Over a Thousand Supernovae in the Near-Infrared
Authors:
Aaron Do,
Benjamin J. Shappee,
Thomas de Jaeger,
David Rubin,
R. Brent Tully,
John L. Tonry,
Erik R. Peterson,
David O. Jones,
Dan Scolnic,
Christopher R. Burns,
Kaisey S. Mandel
Abstract:
We introduce the Hawai`i Supernova Flows project and present summary statistics of the first 1218 astronomical transients observed, 669 of which are spectroscopically classified Type Ia Supernovae (SNe Ia). Our project is designed to obtain systematics-limited distances to SNe Ia while consuming minimal dedicated observational resources. This growing sample will provide increasing resolution into…
▽ More
We introduce the Hawai`i Supernova Flows project and present summary statistics of the first 1218 astronomical transients observed, 669 of which are spectroscopically classified Type Ia Supernovae (SNe Ia). Our project is designed to obtain systematics-limited distances to SNe Ia while consuming minimal dedicated observational resources. This growing sample will provide increasing resolution into peculiar velocities as a function of position on the sky and redshift, allowing us to more accurately map the structure of dark matter. This can be used to derive cosmological parameters such as $σ_8$ and can be compared with large scale flow maps from other methods such as luminosity-line width or luminosity-velocity dispersion correlations in galaxies. Additionally, our photometry will provide a valuable test bed for analyses of SNe Ia incorporating near-infrared data. In this survey paper, we describe the methodology used to select targets, collect and reduce data, and calculate distances.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Detailed Report on the Measurement of the Positive Muon Anomalous Magnetic Moment to 0.20 ppm
Authors:
D. P. Aguillard,
T. Albahri,
D. Allspach,
A. Anisenkov,
K. Badgley,
S. Baeßler,
I. Bailey,
L. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
E. Barzi,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
S. Braun,
M. Bressler,
G. Cantatore,
R. M. Carey,
B. C. K. Casey
, et al. (168 additional authors not shown)
Abstract:
We present details on a new measurement of the muon magnetic anomaly, $a_μ= (g_μ-2)/2$. The result is based on positive muon data taken at Fermilab's Muon Campus during the 2019 and 2020 accelerator runs. The measurement uses $3.1$ GeV$/c$ polarized muons stored in a $7.1$-m-radius storage ring with a $1.45$ T uniform magnetic field. The value of $ a_μ$ is determined from the measured difference b…
▽ More
We present details on a new measurement of the muon magnetic anomaly, $a_μ= (g_μ-2)/2$. The result is based on positive muon data taken at Fermilab's Muon Campus during the 2019 and 2020 accelerator runs. The measurement uses $3.1$ GeV$/c$ polarized muons stored in a $7.1$-m-radius storage ring with a $1.45$ T uniform magnetic field. The value of $ a_μ$ is determined from the measured difference between the muon spin precession frequency and its cyclotron frequency. This difference is normalized to the strength of the magnetic field, measured using Nuclear Magnetic Resonance (NMR). The ratio is then corrected for small contributions from beam motion, beam dispersion, and transient magnetic fields. We measure $a_μ= 116 592 057 (25) \times 10^{-11}$ (0.21 ppm). This is the world's most precise measurement of this quantity and represents a factor of $2.2$ improvement over our previous result based on the 2018 dataset. In combination, the two datasets yield $a_μ(\text{FNAL}) = 116 592 055 (24) \times 10^{-11}$ (0.20 ppm). Combining this with the measurements from Brookhaven National Laboratory for both positive and negative muons, the new world average is $a_μ$(exp) $ = 116 592 059 (22) \times 10^{-11}$ (0.19 ppm).
△ Less
Submitted 22 May, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Conditionally Affinely Invariant Rerandomization and its Admissibility
Authors:
Zhen Zhong,
Donald Rubin
Abstract:
Rerandomization utilizes modern computing ability to search for covariate balance improved experimental design while adhering to the randomization principle originally advocated by RA Fisher. Conditionally affinely invariant rerandomization has the ``Equal Percent Variance Reducing'' property on subsets of conditionally ellipsoidally symmetric covariates. It is suitable to deal with covariates of…
▽ More
Rerandomization utilizes modern computing ability to search for covariate balance improved experimental design while adhering to the randomization principle originally advocated by RA Fisher. Conditionally affinely invariant rerandomization has the ``Equal Percent Variance Reducing'' property on subsets of conditionally ellipsoidally symmetric covariates. It is suitable to deal with covariates of varying importance or mixed types and usually produces multiple balance scores. ``Unified'' and `` intersection'' methods are common ways of deciding on multiple scores. In general, `` intersection'' methods are computationally more efficient but asymptotically inadmissible. As computational cost is not a major concern in experimental design, we recommend ``unified'' methods to build admissible criteria for rerandomization
△ Less
Submitted 17 February, 2024;
originally announced February 2024.
-
What limits performance of weakly supervised deep learning for chest CT classification?
Authors:
Fakrul Islam Tushar,
Vincent M. D'Anniballe,
Geoffrey D. Rubin,
Joseph Y. Lo
Abstract:
Weakly supervised learning with noisy data has drawn attention in the medical imaging community due to the sparsity of high-quality disease labels. However, little is known about the limitations of such weakly supervised learning and the effect of these constraints on disease classification performance. In this paper, we test the effects of such weak supervision by examining model tolerance for th…
▽ More
Weakly supervised learning with noisy data has drawn attention in the medical imaging community due to the sparsity of high-quality disease labels. However, little is known about the limitations of such weakly supervised learning and the effect of these constraints on disease classification performance. In this paper, we test the effects of such weak supervision by examining model tolerance for three conditions. First, we examined model tolerance for noisy data by incrementally increasing error in the labels within the training data. Second, we assessed the impact of dataset size by varying the amount of training data. Third, we compared performance differences between binary and multi-label classification. Results demonstrated that the model could endure up to 10% added label error before experiencing a decline in disease classification performance. Disease classification performance steadily rose as the amount of training data was increased for all disease classes, before experiencing a plateau in performance at 75% of training data. Last, the binary model outperformed the multilabel model in every disease category. However, such interpretations may be misleading, as the binary model was heavily influenced by co-occurring diseases and may not have learned the specific features of the disease in the image. In conclusion, this study may help the medical imaging community understand the benefits and risks of weak supervision with noisy labels. Such studies demonstrate the need to build diverse, large-scale datasets and to develop explainable and responsible AI.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Union Through UNITY: Cosmology with 2,000 SNe Using a Unified Bayesian Framework
Authors:
David Rubin,
Greg Aldering,
Marc Betoule,
Andy Fruchter,
Xiaosheng Huang,
Alex G. Kim,
Chris Lidman,
Eric Linder,
Saul Perlmutter,
Pilar Ruiz-Lapuente,
Nao Suzuki
Abstract:
Type Ia supernovae (SNe Ia) were instrumental in establishing the acceleration of the universe's expansion. By virtue of their combination of distance reach, precision, and prevalence, they continue to provide key cosmological constraints, complementing other cosmological probes. Individual SN surveys cover only over about a factor of two in redshift, so compilations of multiple SN datasets are st…
▽ More
Type Ia supernovae (SNe Ia) were instrumental in establishing the acceleration of the universe's expansion. By virtue of their combination of distance reach, precision, and prevalence, they continue to provide key cosmological constraints, complementing other cosmological probes. Individual SN surveys cover only over about a factor of two in redshift, so compilations of multiple SN datasets are strongly beneficial. We assemble an updated "Union" compilation of 2087 cosmologically useful SNe Ia from 24 datasets ("Union3"). We take care to put all SNe on the same distance scale and update the light-curve fitting with SALT3 to use the full rest-frame optical. Over the next few years, the number of cosmologically useful SNe Ia will increase by more than a factor of ten, and keeping systematic uncertainties subdominant will be more challenging than ever. We discuss the importance of treating outliers, selection effects, light-curve shape and color populations and standardization relations, unexplained dispersion, and heterogeneous observations simultaneously. We present an updated Bayesian framework, called UNITY1.5 (Unified Nonlinear Inference for Type-Ia cosmologY), that incorporates significant improvements in our ability to model selection effects, standardization, and systematic uncertainties compared to earlier analyses. As an analysis byproduct, we also recover the posterior of the SN-only peculiar-velocity field, although we do not interpret it in this work. We compute updated cosmological constraints with Union3 and UNITY1.5, finding weak 1.7--2.6sigma tension with LambdaCDM and possible evidence for thawing dark energy. We release our binned SN distances to the community.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Approaches to lowering the cost of large space telescopes
Authors:
Ewan S Douglas,
Greg Aldering,
Greg W. Allan,
Ramya Anche,
Roger Angel,
Cameron C. Ard,
Supriya Chakrabarti,
Laird M. Close,
Kevin Derby,
Jerry Edelstein,
John Ford,
Jessica Gersh-Range,
Sebastiaan Y. Haffert,
Patrick J. Ingraham,
Hyukmo Kang,
Douglas M. Kelly,
Daewook Kim,
Michael Lesser,
Jarron M. Leisenring,
Yu-Chia Lin,
Jared R. Males,
Buddy Martin,
Bianca Alondra Payan,
Sai Krishanth P. M.,
David Rubin
, et al. (4 additional authors not shown)
Abstract:
New development approaches, including launch vehicles and advances in sensors, computing, and software, have lowered the cost of entry into space, and have enabled a revolution in low-cost, high-risk Small Satellite (SmallSat) missions. To bring about a similar transformation in larger space telescopes, it is necessary to reconsider the full paradigm of space observatories. Here we will review the…
▽ More
New development approaches, including launch vehicles and advances in sensors, computing, and software, have lowered the cost of entry into space, and have enabled a revolution in low-cost, high-risk Small Satellite (SmallSat) missions. To bring about a similar transformation in larger space telescopes, it is necessary to reconsider the full paradigm of space observatories. Here we will review the history of space telescope development and cost drivers, and describe an example conceptual design for a low cost 6.5 m optical telescope to enable new science when operated in space at room temperature. It uses a monolithic primary mirror of borosilicate glass, drawing on lessons and tools from decades of experience with ground-based observatories and instruments, as well as flagship space missions. It takes advantage, as do large launch vehicles, of increased computing power and space-worthy commercial electronics in low-cost active predictive control systems to maintain stability. We will describe an approach that incorporates science and trade study results that address driving requirements such as integration and testing costs, reliability, spacecraft jitter, and wavefront stability in this new risk-tolerant "LargeSat" context.
△ Less
Submitted 19 October, 2023; v1 submitted 10 September, 2023;
originally announced September 2023.
-
Towards more scientific meta-analyses
Authors:
Lily H. Zhang,
Menelaos Konstantinidis,
Marie-Abèle Bind,
Donald B. Rubin
Abstract:
Meta-analysis can be a critical part of the research process, often serving as the primary analysis on which the practitioners, policymakers, and individuals base their decisions. However, current literature synthesis approaches to meta-analysis typically estimate a different quantity than what is implicitly intended; concretely, standard approaches estimate the average effect of a treatment for a…
▽ More
Meta-analysis can be a critical part of the research process, often serving as the primary analysis on which the practitioners, policymakers, and individuals base their decisions. However, current literature synthesis approaches to meta-analysis typically estimate a different quantity than what is implicitly intended; concretely, standard approaches estimate the average effect of a treatment for a population of imperfect studies, rather than the true scientific effect that would be measured in a population of hypothetical perfect studies. We advocate for an alternative method, called response-surface meta-analysis, which models the relationship between the quality of the study design as predictor variables and its reported estimated effect size as the outcome variable in order to estimate the effect size obtained by the hypothetical ideal study. The idea was first introduced by Rubin several decades ago, and here we provide a practical implementation. First, we reintroduce the idea of response-surface meta-analysis, highlighting its focus on a scientifically-motivated estimand while proposing a straightforward implementation. Then we compare the approach to traditional meta-analysis techniques used in practice. We then implement response-surface meta-analysis and contrast its results with existing literature-synthesis approaches on both simulated data and a real-world example published by the Cochrane Collaboration. We conclude by detailing the primary challenges in the implementation of response-surface meta-analysis and offer some suggestions to tackle these challenges.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Measurement of the Positive Muon Anomalous Magnetic Moment to 0.20 ppm
Authors:
D. P. Aguillard,
T. Albahri,
D. Allspach,
A. Anisenkov,
K. Badgley,
S. Baeßler,
I. Bailey,
L. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
E. Barzi,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
S. Braun,
M. Bressler,
G. Cantatore,
R. M. Carey,
B. C. K. Casey
, et al. (166 additional authors not shown)
Abstract:
We present a new measurement of the positive muon magnetic anomaly, $a_μ\equiv (g_μ- 2)/2$, from the Fermilab Muon $g\!-\!2$ Experiment using data collected in 2019 and 2020. We have analyzed more than 4 times the number of positrons from muon decay than in our previous result from 2018 data. The systematic error is reduced by more than a factor of 2 due to better running conditions, a more stable…
▽ More
We present a new measurement of the positive muon magnetic anomaly, $a_μ\equiv (g_μ- 2)/2$, from the Fermilab Muon $g\!-\!2$ Experiment using data collected in 2019 and 2020. We have analyzed more than 4 times the number of positrons from muon decay than in our previous result from 2018 data. The systematic error is reduced by more than a factor of 2 due to better running conditions, a more stable beam, and improved knowledge of the magnetic field weighted by the muon distribution, $\tildeω'^{}_p$, and of the anomalous precession frequency corrected for beam dynamics effects, $ω_a$. From the ratio $ω_a / \tildeω'^{}_p$, together with precisely determined external parameters, we determine $a_μ= 116\,592\,057(25) \times 10^{-11}$ (0.21 ppm). Combining this result with our previous result from the 2018 data, we obtain $a_μ\text{(FNAL)} = 116\,592\,055(24) \times 10^{-11}$ (0.20 ppm). The new experimental world average is $a_μ(\text{Exp}) = 116\,592\,059(22)\times 10^{-11}$ (0.19 ppm), which represents a factor of 2 improvement in precision.
△ Less
Submitted 4 October, 2023; v1 submitted 11 August, 2023;
originally announced August 2023.
-
Roman CCS White Paper: Measuring Type Ia Supernovae Discovered in the Roman High Latitude Time Domain Survey
Authors:
Rebekah Hounsell,
Dan Scolnic,
Dillon Brout,
Benjamin Rose,
Ori Fox,
Masao Sako,
Phillip Macias,
Bhavin Joshi,
Susana Desutua,
David Rubin,
Stefano Casertano,
Saul Perlmutter,
Greg Aldering,
Kaisey Mandel,
Megan Sosey,
Nao Suzuki,
Russell Ryan
Abstract:
We motivate the cosmological science case of measuring Type Ia supernovae with the Nancy Grace Roman Space Telescope as part of the High Latitude Time Domain Survey. We discuss previously stated requirements for the science, and a baseline survey strategy. We discuss the various areas that must still be optimized and point to the other white papers that consider these topics in detail. Overall, th…
▽ More
We motivate the cosmological science case of measuring Type Ia supernovae with the Nancy Grace Roman Space Telescope as part of the High Latitude Time Domain Survey. We discuss previously stated requirements for the science, and a baseline survey strategy. We discuss the various areas that must still be optimized and point to the other white papers that consider these topics in detail. Overall, the baseline case should enable an exquisite measurement of dark energy using SNe Ia from z=0.1 to z>2, and further optimization should only strengthen this once-in-a-generation experiment.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Roman CCS White Paper: Options to Increase the Coverage Area of Prism Time Series in the High-Latitude Time Domain Core Community Survey
Authors:
Benjamin Rose,
Sebastian Gomez,
Rebekah Hounsell,
Bhavin Joshi,
David Rubin,
Dan Scolnic,
Masao Sako
Abstract:
The current reference High-latitude time domain survey increases the completeness of transients with prism temporal time series data by adjusting the ratio of prism-to-imaging time. However, there are two other nobs that allow for a more complete prism coverage: prism cadence and exposure time. In this white paper, we discuss how changes to the prism cadence and exposure time -- in order to increa…
▽ More
The current reference High-latitude time domain survey increases the completeness of transients with prism temporal time series data by adjusting the ratio of prism-to-imaging time. However, there are two other nobs that allow for a more complete prism coverage: prism cadence and exposure time. In this white paper, we discuss how changes to the prism cadence and exposure time -- in order to increase the fraction of observed transients with spectral time series -- affect supernova cosmology, transient typing and template building, and the study of rare transients.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Roman CCS White Paper: Considerations for Selecting Fields for the Roman High-latitude Time Domain Core Community Survey
Authors:
Benjamin Rose,
Greg Aldering,
Rebekah Hounsell,
Bhavin Joshi,
David Rubin,
Dan Scolnic,
Saul Perlmutter,
Susana Deustua,
Masao Sako
Abstract:
In this white paper, we review five top considerations for selecting locations of the fields of the Roman High-latitude Time Domain Survey. Based on these considerations, we recommend Akari Deep Field South (ADFS)/Euclid Deep Field South (EDFS) in the Southern Hemisphere has it avoids bright stars, has minimal Milky Way dust, is in Roman Continuous viewing zone, overlaps with multiple past and fut…
▽ More
In this white paper, we review five top considerations for selecting locations of the fields of the Roman High-latitude Time Domain Survey. Based on these considerations, we recommend Akari Deep Field South (ADFS)/Euclid Deep Field South (EDFS) in the Southern Hemisphere has it avoids bright stars, has minimal Milky Way dust, is in Roman Continuous viewing zone, overlaps with multiple past and future surveys, and minimal zodiacal background variation. In the North, Extended Groth Strip (EGS) is good except for its zodiacal variation and Supernova/Acceleration Probe North (SNAP-N) and European Large Area Infrared Space Observatory Survey-North 1 (ELAIS N-1) are good except for their synergistic archival data.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Roman CCS White Paper: Optimizing the HLTDS Cadence at Fixed Depth
Authors:
David Rubin,
Ben Rose,
Rebekah Hounsell,
Masao Sako,
Greg Aldering,
Dan Scolnic,
Saul Perlmutter
Abstract:
The current proposal for the High Latitude Time Domain Survey (HLTDS) is two tiers (wide and deep) of multi-band imaging and prism spectroscopy with a cadence of five days (Rose et al., 2021). The five-day cadence is motivated by the desire to measure mid-redshift SNe where time dilation is modest as well as to better photometrically characterize the transients detected. This white paper does not…
▽ More
The current proposal for the High Latitude Time Domain Survey (HLTDS) is two tiers (wide and deep) of multi-band imaging and prism spectroscopy with a cadence of five days (Rose et al., 2021). The five-day cadence is motivated by the desire to measure mid-redshift SNe where time dilation is modest as well as to better photometrically characterize the transients detected. This white paper does not provide a conclusion as to the best cadence for the HLTDS. Rather, it collects a set of considerations that should be used for a careful study of cadence by a future committee optimizing the Roman survey. This study should optimize the HLTDS for both SN Ia cosmology and other transient science.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Roman CCS White Paper: Balanced Prism Plus Filter Cadence in the High Latitude Time Domain Survey Core Community Survey
Authors:
Greg Aldering,
David Rubin,
Benjamin Rose,
Rebekah Hounsell,
Saul Perlmutter,
Susana Deustua
Abstract:
The Nancy Grace Roman Space Telescope's (RST) Wide Field Imager (WFI) is equipped with a slitless prism that can be used for spectroscopic discovery and follow-up of explosive transients at high redshift as part of its High Latitude Time Domain Survey. This is new and unique spectroscopic capability, not only for its original purpose for cosmology, but also for other types of explosive transients.…
▽ More
The Nancy Grace Roman Space Telescope's (RST) Wide Field Imager (WFI) is equipped with a slitless prism that can be used for spectroscopic discovery and follow-up of explosive transients at high redshift as part of its High Latitude Time Domain Survey. This is new and unique spectroscopic capability, not only for its original purpose for cosmology, but also for other types of explosive transients. This white paper is intended to help make this new capability more clear to the community. The depth of the RST prism compared to ground-based spectrographs is explored, showing that the RST prism will be unrivaled in the observer-frame NIR. The influence of the selected sky locations on the speed and homogeneity of a RST prism survey is also estimated. This unique new capability should be considered when balancing the HLTDS time devoted to cadenced imaging and spectroscopy.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Roman CCS White Paper: Identifying high-redshift pair-instability supernovae by adding sparse F213 filter observations
Authors:
Takashi Moriya,
Ori D. Fox,
Robert Quimby,
Steve Schulze,
Ashley Villar,
Armin Rest,
Norman Grogin,
Sebastian Gomez,
David Rubin,
Matt Siebert,
Susan Kassin,
Eniko Regos,
Lou Strolger,
Anton Koekemoer,
Steven Finkelstein,
Suvi Gezari,
Seppo Mattila,
Tea Temim,
Melissa Shahbandeh,
Bob Williams,
Ting-Wan Chen,
Isobel Hook,
Justin Pierel,
Masami Ouchi,
Yuichi Harikane
Abstract:
Pair-instability supernovae (PISNe) are explosions of very massive stars that may have played a critical role in the chemical evolution and reionization of the early Universe. In order to quantify their roles, it is required to know the PISN event rate at z > 6. Although Roman Space Telescope has a capability to discover PISNe at z > 6, identifying rare high-redshift PISN candidates among many oth…
▽ More
Pair-instability supernovae (PISNe) are explosions of very massive stars that may have played a critical role in the chemical evolution and reionization of the early Universe. In order to quantify their roles, it is required to know the PISN event rate at z > 6. Although Roman Space Telescope has a capability to discover PISNe at z > 6, identifying rare high-redshift PISN candidates among many other transients is challenging. In order to efficiently identify PISN candidates at z > 6, we propose to add sparse F213 observations reaching 26.5 mag (or deeper) every half year in the High Latitude Time Domain Survey. By adding the F213 information, PISNe at z > 6 can be efficiently identified in the color-magnitude diagram.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Towards trustworthy seizure onset detection using workflow notes
Authors:
Khaled Saab,
Siyi Tang,
Mohamed Taha,
Christopher Lee-Messer,
Christopher Ré,
Daniel Rubin
Abstract:
A major barrier to deploying healthcare AI models is their trustworthiness. One form of trustworthiness is a model's robustness across different subgroups: while existing models may exhibit expert-level performance on aggregate metrics, they often rely on non-causal features, leading to errors in hidden subgroups. To take a step closer towards trustworthy seizure onset detection from EEG, we propo…
▽ More
A major barrier to deploying healthcare AI models is their trustworthiness. One form of trustworthiness is a model's robustness across different subgroups: while existing models may exhibit expert-level performance on aggregate metrics, they often rely on non-causal features, leading to errors in hidden subgroups. To take a step closer towards trustworthy seizure onset detection from EEG, we propose to leverage annotations that are produced by healthcare personnel in routine clinical workflows -- which we refer to as workflow notes -- that include multiple event descriptions beyond seizures. Using workflow notes, we first show that by scaling training data to an unprecedented level of 68,920 EEG hours, seizure onset detection performance significantly improves (+12.3 AUROC points) compared to relying on smaller training sets with expensive manual gold-standard labels. Second, we reveal that our binary seizure onset detection model underperforms on clinically relevant subgroups (e.g., up to a margin of 6.5 AUROC points between pediatrics and adults), while having significantly higher false positives on EEG clips showing non-epileptiform abnormalities compared to any EEG clip (+19 FPR points). To improve model robustness to hidden subgroups, we train a multilabel model that classifies 26 attributes other than seizures, such as spikes, slowing, and movement artifacts. We find that our multilabel model significantly improves overall seizure onset detection performance (+5.9 AUROC points) while greatly improving performance among subgroups (up to +8.3 AUROC points), and decreases false positives on non-epileptiform abnormalities by 8 FPR points. Finally, we propose a clinical utility metric based on false positives per 24 EEG hours and find that our multilabel model improves this clinical utility metric by a factor of 2x across different clinical settings.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
The Intel Neuromorphic DNS Challenge
Authors:
Jonathan Timcheck,
Sumit Bam Shrestha,
Daniel Ben Dayan Rubin,
Adam Kupryjanow,
Garrick Orchard,
Lukasz Pindor,
Timothy Shea,
Mike Davies
Abstract:
A critical enabler for progress in neuromorphic computing research is the ability to transparently evaluate different neuromorphic solutions on important tasks and to compare them to state-of-the-art conventional solutions. The Intel Neuromorphic Deep Noise Suppression Challenge (Intel N-DNS Challenge), inspired by the Microsoft DNS Challenge, tackles a ubiquitous and commercially relevant task: r…
▽ More
A critical enabler for progress in neuromorphic computing research is the ability to transparently evaluate different neuromorphic solutions on important tasks and to compare them to state-of-the-art conventional solutions. The Intel Neuromorphic Deep Noise Suppression Challenge (Intel N-DNS Challenge), inspired by the Microsoft DNS Challenge, tackles a ubiquitous and commercially relevant task: real-time audio denoising. Audio denoising is likely to reap the benefits of neuromorphic computing due to its low-bandwidth, temporal nature and its relevance for low-power devices. The Intel N-DNS Challenge consists of two tracks: a simulation-based algorithmic track to encourage algorithmic innovation, and a neuromorphic hardware (Loihi 2) track to rigorously evaluate solutions. For both tracks, we specify an evaluation methodology based on energy, latency, and resource consumption in addition to output audio quality. We make the Intel N-DNS Challenge dataset scripts and evaluation code freely accessible, encourage community participation with monetary prizes, and release a neuromorphic baseline solution which shows promising audio quality, high power efficiency, and low resource consumption when compared to Microsoft NsNet2 and a proprietary Intel denoising model used in production. We hope the Intel N-DNS Challenge will hasten innovation in neuromorphic algorithms research, especially in the area of training tools and methods for real-time signal processing. We expect the winners of the challenge will demonstrate that for problems like audio denoising, significant gains in power and resources can be realized on neuromorphic devices available today compared to conventional state-of-the-art solutions.
△ Less
Submitted 1 August, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Bayesian Criterion for Re-randomization
Authors:
Zhaoyang Liu,
Tingxuan Han,
Donald B. Rubin,
Ke Deng
Abstract:
Re-randomization has gained popularity as a tool for experiment-based causal inference due to its superior covariate balance and statistical efficiency compared to classic randomized experiments. However, the basic re-randomization method, known as ReM, and many of its extensions have been deemed sub-optimal as they fail to prioritize covariates that are more strongly associated with potential out…
▽ More
Re-randomization has gained popularity as a tool for experiment-based causal inference due to its superior covariate balance and statistical efficiency compared to classic randomized experiments. However, the basic re-randomization method, known as ReM, and many of its extensions have been deemed sub-optimal as they fail to prioritize covariates that are more strongly associated with potential outcomes. To address this limitation and design more efficient re-randomization procedures, a more precise quantification of covariate heterogeneity and its impact on the causal effect estimator is in a great appeal. This work fills in this gap with a Bayesian criterion for re-randomization and a series of novel re-randomization procedures derived under such a criterion. Both theoretical analyses and numerical studies show that the proposed re-randomization procedures under the Bayesian criterion outperform existing ReM-based procedures significantly in effectively balancing covariates and precisely estimating the unknown causal effect.
△ Less
Submitted 19 September, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
Estimating the Instantaneous Reproduction Number With Imperfect Data: A Method to Account for Case-Reporting Variation and Serial Interval Uncertainty
Authors:
Gary Hettinger,
David Rubin,
Jing Huang
Abstract:
During an infectious disease outbreak, public health decision-makers require real-time monitoring of disease transmission to respond quickly and intelligently. In these settings, a key measure of transmission is the instantaneous time-varying reproduction number, $R_t$. Estimation of this number using a Time-Since-Infection model relies on case-notification data and the distribution of the serial…
▽ More
During an infectious disease outbreak, public health decision-makers require real-time monitoring of disease transmission to respond quickly and intelligently. In these settings, a key measure of transmission is the instantaneous time-varying reproduction number, $R_t$. Estimation of this number using a Time-Since-Infection model relies on case-notification data and the distribution of the serial interval on the target population. However, in practice, case-notification data may contain measurement error due to variation in case reporting while available serial interval estimates may come from studies on non-representative populations.
We propose a new data-driven method that accounts for particular forms of case-reporting measurement error and can incorporate multiple partially representative serial interval estimates into the transmission estimation process. In addition, we provide practical tools for automatically identifying measurement error patterns and determining when measurement error may not be adequately accounted for. We illustrate the potential bias undertaken by methods that ignore these practical concerns through a variety of simulated outbreaks. We then demonstrate the use of our method on data from the COVID-19 pandemic to estimate transmission and explore the relationships between social distancing, temperature, and transmission.
△ Less
Submitted 23 March, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Exploring Image Augmentations for Siamese Representation Learning with Chest X-Rays
Authors:
Rogier van der Sluijs,
Nandita Bhaskhar,
Daniel Rubin,
Curtis Langlotz,
Akshay Chaudhari
Abstract:
Image augmentations are quintessential for effective visual representation learning across self-supervised learning techniques. While augmentation strategies for natural imaging have been studied extensively, medical images are vastly different from their natural counterparts. Thus, it is unknown whether common augmentation strategies employed in Siamese representation learning generalize to medic…
▽ More
Image augmentations are quintessential for effective visual representation learning across self-supervised learning techniques. While augmentation strategies for natural imaging have been studied extensively, medical images are vastly different from their natural counterparts. Thus, it is unknown whether common augmentation strategies employed in Siamese representation learning generalize to medical images and to what extent. To address this challenge, in this study, we systematically assess the effect of various augmentations on the quality and robustness of the learned representations. We train and evaluate Siamese Networks for abnormality detection on chest X-Rays across three large datasets (MIMIC-CXR, CheXpert and VinDR-CXR). We investigate the efficacy of the learned representations through experiments involving linear probing, fine-tuning, zero-shot transfer, and data efficiency. Finally, we identify a set of augmentations that yield robust representations that generalize well to both out-of-distribution data and diseases, while outperforming supervised baselines using just zero-shot transfer and linear probes by up to 20%. Our code is available at https://github.com/StanfordMIMI/siaug.
△ Less
Submitted 10 July, 2023; v1 submitted 29 January, 2023;
originally announced January 2023.
-
The DEHVILS Survey Overview and Initial Data Release: High-Quality Near-Infrared Type Ia Supernova Light Curves at Low Redshift
Authors:
Erik R. Peterson,
David O. Jones,
Daniel Scolnic,
Bruno O. Sánchez,
Aaron Do,
Adam G. Riess,
Sam M. Ward,
Arianna Dwomoh,
Thomas de Jaeger,
Saurabh W. Jha,
Kaisey S. Mandel,
Justin D. R. Pierel,
Brodie Popovic,
Benjamin M. Rose,
David Rubin,
Benjamin J. Shappee,
Stephen Thorp,
John L. Tonry,
R. Brent Tully,
Maria Vincenzi
Abstract:
While the sample of optical Type Ia Supernova (SN Ia) light curves (LCs) usable for cosmological parameter measurements surpasses 2000, the sample of published, cosmologically viable near-infrared (NIR) SN Ia LCs, which have been shown to be good "standard candles," is still $\lesssim$ 200. Here, we present high-quality NIR LCs for 83 SNe Ia ranging from $0.002 < z < 0.09$ as a part of the Dark En…
▽ More
While the sample of optical Type Ia Supernova (SN Ia) light curves (LCs) usable for cosmological parameter measurements surpasses 2000, the sample of published, cosmologically viable near-infrared (NIR) SN Ia LCs, which have been shown to be good "standard candles," is still $\lesssim$ 200. Here, we present high-quality NIR LCs for 83 SNe Ia ranging from $0.002 < z < 0.09$ as a part of the Dark Energy, H$_0$, and peculiar Velocities using Infrared Light from Supernovae (DEHVILS) survey. Observations are taken using UKIRT's WFCAM, where the median depth of the images is 20.7, 20.1, and 19.3 mag (Vega) for $Y$, $J$, and $H$-bands, respectively. The median number of epochs per SN Ia is 18 for all three bands ($YJH$) combined and 6 for each band individually. We fit 47 SN Ia LCs that pass strict quality cuts using three LC models, SALT3, SNooPy, and BayeSN and find scatter on the Hubble diagram to be comparable to or better than scatter from optical-only fits in the literature. Fitting NIR-only LCs, we obtain standard deviations ranging from 0.128-0.135 mag. Additionally, we present a refined calibration method for transforming 2MASS magnitudes to WFCAM magnitudes using HST CALSPEC stars that results in a 0.03 mag shift in the WFCAM $Y$-band magnitudes.
△ Less
Submitted 10 April, 2023; v1 submitted 27 January, 2023;
originally announced January 2023.
-
Early-Phase Local-Area Model for Pandemics Using Limited Data: A SARS-CoV-2 Application
Authors:
Jiasheng Shi,
Jeffrey S. Morris,
David M. Rubin,
Jing Huang
Abstract:
The emergence of novel infectious agents presents challenges to statistical models of disease transmission. These challenges arise from limited, poor-quality data and an incomplete understanding of the agent. Moreover, outbreaks manifest differently across regions due to various factors, making it imperative for models to factor in regional specifics. In this work, we offer a model that effectivel…
▽ More
The emergence of novel infectious agents presents challenges to statistical models of disease transmission. These challenges arise from limited, poor-quality data and an incomplete understanding of the agent. Moreover, outbreaks manifest differently across regions due to various factors, making it imperative for models to factor in regional specifics. In this work, we offer a model that effectively utilizes constrained data resources to estimate disease transmission rates at the local level, especially during the early outbreak phase when primarily infection counts and aggregated local characteristics are accessible. This model merges a pathogen transmission methodology based on daily infection numbers with regression techniques, drawing correlations between disease transmission and local-area factors, such as demographics, health policies, behavior, and even climate, to estimate and forecast daily infections. We incorporate the quasi-score method and an error term to navigate potential data concerns and mistaken assumptions. Additionally, we introduce an online estimator that facilitates real-time data updates, complemented by an iterative algorithm for parameter estimation. This approach facilitates real-time analysis of disease transmission when data quality is suboptimal and knowledge of the infectious pathogen is limited. It is particularly useful in the early stages of outbreaks, providing support for local decision-making.
△ Less
Submitted 18 March, 2024; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Modeling Multivariate Biosignals With Graph Neural Networks and Structured State Space Models
Authors:
Siyi Tang,
Jared A. Dunnmon,
Liangqiong Qu,
Khaled K. Saab,
Tina Baykaner,
Christopher Lee-Messer,
Daniel L. Rubin
Abstract:
Multivariate biosignals are prevalent in many medical domains, such as electroencephalography, polysomnography, and electrocardiography. Modeling spatiotemporal dependencies in multivariate biosignals is challenging due to (1) long-range temporal dependencies and (2) complex spatial correlations between the electrodes. To address these challenges, we propose representing multivariate biosignals as…
▽ More
Multivariate biosignals are prevalent in many medical domains, such as electroencephalography, polysomnography, and electrocardiography. Modeling spatiotemporal dependencies in multivariate biosignals is challenging due to (1) long-range temporal dependencies and (2) complex spatial correlations between the electrodes. To address these challenges, we propose representing multivariate biosignals as time-dependent graphs and introduce GraphS4mer, a general graph neural network (GNN) architecture that improves performance on biosignal classification tasks by modeling spatiotemporal dependencies in biosignals. Specifically, (1) we leverage the Structured State Space architecture, a state-of-the-art deep sequence model, to capture long-range temporal dependencies in biosignals and (2) we propose a graph structure learning layer in GraphS4mer to learn dynamically evolving graph structures in the data. We evaluate our proposed model on three distinct biosignal classification tasks and show that GraphS4mer consistently improves over existing models, including (1) seizure detection from electroencephalographic signals, outperforming a previous GNN with self-supervised pre-training by 3.1 points in AUROC; (2) sleep staging from polysomnographic signals, a 4.1 points improvement in macro-F1 score compared to existing sleep staging models; and (3) 12-lead electrocardiogram classification, outperforming previous state-of-the-art models by 2.7 points in macro-F1 score.
△ Less
Submitted 29 April, 2023; v1 submitted 20 November, 2022;
originally announced November 2022.
-
ATCON: Attention Consistency for Vision Models
Authors:
Ali Mirzazadeh,
Florian Dubost,
Maxwell Pike,
Krish Maniar,
Max Zuo,
Christopher Lee-Messer,
Daniel Rubin
Abstract:
Attention--or attribution--maps methods are methods designed to highlight regions of the model's input that were discriminative for its predictions. However, different attention maps methods can highlight different regions of the input, with sometimes contradictory explanations for a prediction. This effect is exacerbated when the training set is small. This indicates that either the model learned…
▽ More
Attention--or attribution--maps methods are methods designed to highlight regions of the model's input that were discriminative for its predictions. However, different attention maps methods can highlight different regions of the input, with sometimes contradictory explanations for a prediction. This effect is exacerbated when the training set is small. This indicates that either the model learned incorrect representations or that the attention maps methods did not accurately estimate the model's representations. We propose an unsupervised fine-tuning method that optimizes the consistency of attention maps and show that it improves both classification performance and the quality of attention maps. We propose an implementation for two state-of-the-art attention computation methods, Grad-CAM and Guided Backpropagation, which relies on an input masking technique. We also show results on Grad-CAM and Integrated Gradients in an ablation study. We evaluate this method on our own dataset of event detection in continuous video recordings of hospital patients aggregated and curated for this work. As a sanity check, we also evaluate the proposed method on PASCAL VOC and SVHN. With the proposed method, with small training sets, we achieve a 6.6 points lift of F1 score over the baselines on our video dataset, a 2.9 point lift of F1 score on PASCAL, and a 1.8 points lift of mean Intersection over Union over Grad-CAM for weakly supervised detection on PASCAL. Those improved attention maps may help clinicians better understand vision model predictions and ease the deployment of machine learning systems into clinical care. We share part of the code for this article at the following repository: https://github.com/alimirzazadeh/SemisupervisedAttention.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
The Spectroscopic Classification of Astronomical Transients (SCAT) Survey: Overview, Pipeline Description, Initial Results, and Future Plans
Authors:
M. A. Tucker,
B. J. Shappee,
M. E. Huber,
A. V. Payne,
A. Do,
J. T. Hinkle,
T. de Jaeger,
C. Ashall,
D. D. Desai,
W. B. Hoogendam,
G. Aldering,
K. Auchettl,
C. Baranec,
J. Bulger,
K. Chambers,
M. Chun,
K. W. Hodapp,
T. B. Lowe,
L. McKay,
R. Rampy,
D. Rubin,
J. L. Tonry
Abstract:
We present the Spectroscopic Classification of Astronomical Transients (SCAT) survey, which is dedicated to spectrophotometric observations of transient objects such as supernovae and tidal disruption events. SCAT uses the SuperNova Integral-Field Spectrograph (SNIFS) on the University of Hawai'i 2.2-meter (UH2.2m) telescope. SNIFS was designed specifically for accurate transient spectrophotometry…
▽ More
We present the Spectroscopic Classification of Astronomical Transients (SCAT) survey, which is dedicated to spectrophotometric observations of transient objects such as supernovae and tidal disruption events. SCAT uses the SuperNova Integral-Field Spectrograph (SNIFS) on the University of Hawai'i 2.2-meter (UH2.2m) telescope. SNIFS was designed specifically for accurate transient spectrophotometry, including absolute flux calibration and host-galaxy removal. We describe the data reduction and calibration pipeline including spectral extraction, telluric correction, atmospheric characterization, nightly photometricity, and spectrophotometric precision. We achieve $\lesssim 5\%$ spectrophotometry across the full optical wavelength range ($3500-9000~Å$) under photometric conditions. The inclusion of photometry from the SNIFS multi-filter mosaic imager allows for decent spectrophotometric calibration ($10-20\%$) even under unfavorable weather/atmospheric conditions. SCAT obtained $\approx 640$ spectra of transients over the first 3 years of operations, including supernovae of all types, active galactic nuclei, cataclysmic variables, and rare transients such as superluminous supernovae and tidal disruption events. These observations will provide the community with benchmark spectrophotometry to constrain the next generation of hydrodynamic and radiative transfer models.
△ Less
Submitted 29 November, 2022; v1 submitted 17 October, 2022;
originally announced October 2022.
-
Bump Morphology of the CMAGIC Diagram
Authors:
L. Aldoroty,
L. Wang,
P. Hoeflich,
J. Yang,
N. Suntzeff,
G. Aldering,
P. Antilogus,
C. Aragon,
S. Bailey,
C. Baltay,
S. Bongard,
K. Boone,
C. Buton,
Y. Copin,
S. Dixon,
D. Fouchez,
E. Gangler,
R. Gupta,
B. Hayden,
Mitchell Karmen,
A. G. Kim,
M. Kowalski,
D. Küsters,
P. -F. Léget,
F. Mondon
, et al. (16 additional authors not shown)
Abstract:
We apply the color-magnitude intercept calibration method (CMAGIC) to the Nearby Supernova Factory SNe Ia spectrophotometric dataset. The currently existing CMAGIC parameters are the slope and intercept of a straight line fit to the first linear region in the color-magnitude diagram, which occurs over a span of approximately 30 days after maximum brightness. We define a new parameter, $ω_{XY}$, th…
▽ More
We apply the color-magnitude intercept calibration method (CMAGIC) to the Nearby Supernova Factory SNe Ia spectrophotometric dataset. The currently existing CMAGIC parameters are the slope and intercept of a straight line fit to the first linear region in the color-magnitude diagram, which occurs over a span of approximately 30 days after maximum brightness. We define a new parameter, $ω_{XY}$, the size of the ``bump'' feature near maximum brightness for arbitrary filters $X$ and $Y$. We find a significant correlation between the slope of the first linear region, $β_{XY, 1}$, in the CMAGIC diagram and $ω_{XY}$. These results may be used to our advantage, as they are less affected by extinction than parameters defined as a function of time. Additionally, $ω_{XY}$ is computed independently of templates. We find that current empirical templates are successful at reproducing the features described in this work, particularly SALT3, which correctly exhibits the negative correlation between slope and bump size seen in our data. In 1-D simulations, we show that the correlation between the size of the bump feature and $β_{XY, 1}$ can be understood as a result of chemical mixing due to large-scale Rayleigh-Taylor instabilities.
△ Less
Submitted 22 June, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Constraints on Cosmological Parameters with a Sample of Type Ia Supernovae from JWST
Authors:
Jia Lu,
Lifan Wang,
Xingzhuo Chen,
David Rubin,
Saul Perlmutter,
Dietrich Baade,
Jeremy Mould,
Jozsef Vinko,
Eniko Regos,
Anton M. Koekemoer
Abstract:
We investigate the potential of using a sample of very high-redshift ($2\lesssim z \lesssim6$) (VHZ) Type Ia supernovae (SNe~Ia) attainable by the James Webb Space Telescope (JWST) on constraining cosmological parameters. At such high redshifts, the age of the universe is young enough that the VHZ SNIa sample comprises the very first SNe~Ia of the universe, with progenitors among the very first ge…
▽ More
We investigate the potential of using a sample of very high-redshift ($2\lesssim z \lesssim6$) (VHZ) Type Ia supernovae (SNe~Ia) attainable by the James Webb Space Telescope (JWST) on constraining cosmological parameters. At such high redshifts, the age of the universe is young enough that the VHZ SNIa sample comprises the very first SNe~Ia of the universe, with progenitors among the very first generation of low mass stars that the universe has made. We show that the VHZ SNe~Ia can be used to disentangle systematic effects due to the luminosity distance evolution with redshifts intrinsic to SNIa standardization. Assuming that the systematic evolution can be described by a linear or logarithmic formula, we found that the coefficients of this dependence can be determined accurately and decoupled from cosmological models. Systematic evolution as large as 0.15 mag and 0.45 mag out to $z=5$ can be robustly separated from popular cosmological models for the linear and logarithmic evolution, respectively. The VHZ SNe~Ia will lay the foundation for quantifying the systematic redshift evolution of SNIa luminosity distance scales. When combined with SNIa surveys at comparatively lower redshifts, the VHZ SNe~Ia allow for a precise measurement of the history of the expansion of the universe from $z\sim 0$ to the epoch approaching reionization.
△ Less
Submitted 2 November, 2022; v1 submitted 3 October, 2022;
originally announced October 2022.
-
Cosmicflows-4
Authors:
R. Brent Tully,
Ehsan Kourkchi,
Hélène M. Courtois,
Gagandeep S. Anand,
John P. Blakeslee,
Dillon Brout,
Thomas de Jaeger,
Alexandra Dupuy,
Daniel Guinet,
Cullan Howlett,
Joseph B. Jensen,
Daniel Pomarède,
Luca Rizzi,
David Rubin,
Khaled Said,
Daniel Scolnic,
Benjamin E. Stahl
Abstract:
With Cosmicflows-4, distances are compiled for 55,877 galaxies gathered into 38,065 groups. Eight methodologies are employed, with the largest numbers coming from the correlations between the photometric and kinematic properties of spiral galaxies (TF) and elliptical galaxies (FP). Supernovae that arise from degenerate progenitors (type Ia Sne) are an important overlapping component. Smaller contr…
▽ More
With Cosmicflows-4, distances are compiled for 55,877 galaxies gathered into 38,065 groups. Eight methodologies are employed, with the largest numbers coming from the correlations between the photometric and kinematic properties of spiral galaxies (TF) and elliptical galaxies (FP). Supernovae that arise from degenerate progenitors (type Ia Sne) are an important overlapping component. Smaller contributions come from distance estimates from the surface brightness fluctuations of elliptical galaxies and the luminosities and expansion rates of core collapse supernovae (SNII). Cepheid period-luminosity relation and tip of the red giant branch observations founded on local stellar parallax measurements along with the geometric maser distance to NGC 4258 provide the absolute scaling of distances. The assembly of galaxies into groups is an important feature of the study in facilitating overlaps between methodologies. Merging between multiple contributions within a methodology and between methodologies is carried out with Bayesian Markov chain Monte Carlo procedures. The final assembly of distances is compatible with a value of the Hubble constant of $H_0=74.6$ km s$^{-1}$ Mpc$^{-1}$ with the small statistical error of $\pm 0.8$ km s$^{-1}$ Mpc$^{-1}$ but a large potential systematic error of ~3 km s$^{-1}$ Mpc$^{-1}$. Peculiar velocities can be inferred from the measured distances. The interpretation of the field of peculiar velocities is complex because of large errors on individual components and invites analyses beyond the scope of this study.
△ Less
Submitted 28 December, 2022; v1 submitted 22 September, 2022;
originally announced September 2022.
-
SALT3-NIR: Taking the Open-Source Type Ia Supernova Model to Longer Wavelengths for Next-Generation Cosmological Measurements
Authors:
J. D. R. Pierel,
D. O. Jones,
W. D. Kenworthy,
M. Dai,
R. Kessler,
C. Ashall,
A. Do,
E. R. Peterson,
B. J. Shappee,
M. R. Siebert,
T. Barna,
T. G. Brink,
J. Burke,
A. Calamida,
Y. Camacho-Neves,
T. de Jaeger,
A. V. Filippenko,
R. J. Foley,
L. Galbany,
O. D. Fox,
S. Gomez,
D. Hiramatsu,
R. Hounsell,
D. A. Howell,
S. W. Jha
, et al. (10 additional authors not shown)
Abstract:
A large fraction of Type Ia supernova (SN Ia) observations over the next decade will be in the near-infrared (NIR), at wavelengths beyond the reach of the current standard light-curve model for SN Ia cosmology, SALT3 ($\sim 2800$--8700$A$ central filter wavelength). To harness this new SN Ia sample and reduce future light-curve standardization systematic uncertainties, we train SALT3 at NIR wavele…
▽ More
A large fraction of Type Ia supernova (SN Ia) observations over the next decade will be in the near-infrared (NIR), at wavelengths beyond the reach of the current standard light-curve model for SN Ia cosmology, SALT3 ($\sim 2800$--8700$A$ central filter wavelength). To harness this new SN Ia sample and reduce future light-curve standardization systematic uncertainties, we train SALT3 at NIR wavelengths (SALT3-NIR) up to 2 $μ$m with the open-source model-training software SALTShaker, which can easily accommodate future observations. Using simulated data we show that the training process constrains the NIR model to $\sim 2$--3% across the phase range ($-20$ to $50$ days). We find that Hubble residual (HR) scatter is smaller using the NIR alone or optical+NIR compared to optical alone, by up to $\sim 30$% depending on filter choice (95% confidence). There is significant correlation between NIR light-curve stretch measurements and luminosity, with stretch and color corrections often improving HR scatter by up to $\sim20%$. For SN Ia observations expected from the \textit{Roman Space Telescope}, SALT3-NIR increases the amount of usable data in the SALT framework by $\sim 20$% at redshift $z\lesssim0.4$ and by $\sim 50$% at $z\lesssim0.15$. The SALT3-NIR model is part of the open-source {\tt SNCosmo} and {\tt SNANA} SN Ia cosmology packages.
△ Less
Submitted 31 October, 2022; v1 submitted 12 September, 2022;
originally announced September 2022.
-
Catalytic Priors: Using Synthetic Data to Specify Prior Distributions in Bayesian Analysis
Authors:
Dongming Huang,
Feicheng Wang,
Donald B. Rubin,
S. C. Kou
Abstract:
Catalytic prior distributions provide general, easy-to-use, and interpretable specifications of prior distributions for Bayesian analysis. They are particularly beneficial when the observed data are inadequate to stably estimate a complex target model. A catalytic prior distribution is constructed by augmenting the observed data with synthetic data that are sampled from the predictive distribution…
▽ More
Catalytic prior distributions provide general, easy-to-use, and interpretable specifications of prior distributions for Bayesian analysis. They are particularly beneficial when the observed data are inadequate to stably estimate a complex target model. A catalytic prior distribution is constructed by augmenting the observed data with synthetic data that are sampled from the predictive distribution of a simpler model estimated from the observed data. We illustrate the usefulness of the catalytic prior approach using an example from labor economics. In the example, the resulting Bayesian inference reflects many important aspects of the observed data, and the estimation accuracy and predictive performance of the inference based on the catalytic prior are superior to, or comparable to, that of other commonly used prior distributions. We further explore the connection between the catalytic prior approach and a few popular regularization methods. We expect the catalytic prior approach to be useful in many applications.
△ Less
Submitted 22 September, 2023; v1 submitted 30 August, 2022;
originally announced August 2022.
-
Contrastive learning-based pretraining improves representation and transferability of diabetic retinopathy classification models
Authors:
Minhaj Nur Alam,
Rikiya Yamashita,
Vignav Ramesh,
Tejas Prabhune,
Jennifer I. Lim,
R. V. P. Chan,
Joelle Hallak,
Theodore Leng,
Daniel Rubin
Abstract:
Self supervised contrastive learning based pretraining allows development of robust and generalized deep learning models with small, labeled datasets, reducing the burden of label generation. This paper aims to evaluate the effect of CL based pretraining on the performance of referrable vs non referrable diabetic retinopathy (DR) classification. We have developed a CL based framework with neural s…
▽ More
Self supervised contrastive learning based pretraining allows development of robust and generalized deep learning models with small, labeled datasets, reducing the burden of label generation. This paper aims to evaluate the effect of CL based pretraining on the performance of referrable vs non referrable diabetic retinopathy (DR) classification. We have developed a CL based framework with neural style transfer (NST) augmentation to produce models with better representations and initializations for the detection of DR in color fundus images. We compare our CL pretrained model performance with two state of the art baseline models pretrained with Imagenet weights. We further investigate the model performance with reduced labeled training data (down to 10 percent) to test the robustness of the model when trained with small, labeled datasets. The model is trained and validated on the EyePACS dataset and tested independently on clinical data from the University of Illinois, Chicago (UIC). Compared to baseline models, our CL pretrained FundusNet model had higher AUC (CI) values (0.91 (0.898 to 0.930) vs 0.80 (0.783 to 0.820) and 0.83 (0.801 to 0.853) on UIC data). At 10 percent labeled training data, the FundusNet AUC was 0.81 (0.78 to 0.84) vs 0.58 (0.56 to 0.64) and 0.63 (0.60 to 0.66) in baseline models, when tested on the UIC dataset. CL based pretraining with NST significantly improves DL classification performance, helps the model generalize well (transferable from EyePACS to UIC data), and allows training with small, annotated datasets, therefore reducing ground truth annotation burden of the clinicians.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
TRUST-LAPSE: An Explainable and Actionable Mistrust Scoring Framework for Model Monitoring
Authors:
Nandita Bhaskhar,
Daniel L. Rubin,
Christopher Lee-Messer
Abstract:
Continuous monitoring of trained ML models to determine when their predictions should and should not be trusted is essential for their safe deployment. Such a framework ought to be high-performing, explainable, post-hoc and actionable. We propose TRUST-LAPSE, a "mistrust" scoring framework for continuous model monitoring. We assess the trustworthiness of each input sample's model prediction using…
▽ More
Continuous monitoring of trained ML models to determine when their predictions should and should not be trusted is essential for their safe deployment. Such a framework ought to be high-performing, explainable, post-hoc and actionable. We propose TRUST-LAPSE, a "mistrust" scoring framework for continuous model monitoring. We assess the trustworthiness of each input sample's model prediction using a sequence of latent-space embeddings. Specifically, (a) our latent-space mistrust score estimates mistrust using distance metrics (Mahalanobis distance) and similarity metrics (cosine similarity) in the latent-space and (b) our sequential mistrust score determines deviations in correlations over the sequence of past input representations in a non-parametric, sliding-window based algorithm for actionable continuous monitoring. We evaluate TRUST-LAPSE via two downstream tasks: (1) distributionally shifted input detection, and (2) data drift detection. We evaluate across diverse domains - audio and vision using public datasets and further benchmark our approach on challenging, real-world electroencephalograms (EEG) datasets for seizure detection. Our latent-space mistrust scores achieve state-of-the-art results with AUROCs of 84.1 (vision), 73.9 (audio), and 77.1 (clinical EEGs), outperforming baselines by over 10 points. We expose critical failures in popular baselines that remain insensitive to input semantic content, rendering them unfit for real-world model monitoring. We show that our sequential mistrust scores achieve high drift detection rates; over 90% of the streams show < 20% error for all domains. Through extensive qualitative and quantitative evaluations, we show that our mistrust scores are more robust and provide explainability for easy adoption into practice.
△ Less
Submitted 12 July, 2023; v1 submitted 22 July, 2022;
originally announced July 2022.
-
A Probabilistic Autoencoder for Type Ia Supernovae Spectral Time Series
Authors:
George Stein,
Uros Seljak,
Vanessa Bohm,
G. Aldering,
P. Antilogus,
C. Aragon,
S. Bailey,
C. Baltay,
S. Bongard,
K. Boone,
C. Buton,
Y. Copin,
S. Dixon,
D. Fouchez,
E. Gangler,
R. Gupta,
B. Hayden,
W. Hillebrandt,
M. Karmen,
A. G. Kim,
M. Kowalski,
D. Kusters,
P. F. Leget,
F. Mondon,
J. Nordin
, et al. (15 additional authors not shown)
Abstract:
We construct a physically-parameterized probabilistic autoencoder (PAE) to learn the intrinsic diversity of type Ia supernovae (SNe Ia) from a sparse set of spectral time series. The PAE is a two-stage generative model, composed of an Auto-Encoder (AE) which is interpreted probabilistically after training using a Normalizing Flow (NF). We demonstrate that the PAE learns a low-dimensional latent sp…
▽ More
We construct a physically-parameterized probabilistic autoencoder (PAE) to learn the intrinsic diversity of type Ia supernovae (SNe Ia) from a sparse set of spectral time series. The PAE is a two-stage generative model, composed of an Auto-Encoder (AE) which is interpreted probabilistically after training using a Normalizing Flow (NF). We demonstrate that the PAE learns a low-dimensional latent space that captures the nonlinear range of features that exists within the population, and can accurately model the spectral evolution of SNe Ia across the full range of wavelength and observation times directly from the data. By introducing a correlation penalty term and multi-stage training setup alongside our physically-parameterized network we show that intrinsic and extrinsic modes of variability can be separated during training, removing the need for the additional models to perform magnitude standardization. We then use our PAE in a number of downstream tasks on SNe Ia for increasingly precise cosmological analyses, including automatic detection of SN outliers, the generation of samples consistent with the data distribution, and solving the inverse problem in the presence of noisy and incomplete data to constrain cosmological distance measurements. We find that the optimal number of intrinsic model parameters appears to be three, in line with previous studies, and show that we can standardize our test sample of SNe Ia with an RMS of $0.091 \pm 0.010$ mag, which corresponds to $0.074 \pm 0.010$ mag if peculiar velocity contributions are removed. Trained models and codes are released at \href{https://github.com/georgestein/suPAErnova}{github.com/georgestein/suPAErnova}
△ Less
Submitted 15 July, 2022;
originally announced July 2022.
-
Evaluating and Optimizing a Slitless Prism for Nancy Grace Roman Space Telescope SN Cosmology
Authors:
David Rubin,
Greg Aldering,
Tri L. Astraatmadja,
Charlie Baltay,
Aleksandar Cikota,
Susana E. Deustua,
Sam Dixon,
Andrew Fruchter,
L. Galbany,
Rebekah Hounsell,
Saul Perlmutter,
Ben Rose
Abstract:
This work presents a set of studies addressing the use of the low-dispersion slitless prism on Roman for SN spectroscopy as part of the Roman High Latitude Time Domain Survey (HLTDS). We find SN spectral energy distributions including prism data carry more information than imaging alone at fixed total observing time, improving redshift measurements and sub-typing of SNe. The Roman field of view wi…
▽ More
This work presents a set of studies addressing the use of the low-dispersion slitless prism on Roman for SN spectroscopy as part of the Roman High Latitude Time Domain Survey (HLTDS). We find SN spectral energy distributions including prism data carry more information than imaging alone at fixed total observing time, improving redshift measurements and sub-typing of SNe. The Roman field of view will typically include ~ 10 SNe Ia at observable redshifts at a range of phases (the multiplexing of host galaxies is much greater as they are always present), building up SN spectral time series without targeted observations. We show that fitting these time series extracts more information than stacking the data over all the phases, resulting in a large improvement in precision for SN Ia subclassification measurements. A prism on Roman thus significantly enhances scientific opportunities for the mission, and is particularly important for the Roman SN cosmology program to provide the systematics-controlled measurement that is a focus of the Roman dark energy mission. Optimizing the prism parameters, we conclude that the blue cutoff should be set as blue as the prism image quality allows (~ 7500A), the red cutoff should be set to ~ 18000A to minimize thermal background, and the two-pixel dispersion should be >~ 70.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
The Importance of Background Information for Out of Distribution Generalization
Authors:
Jupinder Parmar,
Khaled Saab,
Brian Pogatchnik,
Daniel Rubin,
Christopher Ré
Abstract:
Domain generalization in medical image classification is an important problem for trustworthy machine learning to be deployed in healthcare. We find that existing approaches for domain generalization which utilize ground-truth abnormality segmentations to control feature attributions have poor out-of-distribution (OOD) performance relative to the standard baseline of empirical risk minimization (E…
▽ More
Domain generalization in medical image classification is an important problem for trustworthy machine learning to be deployed in healthcare. We find that existing approaches for domain generalization which utilize ground-truth abnormality segmentations to control feature attributions have poor out-of-distribution (OOD) performance relative to the standard baseline of empirical risk minimization (ERM). We investigate what regions of an image are important for medical image classification and show that parts of the background, that which is not contained in the abnormality segmentation, provides helpful signal. We then develop a new task-specific mask which covers all relevant regions. Utilizing this new segmentation mask significantly improves the performance of the existing methods on the OOD test sets. To obtain better generalization results than ERM, we find it necessary to scale up the training data size in addition to the usage of these task-specific masks.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
Label-Efficient Self-Supervised Federated Learning for Tackling Data Heterogeneity in Medical Imaging
Authors:
Rui Yan,
Liangqiong Qu,
Qingyue Wei,
Shih-Cheng Huang,
Liyue Shen,
Daniel Rubin,
Lei Xing,
Yuyin Zhou
Abstract:
The collection and curation of large-scale medical datasets from multiple institutions is essential for training accurate deep learning models, but privacy concerns often hinder data sharing. Federated learning (FL) is a promising solution that enables privacy-preserving collaborative learning among different institutions, but it generally suffers from performance deterioration due to heterogeneou…
▽ More
The collection and curation of large-scale medical datasets from multiple institutions is essential for training accurate deep learning models, but privacy concerns often hinder data sharing. Federated learning (FL) is a promising solution that enables privacy-preserving collaborative learning among different institutions, but it generally suffers from performance deterioration due to heterogeneous data distributions and a lack of quality labeled data. In this paper, we present a robust and label-efficient self-supervised FL framework for medical image analysis. Our method introduces a novel Transformer-based self-supervised pre-training paradigm that pre-trains models directly on decentralized target task datasets using masked image modeling, to facilitate more robust representation learning on heterogeneous data and effective knowledge transfer to downstream models. Extensive empirical results on simulated and real-world medical imaging non-IID federated datasets show that masked image modeling with Transformers significantly improves the robustness of models against various degrees of data heterogeneity. Notably, under severe data heterogeneity, our method, without relying on any additional pre-training data, achieves an improvement of 5.06%, 1.53% and 4.58% in test accuracy on retinal, dermatology and chest X-ray classification compared to the supervised baseline with ImageNet pre-training. In addition, we show that our federated self-supervised pre-training methods yield models that generalize better to out-of-distribution data and perform more effectively when fine-tuning with limited labeled data, compared to existing FL algorithms. The code is available at https://github.com/rui-yan/SSL-FL.
△ Less
Submitted 11 January, 2023; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Masked Co-attentional Transformer reconstructs 100x ultra-fast/low-dose whole-body PET from longitudinal images and anatomically guided MRI
Authors:
Yan-Ran,
Wang,
Liangqiong Qu,
Natasha Diba Sheybani,
Xiaolong Luo,
Jiangshan Wang,
Kristina Elizabeth Hawk,
Ashok Joseph Theruvath,
Sergios Gatidis,
Xuerong Xiao,
Allison Pribnow,
Daniel Rubin,
Heike E. Daldrup-Link
Abstract:
Despite its tremendous value for the diagnosis, treatment monitoring and surveillance of children with cancer, whole body staging with positron emission tomography (PET) is time consuming and associated with considerable radiation exposure. 100x (1% of the standard clinical dosage) ultra-low-dose/ultra-fast whole-body PET reconstruction has the potential for cancer imaging with unprecedented speed…
▽ More
Despite its tremendous value for the diagnosis, treatment monitoring and surveillance of children with cancer, whole body staging with positron emission tomography (PET) is time consuming and associated with considerable radiation exposure. 100x (1% of the standard clinical dosage) ultra-low-dose/ultra-fast whole-body PET reconstruction has the potential for cancer imaging with unprecedented speed and improved safety, but it cannot be achieved by the naive use of machine learning techniques. In this study, we utilize the global similarity between baseline and follow-up PET and magnetic resonance (MR) images to develop Masked-LMCTrans, a longitudinal multi-modality co-attentional CNN-Transformer that provides interaction and joint reasoning between serial PET/MRs of the same patient. We mask the tumor area in the referenced baseline PET and reconstruct the follow-up PET scans. In this manner, Masked-LMCTrans reconstructs 100x almost-zero radio-exposure whole-body PET that was not possible before. The technique also opens a new pathway for longitudinal radiology imaging reconstruction, a significantly under-explored area to date. Our model was trained and tested with Stanford PET/MRI scans of pediatric lymphoma patients and evaluated externally on PET/MRI images from Tübingen University. The high image quality of the reconstructed 100x whole-body PET images resulting from the application of Masked-LMCTrans will substantially advance the development of safer imaging approaches and shorter exam-durations for pediatric patients, as well as expand the possibilities for frequent longitudinal monitoring of these patients by PET.
△ Less
Submitted 9 May, 2022;
originally announced May 2022.
-
Uniform Recalibration of Common Spectrophotometry Standard Stars onto the CALSPEC System using the SuperNova Integral Field Spectrograph
Authors:
David Rubin,
G. Aldering,
P. Antilogus,
C. Aragon,
S. Bailey,
C. Baltay,
S. Bongard,
K. Boone,
C. Buton,
Y. Copin,
S. Dixon,
D. Fouchez,
E. Gangler,
R. Gupta,
B. Hayden,
W. Hillebrandt,
A. G. Kim,
M. Kowalski,
D. Kuesters,
P. -F. Leget,
F. Mondon,
J. Nordin,
R. Pain,
E. Pecontal,
R. Pereira
, et al. (13 additional authors not shown)
Abstract:
We calibrate spectrophotometric optical spectra of 32 stars commonly used as standard stars, referenced to 14 stars already on the HST-based CALSPEC flux system. Observations of CALSPEC and non-CALSPEC stars were obtained with the SuperNova Integral Field Spectrograph over the wavelength range 3300 A to 9400 A as calibration for the Nearby Supernova Factory cosmology experiment. In total, this ana…
▽ More
We calibrate spectrophotometric optical spectra of 32 stars commonly used as standard stars, referenced to 14 stars already on the HST-based CALSPEC flux system. Observations of CALSPEC and non-CALSPEC stars were obtained with the SuperNova Integral Field Spectrograph over the wavelength range 3300 A to 9400 A as calibration for the Nearby Supernova Factory cosmology experiment. In total, this analysis used 4289 standard-star spectra taken on photometric nights. As a modern cosmology analysis, all pre-submission methodological decisions were made with the flux scale and external comparison results blinded. The large number of spectra per star allows us to treat the wavelength-by-wavelength calibration for all nights simultaneously with a Bayesian hierarchical model, thereby enabling a consistent treatment of the Type Ia supernova cosmology analysis and the calibration on which it critically relies. We determine the typical per-observation repeatability (median 14 mmag for exposures >~ 5 s), the Maunakea atmospheric transmission distribution (median dispersion of 7 mmag with uncertainty 1 mmag), and the scatter internal to our CALSPEC reference stars (median of 8 mmag). We also check our standards against literature filter photometry, finding generally good agreement over the full 12-magnitude range. Overall, the mean of our system is calibrated to the mean of CALSPEC at the level of ~ 3 mmag. With our large number of observations, careful crosschecks, and 14 reference stars, our results are the best calibration yet achieved with an integral-field spectrograph, and among the best calibrated surveys.
△ Less
Submitted 21 June, 2022; v1 submitted 2 May, 2022;
originally announced May 2022.
-
Multimodal spatiotemporal graph neural networks for improved prediction of 30-day all-cause hospital readmission
Authors:
Siyi Tang,
Amara Tariq,
Jared Dunnmon,
Umesh Sharma,
Praneetha Elugunti,
Daniel Rubin,
Bhavik N. Patel,
Imon Banerjee
Abstract:
Measures to predict 30-day readmission are considered an important quality factor for hospitals as accurate predictions can reduce the overall cost of care by identifying high risk patients before they are discharged. While recent deep learning-based studies have shown promising empirical results on readmission prediction, several limitations exist that may hinder widespread clinical utility, such…
▽ More
Measures to predict 30-day readmission are considered an important quality factor for hospitals as accurate predictions can reduce the overall cost of care by identifying high risk patients before they are discharged. While recent deep learning-based studies have shown promising empirical results on readmission prediction, several limitations exist that may hinder widespread clinical utility, such as (a) only patients with certain conditions are considered, (b) existing approaches do not leverage data temporality, (c) individual admissions are assumed independent of each other, which is unrealistic, (d) prior studies are usually limited to single source of data and single center data. To address these limitations, we propose a multimodal, modality-agnostic spatiotemporal graph neural network (MM-STGNN) for prediction of 30-day all-cause hospital readmission that fuses multimodal in-patient longitudinal data. By training and evaluating our methods using longitudinal chest radiographs and electronic health records from two independent centers, we demonstrate that MM-STGNN achieves AUROC of 0.79 on both primary and external datasets. Furthermore, MM-STGNN significantly outperforms the current clinical reference standard, LACE+ score (AUROC=0.61), on the primary dataset. For subset populations of patients with heart and vascular disease, our model also outperforms baselines on predicting 30-day readmission (e.g., 3.7 point improvement in AUROC in patients with heart disease). Lastly, qualitative model interpretability analysis indicates that while patients' primary diagnoses were not explicitly used to train the model, node features crucial for model prediction directly reflect patients' primary diagnoses. Importantly, our MM-STGNN is agnostic to node feature modalities and could be utilized to integrate multimodal data for triaging patients in various downstream resource allocation tasks.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
Configuration and Collection Factors for Side-Channel Disassembly
Authors:
Random Gwinn,
Mark Matties,
Aviel D. Rubin
Abstract:
Myriad uses, methodologies, and channels have been explored for side-channel analysis. However, specific implementation considerations are often unpublished. This paper explores select test configuration and collection parameters, such as input voltage, shunt resistance, sample rate, and microcontroller clock frequency, along with their impact on side-channel analysis performance. The analysis use…
▽ More
Myriad uses, methodologies, and channels have been explored for side-channel analysis. However, specific implementation considerations are often unpublished. This paper explores select test configuration and collection parameters, such as input voltage, shunt resistance, sample rate, and microcontroller clock frequency, along with their impact on side-channel analysis performance. The analysis use case considered is instruction disassembly and classification using the microcontroller power side-channel. An ATmega328P microcontroller and a subset of the AVR instruction set are used in the experiments as the Device Under Test (DUT). A time-series convolutional neural network (CNN) is used to evaluate classification performance at clock-cycle fidelity. We conclude that configuration and collection parameters have a meaningful impact on performance, especially where the instruction-trace's signal to noise ratio (SNR) is impacted. Additionally, data collection and analysis well above the Nyquist rate is required for side-channel disassembly. We also found that 7V input voltage with 1 kiloohm shunt and a sample rate of 250-500 MSa/s provided optimal performance in our application, with diminishing returns or in some cases degradation at higher levels.
△ Less
Submitted 10 April, 2022;
originally announced April 2022.
-
Supervised Machine Learning Algorithm for Detecting Consistency between Reported Findings and the Conclusions of Mammography Reports
Authors:
Alexander Berdichevsky,
Mor Peleg,
Daniel L. Rubin
Abstract:
Objective. Mammography reports document the diagnosis of patients' conditions. However, many reports contain non-standard terms (non-BI-RADS descriptors) and incomplete statements, which can lead to conclusions that are not well-supported by the reported findings. Our aim was to develop a tool to detect such discrepancies by comparing the reported conclusions to those that would be expected based…
▽ More
Objective. Mammography reports document the diagnosis of patients' conditions. However, many reports contain non-standard terms (non-BI-RADS descriptors) and incomplete statements, which can lead to conclusions that are not well-supported by the reported findings. Our aim was to develop a tool to detect such discrepancies by comparing the reported conclusions to those that would be expected based on the reported radiology findings. Materials and Methods. A deidentified data set from an academic hospital containing 258 mammography reports supplemented by 120 reports found on the web was used for training and evaluation. Spell checking and term normalization was used to unambiguously determine the reported BI-RADS descriptors. The resulting data were input into seven classifiers that classify mammography reports, based on their Findings sections, into seven BI-RADS final assessment categories. Finally, the semantic similarity score of a report to each BI-RADS category is reported. Results. Our term normalization algorithm correctly identified 97% of the BI-RADS descriptors in mammography reports. Our system provided 76% precision and 83% recall in correctly classifying the reports according to BI-RADS final assessment category. Discussion. The strength of our approach relies on providing high importance to BI-RADS terms in the summarization phase, on the semantic similarity that considers the complex data representation, and on the classification into all seven BI-RADs categories. Conclusion. BI-RADS descriptors and expected final assessment categories could be automatically detected by our approach with fairly good accuracy, which could be used to make users aware that their reported findings do not match well with their conclusion.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Co-occurring Diseases Heavily Influence the Performance of Weakly Supervised Learning Models for Classification of Chest CT
Authors:
Fakrul Islam Tushar,
Vincent M. D'Anniballe,
Geoffrey D. Rubin,
Ehsan Samei,
Joseph Y. Lo
Abstract:
Despite the potential of weakly supervised learning to automatically annotate massive amounts of data, little is known about its limitations for use in computer-aided diagnosis (CAD). For CT specifically, interpreting the performance of CAD algorithms can be challenging given the large number of co-occurring diseases. This paper examines the effect of co-occurring diseases when training classifica…
▽ More
Despite the potential of weakly supervised learning to automatically annotate massive amounts of data, little is known about its limitations for use in computer-aided diagnosis (CAD). For CT specifically, interpreting the performance of CAD algorithms can be challenging given the large number of co-occurring diseases. This paper examines the effect of co-occurring diseases when training classification models by weakly supervised learning, specifically by comparing multi-label and multiple binary classifiers using the same training data. Our results demonstrated that the binary model outperformed the multi-label classification in every disease category in terms of AUC. However, this performance was heavily influenced by co-occurring diseases in the binary model, suggesting it did not always learn the correct appearance of the specific disease. For example, binary classification of lung nodules resulted in an AUC of < 0.65 when there were no other co-occurring diseases, but when lung nodules co-occurred with emphysema, the performance reached AUC > 0.80. We hope this paper revealed the complexity of interpreting disease classification performance in weakly supervised models and will encourage researchers to examine the effect of co-occurring diseases on classification performance in the future.
△ Less
Submitted 23 February, 2022;
originally announced February 2022.
-
GIGA-Lens: Fast Bayesian Inference for Strong Gravitational Lens Modeling
Authors:
A. Gu,
X. Huang,
W. Sheu,
G. Aldering,
A. S. Bolton,
K. Boone,
A. Dey,
A. Filipp,
E. Jullo,
S. Perlmutter,
D. Rubin,
E. F. Schlafly,
D. J. Schlegel,
Y. Shu,
S. H. Suyu
Abstract:
We present GIGA-Lens: a gradient-informed, GPU-accelerated Bayesian framework for modeling strong gravitational lensing systems, implemented in TensorFlow and JAX. The three components, optimization using multi-start gradient descent, posterior covariance estimation with variational inference, and sampling via Hamiltonian Monte Carlo, all take advantage of gradient information through automatic di…
▽ More
We present GIGA-Lens: a gradient-informed, GPU-accelerated Bayesian framework for modeling strong gravitational lensing systems, implemented in TensorFlow and JAX. The three components, optimization using multi-start gradient descent, posterior covariance estimation with variational inference, and sampling via Hamiltonian Monte Carlo, all take advantage of gradient information through automatic differentiation and massive parallelization on graphics processing units (GPUs). We test our pipeline on a large set of simulated systems and demonstrate in detail its high level of performance. The average time to model a single system on four Nvidia A100 GPUs is 105 seconds. The robustness, speed, and scalability offered by this framework make it possible to model the large number of strong lenses found in current surveys and present a very promising prospect for the modeling of $\mathcal{O}(10^5)$ lensing systems expected to be discovered in the era of the Vera C. Rubin Observatory, Euclid, and the Nancy Grace Roman Space Telescope.
△ Less
Submitted 15 February, 2022;
originally announced February 2022.
-
Causal inference from treatment-control studies having an additional factor with unknown assignment mechanism
Authors:
Nicole E. Pashley,
Kristen B. Hunter,
Katy McKeough,
Donald B. Rubin,
Tirthankar Dasgupta
Abstract:
Consider a situation with two treatments, the first of which is randomized but the second is not, and the multifactor version of this. Interest is in treatment effects, defined using standard factorial notation. We define estimators for the treatment effects and explore their properties when there is information about the nonrandomized treatment assignment and when there is no information on the a…
▽ More
Consider a situation with two treatments, the first of which is randomized but the second is not, and the multifactor version of this. Interest is in treatment effects, defined using standard factorial notation. We define estimators for the treatment effects and explore their properties when there is information about the nonrandomized treatment assignment and when there is no information on the assignment of the nonrandomized treatment. We show when and how hidden treatments can bias estimators and inflate their sampling variances.
△ Less
Submitted 7 February, 2022;
originally announced February 2022.
-
Automated Detection of Patients in Hospital Video Recordings
Authors:
Siddharth Sharma,
Florian Dubost,
Christopher Lee-Messer,
Daniel Rubin
Abstract:
In a clinical setting, epilepsy patients are monitored via video electroencephalogram (EEG) tests. A video EEG records what the patient experiences on videotape while an EEG device records their brainwaves. Currently, there are no existing automated methods for tracking the patient's location during a seizure, and video recordings of hospital patients are substantially different from publicly avai…
▽ More
In a clinical setting, epilepsy patients are monitored via video electroencephalogram (EEG) tests. A video EEG records what the patient experiences on videotape while an EEG device records their brainwaves. Currently, there are no existing automated methods for tracking the patient's location during a seizure, and video recordings of hospital patients are substantially different from publicly available video benchmark datasets. For example, the camera angle can be unusual, and patients can be partially covered with bedding sheets and electrode sets. Being able to track a patient in real-time with video EEG would be a promising innovation towards improving the quality of healthcare. Specifically, an automated patient detection system could supplement clinical oversight and reduce the resource-intensive efforts of nurses and doctors who need to continuously monitor patients. We evaluate an ImageNet pre-trained Mask R-CNN, a standard deep learning model for object detection, on the task of patient detection using our own curated dataset of 45 videos of hospital patients. The dataset was aggregated and curated for this work. We show that without fine-tuning, ImageNet pre-trained Mask R-CNN models perform poorly on such data. By fine-tuning the models with a subset of our dataset, we observe a substantial improvement in patient detection performance, with a mean average precision of 0.64. We show that the results vary substantially depending on the video clip.
△ Less
Submitted 28 November, 2021;
originally announced November 2021.
-
RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR
Authors:
Yuyin Zhou,
Shih-Cheng Huang,
Jason Alan Fries,
Alaa Youssef,
Timothy J. Amrhein,
Marcello Chang,
Imon Banerjee,
Daniel Rubin,
Lei Xing,
Nigam Shah,
Matthew P. Lungren
Abstract:
Despite the routine use of electronic health record (EHR) data by radiologists to contextualize clinical history and inform image interpretation, the majority of deep learning architectures for medical imaging are unimodal, i.e., they only learn features from pixel-level information. Recent research revealing how race can be recovered from pixel data alone highlights the potential for serious bias…
▽ More
Despite the routine use of electronic health record (EHR) data by radiologists to contextualize clinical history and inform image interpretation, the majority of deep learning architectures for medical imaging are unimodal, i.e., they only learn features from pixel-level information. Recent research revealing how race can be recovered from pixel data alone highlights the potential for serious biases in models which fail to account for demographics and other key patient attributes. Yet the lack of imaging datasets which capture clinical context, inclusive of demographics and longitudinal medical history, has left multimodal medical imaging underexplored. To better assess these challenges, we present RadFusion, a multimodal, benchmark dataset of 1794 patients with corresponding EHR data and high-resolution computed tomography (CT) scans labeled for pulmonary embolism. We evaluate several representative multimodal fusion models and benchmark their fairness properties across protected subgroups, e.g., gender, race/ethnicity, age. Our results suggest that integrating imaging and EHR data can improve classification performance and robustness without introducing large disparities in the true positive rate between population groups.
△ Less
Submitted 26 November, 2021; v1 submitted 23 November, 2021;
originally announced November 2021.
-
Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence
Authors:
Xiang Bai,
Hanchen Wang,
Liya Ma,
Yongchao Xu,
Jiefeng Gan,
Ziwei Fan,
Fan Yang,
Ke Ma,
Jiehua Yang,
Song Bai,
Chang Shu,
Xinyu Zou,
Renhao Huang,
Changzheng Zhang,
Xiaowu Liu,
Dandan Tu,
Chuou Xu,
Wenqing Zhang,
Xi Wang,
Anguo Chen,
Yu Zeng,
Dehua Yang,
Ming-Wei Wang,
Nagaraj Holalkere,
Neil J. Halin
, et al. (21 additional authors not shown)
Abstract:
Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalised model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI),…
▽ More
Artificial intelligence (AI) provides a promising substitution for streamlining COVID-19 diagnoses. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalised model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the AI model can be distributedly trained and independently executed at each host institution under a federated learning framework (FL) without data sharing. Here we show that our FL model outperformed all the local models by a large yield (test sensitivity /specificity in China: 0.973/0.951, in the UK: 0.730/0.942), achieving comparable performance with a panel of professional radiologists. We further evaluated the model on the hold-out (collected from another two hospitals leaving out the FL) and heterogeneous (acquired with contrast materials) data, provided visual explanations for decisions made by the model, and analysed the trade-offs between the model performance and the communication costs in the federated training process. Our study is based on 9,573 chest computed tomography scans (CTs) from 3,336 patients collected from 23 hospitals located in China and the UK. Collectively, our work advanced the prospects of utilising federated learning for privacy-preserving AI in digital health.
△ Less
Submitted 17 November, 2021;
originally announced November 2021.
-
Efficient Neuromorphic Signal Processing with Loihi 2
Authors:
Garrick Orchard,
E. Paxon Frady,
Daniel Ben Dayan Rubin,
Sophia Sanborn,
Sumit Bam Shrestha,
Friedrich T. Sommer,
Mike Davies
Abstract:
The biologically inspired spiking neurons used in neuromorphic computing are nonlinear filters with dynamic state variables -- very different from the stateless neuron models used in deep learning. The next version of Intel's neuromorphic research processor, Loihi 2, supports a wide range of stateful spiking neuron models with fully programmable dynamics. Here we showcase advanced spiking neuron m…
▽ More
The biologically inspired spiking neurons used in neuromorphic computing are nonlinear filters with dynamic state variables -- very different from the stateless neuron models used in deep learning. The next version of Intel's neuromorphic research processor, Loihi 2, supports a wide range of stateful spiking neuron models with fully programmable dynamics. Here we showcase advanced spiking neuron models that can be used to efficiently process streaming data in simulation experiments on emulated Loihi 2 hardware. In one example, Resonate-and-Fire (RF) neurons are used to compute the Short Time Fourier Transform (STFT) with similar computational complexity but 47x less output bandwidth than the conventional STFT. In another example, we describe an algorithm for optical flow estimation using spatiotemporal RF neurons that requires over 90x fewer operations than a conventional DNN-based solution. We also demonstrate promising preliminary results using backpropagation to train RF neurons for audio classification tasks. Finally, we show that a cascade of Hopf resonators - a variant of the RF neuron - replicates novel properties of the cochlea and motivates an efficient spike-based spectrogram encoder.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
A Reference Survey for Supernova Cosmology with the Nancy Grace Roman Space Telescope
Authors:
B. M. Rose,
C. Baltay,
R. Hounsell,
P. Macias,
D. Rubin,
D. Scolnic,
G. Aldering,
R. Bohlin,
M. Dai,
S. E. Deustua,
R. J. Foley,
A. Fruchter,
L. Galbany,
S. W. Jha,
D. O. Jones,
B. A. Joshi,
P. L. Kelly,
R. Kessler,
R. P. Kirshner,
K. S. Mandel,
S. Perlmutter,
J. Pierel,
H. Qu,
D. Rabinowitz,
A. Rest
, et al. (11 additional authors not shown)
Abstract:
This note presents an initial survey design for the Nancy Grace Roman High-latitude Time Domain Survey. This is not meant to be a final or exhaustive list of all the survey strategy choices, but instead presents a viable path towards achieving the desired precision and accuracy of dark energy measurements using Type Ia supernovae (SNe Ia). We describe a survey strategy that use six filters (RZYJH…
▽ More
This note presents an initial survey design for the Nancy Grace Roman High-latitude Time Domain Survey. This is not meant to be a final or exhaustive list of all the survey strategy choices, but instead presents a viable path towards achieving the desired precision and accuracy of dark energy measurements using Type Ia supernovae (SNe Ia). We describe a survey strategy that use six filters (RZYJH and F) and the prism on the Roman Wide Field Instrument. This survey has two tiers, one "wide" which targets SNe Ia at redshifts up to 1 and one "deep" targeting redshifts up to 1.7; for each, four filters are used (with Y and J used in both tiers). We propose one field each in the north and south continuous viewing zones, and expect to obtain high-quality distances of $\sim$12,000 SNe Ia with $\sim$5,000 at z > 1. We propose a wide-tier area of $\sim$19 deg$^2$ and a deep tier of $\sim$5 deg$^2$. Exposure times range from 100 s to 900 s for imaging and 900 s to 3600 s for the prism. These exposure times would reach $\sim$25.5 mag and $\sim$26.5 mag for the wide and deep tiers respectively, with deep co-add stacks reaching $\sim$28 mag and $\sim$29 mag. The total survey spans two years, with a total allocation time of six months, and a cadence of $\sim$5 days.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.