-
Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers
Authors:
Cody Wild,
Jesper Anderson
Abstract:
Previous work has demonstrated that MLPs within ReLU Transformers exhibit high levels of sparsity, with many of their activations equal to zero for any given token. We build on that work to more deeply explore how token-level sparsity evolves over the course of training, and how it connects to broader sparsity patterns over the course of a sequence or batch, demonstrating that the different layers…
▽ More
Previous work has demonstrated that MLPs within ReLU Transformers exhibit high levels of sparsity, with many of their activations equal to zero for any given token. We build on that work to more deeply explore how token-level sparsity evolves over the course of training, and how it connects to broader sparsity patterns over the course of a sequence or batch, demonstrating that the different layers within small transformers exhibit distinctly layer-specific patterns on both of these fronts. In particular, we demonstrate that the first and last layer of the network have distinctive and in many ways inverted relationships to sparsity, and explore implications for the structure of feature representations being learned at different depths of the model. We additionally explore the phenomenon of ReLU dimensions "turning off", and show evidence suggesting that "neuron death" is being primarily driven by the dynamics of training, rather than simply occurring randomly or accidentally as a result of outliers.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control
Authors:
Bruce D. Lee,
Leonardo F. Toso,
Thomas T. Zhang,
James Anderson,
Nikolai Matni
Abstract:
Representation learning is a powerful tool that enables learning over large multitudes of agents or domains by enforcing that all agents operate on a shared set of learned features. However, many robotics or controls applications that would benefit from collaboration operate in settings with changing environments and goals, whereas most guarantees for representation learning are stated for static…
▽ More
Representation learning is a powerful tool that enables learning over large multitudes of agents or domains by enforcing that all agents operate on a shared set of learned features. However, many robotics or controls applications that would benefit from collaboration operate in settings with changing environments and goals, whereas most guarantees for representation learning are stated for static settings. Toward rigorously establishing the benefit of representation learning in dynamic settings, we analyze the regret of multi-task representation learning for linear-quadratic control. This setting introduces unique challenges. Firstly, we must account for and balance the $\textit{misspecification}$ introduced by an approximate representation. Secondly, we cannot rely on the parameter update schemes of single-task online LQR, for which least-squares often suffices, and must devise a novel scheme to ensure sufficient improvement. We demonstrate that for settings where exploration is "benign", the regret of any agent after $T$ timesteps scales as $\tilde O(\sqrt{T/H})$, where $H$ is the number of agents. In settings with "difficult" exploration, the regret scales as $\tilde{\mathcal O}(\sqrt{d_u d_θ} \sqrt{T} + T^{3/4}/H^{1/5})$, where $d_x$ is the state-space dimension, $d_u$ is the input dimension, and $d_θ$ is the task-specific parameter count. In both cases, by comparing to the minimax single-task regret $\tilde{\mathcal O}(\sqrt{d_x d_u^2}\sqrt{T})$, we see a benefit of a large number of agents. Notably, in the difficult exploration case, by sharing a representation across tasks, the effective task-specific parameter count can often be small $d_θ< d_x d_u$. Lastly, we provide numerical validation of the trends we predict.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Geophysical Observations of the 24 September 2023 OSIRIS-REx Sample Return Capsule Re-Entry
Authors:
Elizabeth A. Silber,
Daniel C. Bowman,
Chris G. Carr,
David P. Eisenberg,
Brian R. Elbing,
Benjamin Fernando,
Milton A. Garcés,
Robert Haaser,
Siddharth Krishnamoorthy,
Charles A. Langston,
Yasuhiro Nishikawa,
Jeremy Webster,
Jacob F. Anderson,
Stephen Arrowsmith,
Sonia Bazargan,
Luke Beardslee,
Brant Beck,
Jordan W. Bishop,
Philip Blom,
Grant Bracht,
David L. Chichester,
Anthony Christe,
Kenneth Cummins,
James Cutts,
Lisa Danielson
, et al. (57 additional authors not shown)
Abstract:
Sample Return Capsules (SRCs) entering Earth's atmosphere at hypervelocity from interplanetary space are a valuable resource for studying meteor phenomena. The 24 September 2023 arrival of the OSIRIS-REx (Origins, Spectral Interpretation, Resource Identification, and Security-Regolith Explorer) SRC provided an unprecedented chance for geophysical observations of a well-characterized source with kn…
▽ More
Sample Return Capsules (SRCs) entering Earth's atmosphere at hypervelocity from interplanetary space are a valuable resource for studying meteor phenomena. The 24 September 2023 arrival of the OSIRIS-REx (Origins, Spectral Interpretation, Resource Identification, and Security-Regolith Explorer) SRC provided an unprecedented chance for geophysical observations of a well-characterized source with known parameters, including timing and trajectory. A collaborative effort involving researchers from 16 institutions executed a carefully planned geophysical observational campaign at strategically chosen locations, deploying over 400 ground-based sensors encompassing infrasound, seismic, distributed acoustic sensing (DAS), and GPS technologies. Additionally, balloons equipped with infrasound sensors were launched to capture signals at higher altitudes. This campaign (the largest of its kind so far) yielded a wealth of invaluable data anticipated to fuel scientific inquiry for years to come. The success of the observational campaign is evidenced by the near-universal detection of signals across instruments, both proximal and distal. This paper presents a comprehensive overview of the collective scientific effort, field deployment, and preliminary findings. The early findings have the potential to inform future space missions and terrestrial campaigns, contributing to our understanding of meteoroid interactions with planetary atmospheres. Furthermore, the dataset collected during this campaign will improve entry and propagation models as well as augment the study of atmospheric dynamics and shock phenomena generated by meteoroids and similar sources.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
SoK: Web Authentication in the Age of End-to-End Encryption
Authors:
Jenny Blessing,
Daniel Hugenroth,
Ross J. Anderson,
Alastair R. Beresford
Abstract:
The advent of end-to-end encrypted (E2EE) messaging and backup services has brought new challenges for usable authentication. Compared to regular web services, the nature of E2EE implies that the provider cannot recover data for users who have forgotten passwords or lost devices. Therefore, new forms of robustness and recoverability are required, leading to a plethora of solutions ranging from ran…
▽ More
The advent of end-to-end encrypted (E2EE) messaging and backup services has brought new challenges for usable authentication. Compared to regular web services, the nature of E2EE implies that the provider cannot recover data for users who have forgotten passwords or lost devices. Therefore, new forms of robustness and recoverability are required, leading to a plethora of solutions ranging from randomly-generated recovery codes to threshold-based social verification. These implications also spread to new forms of authentication and legacy web services: passwordless authentication ("passkeys") has become a promising candidate to replace passwords altogether, but are inherently device-bound. However, users expect that they can login from multiple devices and recover their passwords in case of device loss--prompting providers to sync credentials to cloud storage using E2EE, resulting in the very same authentication challenges of regular E2EE services. Hence, E2EE authentication quickly becomes relevant not only for a niche group of dedicated E2EE enthusiasts but for the general public using the passwordless authentication techniques promoted by their device vendors. In this paper we systematize existing research literature and industry practice relating to security, privacy, usability, and recoverability of E2EE authentication. We investigate authentication and recovery schemes in all widely-used E2EE web services and survey passwordless authentication deployment in the top-200 most popular websites. Finally, we present concrete research directions based on observed gaps between industry deployment and academic literature.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Design and Validation of a Cold Load for Characterization of CMB-S4 Detectors
Authors:
Cesiley L. King,
Ian Gullet,
Adam J. Anderson,
Bradford A. Benson,
Rick Bihary,
Haichen Fan,
Johanna M. Nagy,
Hogan Nguyen,
John E. Ruhl,
Sara M. Simon
Abstract:
We present the design and validation of a variable temperature cryogenic blackbody source, hereinafter called a cold load, that will be used to characterize detectors to be deployed by CMB-S4, the next-generation ground-based cosmic microwave background (CMB) experiment. Although cold loads have been used for detector characterization by previous CMB experiments, this cold load has three novel des…
▽ More
We present the design and validation of a variable temperature cryogenic blackbody source, hereinafter called a cold load, that will be used to characterize detectors to be deployed by CMB-S4, the next-generation ground-based cosmic microwave background (CMB) experiment. Although cold loads have been used for detector characterization by previous CMB experiments, this cold load has three novel design features: (1) the ability to operate from the 1 K stage of a dilution refrigerator (DR), (2) a 3He gas-gap heat switch to reduce cooling time, and (3) the ability to couple small external optical signals to measure detector optical time constants under low optical loading. The efficacy of this design was validated using a 150 GHz detector array previously deployed by the Spider experiment. Thermal tests showed that the cold load can be heated to temperatures required for characterizing CMB-S4's detectors without significantly impacting the temperatures of other cryogenic stages when mounted to the DR's 1 K stage. Additionally, optical tests demonstrated that external signals can be coupled to a detector array through the cold load without imparting a significant optical load on the detectors, which will enable measurements of the CMB-S4 detectors' optical time constants.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Chirality Effects in Molecular Chainmail
Authors:
Alexander R. Klotz,
Caleb J. Anderson,
Michael S. Dimitriyev
Abstract:
Motivated by the observation of positive Gaussian curvature in kinetoplast DNA networks, we consider the effect of linking chirality in square lattice molecular chainmail networks using Langevin dynamics simulations and constrained gradient optimization. Linking chirality here refers to ordering of over-under versus under-over linkages between a loop and its neighbors. We consider fully alternatin…
▽ More
Motivated by the observation of positive Gaussian curvature in kinetoplast DNA networks, we consider the effect of linking chirality in square lattice molecular chainmail networks using Langevin dynamics simulations and constrained gradient optimization. Linking chirality here refers to ordering of over-under versus under-over linkages between a loop and its neighbors. We consider fully alternating linking, maximally non-alternating, and partially non-alternating linking chiralities. We find that in simulations of polymer chainmail networks, the linking chirality dictates the sign of the Gaussian curvature of the final state of the chainmail membranes. Alternating networks have positive Gaussian curvature, similar to what is observed in kinetoplast DNA networks. Maximally non-alternating networks form isotropic membranes with negative Gaussian curvature. Partially non-alternating networks form flat diamond-shaped sheets which undergo a thermal folding transition when sufficiently large, similar to the crumpling transition in tethered membranes. We further investigate this topology-curvature relationship on geometric grounds by considering the tightest possible configurations and the constraints that must be satisfied to achieve them.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
The Design, Implementation, and Performance of the LZ Calibration Systems
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (179 additional authors not shown)
Abstract:
LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low e…
▽ More
LUX-ZEPLIN (LZ) is a tonne-scale experiment searching for direct dark matter interactions and other rare events. It is located at the Sanford Underground Research Facility (SURF) in Lead, South Dakota, USA. The core of the LZ detector is a dual-phase xenon time projection chamber (TPC), designed with the primary goal of detecting Weakly Interacting Massive Particles (WIMPs) via their induced low energy nuclear recoils. Surrounding the TPC, two veto detectors immersed in an ultra-pure water tank enable reducing background events to enhance the discovery potential. Intricate calibration systems are purposely designed to precisely understand the responses of these three detector volumes to various types of particle interactions and to demonstrate LZ's ability to discriminate between signals and backgrounds. In this paper, we present a comprehensive discussion of the key features, requirements, and performance of the LZ calibration systems, which play a crucial role in enabling LZ's WIMP-search and its broad science program. The thorough description of these calibration systems, with an emphasis on their novel aspects, is valuable for future calibration efforts in direct dark matter and other rare-event search experiments.
△ Less
Submitted 20 June, 2024; v1 submitted 2 May, 2024;
originally announced June 2024.
-
Discovery and Extensive Follow-Up of SN 2024ggi, a nearby type IIP supernova in NGC 3621
Authors:
Ting-Wan Chen,
Sheng Yang,
Shubham Srivastav,
Takashi J. Moriya,
Stephen J. Smartt,
Sofia Rest,
Armin Rest,
Hsing Wen Lin,
Hao-Yu Miao,
Yu-Chi Cheng,
Amar Aryan,
Chia-Yu Cheng,
Morgan Fraser,
Li-Ching Huang,
Meng-Han Lee,
Cheng-Han Lai,
Yu Hsuan Liu,
Aiswarya Sankar. K,
Ken W. Smith,
Heloise F. Stevance,
Ze-Ning Wang,
Joseph P. Anderson,
Charlotte R. Angus,
Thomas de Boer,
Kenneth Chambers
, et al. (23 additional authors not shown)
Abstract:
We present the discovery and early observations of the nearby Type II supernova (SN) 2024ggi in NGC 3621 at 6.64 +/- 0.3 Mpc. The SN was caught 5.8 (+1.9 -2.9) hours after its explosion by the ATLAS survey. Early-phase, high-cadence, and multi-band photometric follow-up was performed by the Kinder (Kilonova Finder) project, collecting over 1000 photometric data points within a week. The combined o…
▽ More
We present the discovery and early observations of the nearby Type II supernova (SN) 2024ggi in NGC 3621 at 6.64 +/- 0.3 Mpc. The SN was caught 5.8 (+1.9 -2.9) hours after its explosion by the ATLAS survey. Early-phase, high-cadence, and multi-band photometric follow-up was performed by the Kinder (Kilonova Finder) project, collecting over 1000 photometric data points within a week. The combined o- and r-band light curves show a rapid rise of 3.3 magnitudes in 13.7 hours, much faster than SN 2023ixf (another recent, nearby, and well-observed SN II). Between 13.8 and 18.8 hours after explosion SN 2024ggi became bluer, with u-g colour dropping from 0.53 to 0.15 mag. The rapid blueward evolution indicates a wind shock breakout (SBO) scenario. No hour-long brightening expected for the SBO from a bare stellar surface was detected during our observations. The classification spectrum, taken 17 hours after the SN explosion, shows flash features of high-ionization species such as Balmer lines, He I, C III, and N III. Detailed light curve modeling reveals critical insights into the properties of the circumstellar material (CSM). Our favoured model has an explosion energy of 2 x 10^51 erg, a mass-loss rate of 10^-3 solar_mass/yr (with an assumed 10 km/s wind), and a confined CSM radius of 6 x 10^14 cm. The corresponding CSM mass is 0.4 solar_mass. Comparisons with SN 2023ixf highlight that SN 2024ggi has a smaller CSM density, resulting in a faster rise and fainter UV flux. The extensive dataset and the involvement of citizen astronomers underscore that a collaborative network is essential for SBO searches, leading to more precise and comprehensive SN characterizations.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
FireBench: A High-fidelity Ensemble Simulation Framework for Exploring Wildfire Behavior and Data-driven Modeling
Authors:
Qing Wang,
Matthias Ihme,
Cenk Gazen,
Yi-Fan Chen,
John Anderson
Abstract:
Background. Wildfire research uses ensemble methods to analyze fire behaviors and assess uncertainties. Nonetheless, current research methods are either confined to simple models or complex simulations with limits. Modern computing tools could allow for efficient, high-fidelity ensemble simulations. Aims. This study proposes a high-fidelity ensemble wildfire simulation framework for studying wildf…
▽ More
Background. Wildfire research uses ensemble methods to analyze fire behaviors and assess uncertainties. Nonetheless, current research methods are either confined to simple models or complex simulations with limits. Modern computing tools could allow for efficient, high-fidelity ensemble simulations. Aims. This study proposes a high-fidelity ensemble wildfire simulation framework for studying wildfire behavior, ML tasks, fire-risk assessment, and uncertainty analysis. Methods. In this research, we present a simulation framework that integrates the Swirl-Fire large-eddy simulation tool for wildfire predictions with the Vizier optimization platform for automated run-time management of ensemble simulations and large-scale batch processing. All simulations are executed on tensor-processing units to enhance computational efficiency. Key results. A dataset of 117 simulations is created, each with 1.35 billion mesh points. The simulations are compared to existing experimental data and show good agreement in terms of fire rate of spread. Computations are done for fire acceleration, mean rate of spread, and fireline intensity. Conclusions. Strong coupling between these 2 parameters are observed for the fire spread and intermittency. A critical Froude number that delineates fires from plume-driven to convection-driven is identified and confirmed with literature observations. Implications. The ensemble simulation framework is efficient in facilitating parametric wildfire studies.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Probing the Scalar WIMP-Pion Coupling with the first LUX-ZEPLIN data
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. J. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (178 additional authors not shown)
Abstract:
Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we repor…
▽ More
Weakly interacting massive particles (WIMPs) may interact with a virtual pion that is exchanged between nucleons. This interaction channel is important to consider in models where the spin-independent isoscalar channel is suppressed. Using data from the first science run of the LUX-ZEPLIN dark matter experiment, containing 60 live days of data in a 5.5~tonne fiducial mass of liquid xenon, we report the results on a search for WIMP-pion interactions. We observe no significant excess and set an upper limit of $1.5\times10^{-46}$~cm$^2$ at a 90\% confidence level for a WIMP mass of 33~GeV/c$^2$ for this interaction.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments
Authors:
Han Wang,
Sihong He,
Zhili Zhang,
Fei Miao,
James Anderson
Abstract:
We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maxim…
▽ More
We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maximizes the average performance across all potentially completely different environments, we propose two algorithms: FedSVRPG-M and FedHAPG-M. In contrast to existing results, we demonstrate that both FedSVRPG-M and FedHAPG-M, both of which leverage momentum mechanisms, can exactly converge to a stationary point of the average performance function, regardless of the magnitude of environment heterogeneity. Furthermore, by incorporating the benefits of variance-reduction techniques or Hessian approximation, both algorithms achieve state-of-the-art convergence results, characterized by a sample complexity of $\mathcal{O}\left(ε^{-\frac{3}{2}}/N\right)$. Notably, our algorithms enjoy linear convergence speedups with respect to the number of agents, highlighting the benefit of collaboration among agents in finding a common policy.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
The JWST Resolved Stellar Populations Early Release Science Program VII. Stress Testing the NIRCam Exposure Time Calculator
Authors:
A. Savino,
M. Gennaro,
A. E. Dolphin,
D. R. Weisz,
M. Correnti,
J. Anderson,
R. Beaton,
M. L. Boyer,
R. E. Cohen,
A. A. Cole,
M. J. Durbin,
C. T. Garling,
M. C. Geha,
K. M. Gilbert,
J. Kalirai,
N. Kallivayalil,
K. B. W. McQuinn,
M. J. B. Newman,
H. Richstein,
E. D. Skillman,
J. T. Warfield,
B. F. Williams
Abstract:
We empirically assess estimates from v3.0 of the JWST NIRCam Exposure Time Calculator (ETC) using observations of resolved stars in Local Group targets taken as part of the Resolved Stellar Populations Early Release Science (ERS) Program. For bright stars, we find that: (i) purely Poissonian estimates of the signal-to-noise ratio (SNR) are in good agreement between the ETC and observations, but no…
▽ More
We empirically assess estimates from v3.0 of the JWST NIRCam Exposure Time Calculator (ETC) using observations of resolved stars in Local Group targets taken as part of the Resolved Stellar Populations Early Release Science (ERS) Program. For bright stars, we find that: (i) purely Poissonian estimates of the signal-to-noise ratio (SNR) are in good agreement between the ETC and observations, but non-ideal effects (e.g., flat field uncertainties) are the current limiting factor in the photometric precision that can be achieved; (ii) source position offsets, relative to the detector pixels, have a large impact on the ETC saturation predictions and introducing sub-pixel dithers in the observation design can improve the saturation limits by up to ~1 mag. For faint stars, for which the sky dominates the error budget, we find that the choice in ETC extraction strategy (e.g., aperture size relative to point spread function size) can affect the exposure time estimates by up to a factor of 5. We provide guidelines for configuring the ETC aperture photometry to produce SNR predictions in line with the ERS data. Finally, we quantify the effects of crowding on the SNRs over a large dynamic range in stellar density and provide guidelines for approximating the effects of crowding on SNRs predicted by the ETC.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
The Data Acquisition System of the LZ Dark Matter Detector: FADR
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (190 additional authors not shown)
Abstract:
The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals.…
▽ More
The Data Acquisition System (DAQ) for the LUX-ZEPLIN (LZ) dark matter detector is described. The signals from 745 PMTs, distributed across three subsystems, are sampled with 100-MHz 32-channel digitizers (DDC-32s). A basic waveform analysis is carried out on the on-board Field Programmable Gate Arrays (FPGAs) to extract information about the observed scintillation and electroluminescence signals. This information is used to determine if the digitized waveforms should be preserved for offline analysis.
The system is designed around the Kintex-7 FPGA. In addition to digitizing the PMT signals and providing basic event selection in real time, the flexibility provided by the use of FPGAs allows us to monitor the performance of the detector and the DAQ in parallel to normal data acquisition.
The hardware and software/firmware of this FPGA-based Architecture for Data acquisition and Realtime monitoring (FADR) are discussed and performance measurements are described.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
SN 2023zaw: the low-energy explosion of an ultra-stripped star, with non-radioactive heating
Authors:
Thomas Moore,
James Gillanders,
Matt Nicholl,
Mark Huber,
Stephen Smartt,
Shubham Srivastav,
Heloise Stevance,
Ting-Wan Chen,
Kenneth Chambers,
Joseph Anderson,
Michael Fulton,
Samantha Oates,
Charlotte Angus,
Giuliano Pignata,
Nicolas Erasmus,
Hua Gao,
Joanna Bulger,
Chien-Cheng Lin,
Thomas Lowe,
Eugene Magnier,
Paloma Minguez,
Chow-Choong Ngeow,
Xinyue Sheng,
Stuart A. Sim,
Ken Smith
, et al. (4 additional authors not shown)
Abstract:
Most stripped envelope supernova progenitors are formed through binary interaction, losing hydrogen and/or helium from their outer layers. An emerging class of supernovae with the highest degree of envelope-stripping are thought to be the product of stripping by a NS companion. However, relatively few examples are known and the outcomes of such systems can be diverse and are poorly understood at p…
▽ More
Most stripped envelope supernova progenitors are formed through binary interaction, losing hydrogen and/or helium from their outer layers. An emerging class of supernovae with the highest degree of envelope-stripping are thought to be the product of stripping by a NS companion. However, relatively few examples are known and the outcomes of such systems can be diverse and are poorly understood at present. Here, we present spectroscopic observations and high cadence multi-band photometry of SN 2023zaw, a low ejecta mass and rapidly evolving supernova. SN 2023zaw was discovered in a nearby spiral galaxy at D = 39.7 Mpc, with significant Milky Way extinction, $E(B-V) = 0.21$, and significant (but uncertain) host extinction. Bayesian evidence comparison reveals that nickel is not the only power source and an additional energy source is required to explain our observations. Our models suggest an ejecta mass of $M_{\rm ej} \sim 0.07\,\rm M_\odot$ and a synthesised nickel mass of $M_{\rm ej} \sim 0.007\,\rm M_\odot$ is required to explain the explosion. However an additional heating from a magnetar or interaction with circumstellar material is required to power the early light curve.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Data-Efficient and Robust Task Selection for Meta-Learning
Authors:
Donglin Zhan,
James Anderson
Abstract:
Meta-learning methods typically learn tasks under the assumption that all tasks are equally important. However, this assumption is often not valid. In real-world applications, tasks can vary both in their importance during different training stages and in whether they contain noisy labeled data or not, making a uniform approach suboptimal. To address these issues, we propose the Data-Efficient and…
▽ More
Meta-learning methods typically learn tasks under the assumption that all tasks are equally important. However, this assumption is often not valid. In real-world applications, tasks can vary both in their importance during different training stages and in whether they contain noisy labeled data or not, making a uniform approach suboptimal. To address these issues, we propose the Data-Efficient and Robust Task Selection (DERTS) algorithm, which can be incorporated into both gradient and metric-based meta-learning algorithms. DERTS selects weighted subsets of tasks from task pools by minimizing the approximation error of the full gradient of task pools in the meta-training stage. The selected tasks are efficient for rapid training and robust towards noisy label scenarios. Unlike existing algorithms, DERTS does not require any architecture modification for training and can handle noisy label data in both the support and query sets. Analysis of DERTS shows that the algorithm follows similar training dynamics as learning on the full task pools. Experiments show that DERTS outperforms existing sampling strategies for meta-learning on both gradient-based and metric-based meta-learning algorithms in limited data budget and noisy task settings.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Fast-moving stars around an intermediate-mass black hole in Omega Centauri
Authors:
Maximilian Häberle,
Nadine Neumayer,
Anil Seth,
Andrea Bellini,
Mattia Libralato,
Holger Baumgardt,
Matthew Whitaker,
Antoine Dumont,
Mayte Alfaro Cuello,
Jay Anderson,
Callie Clontz,
Nikolay Kacharov,
Sebastian Kamann,
Anja Feldmeier-Krause,
Antonino Milone,
Maria Selina Nitschai,
Renuka Pechetti,
Glenn van de Ven
Abstract:
Black holes have been found over a wide range of masses, from stellar remnants with masses of 5-150 solar masses (Msun), to those found at the centers of galaxies with $M>10^5$ Msun. However, only a few debated candidate black holes exist between 150 and $10^5$ Msun. Determining the population of these intermediate-mass black holes is an important step towards understanding supermassive black hole…
▽ More
Black holes have been found over a wide range of masses, from stellar remnants with masses of 5-150 solar masses (Msun), to those found at the centers of galaxies with $M>10^5$ Msun. However, only a few debated candidate black holes exist between 150 and $10^5$ Msun. Determining the population of these intermediate-mass black holes is an important step towards understanding supermassive black hole formation in the early universe. Several studies have claimed the detection of a central black hole in $ω$ Centauri, the Milky Way's most massive globular cluster. However, these studies have been questioned due to the possible mass contribution of stellar mass black holes, their sensitivity to the cluster center, and the lack of fast-moving stars above the escape velocity. Here we report observations of seven fast-moving stars in the central 3 arcseconds (0.08 pc) of $ω$ Centauri. The velocities of the fast-moving stars are significantly higher than the expected central escape velocity of the star cluster, so their presence can only be explained by being bound to a massive black hole. From the velocities alone, we can infer a firm lower limit of the black hole mass of $\sim$8,200 Msun, making this a compelling candidate for an intermediate-mass black hole in the local universe.
△ Less
Submitted 12 July, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
ATClean: A Novel Method for Detecting Low-Luminosity Transients and Application to Pre-explosion Counterparts from SN 2023ixf
Authors:
S. Rest,
A. Rest,
C. D. Kilpatrick,
J. E. Jencson,
S. von Coelln,
L. Strolger,
S. Smartt,
J. P. Anderson,
A. Clocchiatti,
D. A. Coulter,
L. Denneau,
S. Gomez,
A. Heinze,
R. Ridden-Harper,
K. W. Smith,
B. Stalder,
J. l. Tonry,
Q. Wang,
Y. Zenati
Abstract:
In an effort to search for faint sources of emission over arbitrary timescales, we present a novel method for analyzing forced photometry light curves in difference imaging from optical surveys. Our method "ATLAS Clean'' or ATClean, utilizes the reported fluxes, uncertainties, and fits to the point-spread function from difference images to quantify the statistical significance of individual measur…
▽ More
In an effort to search for faint sources of emission over arbitrary timescales, we present a novel method for analyzing forced photometry light curves in difference imaging from optical surveys. Our method "ATLAS Clean'' or ATClean, utilizes the reported fluxes, uncertainties, and fits to the point-spread function from difference images to quantify the statistical significance of individual measurements. We apply this method to control light curves across the image to determine whether any source of flux is present in the data for a range of specific timescales. From ATLAS $o$-band imaging at the site of the Type II supernova (SN) 2023ixf in M101 from 2015--2023, we show that this method accurately reproduces the 3$σ$ flux limits produced from other, more computationally expensive methods. We derive limits for emission on timescales of 5~days and 80-300~days at the site of SN\,2023ixf, which are 19.8 and 21.3~mag, respectively. The latter limits rule out variability for unextinguished red supergiants (RSG) with initial masses $>$22~$M_{\odot}$, comparable to the most luminous predictions for the SN 2023ixf progenitor system. We also compare our limits to short timescale outbursts, similar to those expected for Type IIn SN progenitor stars or the Type II SN 2020tlf, and rule out outburst ejecta masses of $>$0.021~$M_{\odot}$, much lower than the inferred mass of circumstellar matter around SN 2023ixf in the literature. In the future, these methods can be applied to any forced point-spread function photometry on difference imaging from other surveys, such as Rubin optical imaging.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Cryogenic optical beam steering for superconducting device calibration
Authors:
K. Stifter,
H. Magoon,
A. J. Anderson,
D. J. Temples,
N. A. Kurinsky,
C. Stoughton,
I. Hernandez,
A. Nuñez,
K. Anyang,
R. Linehan,
M. R. Young,
P. Barry,
D. Baxter,
D. Bowring,
G. Cancelo,
A. Chou,
K. R. Dibert,
E. Figueroa-Feliciano,
L. Hsu,
R. Khatiwada,
S. D. Mork,
L. Stefanazzi,
N. Tabassum,
S. Uemura,
B. A. Young
Abstract:
We have developed a calibration system based on a micro-electromechanical systems (MEMS) mirror that is capable of delivering an optical beam over a wavelength range of 180 -- 2000 nm (0.62 -- 6.89 eV) in a sub-Kelvin environment. This portable, integrated system can steer the beam over a $\sim$3 cm $\times$ 3 cm area on the surface of any sensor with a precision of $\sim$100 $μ$m, enabling charac…
▽ More
We have developed a calibration system based on a micro-electromechanical systems (MEMS) mirror that is capable of delivering an optical beam over a wavelength range of 180 -- 2000 nm (0.62 -- 6.89 eV) in a sub-Kelvin environment. This portable, integrated system can steer the beam over a $\sim$3 cm $\times$ 3 cm area on the surface of any sensor with a precision of $\sim$100 $μ$m, enabling characterization of device response as a function of position. This fills a critical need in the landscape of calibration tools for sub-Kelvin devices, including those used for dark matter detection and quantum computing. These communities have a shared goal of understanding the impact of ionizing radiation on device performance, which can be pursued with our system. This paper describes the design of the first-generation calibration system and the results from successfully testing its performance at room temperature and 20 mK.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
JWST Imaging of the Closest Globular Clusters -- III. Multiple Populations along the low-mass Main Sequence stars of NGC 6397
Authors:
M. Scalco,
M. Libralato,
R. Gerasimov,
L. R. Bedin,
E. Vesperini,
D. Nardiello,
A. Bellini,
M. Griggio,
D. Apai,
M. Salaris,
A. Burgasser,
J. Anderson
Abstract:
Thanks to its exceptional near-infrared photometry, JWST can effectively contribute to the discovery, characterization, and understanding of multiple stellar populations in globular clusters, especially at low masses where the Hubble Space Telescope (HST) faces limitations. This paper continues the efforts of the JWST GO-1979 program in exploring the faintest members of the globular cluster NGC 63…
▽ More
Thanks to its exceptional near-infrared photometry, JWST can effectively contribute to the discovery, characterization, and understanding of multiple stellar populations in globular clusters, especially at low masses where the Hubble Space Telescope (HST) faces limitations. This paper continues the efforts of the JWST GO-1979 program in exploring the faintest members of the globular cluster NGC 6397. Here we show that the combination of HST and JWST data allows us to identify two groups of MS stars (MSa, the first-generation, and MSb, the second-generation group). We measured the ratio between the two groups and combined it with measurements from the literature focused on more central fields and more massive stars compared to our study. We find that the MSa and MSb stars are present in a $\approx$30-70 ratio regardless of the distance from the centre of the cluster and the mass of the stars used so far.
△ Less
Submitted 3 July, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
JWST Imaging of the Closest Globular Clusters -- II. Discovery of Brown Dwarfs in NGC 6397 and Measurement of Age from the Brown Dwarf Cooling Sequence, using SANDee - a New Grid of Model Isochrones across the Hydrogen-Burning Limit
Authors:
Roman Gerasimov,
Luigi R. Bedin,
Adam J. Burgasser,
Daniel Apai,
Domenico Nardiello,
Efrain Alvarado III,
Jay Anderson
Abstract:
Globular clusters contain vast repositories of metal-poor stars that represent some of the oldest stellar generations in the Universe. The archaeological footprint of early Galactic evolution may be retained in the measurable properties of globular clusters, such as their ages, mass functions and chemical abundances. Until recently, all photometric studies of globular clusters were restricted to s…
▽ More
Globular clusters contain vast repositories of metal-poor stars that represent some of the oldest stellar generations in the Universe. The archaeological footprint of early Galactic evolution may be retained in the measurable properties of globular clusters, such as their ages, mass functions and chemical abundances. Until recently, all photometric studies of globular clusters were restricted to stellar members. Now, the sensitivity of JWST can extend this analysis to the substellar regime. If detected in sufficient numbers, brown dwarf members can provide tight constraints on the properties of their parent population. We present SANDee - a new grid of stellar models that accurately represent the color-magnitude diagrams of globular clusters across the hydrogen-burning limit at a wide range of metallicities. Using JWST NIRCam photometry and the new models, we identify three brown dwarfs in the globular cluster NGC 6397 with effective temperatures of 1300-1800 K, confirmed by both proper motion and model fitting. We use the observed luminosities of discovered brown dwarfs to obtain the first age estimate of a globular cluster from its substellar cooling sequence: 13.4 +/- 3.3 Gyr. We also derive the local mass function of the cluster across the hydrogen-burning limit and find it to be top-heavy, suggesting extensive dynamical evolution. We expect that the constraints on both age and mass function of NGC 6397 derived in this work can be greatly improved by a second epoch of NIRCam imaging in the same field.
△ Less
Submitted 31 May, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
JWST Imaging of the Closest Globular Clusters -- I. Possible Infrared Excess Among White Dwarfs in NGC 6397
Authors:
L. R. Bedin,
D. Nardiello,
M. Salaris,
M. Libralato,
P. Bergeron,
A. J. Burgasser,
D. Apai,
M. Griggio,
M. Scalco,
J. Anderson,
R. Gerasimov,
A. Bellini
Abstract:
We present James Webb Space Telescope observations of the globular cluster NGC 6397 and use them to extend to infrared wavelengths the characterization of the cluster's entire white dwarf (WD) cooling sequence (CS). The data allows us to probe fundamental astrophysical WD properties and to search for evidence in their colors for (or against) the existence of ancient planetary systems. The existing…
▽ More
We present James Webb Space Telescope observations of the globular cluster NGC 6397 and use them to extend to infrared wavelengths the characterization of the cluster's entire white dwarf (WD) cooling sequence (CS). The data allows us to probe fundamental astrophysical WD properties and to search for evidence in their colors for (or against) the existence of ancient planetary systems. The existing archival Hubble Space Telescope imaging data obtained ~18 years ago reach ultra-deep optical magnitudes (V~31) and allow us to derive a near-perfect separation between field and cluster members. We detect an apparent split in the lower part of the WD CS of NGC 6397. The red part of the WD CS, containing about 25% of the total, exhibits significant IR-excess of up to Delta m_F322W2 ~ 0.5 mag. These infrared excesses require both theoretical and observational follow-ups to confirm their veracity and to ascertain their true nature.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Market Power and Withholding Behavior of Energy Storage Units
Authors:
Yiqian Wu,
Bolun Xu,
James Anderson
Abstract:
Electricity markets are experiencing a rapid increase in energy storage unit participation. Unlike conventional generation resources, quantifying the competitive operation and identifying if a storage unit is exercising market power is challenging, particularly in the context of multi-interval bidding strategies. We present a framework to differentiate strategic capacity withholding behaviors attr…
▽ More
Electricity markets are experiencing a rapid increase in energy storage unit participation. Unlike conventional generation resources, quantifying the competitive operation and identifying if a storage unit is exercising market power is challenging, particularly in the context of multi-interval bidding strategies. We present a framework to differentiate strategic capacity withholding behaviors attributed to market power from inherent competitive bidding in storage unit strategies. Our framework evaluates the profitability of strategic storage unit participation, analyzing bidding behaviors as both price takers and price makers using a self-scheduling model, and investigates how they leverage market inefficiencies. Specifically, we propose a price sensitivity model derived from the linear supply function equilibrium model to examine the price-anticipating bidding strategy, effectively capturing the influence of market power. We introduce a sufficient ex-post analysis for market operators to identify potential exploitative behaviors by monitoring instances of withholding within the bidding profiles, ensuring market resilience and competitiveness. We discuss and verify applicability of the proposed framework to realistic settings. Our analysis substantiates commonly observed economic bidding behaviors of storage units. Furthermore, it demonstrates that significant price volatility offers considerable profit opportunities not only for participants possessing market power but also for typical strategic profit seekers.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Optical Spectroscopy of Type Ia Supernovae by the Carnegie Supernova Projects I and II
Authors:
N. Morrell,
M. M. Phillips,
G. Folatelli,
M. D. Stritzinger,
M. Hamuy,
N. B. Suntzeff,
E. Y. Hsiao,
F. Taddia,
C. R. Burns,
P. Hoeflich,
C. Ashall,
C. Contreras,
L. Galbany,
J. Lu,
A. L. Piro,
J. Anais,
E. Baron,
A. Burrow,
L. Busta,
A. Campillay,
S. Castellón,
C. Corco,
T. Diamond,
W. L. Freedman,
C. González
, et al. (35 additional authors not shown)
Abstract:
We present the second and final release of optical spectroscopy of Type Ia Supernovae (SNe Ia) obtained during the first and second phases of the Carnegie Supernova Project (CSP-I and CSP-II). The newly released data consist of 148 spectra of 30 SNe Ia observed in the course of the CSP-I, and 234 spectra of 127 SNe Ia obtained during the CSP-II. We also present 216 optical spectra of 46 historical…
▽ More
We present the second and final release of optical spectroscopy of Type Ia Supernovae (SNe Ia) obtained during the first and second phases of the Carnegie Supernova Project (CSP-I and CSP-II). The newly released data consist of 148 spectra of 30 SNe Ia observed in the course of the CSP-I, and 234 spectra of 127 SNe Ia obtained during the CSP-II. We also present 216 optical spectra of 46 historical SNe Ia, including 53 spectra of 30 SNe Ia observed by the Calán/Tololo Supernova Survey. We combine these observations with previously published CSP data and publicly-available spectra to compile a large sample of measurements of spectroscopic parameters at maximum light, consisting of pseudo-equivalent widths and expansion velocities of selected features, for 232 CSP and historical SNe Ia (including more than 1000 spectra). Finally, we review some of the strongest correlations between spectroscopic and photometric properties of SNe Ia. Specifically, we define two samples: one consisting of SNe Ia discovered by targeted searches (most of them CSP-I objects) and the other composed of SNe Ia discovered by untargeted searches, which includes most of the CSP-II objects. The analysed correlations are similar for both samples. We find a larger incidence of SNe Ia belonging to the Cool (CL)and Broad Line (BL) Branch subtypes among the events discovered by targeted searches, Shallow Silicon (SS) SNe Ia are present with similar frequencies in both samples, while Core Normal (CN) SNe Ia are more frequent in untargeted searches.
△ Less
Submitted 7 May, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Capabilities of Gemini Models in Medicine
Authors:
Khaled Saab,
Tao Tu,
Wei-Hung Weng,
Ryutaro Tanno,
David Stutz,
Ellery Wulczyn,
Fan Zhang,
Tim Strother,
Chunjong Park,
Elahe Vedadi,
Juanma Zambrano Chaves,
Szu-Yeu Hu,
Mike Schaekermann,
Aishwarya Kamath,
Yong Cheng,
David G. T. Barrett,
Cathy Cheung,
Basil Mustafa,
Anil Palepu,
Daniel McDuff,
Le Hou,
Tomer Golany,
Luyang Liu,
Jean-baptiste Alayrac,
Neil Houlsby
, et al. (42 additional authors not shown)
Abstract:
Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G…
▽ More
Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-Gemini, a family of highly capable multimodal models that are specialized in medicine with the ability to seamlessly use web search, and that can be efficiently tailored to novel modalities using custom encoders. We evaluate Med-Gemini on 14 medical benchmarks, establishing new state-of-the-art (SoTA) performance on 10 of them, and surpass the GPT-4 model family on every benchmark where a direct comparison is viable, often by a wide margin. On the popular MedQA (USMLE) benchmark, our best-performing Med-Gemini model achieves SoTA performance of 91.1% accuracy, using a novel uncertainty-guided search strategy. On 7 multimodal benchmarks including NEJM Image Challenges and MMMU (health & medicine), Med-Gemini improves over GPT-4V by an average relative margin of 44.5%. We demonstrate the effectiveness of Med-Gemini's long-context capabilities through SoTA performance on a needle-in-a-haystack retrieval task from long de-identified health records and medical video question answering, surpassing prior bespoke methods using only in-context learning. Finally, Med-Gemini's performance suggests real-world utility by surpassing human experts on tasks such as medical text summarization, alongside demonstrations of promising potential for multimodal medical dialogue, medical research and education. Taken together, our results offer compelling evidence for Med-Gemini's potential, although further rigorous evaluation will be crucial before real-world deployment in this safety-critical domain.
△ Less
Submitted 1 May, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
Constraints On Covariant WIMP-Nucleon Effective Field Theory Interactions from the First Science Run of the LUX-ZEPLIN Experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
F. Alder,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
E. E. Barillier,
J. W. Bargemann,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. J. Bishop,
G. M. Blockinger,
B. Boxer
, et al. (179 additional authors not shown)
Abstract:
The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we re…
▽ More
The first science run of the LUX-ZEPLIN (LZ) experiment, a dual-phase xenon time project chamber operating in the Sanford Underground Research Facility in South Dakota, USA, has reported leading limits on spin-independent WIMP-nucleon interactions and interactions described from a non-relativistic effective field theory (NREFT). Using the same 5.5~t fiducial mass and 60 live days of exposure we report on the results of a relativistic extension to the NREFT. We present constraints on couplings from covariant interactions arising from the coupling of vector, axial currents, and electric dipole moments of the nucleon to the magnetic and electric dipole moments of the WIMP which cannot be described by recasting previous results described by an NREFT. Using a profile-likelihood ratio analysis, in an energy region between 0~keV$_\text{nr}$ to 270~keV$_\text{nr}$, we report 90% confidence level exclusion limits on the coupling strength of five interactions in both the isoscalar and isovector bases.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Simultaneous Chandra and HST observations of the quiescent neutron-star low-mass X-ray binaries in 47 Tucanae
Authors:
Maureen van den Berg,
Liliana Rivera Sandoval,
Craig O. Heinke,
Haldan N. Cohn,
Phyllis M. Lugger,
Jonathan E. Grindlay,
Peter D. Edmonds,
Jay Anderson,
Andrei Catuneanu
Abstract:
We present simultaneous Chandra X-ray Observatory and Hubble Space Telescope observations of three certain (X5, X7, W37) and two likely (X4, W17) quiescent neutron-star low-mass X-ray binaries (qLMXBs) in the globular cluster 47 Tuc. We study these systems in the X-ray, optical and near-ultraviolet (NUV) using the simultaneous data and additional non-contemporaneous HST data. We have discovered a…
▽ More
We present simultaneous Chandra X-ray Observatory and Hubble Space Telescope observations of three certain (X5, X7, W37) and two likely (X4, W17) quiescent neutron-star low-mass X-ray binaries (qLMXBs) in the globular cluster 47 Tuc. We study these systems in the X-ray, optical and near-ultraviolet (NUV) using the simultaneous data and additional non-contemporaneous HST data. We have discovered a blue and variable NUV counterpart to W17. We have not securely identified the eclipsing qLMXB W37 in the optical or NUV. Deeper high-resolution imaging is needed to further investigate the faint NUV excess near the centre of the W37 error circle. We suggest that a previously identified optical astrometric match to X7 is likely the true counterpart. The Halpha emission and the location of the counterpart in the colour-magnitude diagram, indicate that the secondary is probably a non-degenerate, H-rich star. This is consistent with previous results from fitting X7's X-ray spectrum. In X4, the simultaneous X-ray and optical behaviour supports the earlier suggestion that the X-ray variability is driven by changes in accretion rate. The X-ray eclipses in X5 coincide with minima in the optical/NUV light curves. Comparison of the 47 Tuc qLMXBs with the cataclysmic variables (CVs) in the cluster confirms that overall the qLMXBs have larger X-ray-to-optical flux ratios. Based on their optical/NUV colors, we conclude that the accretion disks in the qLMXBs are less prominent than in CVs. This makes the ratio of X-ray flux to excess blue optical flux a powerful discriminator between CVs and qLMXBs.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Physical Properties of Type II Supernovae Inferred from ZTF and ATLAS Photometric Data
Authors:
Javier Silva-Farfán,
Francisco Förster,
Takashi J. Moriya,
L. Hernández-García,
A. M. Muñoz Arancibia,
P. Sánchez-Sáez,
Joseph P. Anderson,
John L. Tonry,
Alejandro Clocchiatti
Abstract:
We report an analysis of a sample of 186 spectroscopically confirmed Type II supernova (SN) light curves (LCs) obtained from a combination of Zwicky Transient Facility (ZTF) and Asteroid Terrestrial-impact Last Alert System (ATLAS) observations. We implement a method to infer physical parameters from these LCs using hydrodynamic models that take into account the progenitor mass, the explosion ener…
▽ More
We report an analysis of a sample of 186 spectroscopically confirmed Type II supernova (SN) light curves (LCs) obtained from a combination of Zwicky Transient Facility (ZTF) and Asteroid Terrestrial-impact Last Alert System (ATLAS) observations. We implement a method to infer physical parameters from these LCs using hydrodynamic models that take into account the progenitor mass, the explosion energy, and the presence of circumstellar matter (CSM). The CSM is modelled via the mass loss rate, wind acceleration at the surface of the progenitor star with a $β$ velocity law, and the CSM radius. We also infer the time of explosion, attenuation (A$_V$), and the redshift for each SN. Our results favor low-mass progenitor stars (M$_{ZAMS}$\,$<$14\,$M_\odot$) with a dense CSM ($\dot{M}$ $>$ 10$^{-3}$ [M$_\odot$ yr$^{-1}$], a CSM radius of $\sim$ 10$^{15}$ cm, and $β$ $>$ 2). Additionally, we find that the redshift inferred from the supernova LCs is significantly more accurate than that inferred using the host galaxy photometric redshift, suggesting that this method could be used to infer more accurate host galaxy redshifts from large samples of SNe II in the LSST era. Lastly, we compare our results with similar works from the literature.
△ Less
Submitted 3 June, 2024; v1 submitted 19 April, 2024;
originally announced April 2024.
-
Asynchronous Heterogeneous Linear Quadratic Regulator Design
Authors:
Leonardo F. Toso,
Han Wang,
James Anderson
Abstract:
We address the problem of designing an LQR controller in a distributed setting, where M similar but not identical systems share their locally computed policy gradient (PG) estimates with a server that aggregates the estimates and computes a controller that, on average, performs well on all systems. Learning in a distributed setting has the potential to offer statistical benefits - multiple dataset…
▽ More
We address the problem of designing an LQR controller in a distributed setting, where M similar but not identical systems share their locally computed policy gradient (PG) estimates with a server that aggregates the estimates and computes a controller that, on average, performs well on all systems. Learning in a distributed setting has the potential to offer statistical benefits - multiple datasets can be leveraged simultaneously to produce more accurate policy gradient estimates. However, the interplay of heterogeneous trajectory data and varying levels of local computational power introduce bias to the aggregated PG descent direction, and prevents us from fully exploiting the parallelism in the distributed computation. The latter stems from synchronous aggregation, where straggler systems negatively impact the runtime. To address this, we propose an asynchronous policy gradient algorithm for LQR control design. By carefully controlling the "staleness" in the asynchronous aggregation, we show that the designed controller converges to each system's $ε$-near optimal controller up to a heterogeneity bias. Furthermore, we prove that our asynchronous approach obtains exact local convergence at a sub-linear rate.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
oMEGACat II -- Photometry and proper motions for 1.4 million stars in Omega Centauri and its rotation in the plane of the sky
Authors:
Maximilian Häberle,
Nadine Neumayer,
Andrea Bellini,
Mattia Libralato,
Callie Clontz,
Anil C. Seth,
Maria Selina Nitschai,
Sebastian Kamann,
Mayte Alfaro-Cuello,
Jay Anderson,
Stefan Dreizler,
Anja Feldmeier-Krause,
Nikolay Kacharov,
Marilyn Latour,
Antonino Milone,
Renuka Pechetti,
Glenn van de Ven,
Karina Voggel
Abstract:
Omega Centauri ($ω$ Cen) is the most massive globular cluster of the Milky Way. It is thought to be the nucleus of an accreted dwarf galaxy because of its high mass and its complex stellar populations. To decipher its formation history and study its dynamics, we created the most comprehensive kinematic catalog for its inner region, by analyzing both archival and new Hubble Space Telescope (HST) da…
▽ More
Omega Centauri ($ω$ Cen) is the most massive globular cluster of the Milky Way. It is thought to be the nucleus of an accreted dwarf galaxy because of its high mass and its complex stellar populations. To decipher its formation history and study its dynamics, we created the most comprehensive kinematic catalog for its inner region, by analyzing both archival and new Hubble Space Telescope (HST) data. Our catalog contains 1 395 781 proper-motion measurements out to the half-light radius of the cluster ($\sim$5.0') and down to $m_{F625W}\approx$25. The typical baseline for our proper-motion measurements is 20 years, leading to a median 1D proper motion precision of $\sim$11 $μ$as yr$^{-1}$ for stars with $m_{F625W}\approx$18 mag, with even better precision ($\sim$6.6 $μ$as yr$^{-1}$) achieved in the extensively observed centermost (r$<$1.5') region. In addition to our astrometric measurements, we also obtained precise HST photometry in seven filters spanning from the ultraviolet to the near-infrared. This allows detailed color-magnitude-diagram studies and to separate the multiple stellar populations of the cluster. In this work, we describe the data reduction used to obtain both the photometric and the proper-motion measurements. We also illustrate the creation and the content of our catalog, which is made publicly available. Finally, we present measurements of the plane-of-sky rotation of $ω$ Cen in the previously unprobed inner few arcminutes and a precise measurement of the inclination $i = (43.6\pm1.5)^\circ$.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Mass calibration of DES Year-3 clusters via SPT-3G CMB cluster lensing
Authors:
B. Ansarinejad,
S. Raghunathan,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
O. Alves,
A. J. Anderson,
F. Andrade-Oliveira,
M. Archipley,
L. Balkenhol,
K. Benabed,
A. N. Bender,
B. A. Benson,
E. Bertin,
F. Bianchini,
L. E. Bleem,
S. Bocquet,
F. R. Bouchet,
D. Brooks,
L. Bryant,
D. L. Burke,
E. Camphuis,
J. E. Carlstrom,
A. Carnero Rosell,
J. Carretero
, et al. (120 additional authors not shown)
Abstract:
We measure the stacked lensing signal in the direction of galaxy clusters in the Dark Energy Survey Year 3 (DES Y3) redMaPPer sample, using cosmic microwave background (CMB) temperature data from SPT-3G, the third-generation CMB camera on the South Pole Telescope (SPT). We estimate the lensing signal using temperature maps constructed from the initial 2 years of data from the SPT-3G 'Main' survey,…
▽ More
We measure the stacked lensing signal in the direction of galaxy clusters in the Dark Energy Survey Year 3 (DES Y3) redMaPPer sample, using cosmic microwave background (CMB) temperature data from SPT-3G, the third-generation CMB camera on the South Pole Telescope (SPT). We estimate the lensing signal using temperature maps constructed from the initial 2 years of data from the SPT-3G 'Main' survey, covering 1500 deg$^2$ of the Southern sky. We then use this signal as a proxy for the mean cluster mass of the DES sample. In this work, we employ three versions of the redMaPPer catalogue: a Flux-Limited sample containing 8865 clusters, a Volume-Limited sample with 5391 clusters, and a Volume&Redshift-Limited sample with 4450 clusters. For the three samples, we find the mean cluster masses to be ${M}_{200{\rm{m}}}=1.66\pm0.13$ [stat.]$\pm0.03$ [sys.], $1.97\pm0.18$ [stat.]$\pm0.05$ [sys.], and $2.11\pm0.20$ [stat.]$\pm0.05$ [sys.]$\times{10}^{14}\ {\rm{M}}_{\odot }$, respectively. This is a factor of $\sim2$ improvement relative to the precision of measurements with previous generations of SPT surveys and the most constraining cluster mass measurements using CMB cluster lensing to date. Overall, we find no significant tensions between our results and masses given by redMaPPer mass-richness scaling relations of previous works, which were calibrated using CMB cluster lensing, optical weak lensing, and velocity dispersion measurements from various combinations of DES, SDSS and Planck data. We then divide our sample into 3 redshift and 3 richness bins, finding no significant tensions with optical weak-lensing calibrated masses in these bins. We forecast a $5.7\%$ constraint on the mean cluster mass of the DES Y3 sample with the complete SPT-3G surveys when using both temperature and polarization data and including an additional $\sim1400$ deg$^2$ of observations from the 'Extended' SPT-3G survey.
△ Less
Submitted 12 June, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Testing the $\mathbfΛ$CDM Cosmological Model with Forthcoming Measurements of the Cosmic Microwave Background with SPT-3G
Authors:
K. Prabhu,
S. Raghunathan,
M. Millea,
G. Lynch,
P. A. R. Ade,
E. Anderes,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
L. Balkenhol,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
P. M. Chichura,
T. -L. Chou,
A. Coerver
, et al. (76 additional authors not shown)
Abstract:
We forecast constraints on cosmological parameters enabled by three surveys conducted with SPT-3G, the third-generation camera on the South Pole Telescope. The surveys cover separate regions of 1500, 2650, and 6000 ${\rm deg}^{2}$ to different depths, in total observing 25% of the sky. These regions will be measured to white noise levels of roughly 2.5, 9, and 12 $μ{\rm K-arcmin}$, respectively, i…
▽ More
We forecast constraints on cosmological parameters enabled by three surveys conducted with SPT-3G, the third-generation camera on the South Pole Telescope. The surveys cover separate regions of 1500, 2650, and 6000 ${\rm deg}^{2}$ to different depths, in total observing 25% of the sky. These regions will be measured to white noise levels of roughly 2.5, 9, and 12 $μ{\rm K-arcmin}$, respectively, in CMB temperature units at 150 GHz by the end of 2024. The survey also includes measurements at 95 and 220 GHz, which have noise levels a factor of ~1.2 and 3.5 times higher than 150 GHz, respectively, with each band having a polarization noise level ~$\sqrt{\text{2}}$ times higher than the temperature noise. We use a novel approach to obtain the covariance matrices for jointly and optimally estimated gravitational lensing potential bandpowers and unlensed CMB temperature and polarization bandpowers. We demonstrate the ability to test the $Λ{\rm CDM}$ model via the consistency of cosmological parameters constrained independently from SPT-3G and Planck data, and consider the improvement in constraints on $Λ{\rm CDM}$ extension parameters from a joint analysis of SPT-3G and Planck data. The $Λ{\rm CDM}$ cosmological parameters are typically constrained with uncertainties up to ~2 times smaller with SPT-3G data, compared to Planck, with the two data sets measuring significantly different angular scales and polarization levels, providing additional tests of the standard cosmological model.
△ Less
Submitted 5 July, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
UPSS: a User-centric Private Storage System with its applications
Authors:
Arastoo Bozorgi,
Mahya Soleimani Jadidi,
Jonathan Anderson
Abstract:
Strong confidentiality, integrity, user control, reliability and performance are critical requirements in privacy-sensitive applications. Such applications would benefit from a data storage and sharing infrastructure that provides these properties even in decentralized topologies with untrusted storage backends, but users today are forced to choose between systemic security properties and system r…
▽ More
Strong confidentiality, integrity, user control, reliability and performance are critical requirements in privacy-sensitive applications. Such applications would benefit from a data storage and sharing infrastructure that provides these properties even in decentralized topologies with untrusted storage backends, but users today are forced to choose between systemic security properties and system reliability or performance. As an alternative to this status quo we present UPSS: the user-centric private sharing system, a cryptographic storage system that can be used as a conventional filesystem or as the foundation for security-sensitive applications such as redaction with integrity and private revision control. We demonstrate that both the security and performance properties of UPSS exceed that of existing cryptographic filesystems and that its performance is comparable to mature conventional filesystems - in some cases, even superior. Whether used directly via its Rust API or as a conventional filesystem, UPSS provides strong security and practical performance on untrusted storage.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Probing Electrical Properties of A Silicon Nanocrystal Thin Film Using X-ray Photoelectron Spectroscopy
Authors:
Amrit Laudari,
Sameera Pathiranage,
Salim A. Thomas,
Reed J. Petersen,
Kenneth J. Anderson,
Todd A. Pringle,
Erik K. Hobbie,
Nuri Oncel
Abstract:
We performed X-ray photoelectron spectroscopy (XPS) measurements on a thin film of Si nanocrystals (SiNCs) while applying DC or AC external biases to extract the resistance and the capacitance of the thin film. The measurement consists of the application of 10 V DC or square wave pulses of 10 V amplitude to the sample at various frequencies ranging from 0.01 Hz to 1 MHz while recording X-ray photo…
▽ More
We performed X-ray photoelectron spectroscopy (XPS) measurements on a thin film of Si nanocrystals (SiNCs) while applying DC or AC external biases to extract the resistance and the capacitance of the thin film. The measurement consists of the application of 10 V DC or square wave pulses of 10 V amplitude to the sample at various frequencies ranging from 0.01 Hz to 1 MHz while recording X-ray photoemission data. To analyze the data, we propose three different models with varying degrees of accuracy. The calculated capacitance of SiNCs agrees with the experimental value in the literature.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
High-precision astrometry with VVV -- II. A near-infrared extension of Gaia into the Galactic plane
Authors:
M. Griggio,
M. Libralato,
A. Bellini,
L. R. Bedin,
J. Anderson,
L. C. Smith,
D. Minniti
Abstract:
Aims. We use near-infrared, ground-based data from the VISTA Variables in the Via Lactea (VVV) survey to indirectly extend the astrometry provided by the Gaia catalog to objects in heavily-extincted regions towards the Galactic bulge and plane that are beyond Gaia's reach. Methods. We make use of the state-of-the-art techniques developed for high-precision astrometry and photometry with the Hubble…
▽ More
Aims. We use near-infrared, ground-based data from the VISTA Variables in the Via Lactea (VVV) survey to indirectly extend the astrometry provided by the Gaia catalog to objects in heavily-extincted regions towards the Galactic bulge and plane that are beyond Gaia's reach. Methods. We make use of the state-of-the-art techniques developed for high-precision astrometry and photometry with the Hubble Space Telescope to process the VVV data. We employ empirical, spatially-variable, effective point-spread functions and local transformations to mitigate the effects of systematic errors, like residual geometric distortion and image motion, and to improve measurements in crowded fields and for faint stars. We also anchor our astrometry to the absolute reference frame of the Gaia Data Release 3. Results. We measure between 20 and 60 times more sources than Gaia in the region surrounding the Galactic center, obtaining an single-exposure precision of about 12 mas and a proper-motion precision of better than 1 mas yr$^{-1}$ for bright, unsaturated sources. Our astrometry provides an extension of Gaia into the Galactic center. We publicly release the astro-photometric catalogs of the two VVV fields considered in this work, which contain a total of $\sim$ 3.5 million sources. Our catalogs cover $\sim$ 3 sq. degrees, about 0.5% of the entire VVV survey area.
△ Less
Submitted 25 March, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Unveiling MOA-2007-BLG-192: An M Dwarf Hosting a Likely Super-Earth
Authors:
Sean K. Terry,
Jean-Philippe Beaulieu,
David P. Bennett,
Euan Hamdorf,
Aparna Bhattacharya,
Viveka Chaudhry,
Andrew A. Cole,
Naoki Koshimoto,
Jay Anderson,
Etienne Bachelet,
Joshua W. Blackman,
Ian A. Bond,
Jessica R. Lu,
Jean Baptiste Marquette,
Clement Ranc,
Natalia E. Rektsini,
Kailash Sahu,
Aikaterini Vandorou
Abstract:
We present an analysis of high angular resolution images of the microlensing target MOA-2007-BLG-192 using Keck adaptive optics and the Hubble Space Telescope. The planetary host star is robustly detected as it separates from the background source star in nearly all of the Keck and Hubble data. The amplitude and direction of the lens-source separation allows us to break a degeneracy related to the…
▽ More
We present an analysis of high angular resolution images of the microlensing target MOA-2007-BLG-192 using Keck adaptive optics and the Hubble Space Telescope. The planetary host star is robustly detected as it separates from the background source star in nearly all of the Keck and Hubble data. The amplitude and direction of the lens-source separation allows us to break a degeneracy related to the microlensing parallax and source radius crossing time. Thus, we are able to reduce the number of possible solutions by a factor of ${\sim}2$, demonstrating the power of high angular resolution follow-up imaging for events with sparse light curve coverage. Following Bennett et al. 2023, we apply constraints from the high resolution imaging on the light curve modeling to find host star and planet masses of $M_{\textrm{host}} = 0.28 \pm 0.04M_{\odot}$ and $m_p = 12.49^{+65.47}_{-8.03}M_{\oplus}$ at a distance from Earth of $D_L = 2.16 \pm 0.30\,$kpc. This work illustrates the necessity for the Nancy Grace Roman Galactic Exoplanet Survey (RGES) to use its own high resolution imaging to inform light curve modeling for microlensing planets that the mission discovers.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Narrow absorption lines from intervening material in supernovae I. Measurements and temporal evolution
Authors:
Santiago González-Gaitán,
Claudia P. Gutiérrez,
Joseph P. Anderson,
Antonia Morales-Garoffolo,
Lluis Galbany,
Sabyasashi Goswami,
Ana M. Mourao,
Seppo Mattila,
Mark Sullivan
Abstract:
Narrow absorption features in nearby supernova (SN) spectra are a powerful diagnostic of the slow-moving material in the line of sight: they are extensively used to infer dust extinction from the host galaxies, and they can also serve in the detection of circumstellar material originating from the SN progenitor and present in the vicinity of the explosion. Despite their wide use, very few studies…
▽ More
Narrow absorption features in nearby supernova (SN) spectra are a powerful diagnostic of the slow-moving material in the line of sight: they are extensively used to infer dust extinction from the host galaxies, and they can also serve in the detection of circumstellar material originating from the SN progenitor and present in the vicinity of the explosion. Despite their wide use, very few studies have examined the biases of the methods to characterize narrow lines, and not many statistical analyses exist. This is the first paper of a series in which we present a statistical analysis of narrow lines of SN spectra of various resolutions. We develop a robust automated methodology to measure the equivalent width (EW) and velocity of narrow absorption lines from intervening material in the line of sight of SNe, including Na I D , Ca II H&K, K i and diffuse interstellar bands (DIBs). We carefully study systematic biases in heterogeneous spectra from the literature by simulating different signal-to-noise, spectral resolution, slit size and orientation and present the real capabilities and limitations of using low- and mid-resolution spectra to study these lines. In particular, we find that the measurement of the equivalent width of the narrow lines in low-resolution spectra is highly affected by the evolving broad P-Cygni profiles of the SN ejecta, both for core-collapse and type Ia SNe, inducing a conspicuous apparent evolution. We present thus an easy way to detect and exclude those cases to obtain more robust and reliable measurements. Finally, after considering all possible effects, we analyse the temporal evolution of the narrow features in a large sample of nearby SNe to detect any possible variation in their EWs over time. We find no time evolution of the narrow line features in our large sample for all SN types
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
The $\textit{HST}$ Large Programme on NGC$\,$6752 -- V. Differences in Luminosity and Mass Functions among Multiple Stellar Populations
Authors:
M. Scalco,
R. Gerasimov,
L. R. Bedin,
E. Vesperini,
D. Nardiello,
M. Salaris,
A. Burgasser,
J. Anderson,
M. Libralato,
A. Bellini,
P. Rosati
Abstract:
We exploit the astro-photometric dataset of the multi-epoch infrared parallel field of a $\textit{Hubble Space Telescope}$ Large Programme aimed at studying the faintest stars of the globular cluster NGC$\,$6752 to determine the luminosity and mass functions of the multiple stellar populations of this cluster. Thanks to the measurement of proper motions and deeper completeness, the results present…
▽ More
We exploit the astro-photometric dataset of the multi-epoch infrared parallel field of a $\textit{Hubble Space Telescope}$ Large Programme aimed at studying the faintest stars of the globular cluster NGC$\,$6752 to determine the luminosity and mass functions of the multiple stellar populations of this cluster. Thanks to the measurement of proper motions and deeper completeness, the results presented in this paper represent a significant improvement over those of previous studies. We successfully derived membership probabilities reaching stars as faint as $m_{\rm F160W} \sim 25$, allowing us to reliably distinguish the three main stellar populations detected within this cluster. We employed a new set of model isochrones that have been individually fit to the colour-magnitude diagram of each population. We present a comprehensive analysis of the luminosity and mass functions for three stellar populations within NGC$\,$6752. Notably, our findings reveal differences in the present-day luminosity and mass functions of first-generation and second-generation stars; these differences are consistent with the manifestation of the effects of dynamical processes acting on populations with different initial spatial distributions. Finally, we publicly release the catalogues with positions, photometry, proper motions, and memberships probabilities, as well as the stacked-image atlases and all newly calculated stellar models.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Performance of a modular ton-scale pixel-readout liquid argon time projection chamber
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1340 additional authors not shown)
Abstract:
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi…
▽ More
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Final Moments II: Observational Properties and Physical Modeling of CSM-Interacting Type II Supernovae
Authors:
W. V. Jacobson-Galán,
L. Dessart,
K. W. Davis,
C. D. Kilpatrick,
R. Margutti,
R. J. Foley,
R. Chornock,
G. Terreran,
D. Hiramatsu,
M. Newsome,
E. Padilla Gonzalez,
C. Pellegrino,
D. A. Howell,
A. V. Filippenko,
J. P. Anderson,
C. R. Angus,
K. Auchettl,
K. A. Bostroem,
T. G. Brink,
R. Cartier,
D. A. Coulter,
T. de Boer,
M. R. Drout,
N. Earl,
K. Ertini
, et al. (30 additional authors not shown)
Abstract:
We present ultraviolet/optical/near-infrared observations and modeling of Type II supernovae (SNe II) whose early-time ($δt < 2$ days) spectra show transient, narrow emission lines from shock ionization of confined ($r < 10^{15}$ cm) circumstellar material (CSM). The observed electron-scattering broadened line profiles (i.e., IIn-like) of HI, He I/II, C III/IV, and N III/IV/V from the CSM persist…
▽ More
We present ultraviolet/optical/near-infrared observations and modeling of Type II supernovae (SNe II) whose early-time ($δt < 2$ days) spectra show transient, narrow emission lines from shock ionization of confined ($r < 10^{15}$ cm) circumstellar material (CSM). The observed electron-scattering broadened line profiles (i.e., IIn-like) of HI, He I/II, C III/IV, and N III/IV/V from the CSM persist on a characteristic timescale ($t_{\rm IIn}$) that marks a transition to a lower-density CSM and the emergence of Doppler-broadened features from the fast-moving SN ejecta. Our sample, the largest to date, consists of 39 SNe with early-time IIn-like features in addition to 35 "comparison" SNe with no evidence of early-time IIn-like features, all with ultraviolet observations. The total sample consists of 50 unpublished objects with 474 previously unpublished spectra and 50 multiband light curves, collected primarily through the Young Supernova Experiment and Global Supernova Project collaborations. For all sample objects, we find a significant correlation between peak ultraviolet brightness and both $t_{\rm IIn}$ and the rise time, as well as evidence for enhanced peak luminosities in SNe II with IIn-like features. We quantify mass-loss rates and CSM density for the sample through matching of peak multiband absolute magnitudes, rise times, $t_{\rm IIn}$ and optical SN spectra with a grid of radiation hydrodynamics and non-local thermodynamic equilibrium (nLTE) radiative-transfer simulations. For our grid of models, all with the same underlying explosion, there is a trend between the duration of the electron-scattering broadened line profiles and inferred mass-loss rate: $t_{\rm IIn} \approx 3.8[\dot{M}/(0.01 \textrm{M}_{\odot} \textrm{yr}^{-1})]$ days.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
First Constraints on the Epoch of Reionization Using the non-Gaussianity of the Kinematic Sunyaev-Zel{'}dovich Effect from the South Pole Telescope and {\it Herschel}-SPIRE Observations
Authors:
S. Raghunathan,
P. A. R. Ade,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
J. E. Austermann,
L. Balkenhol,
J. A. Beall,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
J. Bock,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
H. C. Chiang,
P. M. Chichura,
T. -L. Chou,
R. Citron
, et al. (97 additional authors not shown)
Abstract:
We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ i…
▽ More
We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ in bands centered at 95, 150, and 220 GHz. For SPIRE, we include data from the 600 and 857 GHz bands. We reconstruct the velocity-induced large-scale correlation of the small-scale kSZ signal with a quadratic estimator that uses two cosmic microwave background (CMB) temperature maps, constructed by optimally combining data from all the frequency bands. We reject the null hypothesis of a zero trispectrum at $10.3σ$ level. However, the measured trispectrum contains contributions from both the kSZ and other undesired components, such as CMB lensing and astrophysical foregrounds, with kSZ being sub-dominant. We use the \textsc{Agora} simulations to estimate the expected signal from CMB lensing and astrophysical foregrounds. After accounting for the contributions from CMB lensing and foreground signals, we do not detect an excess kSZ-only trispectrum and use this non-detection to set constraints on reionization. By applying a prior based on observations of the Gunn-Peterson trough, we obtain an upper limit on the duration of reionization of $Δz_{\rm re, 50} < 4.5$ (95\% C.L). We find these constraints are fairly robust to foregrounds assumptions. This trispectrum measurement is independent of, but consistent with, {\it Planck}'s optical depth measurement. This result is the first constraint on the epoch of reionization using the non-Gaussian nature of the kSZ signal.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Coloring locally sparse graphs
Authors:
James Anderson,
Abhishek Dhawan,
Aiya Kuchukova
Abstract:
A graph $G$ is $k$-locally sparse if for each vertex $v \in V(G)$, the subgraph induced by its neighborhood contains at most $k$ edges. Alon, Krivelevich, and Sudakov showed that for $f > 0$ if a graph $G$ of maximum degree $Δ$ is $Δ^2/f$-locally-sparse, then $χ(G) = O\left(Δ/\log f\right)$. We introduce a more general notion of local sparsity by defining graphs $G$ to be $(k, F)$-locally-sparse f…
▽ More
A graph $G$ is $k$-locally sparse if for each vertex $v \in V(G)$, the subgraph induced by its neighborhood contains at most $k$ edges. Alon, Krivelevich, and Sudakov showed that for $f > 0$ if a graph $G$ of maximum degree $Δ$ is $Δ^2/f$-locally-sparse, then $χ(G) = O\left(Δ/\log f\right)$. We introduce a more general notion of local sparsity by defining graphs $G$ to be $(k, F)$-locally-sparse for some graph $F$ if for each vertex $v \in V(G)$ the subgraph induced by the neighborhood of $v$ contains at most $k$ copies of $F$. Employing the Rödl nibble method, we prove the following generalization of the above result: for every bipartite graph $F$, if $G$ is $(k, F)$-locally-sparse, then $χ(G) = O\left( Δ/\log\left(Δk^{-1/|V(F)|}\right)\right)$. This improves upon results of Davies, Kang, Pirot, and Sereni who consider the case when $F$ is a path. Our results also recover the best known bound on $χ(G)$ when $G$ is $K_{1, t, t}$-free for $t \geq 4$, and hold for list and correspondence coloring in the more general so-called ''color-degree'' setting.
△ Less
Submitted 1 March, 2024; v1 submitted 29 February, 2024;
originally announced February 2024.
-
StarCoder 2 and The Stack v2: The Next Generation
Authors:
Anton Lozhkov,
Raymond Li,
Loubna Ben Allal,
Federico Cassano,
Joel Lamy-Poirier,
Nouamane Tazi,
Ao Tang,
Dmytro Pykhtar,
Jiawei Liu,
Yuxiang Wei,
Tianyang Liu,
Max Tian,
Denis Kocetkov,
Arthur Zucker,
Younes Belkada,
Zijian Wang,
Qian Liu,
Dmitry Abulkhanov,
Indraneil Paul,
Zhuang Li,
Wen-Ding Li,
Megan Risdal,
Jia Li,
Jian Zhu,
Terry Yue Zhuo
, et al. (41 additional authors not shown)
Abstract:
The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data…
▽ More
The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data sources, such as GitHub pull requests, Kaggle notebooks, and code documentation. This results in a training set that is 4x larger than the first StarCoder dataset. We train StarCoder2 models with 3B, 7B, and 15B parameters on 3.3 to 4.3 trillion tokens and thoroughly evaluate them on a comprehensive set of Code LLM benchmarks. We find that our small model, StarCoder2-3B, outperforms other Code LLMs of similar size on most benchmarks, and also outperforms StarCoderBase-15B. Our large model, StarCoder2- 15B, significantly outperforms other models of comparable size. In addition, it matches or outperforms CodeLlama-34B, a model more than twice its size. Although DeepSeekCoder- 33B is the best-performing model at code completion for high-resource languages, we find that StarCoder2-15B outperforms it on math and code reasoning benchmarks, as well as several low-resource languages. We make the model weights available under an OpenRAIL license and ensure full transparency regarding the training data by releasing the SoftWare Heritage persistent IDentifiers (SWHIDs) of the source code data.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Searching for late-time interaction signatures in Type Ia supernovae from the Zwicky Transient Facility
Authors:
Jacco H. Terwel,
Kate Maguire,
Georgios Dimitriadis,
Mat Smith,
Simeon Reusch,
Leander Lacroix,
Lluís Galbany,
Umut Burgaz,
Luke Harvey,
Steve Schulze,
Mickael Rigault,
Steven L. Groom,
David Hale,
Mansi M. Kasliwal,
Young-Lo Kim,
Josiah Purdum,
Ben Rusholme,
Jesper Sollerman,
Joseph P. Anderson,
Ting-Wan Chen,
Christopher Frohmaier,
Mariusz Gromadzki,
Tomás E. Müller-Bravo,
Matt Nicholl,
Shubham Srivastav
, et al. (1 additional authors not shown)
Abstract:
The nature of the progenitor systems and explosion mechanisms that give rise to Type Ia supernovae (SNe Ia) are still debated. The interaction signature of circumstellar material (CSM) being swept up by expanding ejecta can constrain the type of system from which it was ejected. Most previous studies have focused on finding CSM ejected shortly before the SN Ia explosion still residing close to the…
▽ More
The nature of the progenitor systems and explosion mechanisms that give rise to Type Ia supernovae (SNe Ia) are still debated. The interaction signature of circumstellar material (CSM) being swept up by expanding ejecta can constrain the type of system from which it was ejected. Most previous studies have focused on finding CSM ejected shortly before the SN Ia explosion still residing close to the explosion site, resulting in short delay times until the interaction starts. We use a sample of 3627 SNe Ia from the Zwicky Transient Facility discovered between 2018 and 2020 and search for interaction signatures over 100 days after peak brightness. By binning the late-time light curve data to push the detection limit as deep as possible, we identify potential late-time rebrightening in 3 SNe Ia (SN 2018grt, SN 2019dlf, SN 2020tfc). The late-time detections occur between 550 and 1450 d after peak brightness, have mean absolute $r$-band magnitudes of -16.4 to -16.8 mag and last up to a few hundred days, significantly brighter than the late-time CSM interaction discovered in the prototype SN 2015cp. The late-time detections all occur within 0.8 kpc of the host nucleus and are not easily explained by nuclear activity, another transient at a similar sky position, or data quality issues. This suggests environment or specific progenitor characteristics playing a role in producing potential CSM signatures in these SNe Ia. By simulating the ZTF survey we estimate that <0.5 per cent of normal SNe Ia display late-time strong H $α$-dominated CSM interaction. This is equivalent to an absolute rate of $8_{-4}^{+20}$ to $54_{-26}^{+91}$ Gpc$^{-3}$ yr$^{-1}$ assuming a constant SN Ia rate of $2.4\times10^{-5}$ Mpc$^{-3}$ yr$^{-1}$ for $z \leq 0.1$. Weaker interaction signatures, more similar to the strength seen in SN 2015cp, could be more common but are difficult to constrain with our survey depth.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
Examining the Unique Online Risk Experiences and Mental Health Outcomes of LGBTQ+ versus Heterosexual Youth
Authors:
Tangila Tanni,
Mamtaj Akter,
Joshua Anderson,
Mary Amon,
Pamela Wisniewski
Abstract:
We collected and analyzed Instagram direct messages (DMs) from 173 youth aged 13-21 (including 86 LGBTQ+ youth). We examined youth's risk-flagged social media trace data with their self-reported mental health outcomes to examine how the differing online experiences of LGBTQ+ youth compare with their heterosexual counterparts. We found that LGBTQ+ youth experienced significantly more high-risk onli…
▽ More
We collected and analyzed Instagram direct messages (DMs) from 173 youth aged 13-21 (including 86 LGBTQ+ youth). We examined youth's risk-flagged social media trace data with their self-reported mental health outcomes to examine how the differing online experiences of LGBTQ+ youth compare with their heterosexual counterparts. We found that LGBTQ+ youth experienced significantly more high-risk online interactions compared to heterosexual youth. LGBTQ+ youth reported overall poorer mental health, with online harassment specifically amplifying Self-Harm and Injury. LGBTQ+ youth's mental well-being linked positively to sexual messages, unlike heterosexual youth. Qualitatively, we found that most of the risk-flagged messages of LGBTQ+ youth were sexually motivated; however, a silver lining was that they sought support for their sexual identity from peers on the platform. The study highlights the importance of tailored online safety and inclusive design for LGBTQ+ youth, with implications for CHI community advancements in fostering a supportive online environments.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
New constraints on ultraheavy dark matter from the LZ experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (174 additional authors not shown)
Abstract:
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal f…
▽ More
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal freeze-out, but they have generally been less explored experimentally. In this work, we present a re-analysis of the first science run (SR1) of the LZ experiment, with an exposure of $0.9$ tonne$\times$year, to search for ultraheavy particle dark matter. The signal topology consists of multiple energy deposits in the active region of the detector forming a straight line, from which the velocity of the incoming particle can be reconstructed on an event-by-event basis. Zero events with this topology were observed after applying the data selection calibrated on a simulated sample of signal-like events. New experimental constraints are derived, which rule out previously unexplored regions of the dark matter parameter space of spin-independent interactions beyond a mass of 10$^{17}$ GeV/$c^2$.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
The JWST Resolved Stellar Populations Early Release Science Program V. DOLPHOT Stellar Photometry for NIRCam and NIRISS
Authors:
Daniel R. Weisz,
Andrew E. Dolphin,
Alessandro Savino,
Kristen B. W. McQuinn,
Max J. B. Newman,
Benjamin F. Williams,
Nitya Kallivayalil,
Jay Anderson,
Martha L. Boyer,
Matteo Correnti,
Marla C. Geha,
Karin M. Sandstrom,
Andrew A. Cole,
Jack T. Warfield,
Evan D. Skillman,
Roger E. Cohen,
Rachael Beaton,
Alessandro Bressan,
Alberto Bolatto,
Michael Boylan-Kolchin,
Alyson M. Brooks,
James S. Bullock,
Charlie Conroy,
Michael C. Cooper,
Julianne J. Dalcanton
, et al. (16 additional authors not shown)
Abstract:
We present NIRCam and NIRISS modules for DOLPHOT, a widely-used crowded field stellar photometry package. We describe details of the modules including pixel masking, astrometric alignment, star finding, photometry, catalog creation, and artificial star tests (ASTs). We tested these modules using NIRCam and NIRISS images of M92 (a Milky Way globular cluster), Draco II (an ultra-faint dwarf galaxy),…
▽ More
We present NIRCam and NIRISS modules for DOLPHOT, a widely-used crowded field stellar photometry package. We describe details of the modules including pixel masking, astrometric alignment, star finding, photometry, catalog creation, and artificial star tests (ASTs). We tested these modules using NIRCam and NIRISS images of M92 (a Milky Way globular cluster), Draco II (an ultra-faint dwarf galaxy), and WLM (a star-forming dwarf galaxy). DOLPHOT's photometry is highly precise and the color-magnitude diagrams are deeper and have better definition than anticipated during original program design in 2017. The primary systematic uncertainties in DOLPHOT's photometry arise from mismatches in the model and observed point spread functions (PSFs) and aperture corrections, each contributing $\lesssim0.01$ mag to the photometric error budget. Version 1.2 of WebbPSF models, which include charge diffusion and interpixel capacitance effects, significantly reduced PSF-related uncertainties. We also observed minor ($\lesssim0.05$ mag) chip-to-chip variations in NIRCam's zero points, which will be addressed by the JWST flux calibration program. Globular cluster observations are crucial for photometric calibration. Temporal variations in the photometry are generally $\lesssim0.01$ mag, although rare large misalignment events can introduce errors up to 0.08 mag. We provide recommended DOLPHOT parameters, guidelines for photometric reduction, and advice for improved observing strategies. Our ERS DOLPHOT data products are available on MAST, complemented by comprehensive online documentation and tutorials for using DOLPHOT with JWST imaging data.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Doping Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
H. Amar Es-sghir,
P. Amedo,
J. Anderson,
D. A. Andrade,
C. Andreopoulos
, et al. (1300 additional authors not shown)
Abstract:
Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN…
▽ More
Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon doping can substantially recover light losses due to contamination of the liquid argon by nitrogen.
△ Less
Submitted 9 February, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
HII regions and diffuse ionized gas in the AMUSING++ Compilation: I. Catalogue presentation
Authors:
A. Z. Lugo-Aranda,
S. F. Sánchez,
J. K. Barrera-Ballesteros,
C. López-Cobá,
C. Espinosa-Ponce,
L. Galbany,
Joseph P. Anderson
Abstract:
We present a catalog of $\sim$52,000 extragalactic HII regions and their spectroscopic properties obtained using Integral Field Spectroscopy (IFS) from MUSE observations. The sample analyzed in this study contains 678 galaxies within the nearby Universe (0.004 < z < 0.06) covering different morphological types and a wide range of stellar masses (6 < log(M$_{*}$/M$_{\odot}$) < 13). Each galaxy was…
▽ More
We present a catalog of $\sim$52,000 extragalactic HII regions and their spectroscopic properties obtained using Integral Field Spectroscopy (IFS) from MUSE observations. The sample analyzed in this study contains 678 galaxies within the nearby Universe (0.004 < z < 0.06) covering different morphological types and a wide range of stellar masses (6 < log(M$_{*}$/M$_{\odot}$) < 13). Each galaxy was analyzed using the Pipe3D and pyHIIextractor codes to obtain information of the ionized gas and underlying stellar populations. Specifically, the fluxes, equivalent widths, velocities and velocity dispersions of 30 emission lines covering the wavelength range between $λ$4750A to $λ$9300A, were extracted and were used to estimate luminosity weighted ages and metallicities of the underlying stellar populations from each HII region (of the original sample we detect HII regions in 539 galaxies). In addition, we introduce and apply a novel method and independent of any intrinsic physical property to estimate and decontaminate the contribution of the diffuse ionized gas. Using the final catalog, we explore the dependence of properties of the HII regions on different local and global galaxy parameters: (i) Hubble type, (ii) stellar mass, (iii) galactocentric distance, and (iv) the age and metallicity of the underlying/neighbour stellar populations. We confirm known relations between properties of the HII regions and the underlying stellar populations (in particular with the age) uncovered using data of lower spatial and spectral resolution. Furthermore, we describe the existence of two main families of diffuse ionized gas different for galaxies host or not of HII region
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
Authors:
Chenyu Zhang,
Han Wang,
Aritra Mitra,
James Anderson
Abstract:
Federated reinforcement learning (FRL) has emerged as a promising paradigm for reducing the sample complexity of reinforcement learning tasks by exploiting information from different agents. However, when each agent interacts with a potentially different environment, little to nothing is known theoretically about the non-asymptotic performance of FRL algorithms. The lack of such results can be att…
▽ More
Federated reinforcement learning (FRL) has emerged as a promising paradigm for reducing the sample complexity of reinforcement learning tasks by exploiting information from different agents. However, when each agent interacts with a potentially different environment, little to nothing is known theoretically about the non-asymptotic performance of FRL algorithms. The lack of such results can be attributed to various technical challenges and their intricate interplay: Markovian sampling, linear function approximation, multiple local updates to save communication, heterogeneity in the reward functions and transition kernels of the agents' MDPs, and continuous state-action spaces. Moreover, in the on-policy setting, the behavior policies vary with time, further complicating the analysis. In response, we introduce FedSARSA, a novel federated on-policy reinforcement learning scheme, equipped with linear function approximation, to address these challenges and provide a comprehensive finite-time error analysis. Notably, we establish that FedSARSA converges to a policy that is near-optimal for all agents, with the extent of near-optimality proportional to the level of heterogeneity. Furthermore, we prove that FedSARSA leverages agent collaboration to enable linear speedups as the number of agents increases, which holds for both fixed and adaptive step-size configurations.
△ Less
Submitted 14 April, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.
-
How Beginning Programmers and Code LLMs (Mis)read Each Other
Authors:
Sydney Nguyen,
Hannah McLean Babe,
Yangtian Zi,
Arjun Guha,
Carolyn Jane Anderson,
Molly Q Feldman
Abstract:
Generative AI models, specifically large language models (LLMs), have made strides towards the long-standing goal of text-to-code generation. This progress has invited numerous studies of user interaction. However, less is known about the struggles and strategies of non-experts, for whom each step of the text-to-code problem presents challenges: describing their intent in natural language, evaluat…
▽ More
Generative AI models, specifically large language models (LLMs), have made strides towards the long-standing goal of text-to-code generation. This progress has invited numerous studies of user interaction. However, less is known about the struggles and strategies of non-experts, for whom each step of the text-to-code problem presents challenges: describing their intent in natural language, evaluating the correctness of generated code, and editing prompts when the generated code is incorrect. This paper presents a large-scale controlled study of how 120 beginning coders across three academic institutions approach writing and editing prompts. A novel experimental design allows us to target specific steps in the text-to-code process and reveals that beginners struggle with writing and editing prompts, even for problems at their skill level and when correctness is automatically determined. Our mixed-methods evaluation provides insight into student processes and perceptions with key implications for non-expert Code LLM use within and outside of education.
△ Less
Submitted 7 July, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.