-
Tunable frequency conversion in doped photonic crystal fiber pumped near degeneracy
Authors:
Leah R Murphy,
Mateusz J Olszewski,
Petros Androvitsaneas,
Miguel Alvarez Perez,
Will A M Smith,
Anthony J Bennett,
Peter J Mosley,
Alex O C Davis
Abstract:
Future quantum networks will rely on the ability to coherently transfer optically encoded quantum information between different wavelength bands. Bragg-scattering four-wave mixing in optical fiber is a promising route to achieving this, but requires fibers with precise dispersion control and broadband transmission at signal, target and pump wavelengths. Here we introduce a photonic crystal fiber w…
▽ More
Future quantum networks will rely on the ability to coherently transfer optically encoded quantum information between different wavelength bands. Bragg-scattering four-wave mixing in optical fiber is a promising route to achieving this, but requires fibers with precise dispersion control and broadband transmission at signal, target and pump wavelengths. Here we introduce a photonic crystal fiber with a germanium-doped core featuring group velocity matching at 1550 nm, the telecoms C-band, and 920 nm, within the emission range of efficient single photon sources based on InAs quantum dots. With low chromatic walk-off and good optical guidance even at long wavelengths, large lengths of this fiber are used to achieve nanometer-scale frequency shifts between wavelengths around 920 nm with up to 79.4\% internal conversion efficiency, allowing dissimilar InAs dots to be interfaced. We also show how cascading this frequency conversion can be used to generate a frequency comb away from telecoms wavelengths. Finally, we use the fiber to demonstrate tunable frequency conversion of weak classical signals around 918 nm to the telecoms C-band.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
TeV Analysis of a Source Rich Region with HAWC Observatory: Is HESS J1809-193 a Potential Hadronic PeVatron?
Authors:
A. Albert,
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
R. Babu,
E. Belmont-Moreno,
A. Bernal,
M. Breuhaus,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
J. Cotzomi,
E. De la Fuente,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
C. Espinoza,
K. L. Fan,
K. Fang,
B. Fick,
N. Fraija
, et al. (57 additional authors not shown)
Abstract:
HESS J1809-193 is an unidentified TeV source, first detected by the High Energy Stereoscopic System (H.E.S.S.) Collaboration. The emission originates in a source-rich region that includes several Supernova Remnants (SNR) and Pulsars (PSR) including SNR G11.1+0.1, SNR G11.0-0.0, and the young radio pulsar J1809-1917. Originally classified as a pulsar wind nebula (PWN) candidate, recent studies show…
▽ More
HESS J1809-193 is an unidentified TeV source, first detected by the High Energy Stereoscopic System (H.E.S.S.) Collaboration. The emission originates in a source-rich region that includes several Supernova Remnants (SNR) and Pulsars (PSR) including SNR G11.1+0.1, SNR G11.0-0.0, and the young radio pulsar J1809-1917. Originally classified as a pulsar wind nebula (PWN) candidate, recent studies show the peak of the TeV region overlapping with a system of molecular clouds. This resulted in the revision of the original leptonic scenario to look for alternate hadronic scenarios. Marked as a potential PeVatron candidate, this region has been studied extensively by H.E.S.S. due to its emission extending up-to several tens of TeV. In this work, we use 2398 days of data from the High Altitude Water Cherenkov (HAWC) observatory to carry out a systematic source search for the HESS J1809-193 region. We were able to resolve emission detected as an extended component (modelled as a Symmetric Gaussian with a 1 $σ$ radius of 0.21 $^\circ$) with no clear cutoff at high energies and emitting photons up-to 210 TeV. We model the multi-wavelength observations for the region HESS J1809-193 using a time-dependent leptonic model and a lepto-hadronic model. Our model indicates that both scenarios could explain the observed data within the region of HESS J1809-193.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Authors:
Orevaoghene Ahia,
Sachin Kumar,
Hila Gonen,
Valentin Hoffman,
Tomasz Limisiewicz,
Yulia Tsvetkov,
Noah A. Smith
Abstract:
In multilingual settings, non-Latin scripts and low-resource languages are usually disadvantaged in terms of language models' utility, efficiency, and cost. Specifically, previous studies have reported multiple modeling biases that the current tokenization algorithms introduce to non-Latin script languages, the main one being over-segmentation. In this work, we propose MAGNET; multilingual adaptiv…
▽ More
In multilingual settings, non-Latin scripts and low-resource languages are usually disadvantaged in terms of language models' utility, efficiency, and cost. Specifically, previous studies have reported multiple modeling biases that the current tokenization algorithms introduce to non-Latin script languages, the main one being over-segmentation. In this work, we propose MAGNET; multilingual adaptive gradient-based tokenization to reduce over-segmentation via adaptive gradient-based subword tokenization. MAGNET learns to predict segment boundaries between byte tokens in a sequence via sub-modules within the model, which act as internal boundary predictors (tokenizers). Previous gradient-based tokenization methods aimed for uniform compression across sequences by integrating a single boundary predictor during training and optimizing it end-to-end through stochastic reparameterization alongside the next token prediction objective. However, this approach still results in over-segmentation for non-Latin script languages in multilingual settings. In contrast, MAGNET offers a customizable architecture where byte-level sequences are routed through language-script-specific predictors, each optimized for its respective language script. This modularity enforces equitable segmentation granularity across different language scripts compared to previous methods. Through extensive experiments, we demonstrate that in addition to reducing segmentation disparities, MAGNET also enables faster language modelling and improves downstream utility.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Relative Sensitivities and Correlation of Factors Introducing Uncertainty in Radiotherapy Dosimetry Audits
Authors:
Padmini Krishnadas,
Spencer Angus Thomas,
Jessica Goldring,
Nadia A. S. Smith,
Mohammad Hussein
Abstract:
Dosimetry audits are carried out to determine how well radiotherapy is delivered to the patient. It is also used to understand the uncertainty introduced into the measurement result when using different computational models. As measurement procedures are becoming increasingly complex with technological advancements, it is harder to establish sources of variability in measurements and understand if…
▽ More
Dosimetry audits are carried out to determine how well radiotherapy is delivered to the patient. It is also used to understand the uncertainty introduced into the measurement result when using different computational models. As measurement procedures are becoming increasingly complex with technological advancements, it is harder to establish sources of variability in measurements and understand if they stem from true differences in measurands or in the measurement pipelines themselves. The gamma index calculation is a widely accepted metric used for the comparison of measured and predicted doses in radiotherapy. However, various steps in the measurement pipeline can introduce variation in the measurement result. In this paper, we perform a sensitivity and correlation analysis to investigate the influence of various input factors (i.e. setting) in gamma index calculations on the uncertainty introduced in dosimetry audits. We identify a number of factors where standardization will improve measurements by reducing variability in outputs. Furthermore, we also compare gamma index metrics and similarities across audit sites.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
General Relativistic effects and the NIR variability of Sgr A* II: A systematic approach to temporal asymmetry
Authors:
Sebastiano D. von Fellenberg,
Gunther Witzel,
Michi Bauboeck,
Hui-Hsuan Chung,
Nicola Marchili,
Greg Martinez,
Matteo Sadun-Bordoni,
Guillaume Bourdarot,
Tuan Do,
Antonia Drescher,
Giovanni Fazio,
Frank Eisenhauer,
Reinhard Genzel,
Stefan Gillessen,
Joseph L. Hora,
Felix Mang,
Thomas Ott,
Howard A. Smith,
Eduardo Ros,
Diogo C. Ribeiro,
Felix Widmann,
S. P. Willner,
J. Anton Zensus
Abstract:
A systematic study, based on the third-moment structure function, of Sgr A*'s variability finds an exponential rise time $τ_{1,\rm{obs}}=14.8^{+0.4}_{-1.5}~\mathrm{minutes}$ and decay time $τ_{2,\rm{obs}}=13.1^{+1.3}_{-1.4}~\mathrm{minutes}$. This symmetry of the flux-density variability is consistent with earlier work, and we interpret it as caused by the dominance of Doppler boosting, as opposed…
▽ More
A systematic study, based on the third-moment structure function, of Sgr A*'s variability finds an exponential rise time $τ_{1,\rm{obs}}=14.8^{+0.4}_{-1.5}~\mathrm{minutes}$ and decay time $τ_{2,\rm{obs}}=13.1^{+1.3}_{-1.4}~\mathrm{minutes}$. This symmetry of the flux-density variability is consistent with earlier work, and we interpret it as caused by the dominance of Doppler boosting, as opposed to gravitational lensing, in Sgr~A*'s light curve. A relativistic, semi-physical model of Sgr~A* confirms an inclination angle $i<45$ degrees. The model also shows that the emission of the intrinsic radiative process can have some asymmetry even though the observed emission does not. The third-moment structure function, which is a measure of the skewness of the light-curve increments, may be a useful summary statistic in other contexts of astronomy because it senses only temporal asymmetry, i.e., it averages to zero for any temporally symmetric signal.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
Authors:
Weijia Shi,
Jaechan Lee,
Yangsibo Huang,
Sadhika Malladi,
Jieyu Zhao,
Ari Holtzman,
Daogao Liu,
Luke Zettlemoyer,
Noah A. Smith,
Chiyuan Zhang
Abstract:
Language models (LMs) are trained on vast amounts of text data, which may include private and copyrighted content. Data owners may request the removal of their data from a trained model due to privacy or copyright concerns. However, exactly unlearning only these datapoints (i.e., retraining with the data removed) is intractable in modern-day models. This has led to the development of many approxim…
▽ More
Language models (LMs) are trained on vast amounts of text data, which may include private and copyrighted content. Data owners may request the removal of their data from a trained model due to privacy or copyright concerns. However, exactly unlearning only these datapoints (i.e., retraining with the data removed) is intractable in modern-day models. This has led to the development of many approximate unlearning algorithms. The evaluation of the efficacy of these algorithms has traditionally been narrow in scope, failing to precisely quantify the success and practicality of the algorithm from the perspectives of both the model deployers and the data owners. We address this issue by proposing MUSE, a comprehensive machine unlearning evaluation benchmark that enumerates six diverse desirable properties for unlearned models: (1) no verbatim memorization, (2) no knowledge memorization, (3) no privacy leakage, (4) utility preservation on data not intended for removal, (5) scalability with respect to the size of removal requests, and (6) sustainability over sequential unlearning requests. Using these criteria, we benchmark how effectively eight popular unlearning algorithms on 7B-parameter LMs can unlearn Harry Potter books and news articles. Our results demonstrate that most algorithms can prevent verbatim memorization and knowledge memorization to varying degrees, but only one algorithm does not lead to severe privacy leakage. Furthermore, existing algorithms fail to meet deployer's expectations because they often degrade general model utility and also cannot sustainably accommodate successive unlearning requests or large-scale content removal. Our findings identify key issues with the practicality of existing unlearning algorithms on language models, and we release our benchmark to facilitate further evaluations: muse-bench.github.io
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Measurement and analysis of the $^{246}$Cm and $^{248}$Cm neutron capture cross-sections at the EAR2 of the n TOF facility
Authors:
V. Alcayne,
A. Kimura,
E. Mendoza,
D. Cano-Ott,
O. Aberle,
F. Álvarez-Velarde,
S. Amaducci,
J. Andrzejewski,
L. Audouin,
V. Bécares,
V. Babiano-Suarez,
M. Bacak,
M. Barbagallo,
F. Bečvář,
G. Bellia,
E. Berthoumieux,
J. Billowes,
D. Bosnar,
A. Brown,
M. Busso,
M. Caamaño,
L. Caballero-Ontanaya,
F. Calviño,
M. Calviani,
A. Casanovas
, et al. (108 additional authors not shown)
Abstract:
The $^{246}$Cm(n,$γ$) and $^{248}$Cm(n,$γ$) cross-sections have been measured at the Experimental Area 2 (EAR2) of the n_TOF facility at CERN with three C$_6$D$_6$ detectors. This measurement is part of a collective effort to improve the capture cross-section data for Minor Actinides (MAs), which are required to estimate the production and transmutation rates of these isotopes in light water react…
▽ More
The $^{246}$Cm(n,$γ$) and $^{248}$Cm(n,$γ$) cross-sections have been measured at the Experimental Area 2 (EAR2) of the n_TOF facility at CERN with three C$_6$D$_6$ detectors. This measurement is part of a collective effort to improve the capture cross-section data for Minor Actinides (MAs), which are required to estimate the production and transmutation rates of these isotopes in light water reactors and innovative reactor systems. In particular, the neutron capture in $^{246}$Cm and $^{248}$Cm open the path for the formation of other Cm isotopes and heavier elements such as Bk and Cf and the knowledge of (n,$γ$) cross-sections of these Cm isotopes plays an important role in the transport, transmutation and storage of the spent nuclear fuel. The reactions $^{246}$Cm(n,$γ$) and $^{248}$Cm(n,$γ$) have been the two first capture measurements analyzed at n_TOF EAR2. Until this experiment and two recent measurements performed at J-PARC, there was only one set of data of the capture cross-sections of $^{246}$Cm and $^{248}$Cm, that was obtained in 1969 in an underground nuclear explosion experiment. In the measurement at n_TOF a total of 13 resonances of $^{246}$Cm between 4 and 400 eV and 5 of $^{248}$Cm between 7 and 100 eV have been identified and fitted. The radiative kernels obtained for $^{246}$Cm are compatible with JENDL-5, but some of them are not with JENDL-4, which has been adopted by JEFF-3.3 and ENDF/B-VIII.0. The radiative kernels obtained for the first three $^{248}$Cm resonances are compatible with JENDL-5, however, the other two are not compatible with any other evaluation and are 20% and 60% larger than JENDL-5.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Characterisation of the Warm-Jupiter TOI-1130 system with CHEOPS and photo-dynamical approach
Authors:
L. Borsato,
D. Degen,
A. Leleu,
M. J. Hooton,
J. A. Egger,
A. Bekkelien,
A. Brandeker,
A. Collier Cameron,
M. N. Günther,
V. Nascimbeni,
C. M. Persson,
A. Bonfanti,
T. G. Wilson,
A. C. M. Correia,
T. Zingales,
T. Guillot,
A. H. M. J. Triaud,
G. Piotto,
D. Gandolfi,
L. Abe,
Y. Alibert,
R. Alonso,
T. Bárczy,
D. Barrado Navascues,
S. C. C. Barros
, et al. (71 additional authors not shown)
Abstract:
Among the thousands of exoplanets discovered to date, approximately a few hundred gas giants on short-period orbits are classified as "lonely" and only a few are in a multi-planet system with a smaller companion on a close orbit. The processes that formed multi-planet systems hosting gas giants on close orbits are poorly understood, and only a few examples of this kind of system have been observed…
▽ More
Among the thousands of exoplanets discovered to date, approximately a few hundred gas giants on short-period orbits are classified as "lonely" and only a few are in a multi-planet system with a smaller companion on a close orbit. The processes that formed multi-planet systems hosting gas giants on close orbits are poorly understood, and only a few examples of this kind of system have been observed and well characterised. Within the contest of multi-planet system hosting gas-giant on short orbits, we characterise TOI-1130 system by measuring masses and orbital parameters. This is a 2-transiting planet system with a Jupiter-like planet (c) on a 8.35 days orbit and a Neptune-like planet (b) on an inner (4.07 days) orbit. Both planets show strong anti-correlated transit timing variations (TTVs). Furthermore, radial velocity (RV) analysis showed an additional linear trend, a possible hint of a non-transiting candidate planet on a far outer orbit. Since 2019, extensive transit and radial velocity observations of the TOI-1130 have been acquired using TESS and various ground-based facilities. We present a new photo-dynamical analysis of all available transit and RV data, with the addition of new CHEOPS and ASTEP+ data that achieve the best precision to date on the planetary radii and masses and on the timings of each transit. We were able to model interior structure of planet b constraining the presence of a gaseous envelope of H/He, while it was not possible to assess the possible water content. Furthermore, we analysed the resonant state of the two transiting planets, and we found that they lie just outside the resonant region. This could be the result of the tidal evolution that the system underwent. We obtained both masses of the planets with a precision less than 1.5%, and radii with a precision of about 1% and 3% for planet b and c, respectively.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Observation of the Galactic Center PeVatron Beyond 100 TeV with HAWC
Authors:
A. Albert,
R. Alfaro,
C. Alvarez,
A. Andrés,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
A. Bernal,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
C. de León,
D. Depaoli,
N. Di Lalla,
N. Di Lalla,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois
, et al. (78 additional authors not shown)
Abstract:
We report an observation of ultra-high energy (UHE) gamma rays from the Galactic Center region, using seven years of data collected by the High-Altitude Water Cherenkov (HAWC) Observatory. The HAWC data are best described as a point-like source (HAWC J1746-2856) with a power-law spectrum ($\mathrm{d}N/\mathrm{d}E=φ(E/26 \,\text{TeV})^γ$), where $γ=-2.88 \pm 0.15_{\text{stat}} - 0.1_{\text{sys}} $…
▽ More
We report an observation of ultra-high energy (UHE) gamma rays from the Galactic Center region, using seven years of data collected by the High-Altitude Water Cherenkov (HAWC) Observatory. The HAWC data are best described as a point-like source (HAWC J1746-2856) with a power-law spectrum ($\mathrm{d}N/\mathrm{d}E=φ(E/26 \,\text{TeV})^γ$), where $γ=-2.88 \pm 0.15_{\text{stat}} - 0.1_{\text{sys}} $ and $φ=1.5 \times 10^{-15}$ (TeV cm$^{2}$s)$^{-1}$ $\pm\, 0.3_{\text{stat}}\,^{+0.08_{\text{sys}}}_{-0.13_{\text{sys}}}$ extending from 6 to 114 TeV. We find no evidence of a spectral cutoff up to $100$ TeV using HAWC data. Two known point-like gamma-ray sources are spatially coincident with the HAWC gamma-ray excess: Sgr A$^{*}$ (HESS J1745-290) and the Arc (HESS J1746-285). We subtract the known flux contribution of these point sources from the measured flux of HAWC J1746-2856 to exclude their contamination and show that the excess observed by HAWC remains significant ($>$5$σ$) with the spectrum extending to $>$100 TeV. Our result supports that these detected UHE gamma rays can originate via hadronic interaction of PeV cosmic-ray protons with the dense ambient gas and confirms the presence of a proton PeVatron at the Galactic Center.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
An Upper Limit on the Photoproduction Cross Section of the Spin-Exotic $π_1(1600)$
Authors:
F. Afzal,
C. S. Akondi,
M. Albrecht,
M. Amaryan,
S. Arrigo,
V. Arroyave,
A. Asaturyan,
A. Austregesilo,
Z. Baldwin,
F. Barbosa,
J. Barlow,
E. Barriga,
R. Barsotti,
D. Barton,
V. Baturin,
V. V. Berdnikov,
T. Black,
W. Boeglin,
M. Boer,
W. J. Briscoe,
T. Britton,
S. Cao,
E. Chudakov,
G. Chung,
P. L. Cole
, et al. (124 additional authors not shown)
Abstract:
The spin-exotic hybrid meson $π_{1}(1600)$ is predicted to have a large decay rate to the $ωππ$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $γp \to ωπ^+ π^- p$, $γp \to ωπ^0 π^0 p$, and $γp\toωπ^-π^0Δ^{++}$ in the range $E_γ=$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction c…
▽ More
The spin-exotic hybrid meson $π_{1}(1600)$ is predicted to have a large decay rate to the $ωππ$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $γp \to ωπ^+ π^- p$, $γp \to ωπ^0 π^0 p$, and $γp\toωπ^-π^0Δ^{++}$ in the range $E_γ=$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction cross sections of the $π^{0}_{1}(1600)$ and $π^{-}_{1}(1600)$. We combine these limits with lattice calculations of decay widths and find that photoproduction of $η'π$ is the most sensitive two-body system to search for the $π_1(1600)$.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Understanding the Emission and Morphology of the Unidentified Gamma-Ray Source TeV J2032+4130
Authors:
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
C. de León,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
J. C. Díaz-Vélez,
K. Engel,
T. Ergin,
C. Espinoza
, et al. (56 additional authors not shown)
Abstract:
The first TeV gamma-ray source with no lower energy counterparts, TeV J2032+4130, was discovered by HEGRA. It appears in the third HAWC catalog as 3HWC J2031+415 and it is a bright TeV gamma-ray source whose emission has previously been resolved as 2 sources: HAWC J2031+415 and HAWC J2030+409. While HAWC J2030+409 has since been associated with the \emph{Fermi-LAT} Cygnus Cocoon, no such associati…
▽ More
The first TeV gamma-ray source with no lower energy counterparts, TeV J2032+4130, was discovered by HEGRA. It appears in the third HAWC catalog as 3HWC J2031+415 and it is a bright TeV gamma-ray source whose emission has previously been resolved as 2 sources: HAWC J2031+415 and HAWC J2030+409. While HAWC J2030+409 has since been associated with the \emph{Fermi-LAT} Cygnus Cocoon, no such association for HAWC J2031+415 has yet been found. In this work, we investigate the spectrum and energy-dependent morphology of HAWC J2031+415. We associate HAWC J2031+415 with the pulsar PSR J2032+4127 and perform a combined multi-wavelength analysis using radio, X-ray, and $γ$-ray emission. We conclude that HAWC J2031+415 and, by extension, TeV J2032+4130 are most probably a pulsar wind nebula (PWN) powered by PSR J2032+4127.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Large-Amplitude, Easy-Plane Spin-Orbit Torque Oscillators Driven by Out-of-Plane Spin Current: A Micromagnetic Study
Authors:
Daniel Kubler,
David A. Smith,
Tommy Nguyen,
Fernando Ramos-Diaz,
Satoru Emori,
Vivek P. Amin
Abstract:
Spin torque oscillators are spintronic devices that generate a periodic output signal from a non-periodic input, making them promising candidates for applications like microwave communications and neuromorphic computing. However, traditional spin torque oscillators suffer from a limited precessional cone angle and thermal stability, as well as a need for an applied bias magnetic field. We use micr…
▽ More
Spin torque oscillators are spintronic devices that generate a periodic output signal from a non-periodic input, making them promising candidates for applications like microwave communications and neuromorphic computing. However, traditional spin torque oscillators suffer from a limited precessional cone angle and thermal stability, as well as a need for an applied bias magnetic field. We use micromagnetic simulations to demonstrate a novel spin torque oscillator that relies on spin-orbit effects in ferromagnets to overcome these limitations. The key mechanism behind this oscillator is the generation of an out-of-plane spin current, in which both the spin flow and the spin orientation are out-of-plane. The torque from this spin current enables easy-plane coherent magnetic precession with a large cone angle and high thermal stability over a micron-scale lateral area. Moreover, the precession occurs about an internal field in the free layer, thereby eliminating the need for an external bias field. We demonstrate the feasibility of an easy-plane spin-orbit torque oscillator at room temperature over a wide parameter space, including the ratio of the out-of-plane spin current to the conventional spin-Hall spin current, presenting exciting possibilities for this novel spintronic device.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
On the Precision of the Spectral Profile Bound for the Mixing Time of Continuous State Markov Chains
Authors:
Elnaz Karimian Sichani,
Aaron Smith
Abstract:
We investigate the sharpness of the spectral profile bound presented by Goel et al. and Chen et al. on the $L^{2}$ mixing time of Markov chains on continuous state spaces. We show that the bound provided by Chen et al. is sharp up to a factor of $\log\log$ of the initial density. This result extends the findings of Kozma, which showed the analogous result for the original spectral profile bound of…
▽ More
We investigate the sharpness of the spectral profile bound presented by Goel et al. and Chen et al. on the $L^{2}$ mixing time of Markov chains on continuous state spaces. We show that the bound provided by Chen et al. is sharp up to a factor of $\log\log$ of the initial density. This result extends the findings of Kozma, which showed the analogous result for the original spectral profile bound of Goel et al. for Markov chains on finite state spaces. Kozma shows that the spectral profile bound is sharp up to a multiplicative factor of $\log(\log(π_{min}))$, where $π_{\min}$ is the smallest value of the probability mass function of the stationary distribution. We discuss the application of our primary finding to the comparison of Markov chains. Our main result can be used as a comparison bound, indicating that it is possible to compare chains even when only non-spectral bounds exist for a known chain.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription
Authors:
Jaydeep Borkar,
David A. Smith
Abstract:
Historical documents frequently suffer from damage and inconsistencies, including missing or illegible text resulting from issues such as holes, ink problems, and storage damage. These missing portions or gaps are referred to as lacunae. In this study, we employ transformer-based optical character recognition (OCR) models trained on synthetic data containing lacunae in a supervised manner. We demo…
▽ More
Historical documents frequently suffer from damage and inconsistencies, including missing or illegible text resulting from issues such as holes, ink problems, and storage damage. These missing portions or gaps are referred to as lacunae. In this study, we employ transformer-based optical character recognition (OCR) models trained on synthetic data containing lacunae in a supervised manner. We demonstrate their effectiveness in detecting and restoring lacunae, achieving a success rate of 65%, compared to a base model lacking knowledge of lacunae, which achieves only 5% restoration. Additionally, we investigate the mechanistic properties of the model, such as the log probability of transcription, which can identify lacunae and other errors (e.g., mistranscriptions due to complex writing or ink issues) in line images without directly inspecting the image. This capability could be valuable for scholars seeking to distinguish images containing lacunae or errors from clean ones. Although we explore the potential of attention mechanisms in flagging lacunae and transcription errors, our findings suggest it is not a significant factor. Our work highlights a promising direction in utilizing transformer-based OCR models for restoring or analyzing damaged historical documents.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Transfer printing micro-assembly of silicon photonic crystal cavity arrays: beating the fabrication tolerance limit
Authors:
Sean P. Bommer,
Christopher Panuski,
Benoit Guilhabert,
Zhongyi Xia,
Jack A. Smith,
Martin D. Dawson,
Dirk Englund,
Michael J. Strain
Abstract:
Photonic crystal cavities (PhCCs) can confine optical fields in ultra-small volumes, enabling efficient light-matter interactions for quantum and non-linear optics, sensing and all-optical signal processing. The inherent nanometric tolerances of micro-fabrication platforms can induce cavity resonant wavelength shifts two-orders of magnitude larger than cavity linewidths, prohibiting fabrication of…
▽ More
Photonic crystal cavities (PhCCs) can confine optical fields in ultra-small volumes, enabling efficient light-matter interactions for quantum and non-linear optics, sensing and all-optical signal processing. The inherent nanometric tolerances of micro-fabrication platforms can induce cavity resonant wavelength shifts two-orders of magnitude larger than cavity linewidths, prohibiting fabrication of arrays of nominally identical devices. We address this device variability by fabricating PhCCs as releasable pixels that can be transferred from their native substrate to a receiver where ordered micro-assembly can overcome the inherent fabrication variance. We demonstrate the measurement, binning and transfer of 119 PhCCs in a single session, producing spatially ordered arrays of PhCCs, sorted by resonant wavelength. Furthermore, the rapid in-situ measurement of the devices enables measurements of the PhCCs dynamic response to the print process for the first time, showing plastic and elastic effects in the seconds to hours range.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Authors:
Orevaoghene Ahia,
Anuoluwapo Aremu,
Diana Abagyan,
Hila Gonen,
David Ifeoluwa Adelani,
Daud Abolade,
Noah A. Smith,
Yulia Tsvetkov
Abstract:
Yorùbá an African language with roughly 47 million speakers encompasses a continuum with several dialects. Recent efforts to develop NLP technologies for African languages have focused on their standard dialects, resulting in disparities for dialects and varieties for which there are little to no resources or tools. We take steps towards bridging this gap by introducing a new high-quality parallel…
▽ More
Yorùbá an African language with roughly 47 million speakers encompasses a continuum with several dialects. Recent efforts to develop NLP technologies for African languages have focused on their standard dialects, resulting in disparities for dialects and varieties for which there are little to no resources or tools. We take steps towards bridging this gap by introducing a new high-quality parallel text and speech corpus YORÙLECT across three domains and four regional Yorùbá dialects. To develop this corpus, we engaged native speakers, travelling to communities where these dialects are spoken, to collect text and speech data. Using our newly created corpus, we conducted extensive experiments on (text) machine translation, automatic speech recognition, and speech-to-text translation. Our results reveal substantial performance disparities between standard Yorùbá and the other dialects across all tasks. However, we also show that with dialect-adaptive finetuning, we are able to narrow this gap. We believe our dataset and experimental analysis will contribute greatly to developing NLP tools for Yorùbá and its dialects, and potentially for other African languages, by improving our understanding of existing challenges and offering a high-quality dataset for further development. We release YORÙLECT dataset and models publicly under an open license.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Decoding-Time Language Model Alignment with Multiple Objectives
Authors:
Ruizhe Shi,
Yifang Chen,
Yushi Hu,
Alisa Liu,
Hannaneh Hajishirzi,
Noah A. Smith,
Simon Du
Abstract:
Aligning language models (LMs) to human preferences has emerged as a critical pursuit, enabling these models to better serve diverse user needs. Existing methods primarily focus on optimizing LMs for a single reward function, limiting their adaptability to varied objectives. Here, we propose $\textbf{multi-objective decoding (MOD)}$, a decoding-time algorithm that outputs the next token from a lin…
▽ More
Aligning language models (LMs) to human preferences has emerged as a critical pursuit, enabling these models to better serve diverse user needs. Existing methods primarily focus on optimizing LMs for a single reward function, limiting their adaptability to varied objectives. Here, we propose $\textbf{multi-objective decoding (MOD)}$, a decoding-time algorithm that outputs the next token from a linear combination of predictions of all base models, for any given weightings over different objectives. We exploit a common form among a family of $f$-divergence regularized alignment approaches (such as PPO, DPO, and their variants) to identify a closed-form solution by Legendre transform, and derive an efficient decoding strategy. Theoretically, we show why existing approaches can be sub-optimal even in natural settings and obtain optimality guarantees for our method. Empirical results demonstrate the effectiveness of the algorithm. For example, compared to a parameter-merging baseline, MOD achieves 12.8% overall reward improvement when equally optimizing towards $3$ objectives. Moreover, we experiment with MOD on combining three fully-finetuned LLMs of different model sizes, each aimed at different objectives such as safety, coding, and general user preference. Unlike traditional methods that require careful curation of a mixture of datasets to achieve comprehensive improvement, we can quickly experiment with preference weightings using MOD to find the best combination of models. Our best combination reduces toxicity on Toxigen to nearly 0% and achieves 7.9--33.3% improvement across other three metrics ($\textit{i.e.}$, Codex@1, GSM-COT, BBH-COT).
△ Less
Submitted 28 June, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Evaluating Copyright Takedown Methods for Language Models
Authors:
Boyi Wei,
Weijia Shi,
Yangsibo Huang,
Noah A. Smith,
Chiyuan Zhang,
Luke Zettlemoyer,
Kai Li,
Peter Henderson
Abstract:
Language models (LMs) derive their capabilities from extensive training on diverse data, including potentially copyrighted material. These models can memorize and generate content similar to their training data, posing potential concerns. Therefore, model creators are motivated to develop mitigation methods that prevent generating protected content. We term this procedure as copyright takedowns fo…
▽ More
Language models (LMs) derive their capabilities from extensive training on diverse data, including potentially copyrighted material. These models can memorize and generate content similar to their training data, posing potential concerns. Therefore, model creators are motivated to develop mitigation methods that prevent generating protected content. We term this procedure as copyright takedowns for LMs, noting the conceptual similarity to (but legal distinction from) the DMCA takedown This paper introduces the first evaluation of the feasibility and side effects of copyright takedowns for LMs. We propose CoTaEval, an evaluation framework to assess the effectiveness of copyright takedown methods, the impact on the model's ability to retain uncopyrightable factual knowledge from the training data whose recitation is embargoed, and how well the model maintains its general utility and efficiency. We examine several strategies, including adding system prompts, decoding-time filtering interventions, and unlearning approaches. Our findings indicate that no tested method excels across all metrics, showing significant room for research in this unique problem setting and indicating potential unresolved challenges for live policy proposals.
△ Less
Submitted 11 July, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Unveiling the internal structure and formation history of the three planets transiting HIP 29442 (TOI-469) with CHEOPS
Authors:
J. A. Egger,
H. P. Osborn,
D. Kubyshkina,
C. Mordasini,
Y. Alibert,
M. N. Günther,
M. Lendl,
A. Brandeker,
A. Heitzmann,
A. Leleu,
M. Damasso,
A. Bonfanti,
T. G. Wilson,
S. G. Sousa,
J. Haldemann,
L. Delrez,
M. J. Hooton,
T. Zingales,
R. Luque,
R. Alonso,
J. Asquier,
T. Bárczy,
D. Barrado Navascues,
S. C. C. Barros,
W. Baumjohann
, et al. (69 additional authors not shown)
Abstract:
Multiplanetary systems spanning the radius valley are ideal testing grounds for exploring the proposed explanations for the observed bimodality in the radius distribution of close-in exoplanets. One such system is HIP 29442 (TOI-469), an evolved K0V star hosting two super-Earths and a sub-Neptune. We observe HIP 29442 with CHEOPS for a total of 9.6 days, which we model jointly with 2 sectors of TE…
▽ More
Multiplanetary systems spanning the radius valley are ideal testing grounds for exploring the proposed explanations for the observed bimodality in the radius distribution of close-in exoplanets. One such system is HIP 29442 (TOI-469), an evolved K0V star hosting two super-Earths and a sub-Neptune. We observe HIP 29442 with CHEOPS for a total of 9.6 days, which we model jointly with 2 sectors of TESS data to derive planetary radii of $3.410\pm0.046$, $1.551\pm0.045$ and $1.538\pm0.049$ R$_\oplus$ for planets b, c and d, which orbit HIP 29442 with periods of 13.6, 3.5 and 6.4 days. For planet d, this value deviates by more than 3 sigma from the median value reported in the discovery paper, leading us to conclude that caution is required when using TESS photometry to determine the radii of small planets with low per-transit S/N and large gaps between observations. Given the high precision of these new radii, combining them with published RVs from ESPRESSO and HIRES provides us with ideal conditions to investigate the internal structure and formation pathways of the planets in the system. We introduce the publicly available code plaNETic, a fast and robust neural network-based Bayesian internal structure modelling framework. We then apply hydrodynamic models to explore the upper atmospheric properties of these inferred structures. Finally, we identify planetary system analogues in a synthetic population generated with the Bern model for planet formation and evolution. Based on this analysis, we find that the planets likely formed on opposing sides of the water iceline from a protoplanetary disk with an intermediate solid mass. We finally report that the observed parameters of the HIP 29442 system are compatible with both a scenario where the second peak in the bimodal radius distribution corresponds to sub-Neptunes with a pure H/He envelope as well as a scenario with water-rich sub-Neptunes.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Low-Crosstalk, Silicon-Fabricated Optical Waveguides for Laser Delivery to Matter Qubits
Authors:
Clayton L. Craft,
Nicholas J. Barton,
Andrew C. Klug,
Kenneth Scalzi,
Ian Wildemann,
Pramod Asagodu,
Joseph D. Broz,
Nikola L. Porto,
Michael Macalik,
Anthony Rizzo,
Garrett Percevault,
Christopher C. Tison,
A. Matthew Smith,
Michael L. Fanto,
James Schneeloch,
Erin Sheridan,
Dylan Heberle,
Andrew Brownell,
Vijay S. S. Sundaram,
Venkatesh Deenadayalan,
Matthew van Niekerk,
Evan Manfreda-Schulz,
Gregory A. Howland,
Stefan F. Preble,
Daniel Coleman
, et al. (8 additional authors not shown)
Abstract:
Reliable control of quantum information in matter-based qubits requires precisely applied external fields, and unaccounted for spatial cross-talk of these fields between adjacent qubits leads to loss of fidelity. We report a CMOS foundry-produced, micro-fabricated silicon nitride (Si3N4) optical waveguide for addressing a chain of eight, unequally-spaced trapped barium ions with crosstalk compatib…
▽ More
Reliable control of quantum information in matter-based qubits requires precisely applied external fields, and unaccounted for spatial cross-talk of these fields between adjacent qubits leads to loss of fidelity. We report a CMOS foundry-produced, micro-fabricated silicon nitride (Si3N4) optical waveguide for addressing a chain of eight, unequally-spaced trapped barium ions with crosstalk compatible with scalable quantum information processing. The crosstalk mitigation techniques incorporated into the chip design result in a reduction of the measured optical field by at least 50.8(1.3) dB between adjacent waveguide outputs near 650 nm and similar behavior for devices designed for 493 nm and 585 nm. The waveguide outputs near 650 nm, along with a global laser near 493 nm were used to laser-cool a chain of eight barium-138 ions, and a camera imaged the resulting fluorescence at 493 nm.
△ Less
Submitted 27 June, 2024; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Sparse Bayesian multidimensional scaling(s)
Authors:
Ami Sheth,
Aaron Smith,
Andrew J. Holbrook
Abstract:
Bayesian multidimensional scaling (BMDS) is a probabilistic dimension reduction tool that allows one to model and visualize data consisting of dissimilarities between pairs of objects. Although BMDS has proven useful within, e.g., Bayesian phylogenetic inference, its likelihood and gradient calculations require a burdensome order of $N^2$ floating-point operations, where $N$ is the number of data…
▽ More
Bayesian multidimensional scaling (BMDS) is a probabilistic dimension reduction tool that allows one to model and visualize data consisting of dissimilarities between pairs of objects. Although BMDS has proven useful within, e.g., Bayesian phylogenetic inference, its likelihood and gradient calculations require a burdensome order of $N^2$ floating-point operations, where $N$ is the number of data points. Thus, BMDS becomes impractical as $N$ grows large. We propose and compare two sparse versions of BMDS (sBMDS) that apply log-likelihood and gradient computations to subsets of the observed dissimilarity matrix data. Landmark sBMDS (L-sBMDS) extracts columns, while banded sBMDS (B-sBMDS) extracts diagonals of the data. These sparse variants let one specify a time complexity between $N^2$ and $N$. Under simplified settings, we prove posterior consistency for subsampled distance matrices. Through simulations, we examine the accuracy and computational efficiency across all models using both the Metropolis-Hastings and Hamiltonian Monte Carlo algorithms. We observe approximately 3-fold, 10-fold and 40-fold speedups with negligible loss of accuracy, when applying the sBMDS likelihoods and gradients to 500, 1,000 and 5,000 data points with 50 bands (landmarks); these speedups only increase with the size of data considered. Finally, we apply the sBMDS variants to the phylogeographic modeling of multiple influenza subtypes to better understand how these strains spread through global air transportation networks.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
X-raying the zeta Tau binary system
Authors:
Yael Naze,
Christian Motch,
G. Rauw,
Myron A. Smith,
Jan Robrade
Abstract:
The Be star zeta Tau was recently reported to be a gamma Cas analog; that is, it displays an atypical (bright and hard) X-ray emission. The origin of these X-rays remains debated.The first X-ray observations indicated a very large absorption of the hot plasma component (N_H~ 10^{23}/cm^2). This is most probably related to the edge-on configuration of the zeta Tau disk. If the X-ray emission arises…
▽ More
The Be star zeta Tau was recently reported to be a gamma Cas analog; that is, it displays an atypical (bright and hard) X-ray emission. The origin of these X-rays remains debated.The first X-ray observations indicated a very large absorption of the hot plasma component (N_H~ 10^{23}/cm^2). This is most probably related to the edge-on configuration of the zeta Tau disk. If the X-ray emission arises close to the companion, an orbital modulation of the absorption could be detected as the disk comes in and out of the line of sight. New XMM-Newton data were obtained to characterize the high-energy properties of zeta Tau in more detail. They are complemented by previous Chandra and SRG/eROSITA observations as well as by optical spectroscopy and TESS photometry. The high-quality XMM-Newton data reveal the presence of a faint soft X-ray emission, which appears in line with that recorded for non-gamma Cas Be stars. In addition, zeta Tau exhibits significant short-term variability at all energies, with larger amplitudes at lower frequencies (``red noise''), as is found in X-ray data of other gamma Cas stars. Transient variability (softness dip, low-frequency signal) may also be detected at some epochs. In addition, between X-ray exposures, large variations in the spectra are detected in the 1.5-4.keV energy band. They are due to large changes in absorption toward the hottest (9keV) plasma. These changes are not correlated with either the orbital phase or the depth of the shell absorption of the Halpha line. These observed properties are examined in the light of proposed gamma Cas models.
△ Less
Submitted 11 July, 2024; v1 submitted 21 June, 2024;
originally announced June 2024.
-
Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG
Authors:
William Merrill,
Noah A. Smith,
Yanai Elazar
Abstract:
How novel are texts generated by language models (LMs) relative to their training corpora? In this work, we investigate the extent to which modern LMs generate $n$-grams from their training data, evaluating both (i) the probability LMs assign to complete training $n$-grams and (ii) $n$-novelty, the proportion of $n$-grams generated by an LM that did not appear in the training data (for arbitrarily…
▽ More
How novel are texts generated by language models (LMs) relative to their training corpora? In this work, we investigate the extent to which modern LMs generate $n$-grams from their training data, evaluating both (i) the probability LMs assign to complete training $n$-grams and (ii) $n$-novelty, the proportion of $n$-grams generated by an LM that did not appear in the training data (for arbitrarily large $n$). To enable arbitrary-length $n$-gram search over a corpus in constant time, we develop Rusty-DAWG, a novel search tool inspired by indexing of genomic data. We compare the novelty of LM-generated text to human-written text and explore factors that affect generation novelty, focusing on the Pythia models. We find that, for $n > 4$, LM-generated text is less novel than human-written text, though it is more novel for smaller $n$. Larger LMs and more constrained decoding strategies both decrease novelty. Finally, we show that LMs complete $n$-grams with lower loss if they are more frequent in the training data. Overall, our results reveal factors influencing the novelty of LM-generated text, and we release Rusty-DAWG to facilitate further pretraining data research.
△ Less
Submitted 25 June, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
Measurement of Spin-Density Matrix Elements in $Δ^{++}(1232)$ photoproduction
Authors:
F. Afzal,
C. S. Akondi,
M. Albrecht,
M. Amaryan,
S. Arrigo,
V. Arroyave,
A. Asaturyan,
A. Austregesilo,
Z. Baldwin,
F. Barbosa,
J. Barlow,
E. Barriga,
R. Barsotti,
D. Barton,
V. Baturin,
V. V. Berdnikov,
T. Black,
W. Boeglin,
M. Boer,
W. J. Briscoe,
T. Britton,
S. Cao,
E. Chudakov,
G. Chung,
P. L. Cole
, et al. (124 additional authors not shown)
Abstract:
We report the measurement of spin-density matrix elements of the $Δ^{++}(1232)$ in the photoproduction reaction $γp \to π^-Δ^{++}(1232)$ with the GlueX experiment in Hall D at Jefferson Lab. The measurement used a linearly polarized photon beam with $E_γ=8.2-8.8$~GeV and the statistical precision exceeds the previous measurement from SLAC by three orders of magnitude for the momentum transfer squa…
▽ More
We report the measurement of spin-density matrix elements of the $Δ^{++}(1232)$ in the photoproduction reaction $γp \to π^-Δ^{++}(1232)$ with the GlueX experiment in Hall D at Jefferson Lab. The measurement used a linearly polarized photon beam with $E_γ=8.2-8.8$~GeV and the statistical precision exceeds the previous measurement from SLAC by three orders of magnitude for the momentum transfer squared region $-t < 1.4$ GeV$^2$. The data are sensitive to the previously undetermined relative sign between couplings in existing Regge exchange models. Linear combinations of the extracted SDMEs allow for a decomposition into natural and unnatural exchange amplitudes, which shows that the unnatural exchange plays an important role in the low $-t$ region.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
A framework for developing a knowledge management platform
Authors:
Marie Lisandra Zepeda Mendoza,
Sonali Agarwal,
James A. Blackshaw,
Vanesa Bol,
Audrey Fazzi,
Filippo Fiorini,
Amy Louise Foreman,
Nancy George,
Brett R. Johnson,
Brian Martin,
Dave McComb,
Euphemia Mutasa-Gottgens,
Helen Parkinson,
Martin Romacker,
Rolf Russell,
Valérien Ségard,
Shawn Zheng Kai Tan,
Wei Kheng Teh,
F. P. Winstanley,
Benedict Wong,
Adrian M. Smith
Abstract:
Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide gu…
▽ More
Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide guidance on envisioning, executing, evaluating, and evolving knowledge management platforms. We emphasize essential considerations such as setting knowledge domain boundaries and measuring success, as well as the importance of making knowledge accessible for downstream applications and non-computational users and highlights necessary personal and organizational skills for success. We stress the importance of collaboration and the need for convergence on shared principles and commitment to provide or seek resources to advance KM. The community is invited to join the journey of KM and contribute to the advancement of the field by applying and improving on the guidelines described.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
PLATO's signal and noise budget
Authors:
Anko Börner,
Carsten Paproth,
Juan Cabrera,
Martin Pertenais,
Heike Rauer,
J. Miguel Mas-Hesse,
Isabella Pagano,
Jose Lorenzo Alvarez,
Anders Erikson,
Denis Grießbach,
Yves Levillain,
Demetrio Magrin,
Valery Mogulsky,
Sami-Matias Niemi,
Thibaut Prod'homme,
Sara Regibo,
Joris De Ridder,
Steve Rockstein,
Reza Samadi,
Dimitri Serrano-Velarde,
Alan Smith,
Peter Verhoeve,
Dave Walton
Abstract:
ESA's PLATO mission aims the detection and characterization of terrestrial planets around solar-type stars as well as the study of host star properties. The noise-to-signal ratio (NSR) is the main performance parameter of the PLATO instrument, which consists of 24 Normal Cameras and 2 Fast Cameras. In order to justify, verify and breakdown NSR-relevant requirements the software simulator PINE was…
▽ More
ESA's PLATO mission aims the detection and characterization of terrestrial planets around solar-type stars as well as the study of host star properties. The noise-to-signal ratio (NSR) is the main performance parameter of the PLATO instrument, which consists of 24 Normal Cameras and 2 Fast Cameras. In order to justify, verify and breakdown NSR-relevant requirements the software simulator PINE was developed. PINE models the signal pathway from a target star to the digital output of a camera based on physical models and considers the major noise contributors. In this paper, the simulator's coarse mode is introduced which allows fast performance analyses on instrument level. The added value of PINE is illustrated by exemplary applications.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Vertically Graded FeNi Alloys with Low Damping and a Sizeable Spin-Orbit Torque
Authors:
Rachel E. Maizel,
Shuang Wu,
Purnima P. Balakrishnan,
Alexander J. Grutter,
Christy J. Kinane,
Andrew J. Caruana,
Prabandha Nakarmi,
Bhuwan Nepal,
David A. Smith,
Youngmin Lim,
Julia L. Jones,
Wyatt C. Thomas,
Jing Zhao,
F. Marc Michel,
Tim Mewes,
Satoru Emori
Abstract:
Energy-efficient spintronic devices require a large spin-orbit torque (SOT) and low damping to excite magnetic precession. In conventional devices with heavy-metal/ferromagnet bilayers, reducing the ferromagnet thickness to $\sim$1 nm enhances the SOT but dramatically increases damping. Here, we investigate an alternative approach based on a 10 nm thick single-layer ferromagnet to attain both low…
▽ More
Energy-efficient spintronic devices require a large spin-orbit torque (SOT) and low damping to excite magnetic precession. In conventional devices with heavy-metal/ferromagnet bilayers, reducing the ferromagnet thickness to $\sim$1 nm enhances the SOT but dramatically increases damping. Here, we investigate an alternative approach based on a 10 nm thick single-layer ferromagnet to attain both low damping and a sizable SOT. Instead of relying on a single interface, we continuously break the bulk inversion symmetry with a vertical compositional gradient of two ferromagnetic elements: Fe with low intrinsic damping and Ni with sizable spin-orbit coupling. We find low effective damping parameters of $α_\mathrm{eff} < 5\times10^{-3}$ in the FeNi alloy films, despite the steep compositional gradients. Moreover, we reveal a sizable anti-damping SOT efficiency of $θ_\mathrm{AD} \approx 0.05$, even without an intentional compositional gradient. Through depth-resolved x-ray diffraction, we identify a lattice strain gradient as crucial symmetry breaking that underpins the SOT. Our findings provide fresh insights into damping and SOTs in single-layer ferromagnets for power-efficient spintronic devices.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Gap-gradient methods for solving generalized mixed integer inverse optimization: an application to political gerrymandering
Authors:
Ari J. Smith,
Justin J. Boutilier
Abstract:
Inverse optimization has received much attention in recent years, but little literature exists for solving generalized mixed integer inverse optimization. We propose a new approach for solving generalized mixed-integer inverse optimization problems based on sub-gradient methods. We characterize when a generalized inverse optimization problem can be solved using sub-gradient methods and we prove th…
▽ More
Inverse optimization has received much attention in recent years, but little literature exists for solving generalized mixed integer inverse optimization. We propose a new approach for solving generalized mixed-integer inverse optimization problems based on sub-gradient methods. We characterize when a generalized inverse optimization problem can be solved using sub-gradient methods and we prove that modifications to classic sub-gradient algorithms can return exact solutions in finite time. Our best implementation improves solution time by up to 90% compared to the best performing method from the literature. We then develop custom heuristic methods for graph-based inverse problems using a combination of graph coarsening and ensemble methods. Our heuristics are able to further reduce solution time by up to 52%, while still producing near-optimal solutions. Finally, we propose a new application domain - quantitatively identifying gerrymandering - for generalized inverse integer optimization. We apply our overall solution approach to analyze the congressional districts of the State of Iowa using real-world data. We find that the accepted districting marginally improves population imbalance at the cost of a significant increase in partisan efficiency gap. We argue that our approach can produce a more nuanced data-driven argument that a proposed districting should be considered gerrymandered.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
Authors:
Yushi Hu,
Weijia Shi,
Xingyu Fu,
Dan Roth,
Mari Ostendorf,
Luke Zettlemoyer,
Noah A Smith,
Ranjay Krishna
Abstract:
Humans draw to facilitate reasoning: we draw auxiliary lines when solving geometry problems; we mark and circle when reasoning on maps; we use sketches to amplify our ideas and relieve our limited-capacity working memory. However, such actions are missing in current multimodal language models (LMs). Current chain-of-thought and tool-use paradigms only use text as intermediate reasoning steps. In t…
▽ More
Humans draw to facilitate reasoning: we draw auxiliary lines when solving geometry problems; we mark and circle when reasoning on maps; we use sketches to amplify our ideas and relieve our limited-capacity working memory. However, such actions are missing in current multimodal language models (LMs). Current chain-of-thought and tool-use paradigms only use text as intermediate reasoning steps. In this work, we introduce Sketchpad, a framework that gives multimodal LMs a visual sketchpad and tools to draw on the sketchpad. The LM conducts planning and reasoning according to the visual artifacts it has drawn. Different from prior work, which uses text-to-image models to enable LMs to draw, Sketchpad enables LMs to draw with lines, boxes, marks, etc., which is closer to human sketching and better facilitates reasoning. Sketchpad can also use specialist vision models during the sketching process (e.g., draw bounding boxes with object detection models, draw masks with segmentation models), to further enhance visual perception and reasoning. We experiment with a wide range of math tasks (including geometry, functions, graphs, and chess) and complex visual reasoning tasks. Sketchpad substantially improves performance on all tasks over strong base models with no sketching, yielding an average gain of 12.7% on math tasks, and 8.6% on vision tasks. GPT-4o with Sketchpad sets a new state of the art on all tasks, including V*Bench (80.3%), BLINK spatial reasoning (83.9%), and visual correspondence (80.8%). All codes and data are in https://visualsketchpad.github.io/.
△ Less
Submitted 10 July, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Authors:
Hamish Ivison,
Yizhong Wang,
Jiacheng Liu,
Zeqiu Wu,
Valentina Pyatkin,
Nathan Lambert,
Noah A. Smith,
Yejin Choi,
Hannaneh Hajishirzi
Abstract:
Learning from preference feedback has emerged as an essential step for improving the generation quality and performance of modern language models (LMs). Despite its widespread use, the way preference-based learning is applied varies wildly, with differing data, learning algorithms, and evaluations used, making disentangling the impact of each aspect difficult. In this work, we identify four core a…
▽ More
Learning from preference feedback has emerged as an essential step for improving the generation quality and performance of modern language models (LMs). Despite its widespread use, the way preference-based learning is applied varies wildly, with differing data, learning algorithms, and evaluations used, making disentangling the impact of each aspect difficult. In this work, we identify four core aspects of preference-based learning: preference data, learning algorithm, reward model, and policy training prompts, systematically investigate the impact of these components on downstream model performance, and suggest a recipe for strong learning for preference feedback. Our findings indicate that all aspects are important for performance, with better preference data leading to the largest improvements, followed by the choice of learning algorithm, the use of improved reward models, and finally the use of additional unlabeled prompts for policy training. Notably, PPO outperforms DPO by up to 2.5% in math and 1.2% in general domains. High-quality preference data leads to improvements of up to 8% in instruction following and truthfulness. Despite significant gains of up to 5% in mathematical evaluation when scaling up reward models, we surprisingly observe marginal improvements in other categories.
We publicly release the code used for training (https://github.com/hamishivi/EasyLM) and evaluating (https://github.com/allenai/open-instruct) our models, along with the models and datasets themselves (https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618).
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
A green solvent system for precursor phase-engineered sequential deposition of stable formamidinium lead triiodide for perovskite solar cells
Authors:
Benjamin M. Gallant,
Philippe Holzhey,
Joel A. Smith,
Saqlain Choudhary,
Karim A. Elmestekawy,
Pietro Caprioglio,
Igal Levine,
Alex Sheader,
Fengning Yang,
Daniel T. W. Toolan,
Rachel C. Kilbride,
Augustin K. A. Zaininger,
James M. Ball,
M. Greyson Christoforo,
Nakita Noel,
Laura M. Herz,
Dominik J. Kubicki,
Henry J. Snaith
Abstract:
Perovskite solar cells (PSCs) offer an efficient, inexpensive alternative to current photovoltaic technologies, with the potential for manufacture via high-throughput coating methods. However, challenges for commercial-scale solution-processing of metal-halide perovskites include the use of harmful solvents, the expense of maintaining controlled atmospheric conditions, and the inherent instabiliti…
▽ More
Perovskite solar cells (PSCs) offer an efficient, inexpensive alternative to current photovoltaic technologies, with the potential for manufacture via high-throughput coating methods. However, challenges for commercial-scale solution-processing of metal-halide perovskites include the use of harmful solvents, the expense of maintaining controlled atmospheric conditions, and the inherent instabilities of PSCs under operation. Here, we address these challenges by introducing a high volatility, low toxicity, biorenewable solvent system to fabricate a range of 2D perovskites, which highly effective precursor phases for subsequent transformation to alpha-formamidinium lead triiodide (FAPbI3), fully processed under ambient conditions. PSCs utilising our FAPbI3 reproducibly show remarkable stability under illumination and elevated temperature (ISOS-L-2) and "damp heat" (ISOS-D-3) stressing, surpassing other state-of-the-art perovskite compositions. We determine that this enhancement is a consequence of the 2D precursor phase crystallisation route, which simultaneously avoids retention of residual low-volatility solvents (such as DMF and DMSO) and reduces the rate of degradation of FA+ in the material. Our findings highlight both the critical role of the initial crystallisation process in determining the operational stability of perovskite materials, and that neat FA+-based perovskites can be competitively stable despite the inherent metastability of the alpha-phase.
△ Less
Submitted 14 June, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Studying $π^+π^-$ photoproduction beyond Pomeron exchange
Authors:
Łukasz Bibrzycki,
Nadine Hammoud,
Vincent Mathieu,
Robert J. Perry,
Alex Akridge,
César Fernández-Ramírez,
Gloria Montaña,
Alessandro Pilloni,
Arkaitz Rodas,
Vanamali Shastry,
Wyatt A. Smith,
Daniel Winney,
Adam P. Szczepaniak
Abstract:
Forward photoproduction of $π^+π^-$ pairs with invariant mass of the order of $m_ρ\sim 770$ MeV is traditionally understood to be produced via Pomeron exchange. Based on a detailed analysis of the CLAS photoproduction data, it is shown that the dynamics of two-pion photoproduction for $|t|\gtrsim 0.5$ GeV$^2$ cannot be explained by Pomeron exchange alone. This motivates the development of a new th…
▽ More
Forward photoproduction of $π^+π^-$ pairs with invariant mass of the order of $m_ρ\sim 770$ MeV is traditionally understood to be produced via Pomeron exchange. Based on a detailed analysis of the CLAS photoproduction data, it is shown that the dynamics of two-pion photoproduction for $|t|\gtrsim 0.5$ GeV$^2$ cannot be explained by Pomeron exchange alone. This motivates the development of a new theoretical model of two-pion photoproduction which incorporates both two-pion and pion-nucleon resonant contributions. After fitting free parameters, the model provides an excellent description of the low moments of the angular distribution measured at CLAS, and enables an assessment of the relative contributions of particular production mechanisms and an interpretation of the various features of the data in terms of these mechanisms.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
The PLATO Mission
Authors:
Heike Rauer,
Conny Aerts,
Juan Cabrera,
Magali Deleuil,
Anders Erikson,
Laurent Gizon,
Mariejo Goupil,
Ana Heras,
Jose Lorenzo-Alvarez,
Filippo Marliani,
Cesar Martin-Garcia,
J. Miguel Mas-Hesse,
Laurence O'Rourke,
Hugh Osborn,
Isabella Pagano,
Giampaolo Piotto,
Don Pollacco,
Roberto Ragazzoni,
Gavin Ramsay,
Stéphane Udry,
Thierry Appourchaux,
Willy Benz,
Alexis Brandeker,
Manuel Güdel,
Eduardo Janot-Pacheco
, et al. (801 additional authors not shown)
Abstract:
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observati…
▽ More
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observations from the ground, planets will be characterised for their radius, mass, and age with high accuracy (5 %, 10 %, 10 % for an Earth-Sun combination respectively). PLATO will provide us with a large-scale catalogue of well-characterised small planets up to intermediate orbital periods, relevant for a meaningful comparison to planet formation theories and to better understand planet evolution. It will make possible comparative exoplanetology to place our Solar System planets in a broader context. In parallel, PLATO will study (host) stars using asteroseismology, allowing us to determine the stellar properties with high accuracy, substantially enhancing our knowledge of stellar structure and evolution.
The payload instrument consists of 26 cameras with 12cm aperture each. For at least four years, the mission will perform high-precision photometric measurements. Here we review the science objectives, present PLATO's target samples and fields, provide an overview of expected core science performance as well as a description of the instrument and the mission profile at the beginning of the serial production of the flight cameras. PLATO is scheduled for a launch date end 2026. This overview therefore provides a summary of the mission to the community in preparation of the upcoming operational phases.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Auditing Privacy Mechanisms via Label Inference Attacks
Authors:
Róbert István Busa-Fekete,
Travis Dick,
Claudio Gentile,
Andrés Muñoz Medina,
Adam Smith,
Marika Swanberg
Abstract:
We propose reconstruction advantage measures to audit label privatization mechanisms. A reconstruction advantage measure quantifies the increase in an attacker's ability to infer the true label of an unlabeled example when provided with a private version of the labels in a dataset (e.g., aggregate of labels from different users or noisy labels output by randomized response), compared to an attacke…
▽ More
We propose reconstruction advantage measures to audit label privatization mechanisms. A reconstruction advantage measure quantifies the increase in an attacker's ability to infer the true label of an unlabeled example when provided with a private version of the labels in a dataset (e.g., aggregate of labels from different users or noisy labels output by randomized response), compared to an attacker that only observes the feature vectors, but may have prior knowledge of the correlation between features and labels. We consider two such auditing measures: one additive, and one multiplicative. These incorporate previous approaches taken in the literature on empirical auditing and differential privacy. The measures allow us to place a variety of proposed privatization schemes -- some differentially private, some not -- on the same footing. We analyze these measures theoretically under a distributional model which encapsulates reasonable adversarial settings. We also quantify their behavior empirically on real and simulated prediction tasks. Across a range of experimental settings, we find that differentially private schemes dominate or match the privacy-utility tradeoff of more heuristic approaches.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
CHEOPS in-flight performance: A comprehensive look at the first 3.5 years of operations
Authors:
A. Fortier,
A. E. Simon,
C. Broeg,
G. Olofsson,
A. Deline,
T. G. Wilson,
P. F. L. Maxted,
A. Brandeker,
A. Collier Cameron,
M. Beck,
A. Bekkelien,
N. Billot,
A. Bonfanti,
G. Bruno,
J. Cabrera,
L. Delrez,
B. -O. Demory,
D. Futyan,
H. -G. Florén,
M. N. Günther,
A. Heitzmann,
S. Hoyer,
K. G. Isaak,
S. G. Sousa,
M. Stalport
, et al. (106 additional authors not shown)
Abstract:
CHEOPS is a space telescope specifically designed to monitor transiting exoplanets orbiting bright stars. In September 2023, CHEOPS completed its nominal mission and remains in excellent operational conditions. The mission has been extended until the end of 2026. Scientific and instrumental data have been collected throughout in-orbit commissioning and nominal operations, enabling a comprehensive…
▽ More
CHEOPS is a space telescope specifically designed to monitor transiting exoplanets orbiting bright stars. In September 2023, CHEOPS completed its nominal mission and remains in excellent operational conditions. The mission has been extended until the end of 2026. Scientific and instrumental data have been collected throughout in-orbit commissioning and nominal operations, enabling a comprehensive analysis of the mission's performance. In this article, we present the results of this analysis with a twofold goal. First, we aim to inform the scientific community about the present status of the mission and what can be expected as the instrument ages. Secondly, we intend for this publication to serve as a legacy document for future missions, providing insights and lessons learned from the successful operation of CHEOPS. To evaluate the instrument performance in flight, we developed a comprehensive monitoring and characterisation programme. It consists of dedicated observations that allow us to characterise the instrument's response. In addition to the standard collection of nominal science and housekeeping data, these observations provide input for detecting, modelling, and correcting instrument systematics, discovering and addressing anomalies, and comparing the instrument's actual performance with expectations. The precision of the CHEOPS measurements has enabled the mission objectives to be met and exceeded. Careful modelling of the instrumental systematics allows the data quality to be significantly improved during the light curve analysis phase, resulting in more precise scientific measurements. CHEOPS is compliant with the driving scientific requirements of the mission. Although visible, the ageing of the instrument has not affected the mission's performance.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Multiwavelength Observations of Sgr A*. II. 2019 July 21 and 26
Authors:
Joseph M. Michail,
Farhad Yusef-Zadeh,
Mark Wardle,
Devaky Kunneriath,
Joseph L. Hora,
Howard Bushouse,
Giovanni G. Fazio,
Sera Markoff,
Howard A. Smith
Abstract:
We report on the final two days of a multiwavelength campaign of Sgr A* observing in the radio, submillimeter, infrared, and X-ray bands in July 2019. Sgr A* was remarkably active, showing multiple flaring events across the electromagnetic spectrum. We detect a transient $\sim35$-minute periodicity feature in Spitzer Space Telescope light curves on 21 July 2019. Time-delayed emission was detected…
▽ More
We report on the final two days of a multiwavelength campaign of Sgr A* observing in the radio, submillimeter, infrared, and X-ray bands in July 2019. Sgr A* was remarkably active, showing multiple flaring events across the electromagnetic spectrum. We detect a transient $\sim35$-minute periodicity feature in Spitzer Space Telescope light curves on 21 July 2019. Time-delayed emission was detected in ALMA light curves, suggesting a hotspot within the accretion flow on a stable orbit. On the same night, we observe a decreased flux in the submillimeter light curve following an X-ray flare detected by the Chandra X-ray Observatory and model the feature with an adiabatically expanding synchrotron hotspot occulting the accretion flow. The event is produced by a plasma $0.55~R_{\text{S}}$ in radius with an electron spectrum $p=2.84$. It is threaded by a $\sim130$ Gauss magnetic field and expands at $0.6\%$ the speed of light. Finally, we reveal an unambiguous flare in the infrared, submillimeter, and radio, demonstrating that the variable emission is intrinsically linked. We jointly fit the radio and submillimeter light curves using an adiabatically expanding synchrotron hotspot and find it is produced by a plasma with an electron spectrum $p=0.59$, $187$ Gauss magnetic field, and radius $0.47~R_{\text{S}}$ that expands at $0.029c$. In both cases, the uncertainty in the appropriate lower and upper electron energy bounds may inflate the derived equipartition field strengths by a factor of 2 or more. Our results confirm that both synchrotron- and adiabatic-cooling processes are involved in the variable emission's evolution at submillimeter and infrared wavelengths.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
HIP 41378 observed by CHEOPS: Where is planet d?
Authors:
S. Sulis,
L. Borsato,
S. Grouffal,
H. P. Osborn,
A. Santerne,
A. Brandeker,
M. N. Günther,
A. Heitzmann,
M. Lendl,
M. Fridlund,
D. Gandolfi,
Y. Alibert,
R. Alonso,
T. Bárczy,
D. Barrado Navascues,
S. C. Barros,
W. Baumjohann,
T. Beck,
W. Benz,
M. Bergomi,
N. Billot,
A. Bonfanti,
C. Broeg,
A. Collier Cameron,
C. Corral van Damme
, et al. (62 additional authors not shown)
Abstract:
HIP 41378 d is a long-period planet that has only been observed to transit twice, three years apart, with K2. According to stability considerations and a partial detection of the Rossiter-McLaughlin effect, $P_\mathrm{d} = 278.36$ d has been determined to be the most likely orbital period. We targeted HIP 41378 d with CHEOPS at the predicted transit timing based on $P_\mathrm{d}= 278.36$ d, but th…
▽ More
HIP 41378 d is a long-period planet that has only been observed to transit twice, three years apart, with K2. According to stability considerations and a partial detection of the Rossiter-McLaughlin effect, $P_\mathrm{d} = 278.36$ d has been determined to be the most likely orbital period. We targeted HIP 41378 d with CHEOPS at the predicted transit timing based on $P_\mathrm{d}= 278.36$ d, but the observations show no transit. We find that large ($>22.4$ hours) transit timing variations (TTVs) could explain this non-detection during the CHEOPS observation window. We also investigated the possibility of an incorrect orbital solution, which would have major implications for our knowledge of this system. If $P_\mathrm{d} \neq 278.36$ d, the periods that minimize the eccentricity would be $101.22$ d and $371.14$ d. The shortest orbital period will be tested by TESS, which will observe HIP 41378 in Sector 88 starting in January 2025. Our study shows the importance of a mission like CHEOPS, which today is the only mission able to make long observations (i.e., from space) to track the ephemeris of long-period planets possibly affected by large TTVs.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Enhancing Exoplanet Ephemerides by Leveraging Professional and Citizen Science Data: A Test Case with WASP-77A b
Authors:
Federico R. Noguer,
Suber Corley,
Kyle A. Pearson,
Robert T. Zellem,
Molly N. Simon,
Jennifer A. Burt,
Isabela Huckabee,
Prune C. August,
Megan Weiner Mansfield,
Paul A. Dalba,
Peter C. B. Smith,
Timothy Banks,
Ira Bell,
Dominique Daniel,
Lindsay Dawson,
Jesús De Mula,
Marc Deldem,
Dimitrios Deligeorgopoulos,
Romina P. Di Sisto,
Roger Dymock,
Phil Evans,
Giulio Follero,
Martin J. F. Fowler,
Eduardo Fernández-Lajús,
Alex Hamrick
, et al. (20 additional authors not shown)
Abstract:
We present an updated ephemeris and physical parameters for the exoplanet WASP-77 A b. In this effort, we combine 64 ground- and space-based transit observations, 6 space-based eclipse observations, and 32 radial velocity observations to produce the most precise orbital solution to date for this target, aiding in the planning of James Webb Space Telescope (JWST) and Ariel observations and atmosphe…
▽ More
We present an updated ephemeris and physical parameters for the exoplanet WASP-77 A b. In this effort, we combine 64 ground- and space-based transit observations, 6 space-based eclipse observations, and 32 radial velocity observations to produce the most precise orbital solution to date for this target, aiding in the planning of James Webb Space Telescope (JWST) and Ariel observations and atmospheric studies. We report a new orbital period of 1.360029395 +- 5.7e-8 days, a new mid-transit time of 2459957.337860 +- 4.3e-5 BJDTDB (Barycentric Julian Date in the Barycentric Dynamical Time scale; arXiv:1005.4415) and a new mid-eclipse time of 2459956.658192 +- 6.7e-5 BJDTDB. Furthermore, the methods presented in this study reduce the uncertainties in the planet mass to 1.6654 +- 4.5e-3 Mjup and orbital period to 1.360029395 +- 5.7e-8 days by factors of 15.1 and 10.9, respectively. Through a joint fit analysis comparison of transit data taken by space-based and citizen science-led initiatives, our study demonstrates the power of including data collected by citizen scientists compared to a fit of the space-based data alone. Additionally, by including a vast array of citizen science data from ExoClock, Exoplanet Transit Database (ETD), and Exoplanet Watch, we can increase our observational baseline and thus acquire better constraints on the forward propagation of our ephemeris than what is achievable with TESS data alone.
△ Less
Submitted 4 June, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
The Construction of Large-scale Structure Catalogs for the Dark Energy Spectroscopic Instrument
Authors:
A. J. Ross,
J. Aguilar,
S. Ahlen,
S. Alam,
A. Anand,
S. Bailey,
D. Bianchi,
S. Brieden,
D. Brooks,
E. Burtin,
A. Carnero Rosell,
E. Chaussidon,
T. Claybaugh,
S. Cole,
K. Dawson,
A. de la Macorra,
A. de Mattia,
Arjun Dey,
Biprateep Dey,
P. Doel,
K. Fanning,
S. Ferraro,
J. Ereza,
A. Font-Ribera,
J. E. Forero-Romero
, et al. (59 additional authors not shown)
Abstract:
We present the technical details on how large-scale structure (LSS) catalogs are constructed from redshifts measured from spectra observed by the Dark Energy Spectroscopic Instrument (DESI). The LSS catalogs provide the information needed to determine the relative number density of DESI tracers as a function of redshift and celestial coordinates and, e.g., determine clustering statistics. We produ…
▽ More
We present the technical details on how large-scale structure (LSS) catalogs are constructed from redshifts measured from spectra observed by the Dark Energy Spectroscopic Instrument (DESI). The LSS catalogs provide the information needed to determine the relative number density of DESI tracers as a function of redshift and celestial coordinates and, e.g., determine clustering statistics. We produce catalogs that are weighted subsamples of the observed data, each matched to a weighted `random' catalog that forms an unclustered sampling of the probability density that DESI could have observed those data at each location.
Precise knowledge of the DESI observing history and associated hardware performance allows for a determination of the DESI footprint and the number of times DESI has covered it at sub-arcsecond level precision. This enables the completeness of any DESI sample to be modeled at this same resolution. The pipeline developed to create LSS catalogs has been designed to easily allow robustness tests and enable future improvements. We describe how it allows ongoing work improving the match between galaxy and random catalogs, such as including further information when assigning redshifts to randoms, accounting for fluctuations in target density, accounting for variation in the redshift success rate, and accommodating blinding schemes.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
The density-bounded twilight of starbursts in the early Universe
Authors:
William McClymont,
Sandro Tacchella,
Francesco D'Eugenio,
Callum Witten,
Xihan Ji,
Aaron Smith,
Roberto Maiolino,
Jan Scholtz,
Charlotte Simmonds,
Joris Witstok
Abstract:
The peculiar nebular emission displayed by galaxies in the early Universe presents a unique opportunity to gain insight into the regulation of star formation in extreme environments. We investigate 500 (109) galaxies with deep NIRSpec/PRISM observations from the JADES survey at $z>2$ ($z>5.3$), finding 52 (26) galaxies with Balmer line ratios more than $1σ$ inconsistent with Case B recombination.…
▽ More
The peculiar nebular emission displayed by galaxies in the early Universe presents a unique opportunity to gain insight into the regulation of star formation in extreme environments. We investigate 500 (109) galaxies with deep NIRSpec/PRISM observations from the JADES survey at $z>2$ ($z>5.3$), finding 52 (26) galaxies with Balmer line ratios more than $1σ$ inconsistent with Case B recombination. These anomalous Balmer emitters (ABEs) cannot be explained by dust attenuation, indicating a departure from Case B recombination. To address this discrepancy, we model density-bounded nebulae with the photoionisation code CLOUDY. Density-bounded nebulae show anomalous Balmer line ratios due to Lyman line pumping and a transition from the nebulae being optically thin to optically thick for Lyman lines with increasing cloud depth. The H$α$/H$β$ versus H$γ$/H$β$ trend of density-bounded models is robust to changes in stellar age of the ionising source, gas density, and ionisation parameter; however, increasing the stellar metallicity drives a turnover in the trend. This is due to stronger stellar absorption features around Ly$γ$ reducing H$β$ fluorescence, allowing density-bounded models to account for all observed Balmer line ratios. ABEs show higher [OIII]/[OII], have steeper ultra-violet slopes, are fainter, and are more preferentially Ly$α$ emitters than galaxies which are consistent with Case B and little dust. These findings suggest that ABEs are galaxies that have become density bounded during extreme quenching events, representing a transient phase of $\sim$20 Myr during a fast breathing mode of star formation.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
The role of excitation vector fields and all-polarisation state control of cavity magnonics
Authors:
Alban Joseph,
Jayakrishnan M. P. Nair,
Mawgan A. Smith,
Rory Holland,
Luke J. McLellan,
Isabella Boventer,
Tim Wolz,
Dmytro A. Bozhko,
Benedetta Flebus,
Martin P. Weides,
Rair Macedo
Abstract:
Recently the field of cavity magnonics, a field focused on controlling the interaction between magnons and confined microwave photons within microwave resonators, has drawn significant attention as it offers a platform for enabling advancements in quantum- and spin-based technologies. Here, we introduce excitation vector fields, whose polarisation and profile can be easily tuned in a two-port cavi…
▽ More
Recently the field of cavity magnonics, a field focused on controlling the interaction between magnons and confined microwave photons within microwave resonators, has drawn significant attention as it offers a platform for enabling advancements in quantum- and spin-based technologies. Here, we introduce excitation vector fields, whose polarisation and profile can be easily tuned in a two-port cavity setup, thus acting as an effective experimental knob to explore the coupled dynamics of cavity magnon-polaritons. Moreover, we develop theoretical models that accurately predict and reproduce the experimental results for any polarisation state and field profile within the cavity resonator. This versatile experimental platform offers a new avenue for controlling spin-photon interactions and as such also delivering a mechanism to readily control the exchange of information between hybrid systems.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Photo-dynamical characterisation of the TOI-178 resonant chain
Authors:
A. Leleu,
J. -B. Delisle,
L. Delrez,
E. M. Bryant,
A. Brandeker,
H. P. Osborn,
N. Hara,
T. G. Wilson,
N. Billot,
M. Lendl,
D. Ehrenreich,
H. Chakraborty,
M. N. Günther,
M. J. Hooton,
Y. Alibert,
R. Alonso,
D. R. Alves,
D. R. Anderson,
I. Apergis,
D. Armstrong,
T. Bárczy,
D. Barrado Navascues,
S. C. C. Barros,
M. P. Battley,
W. Baumjohann
, et al. (82 additional authors not shown)
Abstract:
The TOI-178 system consists of a nearby late K-dwarf transited by six planets in the super-Earth to mini-Neptune regime, with radii ranging from 1.2 to 2.9 earth radius and orbital periods between 1.9 and 20.7 days. All planets but the innermost one form a chain of Laplace resonances. The fine-tuning and fragility of such orbital configurations ensure that no significant scattering or collision ev…
▽ More
The TOI-178 system consists of a nearby late K-dwarf transited by six planets in the super-Earth to mini-Neptune regime, with radii ranging from 1.2 to 2.9 earth radius and orbital periods between 1.9 and 20.7 days. All planets but the innermost one form a chain of Laplace resonances. The fine-tuning and fragility of such orbital configurations ensure that no significant scattering or collision event has taken place since the formation and migration of the planets in the protoplanetary disc, hence providing important anchors for planet formation models. We aim to improve the characterisation of the architecture of this key system, and in particular the masses and radii of its planets. In addition, since this system is one of the few resonant chains that can be characterised by both photometry and radial velocities, we aim to use it as a test bench for the robustness of the planetary mass determination with each technique. We perform a global analysis of all available photometry and radial velocity. We also try different sets of priors on the masses and eccentricity, as well as different stellar activity models, to study their effects on the masses estimated by each method. We show how stellar activity is preventing us from obtaining a robust mass estimation for the three outer planets using radial velocity data alone. We also show that our joint photo-dynamical and radial velocity analysis resulted in a robust mass determination for planets c to g, with precision of 12% for the mass of planet c, and better than 10% for planets d to g. The new precisions on the radii range from 2 to 3%. The understanding of this synergy between photometric and radial velocity measurements will be valuable during the PLATO mission. We also show that TOI-178 is indeed currently locked in the resonant configuration, librating around an equilibrium of the chain.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Inference with non-differentiable surrogate loss in a general high-dimensional classification framework
Authors:
Muxuan Liang,
Yang Ning,
Maureen A Smith,
Ying-Qi Zhao
Abstract:
Penalized empirical risk minimization with a surrogate loss function is often used to derive a high-dimensional linear decision rule in classification problems. Although much of the literature focuses on the generalization error, there is a lack of valid inference procedures to identify the driving factors of the estimated decision rule, especially when the surrogate loss is non-differentiable. In…
▽ More
Penalized empirical risk minimization with a surrogate loss function is often used to derive a high-dimensional linear decision rule in classification problems. Although much of the literature focuses on the generalization error, there is a lack of valid inference procedures to identify the driving factors of the estimated decision rule, especially when the surrogate loss is non-differentiable. In this work, we propose a kernel-smoothed decorrelated score to construct hypothesis testing and interval estimations for the linear decision rule estimated using a piece-wise linear surrogate loss, which has a discontinuous gradient and non-regular Hessian. Specifically, we adopt kernel approximations to smooth the discontinuous gradient near discontinuity points and approximate the non-regular Hessian of the surrogate loss. In applications where additional nuisance parameters are involved, we propose a novel cross-fitted version to accommodate flexible nuisance estimates and kernel approximations. We establish the limiting distribution of the kernel-smoothed decorrelated score and its cross-fitted version in a high-dimensional setup. Simulation and real data analysis are conducted to demonstrate the validity and superiority of the proposed method.
△ Less
Submitted 19 May, 2024;
originally announced May 2024.
-
Nonperturbative aspects of the electromagnetic pion form factor at high energies
Authors:
Joint Physics Analysis Center,
:,
K. Quirion,
C. Fernández-Ramírez,
V. Mathieu,
G. Montaña,
R. J. Perry,
A. Pilloni,
A. Rodas,
V. Shastry,
W. A. Smith,
A. P. Szczepaniak,
D. Winney
Abstract:
The structure of hadronic form factors at high energies and their deviations from perturbative quantum chromodynamics provide insight on nonperturbative dynamics. Using an approach that is consistent with dispersion relations, we construct a model that simultaneously accounts for the pion wave function, gluonic exchanges, and quark Reggeization. In particular, we find that quark Reggeization can b…
▽ More
The structure of hadronic form factors at high energies and their deviations from perturbative quantum chromodynamics provide insight on nonperturbative dynamics. Using an approach that is consistent with dispersion relations, we construct a model that simultaneously accounts for the pion wave function, gluonic exchanges, and quark Reggeization. In particular, we find that quark Reggeization can be investigated at high energies by studying scaling violation of the form factor.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Sums of rational cubes and the $3$-Selmer group
Authors:
Peter Koymans,
Alexander Smith
Abstract:
Recently, Alpöge-Bhargava-Shnidman determined the average size of the $2$-Selmer group in the cubic twist family of any elliptic curve over $\mathbb{Q}$ with $j$-invariant $0$. We obtain the distribution of the $3$-Selmer groups in the same family. As a consequence, we improve their upper bound on the density of integers expressible as a sum of two rational cubes. Assuming a $3$-converse theorem,…
▽ More
Recently, Alpöge-Bhargava-Shnidman determined the average size of the $2$-Selmer group in the cubic twist family of any elliptic curve over $\mathbb{Q}$ with $j$-invariant $0$. We obtain the distribution of the $3$-Selmer groups in the same family. As a consequence, we improve their upper bound on the density of integers expressible as a sum of two rational cubes. Assuming a $3$-converse theorem, we also improve their lower bound on this density.
The $\sqrt{-3}$-Selmer group in this cubic twist family is well-known to be large, which poses significant challenges to the methods previously developed by the second author. We overcome this problem by strengthening the analytic core of these methods. Specifically, we prove a "trilinear large sieve" for an appropriate generalization of the classical Rédei symbol, then use this to control the restriction of the Cassels-Tate pairing to the $\sqrt{-3}$-Selmer groups in these twist families.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
RIGEL: Simulating dwarf galaxies at solar mass resolution with radiative transfer and feedback from individual massive stars
Authors:
Yunwei Deng,
Hui Li,
Boyuan Liu,
Rahul Kannan,
Aaron Smith,
Greg L. Bryan
Abstract:
We introduce the RIGEL model, a novel framework to self-consistently model the effects of stellar feedback in the multiphase ISM of dwarf galaxies with radiative transfer (RT) on a star-by-star basis. The RIGEL model integrates detailed implementations of feedback from individual massive stars into the RHD code, AREPO-RT. It forms individual massive stars from the resolved multiphase ISM by sampli…
▽ More
We introduce the RIGEL model, a novel framework to self-consistently model the effects of stellar feedback in the multiphase ISM of dwarf galaxies with radiative transfer (RT) on a star-by-star basis. The RIGEL model integrates detailed implementations of feedback from individual massive stars into the RHD code, AREPO-RT. It forms individual massive stars from the resolved multiphase ISM by sampling the IMF and tracks their evolution individually. The lifetimes, photon production rates, mass-loss rates, and wind velocities of these stars are determined by their initial masses and metallicities based on a library that incorporates a variety of stellar models. The RT equations are solved in seven spectral bins accounting for the IR to HeII ionizing bands, using an M1 RT scheme. The thermochemistry model tracks the non-equilibrium H, He chemistry and the equilibrium abundance of CI, CII, OI, OII, and CO to capture the thermodynamics of all ISM phases. We evaluate the performance of the RIGEL model using $1\,{\rm M}_\odot$ resolution simulations of isolated dwarf galaxies. We found that the SFR and ISRF show strong positive correlations to the metallicity of the galaxy. Photoionization and photoheating can reduce the SFR by an order of magnitude by removing the available cold-dense gas fuel for star formation. The ISRF also changes the thermal structure of the ISM. Radiative feedback occurs immediately after the birth of massive stars and rapidly disperses the molecular clouds within 1 Myr. As a consequence, radiative feedback reduces the age spread of star clusters to less than 2 Myr, prohibits the formation of massive star clusters, and shapes the cluster initial mass function to a steep power-law form with a slope of $\sim-2$. The mass-loading factor of the fiducial galaxy has a median of $\sim50$, while turning off radiative feedback reduces this factor by an order of magnitude.
△ Less
Submitted 16 May, 2024; v1 submitted 14 May, 2024;
originally announced May 2024.
-
Faithful Artin induction and the Chebotarev density theorem
Authors:
Robert J. Lemke Oliver,
Alexander Smith
Abstract:
Given a finite group G, we prove that the vector space spanned by the faithful irreducible characters of G is generated by the monomial characters in the vector space. As a consequence, we show that in any family of G-extensions of a fixed number field F, almost all are subject to a strong effective version of the Chebotarev density theorem. We use this version of the Chebotarev density theorem to…
▽ More
Given a finite group G, we prove that the vector space spanned by the faithful irreducible characters of G is generated by the monomial characters in the vector space. As a consequence, we show that in any family of G-extensions of a fixed number field F, almost all are subject to a strong effective version of the Chebotarev density theorem. We use this version of the Chebotarev density theorem to deduce several consequences for class groups in families of number fields.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
What Can Natural Language Processing Do for Peer Review?
Authors:
Ilia Kuznetsov,
Osama Mohammed Afzal,
Koen Dercksen,
Nils Dycke,
Alexander Goldberg,
Tom Hope,
Dirk Hovy,
Jonathan K. Kummerfeld,
Anne Lauscher,
Kevin Leyton-Brown,
Sheng Lu,
Mausam,
Margot Mieskes,
Aurélie Névéol,
Danish Pruthi,
Lizhen Qu,
Roy Schwartz,
Noah A. Smith,
Thamar Solorio,
Jingyan Wang,
Xiaodan Zhu,
Anna Rogers,
Nihar B. Shah,
Iryna Gurevych
Abstract:
The number of scientific articles produced every year is growing rapidly. Providing quality control over them is crucial for scientists and, ultimately, for the public good. In modern science, this process is largely delegated to peer review -- a distributed procedure in which each submission is evaluated by several independent experts in the field. Peer review is widely used, yet it is hard, time…
▽ More
The number of scientific articles produced every year is growing rapidly. Providing quality control over them is crucial for scientists and, ultimately, for the public good. In modern science, this process is largely delegated to peer review -- a distributed procedure in which each submission is evaluated by several independent experts in the field. Peer review is widely used, yet it is hard, time-consuming, and prone to error. Since the artifacts involved in peer review -- manuscripts, reviews, discussions -- are largely text-based, Natural Language Processing has great potential to improve reviewing. As the emergence of large language models (LLMs) has enabled NLP assistance for many new tasks, the discussion on machine-assisted peer review is picking up the pace. Yet, where exactly is help needed, where can NLP help, and where should it stand aside? The goal of our paper is to provide a foundation for the future efforts in NLP for peer-reviewing assistance. We discuss peer review as a general process, exemplified by reviewing at AI conferences. We detail each step of the process from manuscript submission to camera-ready revision, and discuss the associated challenges and opportunities for NLP assistance, illustrated by existing work. We then turn to the big challenges in NLP for peer review as a whole, including data acquisition and licensing, operationalization and experimentation, and ethical issues. To help consolidate community efforts, we create a companion repository that aggregates key datasets pertaining to peer review. Finally, we issue a detailed call for action for the scientific community, NLP and AI researchers, policymakers, and funding bodies to help bring the research in NLP for peer review forward. We hope that our work will help set the agenda for research in machine-assisted scientific quality control in the age of AI, within the NLP community and beyond.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Performance of the HAWC Observatory and TeV Gamma-Ray Measurements of the Crab Nebula with Improved Extensive Air Shower Reconstruction Algorithms
Authors:
A . Albert,
R. Alfaro,
C. Alvarez,
A . Andrés,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
C. de León,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
B. L . Dingus,
M. A. DuVernois,
K. Engel,
T. Ergin
, et al. (68 additional authors not shown)
Abstract:
The High-Altitude Water Cherenkov (HAWC) Gamma-Ray Observatory located on the side of the Sierra Negra volcano in Mexico, has been fully operational since 2015. The HAWC collaboration has recently significantly improved their extensive-air-shower reconstruction algorithms, which has notably advanced the observatory performance. The energy resolution for primary gamma rays with energies below 1~TeV…
▽ More
The High-Altitude Water Cherenkov (HAWC) Gamma-Ray Observatory located on the side of the Sierra Negra volcano in Mexico, has been fully operational since 2015. The HAWC collaboration has recently significantly improved their extensive-air-shower reconstruction algorithms, which has notably advanced the observatory performance. The energy resolution for primary gamma rays with energies below 1~TeV was improved by including a noise-suppression algorithm. Corrections have also been made to systematic errors in direction fitting related to the detector and shower plane inclinations, $\mathcal{O}(0.1^{\circ})$ biases in highly inclined showers, as well as enhancements to the core reconstruction. The angular resolution for gamma rays approaching the HAWC array from large zenith angles ($> 37^{\circ}$) has improved by a factor of four at the highest energies ($> 70$~TeV) as compared to previous reconstructions. The inclusion of a lateral distribution function fit to the extensive air shower footprint on the array to separate gamma-ray primaries from cosmic-ray ones, based on the resulting $χ^{2}$ values, improved the background rejection performance at all inclinations. At large zenith angles, the improvement in significance is a factor of four compared to previous HAWC publications. These enhancements have been verified by observing the Crab Nebula, which is an overhead source for the HAWC Observatory. We show that the sensitivity to Crab-like point sources ($E^{-2.63}$) with locations overhead to 30$^{\circ}$ zenith is comparable or less than 10\% of the Crab Nebula's flux between 2 and 50~TeV. Thanks to these improvements, HAWC can now detect more sources, including the Galactic Center.
△ Less
Submitted 1 July, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting
Authors:
Ola Shorinwa,
Johnathan Tucker,
Aliyah Smith,
Aiden Swann,
Timothy Chen,
Roya Firoozi,
Monroe Kennedy III,
Mac Schwager
Abstract:
We present Splat-MOVER, a modular robotics stack for open-vocabulary robotic manipulation, which leverages the editability of Gaussian Splatting (GSplat) scene representations to enable multi-stage manipulation tasks. Splat-MOVER consists of: (i) ASK-Splat, a GSplat representation that distills semantic and grasp affordance features into the 3D scene. ASK-Splat enables geometric, semantic, and aff…
▽ More
We present Splat-MOVER, a modular robotics stack for open-vocabulary robotic manipulation, which leverages the editability of Gaussian Splatting (GSplat) scene representations to enable multi-stage manipulation tasks. Splat-MOVER consists of: (i) ASK-Splat, a GSplat representation that distills semantic and grasp affordance features into the 3D scene. ASK-Splat enables geometric, semantic, and affordance understanding of 3D scenes, which is critical in many robotics tasks; (ii) SEE-Splat, a real-time scene-editing module using 3D semantic masking and infilling to visualize the motions of objects that result from robot interactions in the real-world. SEE-Splat creates a "digital twin" of the evolving environment throughout the manipulation task; and (iii) Grasp-Splat, a grasp generation module that uses ASK-Splat and SEE-Splat to propose affordance-aligned candidate grasps for open-world objects. ASK-Splat is trained in real-time from RGB images in a brief scanning phase prior to operation, while SEE-Splat and Grasp-Splat run in real-time during operation. We demonstrate the superior performance of Splat-MOVER in hardware experiments on a Kinova robot compared to two recent baselines in four single-stage, open-vocabulary manipulation tasks and in four multi-stage manipulation tasks, using the edited scene to reflect changes due to prior manipulation stages, which is not possible with existing baselines. The project page is available at https://splatmover.github.io, and the code for the project will be made available after review.
△ Less
Submitted 8 June, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.