-
Generating SROI^{-} Ontologies via Knowledge Graph Query Embedding Learning
Authors:
Yunjie He,
Daniel Hernandez,
Mojtaba Nayyeri,
Bo Xiong,
Yuqicheng Zhu,
Evgeny Kharlamov,
Steffen Staab
Abstract:
Query embedding approaches answer complex logical queries over incomplete knowledge graphs (KGs) by computing and operating on low-dimensional vector representations of entities, relations, and queries. However, current query embedding models heavily rely on excessively parameterized neural networks and cannot explain the knowledge learned from the graph. We propose a novel query embedding method,…
▽ More
Query embedding approaches answer complex logical queries over incomplete knowledge graphs (KGs) by computing and operating on low-dimensional vector representations of entities, relations, and queries. However, current query embedding models heavily rely on excessively parameterized neural networks and cannot explain the knowledge learned from the graph. We propose a novel query embedding method, AConE, which explains the knowledge learned from the graph in the form of SROI^{-} description logic axioms while being more parameter-efficient than most existing approaches. AConE associates queries to a SROI^{-} description logic concept. Every SROI^{-} concept is embedded as a cone in complex vector space, and each SROI^{-} relation is embedded as a transformation that rotates and scales cones. We show theoretically that AConE can learn SROI^{-} axioms, and defines an algebra whose operations correspond one to one to SROI^{-} description logic concept constructs. Our empirical study on multiple query datasets shows that AConE achieves superior results over previous baselines with fewer parameters. Notably on the WN18RR dataset, AConE achieves significant improvement over baseline models. We provide comprehensive analyses showing that the capability to represent axioms positively impacts the results of query answering.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
TeV Analysis of a Source Rich Region with HAWC Observatory: Is HESS J1809-193 a Potential Hadronic PeVatron?
Authors:
A. Albert,
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
R. Babu,
E. Belmont-Moreno,
A. Bernal,
M. Breuhaus,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
J. Cotzomi,
E. De la Fuente,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
C. Espinoza,
K. L. Fan,
K. Fang,
B. Fick,
N. Fraija
, et al. (57 additional authors not shown)
Abstract:
HESS J1809-193 is an unidentified TeV source, first detected by the High Energy Stereoscopic System (H.E.S.S.) Collaboration. The emission originates in a source-rich region that includes several Supernova Remnants (SNR) and Pulsars (PSR) including SNR G11.1+0.1, SNR G11.0-0.0, and the young radio pulsar J1809-1917. Originally classified as a pulsar wind nebula (PWN) candidate, recent studies show…
▽ More
HESS J1809-193 is an unidentified TeV source, first detected by the High Energy Stereoscopic System (H.E.S.S.) Collaboration. The emission originates in a source-rich region that includes several Supernova Remnants (SNR) and Pulsars (PSR) including SNR G11.1+0.1, SNR G11.0-0.0, and the young radio pulsar J1809-1917. Originally classified as a pulsar wind nebula (PWN) candidate, recent studies show the peak of the TeV region overlapping with a system of molecular clouds. This resulted in the revision of the original leptonic scenario to look for alternate hadronic scenarios. Marked as a potential PeVatron candidate, this region has been studied extensively by H.E.S.S. due to its emission extending up-to several tens of TeV. In this work, we use 2398 days of data from the High Altitude Water Cherenkov (HAWC) observatory to carry out a systematic source search for the HESS J1809-193 region. We were able to resolve emission detected as an extended component (modelled as a Symmetric Gaussian with a 1 $σ$ radius of 0.21 $^\circ$) with no clear cutoff at high energies and emitting photons up-to 210 TeV. We model the multi-wavelength observations for the region HESS J1809-193 using a time-dependent leptonic model and a lepto-hadronic model. Our model indicates that both scenarios could explain the observed data within the region of HESS J1809-193.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Interacting Circular Rydberg Atoms Trapped in Optical Tweezers
Authors:
Paul Méhaignerie,
Yohann Machu,
Andrés Durán Hernández,
Gautier Creutzer,
David J. Papoular,
Jean-Michel Raimond,
Clément Sayrin,
Michel Brune
Abstract:
Circular Rydberg atoms (CRAs), i.e., Rydberg atoms with maximal orbital momentum, ideally combine long coherence times and strong interactions, a key property of quantum systems, in particular for the development of quantum technologies. However, the dipole-dipole interaction between CRAs has not been observed so far. We report the measurement and characterization of the resonant dipole-dipole int…
▽ More
Circular Rydberg atoms (CRAs), i.e., Rydberg atoms with maximal orbital momentum, ideally combine long coherence times and strong interactions, a key property of quantum systems, in particular for the development of quantum technologies. However, the dipole-dipole interaction between CRAs has not been observed so far. We report the measurement and characterization of the resonant dipole-dipole interaction between two CRAs, individually trapped in optical tweezers, and find excellent agreement with theoretical predictions. We demonstrate a dynamic control over the strength of the interaction by tuning the orientation of an electric field. We use the interaction between the CRAs as a meter for the interatomic distance, and record the relative motion between two atoms in their traps. This motion, that we induce through the interaction between Rydberg levels with permanent electric dipoles, transiently populated during the preparation of the circular states, is a signature of spin-motion coupling.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Observation of the Galactic Center PeVatron Beyond 100 TeV with HAWC
Authors:
A. Albert,
R. Alfaro,
C. Alvarez,
A. Andrés,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
A. Bernal,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
C. de León,
D. Depaoli,
N. Di Lalla,
N. Di Lalla,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois
, et al. (78 additional authors not shown)
Abstract:
We report an observation of ultra-high energy (UHE) gamma rays from the Galactic Center region, using seven years of data collected by the High-Altitude Water Cherenkov (HAWC) Observatory. The HAWC data are best described as a point-like source (HAWC J1746-2856) with a power-law spectrum ($\mathrm{d}N/\mathrm{d}E=φ(E/26 \,\text{TeV})^γ$), where $γ=-2.88 \pm 0.15_{\text{stat}} - 0.1_{\text{sys}} $…
▽ More
We report an observation of ultra-high energy (UHE) gamma rays from the Galactic Center region, using seven years of data collected by the High-Altitude Water Cherenkov (HAWC) Observatory. The HAWC data are best described as a point-like source (HAWC J1746-2856) with a power-law spectrum ($\mathrm{d}N/\mathrm{d}E=φ(E/26 \,\text{TeV})^γ$), where $γ=-2.88 \pm 0.15_{\text{stat}} - 0.1_{\text{sys}} $ and $φ=1.5 \times 10^{-15}$ (TeV cm$^{2}$s)$^{-1}$ $\pm\, 0.3_{\text{stat}}\,^{+0.08_{\text{sys}}}_{-0.13_{\text{sys}}}$ extending from 6 to 114 TeV. We find no evidence of a spectral cutoff up to $100$ TeV using HAWC data. Two known point-like gamma-ray sources are spatially coincident with the HAWC gamma-ray excess: Sgr A$^{*}$ (HESS J1745-290) and the Arc (HESS J1746-285). We subtract the known flux contribution of these point sources from the measured flux of HAWC J1746-2856 to exclude their contamination and show that the excess observed by HAWC remains significant ($>$5$σ$) with the spectrum extending to $>$100 TeV. Our result supports that these detected UHE gamma rays can originate via hadronic interaction of PeV cosmic-ray protons with the dense ambient gas and confirms the presence of a proton PeVatron at the Galactic Center.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Understanding the Emission and Morphology of the Unidentified Gamma-Ray Source TeV J2032+4130
Authors:
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
C. de León,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
J. C. Díaz-Vélez,
K. Engel,
T. Ergin,
C. Espinoza
, et al. (56 additional authors not shown)
Abstract:
The first TeV gamma-ray source with no lower energy counterparts, TeV J2032+4130, was discovered by HEGRA. It appears in the third HAWC catalog as 3HWC J2031+415 and it is a bright TeV gamma-ray source whose emission has previously been resolved as 2 sources: HAWC J2031+415 and HAWC J2030+409. While HAWC J2030+409 has since been associated with the \emph{Fermi-LAT} Cygnus Cocoon, no such associati…
▽ More
The first TeV gamma-ray source with no lower energy counterparts, TeV J2032+4130, was discovered by HEGRA. It appears in the third HAWC catalog as 3HWC J2031+415 and it is a bright TeV gamma-ray source whose emission has previously been resolved as 2 sources: HAWC J2031+415 and HAWC J2030+409. While HAWC J2030+409 has since been associated with the \emph{Fermi-LAT} Cygnus Cocoon, no such association for HAWC J2031+415 has yet been found. In this work, we investigate the spectrum and energy-dependent morphology of HAWC J2031+415. We associate HAWC J2031+415 with the pulsar PSR J2032+4127 and perform a combined multi-wavelength analysis using radio, X-ray, and $γ$-ray emission. We conclude that HAWC J2031+415 and, by extension, TeV J2032+4130 are most probably a pulsar wind nebula (PWN) powered by PSR J2032+4127.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Deep Learning Based Apparent Diffusion Coefficient Map Generation from Multi-parametric MR Images for Patients with Diffuse Gliomas
Authors:
Zach Eidex,
Mojtaba Safari,
Jacob Wynne,
Richard L. J. Qiu,
Tonghe Wang,
David Viar Hernandez,
Hui-Kuo Shu,
Hui Mao,
Xiaofeng Yang
Abstract:
Purpose: Apparent diffusion coefficient (ADC) maps derived from diffusion weighted (DWI) MRI provides functional measurements about the water molecules in tissues. However, DWI is time consuming and very susceptible to image artifacts, leading to inaccurate ADC measurements. This study aims to develop a deep learning framework to synthesize ADC maps from multi-parametric MR images. Methods: We pro…
▽ More
Purpose: Apparent diffusion coefficient (ADC) maps derived from diffusion weighted (DWI) MRI provides functional measurements about the water molecules in tissues. However, DWI is time consuming and very susceptible to image artifacts, leading to inaccurate ADC measurements. This study aims to develop a deep learning framework to synthesize ADC maps from multi-parametric MR images. Methods: We proposed the multiparametric residual vision transformer model (MPR-ViT) that leverages the long-range context of ViT layers along with the precision of convolutional operators. Residual blocks throughout the network significantly increasing the representational power of the model. The MPR-ViT model was applied to T1w and T2- fluid attenuated inversion recovery images of 501 glioma cases from a publicly available dataset including preprocessed ADC maps. Selected patients were divided into training (N=400), validation (N=50) and test (N=51) sets, respectively. Using the preprocessed ADC maps as ground truth, model performance was evaluated and compared against the Vision Convolutional Transformer (VCT) and residual vision transformer (ResViT) models. Results: The results are as follows using T1w + T2-FLAIR MRI as inputs: MPR-ViT - PSNR: 31.0 +/- 2.1, MSE: 0.009 +/- 0.0005, SSIM: 0.950 +/- 0.015. In addition, ablation studies showed the relative impact on performance of each input sequence. Both qualitative and quantitative results indicate that the proposed MR- ViT model performs favorably against the ground truth data. Conclusion: We show that high-quality ADC maps can be synthesized from structural MRI using a MPR- VCT model. Our predicted images show better conformality to the ground truth volume than ResViT and VCT predictions. These high-quality synthetic ADC maps would be particularly useful for disease diagnosis and intervention, especially when ADC maps have artifacts or are unavailable.
△ Less
Submitted 4 July, 2024; v1 submitted 2 July, 2024;
originally announced July 2024.
-
The new Herschel/PACS Point Source Catalogue
Authors:
Gábor Marton,
Ilknur Gezer,
Máté Madarász,
Odysseas Dionatos,
Marc Audard,
Julia Roquette,
David Hernandez,
Roberta Paladini,
Bruno Altieri
Abstract:
Herschel operated as an observatory, therefore it did not cover the whole sky, but still observed ~8% of it. The first version of an overall Herschel/PACS Point Source Catalogue was released in 2017. The data are still unique and are very important for research, especially because no new far-infrared mission is foreseen for at least the next decade. In the framework of the NEMESIS project, we revi…
▽ More
Herschel operated as an observatory, therefore it did not cover the whole sky, but still observed ~8% of it. The first version of an overall Herschel/PACS Point Source Catalogue was released in 2017. The data are still unique and are very important for research, especially because no new far-infrared mission is foreseen for at least the next decade. In the framework of the NEMESIS project, we revisited all the photometric observations obtained by the PACS instrument on-board of the Herschel space observatory.
We aimed to build the most complete and most accurate Herschel/PACS catalogue to date. Our primary goal was to increase the number of real sources, and decrease the number of spurious sources identified on a strongly variable background. Our goal was to build a blind catalogue, meaning that source extraction is conducted without relying on prior detections at various wavelengths, allowing us to detect sources never catalogued before.
We define a hybrid strategy that includes classical and ML source identification and characterisation methods, providing catalogues at much higher completeness levels than before. Quality assessment also involves ML techniques. Our source extraction methodology facilitates a systematic and impartial comparison of sensitivity levels across various Herschel fields, a task that was typically beyond the scope of individual programs.
We created a high-reliability and a rejected source catalogue for each PACS passband, i.e., at 70, 100, and 160 μm. With the high-reliability catalogue, we managed to significantly increase the completeness in all bands. At the same time, while the number of high-reliability detections decreased, the number of sources matching with existing catalogues increased, suggesting that the purity is also higher than before. The photometric accuracy of our pipeline is ~1% based on comparison with the standard star models.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Performance of the HAWC Observatory and TeV Gamma-Ray Measurements of the Crab Nebula with Improved Extensive Air Shower Reconstruction Algorithms
Authors:
A . Albert,
R. Alfaro,
C. Alvarez,
A . Andrés,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
C. de León,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
B. L . Dingus,
M. A. DuVernois,
K. Engel,
T. Ergin
, et al. (68 additional authors not shown)
Abstract:
The High-Altitude Water Cherenkov (HAWC) Gamma-Ray Observatory located on the side of the Sierra Negra volcano in Mexico, has been fully operational since 2015. The HAWC collaboration has recently significantly improved their extensive-air-shower reconstruction algorithms, which has notably advanced the observatory performance. The energy resolution for primary gamma rays with energies below 1~TeV…
▽ More
The High-Altitude Water Cherenkov (HAWC) Gamma-Ray Observatory located on the side of the Sierra Negra volcano in Mexico, has been fully operational since 2015. The HAWC collaboration has recently significantly improved their extensive-air-shower reconstruction algorithms, which has notably advanced the observatory performance. The energy resolution for primary gamma rays with energies below 1~TeV was improved by including a noise-suppression algorithm. Corrections have also been made to systematic errors in direction fitting related to the detector and shower plane inclinations, $\mathcal{O}(0.1^{\circ})$ biases in highly inclined showers, as well as enhancements to the core reconstruction. The angular resolution for gamma rays approaching the HAWC array from large zenith angles ($> 37^{\circ}$) has improved by a factor of four at the highest energies ($> 70$~TeV) as compared to previous reconstructions. The inclusion of a lateral distribution function fit to the extensive air shower footprint on the array to separate gamma-ray primaries from cosmic-ray ones, based on the resulting $χ^{2}$ values, improved the background rejection performance at all inclinations. At large zenith angles, the improvement in significance is a factor of four compared to previous HAWC publications. These enhancements have been verified by observing the Crab Nebula, which is an overhead source for the HAWC Observatory. We show that the sensitivity to Crab-like point sources ($E^{-2.63}$) with locations overhead to 30$^{\circ}$ zenith is comparable or less than 10\% of the Crab Nebula's flux between 2 and 50~TeV. Thanks to these improvements, HAWC can now detect more sources, including the Galactic Center.
△ Less
Submitted 1 July, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube
Authors:
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
J. C. Díaz-Vélez,
K. Engel,
T. Ergin,
K. L. Fan,
K. Fang,
N. Fraija,
S. Fraija
, et al. (469 additional authors not shown)
Abstract:
Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis…
▽ More
Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis that combines gamma rays and neutrinos is required. In this study, we use the Multi-Mission Maximum Likelihood framework (3ML) with IceCube Maximum Likelihood Analysis software (i3mla) and HAWC Accelerated Likelihood (HAL) to search for a correlation between 22 known gamma-ray sources from the third HAWC gamma-ray catalog and 14 years of IceCube track-like data. No significant neutrino emission from the direction of the HAWC sources was found. We report the best-fit gamma-ray model and 90% CL neutrino flux limit from the 22 sources. From the neutrino flux limit, we conclude that the gamma-ray emission from five of the sources can not be produced purely from hadronic interactions. We report the limit for the fraction of gamma rays produced by hadronic interactions for these five sources.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
TRACE: a Time-Reversible Algorithm for Close Encounters
Authors:
Tiger Lu,
David M. Hernandez,
Hanno Rein
Abstract:
We present TRACE, a time-reversible hybrid integrator for the planetary N-body problem. Like hybrid symplectic integrators, TRACE can resolve close encounters between particles while retaining many of the accuracy and speed advantages of a fixed timestep symplectic method such the Wisdom-Holman map. TRACE switches methods time-reversibly during close encounters following the prescription of Hernan…
▽ More
We present TRACE, a time-reversible hybrid integrator for the planetary N-body problem. Like hybrid symplectic integrators, TRACE can resolve close encounters between particles while retaining many of the accuracy and speed advantages of a fixed timestep symplectic method such the Wisdom-Holman map. TRACE switches methods time-reversibly during close encounters following the prescription of Hernandez and Dehnen (2023). In this paper we describe the derivation and implementation of TRACE and study its performance for a variety of astrophysical systems. In all our test cases TRACE is at least as accurate and fast as the hybrid symplectic integrator MERCURIUS. In many cases TRACE's performance is vastly superior to that of MERCURIUS. In test cases with planet-planet close encounters, TRACE is as accurate as MECURIUS with a 13x speedup. If close encounters with the central star are considered, TRACE achieves good error performance while MERCURIUS fails to give qualitatively correct results. In ensemble tests of violent scattering systems, TRACE matches the high-accuracy IAS15 while providing a 20x speed-up. In large N systems simulating lunar accretion, TRACE qualitatively gives the same results as IAS15 but at a 47x speedup. We also discuss some cases such as von Zeipel-Lidov-Kozai cycles where hybrid integrators perform poorly and provide some guidance on which integrator to use for which system. TRACE is freely available within the REBOUND package.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Bayesian and Convolutional Networks for Hierarchical Morphological Classification of Galaxies
Authors:
Jonathan Serrano-Pérez,
Raquel Díaz Hernández,
L. Enrique Sucar
Abstract:
This work is focused on the morphological classification of galaxies following the Hubble sequence in which the different classes are arranged in a hierarchy. The proposed method, BCNN, is composed of two main modules. First, a convolutional neural network (CNN) is trained with images of the different classes of galaxies (image augmentation is carried out to balance some classes); the CNN outputs…
▽ More
This work is focused on the morphological classification of galaxies following the Hubble sequence in which the different classes are arranged in a hierarchy. The proposed method, BCNN, is composed of two main modules. First, a convolutional neural network (CNN) is trained with images of the different classes of galaxies (image augmentation is carried out to balance some classes); the CNN outputs the probability for each class of the hierarchy, and its outputs/predictions feed the second module. The second module consists of a Bayesian network that represents the hierarchy and helps to improve the prediction accuracy by combining the predictions of the first phase while maintaining the hierarchical constraint (in a hierarchy, an instance associated with a node must be associated to all its ancestors), through probabilistic inference over the Bayesian network so that a consistent prediction is obtained. Different images from the Hubble telescope have been collected and labeled by experts, which are used to perform the experiments. The results show that BCNN performed better than several CNNs in multiple evaluation measures, reaching the next scores: 67% in exact match, 78% in accuracy, and 83% in hierarchical F-measure.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Monoidal Jantzen filtrations
Authors:
Ryo Fujita,
David Hernandez
Abstract:
We introduce a monoidal analogue of Jantzen filtrations in the framework of monoidal abelian categories with generic braidings. It leads to a deformation of the multiplication of the Grothendieck ring. We conjecture, and we prove in many remarkable situations, that this deformation is associative so that our construction yields a quantization of the Grothendieck ring as well as analogs of Kazhdan-…
▽ More
We introduce a monoidal analogue of Jantzen filtrations in the framework of monoidal abelian categories with generic braidings. It leads to a deformation of the multiplication of the Grothendieck ring. We conjecture, and we prove in many remarkable situations, that this deformation is associative so that our construction yields a quantization of the Grothendieck ring as well as analogs of Kazhdan-Lusztig polynomials. As a first main example, for finite-dimensional representations of simply-laced quantum loop algebras, we prove the associativity and we establish that the resulting quantization coincides with the quantum Grothendieck ring constructed by Nakajima and Varagnolo-Vasserot in a geometric manner. Hence, it yields a unified representation-theoretic interpretation of the quantum Grothendieck ring. As a second main example, we establish an analogous result for a monoidal category of finite-dimensional modules over symmetric quiver Hecke algebras categorifying the coordinate ring of a unipotent group associated with a Weyl group element.
△ Less
Submitted 6 March, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
From Shapes to Shapes: Inferring SHACL Shapes for Results of SPARQL CONSTRUCT Queries (Extended Version)
Authors:
Philipp Seifer,
Daniel Hernández,
Ralf Lämmel,
Steffen Staab
Abstract:
SPARQL CONSTRUCT queries allow for the specification of data processing pipelines that transform given input graphs into new output graphs. It is now common to constrain graphs through SHACL shapes allowing users to understand which data they can expect and which not. However, it becomes challenging to understand what graph data can be expected at the end of a data processing pipeline without know…
▽ More
SPARQL CONSTRUCT queries allow for the specification of data processing pipelines that transform given input graphs into new output graphs. It is now common to constrain graphs through SHACL shapes allowing users to understand which data they can expect and which not. However, it becomes challenging to understand what graph data can be expected at the end of a data processing pipeline without knowing the particular input data: Shape constraints on the input graph may affect the output graph, but may no longer apply literally, and new shapes may be imposed by the query template. In this paper, we study the derivation of shape constraints that hold on all possible output graphs of a given SPARQL CONSTRUCT query. We assume that the SPARQL CONSTRUCT query is fixed, e.g., being part of a program, whereas the input graphs adhere to input shape constraints but may otherwise vary over time and, thus, are mostly unknown. We study a fragment of SPARQL CONSTRUCT queries (SCCQ) and a fragment of SHACL (Simple SHACL). We formally define the problem of deriving the most restrictive set of Simple SHACL shapes that constrain the results from evaluating a SCCQ over any input graph restricted by a given set of Simple SHACL shapes. We propose and implement an algorithm that statically analyses input SHACL shapes and CONSTRUCT queries and prove its soundness and complexity.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
New approaches and error assessment to snow cover thickness and density using air temperature data at different heights
Authors:
Diego García-Maroto,
Luis Durán,
Miguel Ángel de Pablo Hernández
Abstract:
Snow poles are inexpensive systems composed of a wooden mast with temperature sensors affixed at varying heights with the purpose of estimating the snow depth. They are frequently utilised in cold, remote regions where the maintenance of complex monitoring instruments becomes impractical. In this study, snow cover thickness is determined using different methods, based on the thermal behaviour of a…
▽ More
Snow poles are inexpensive systems composed of a wooden mast with temperature sensors affixed at varying heights with the purpose of estimating the snow depth. They are frequently utilised in cold, remote regions where the maintenance of complex monitoring instruments becomes impractical. In this study, snow cover thickness is determined using different methods, based on the thermal behaviour of air temperature measured by a snow pole on Deception Island, Antarctica. The methods are compared to high-resolution measurements of snow depth obtained using an ultrasonic sensor at the same site. A new modified method is proposed and shown to give the best results. Errors and sensitivity to chosen thresholds of the various methods have been compared. Sensitivity tests have been also conducted to evaluate the impact of missing data from some of the sensors. Finally, the insulating effect on the thermal signal produced by the snow is used to obtain information on the snowpack density. Promising results have been found from this effort, opening new possibilities for the usage of snow poles and may lead to future studies.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
AI can identify Solar System instability billions of years in advance
Authors:
Dorian S. Abbot,
J. D. Laurence-Chasen,
Robert J. Webber,
David M. Hernandez,
Jonathan Weare
Abstract:
Rare event schemes require an approximation of the probability of the rare event as a function of system state. Finding an appropriate reaction coordinate is typically the most challenging aspect of applying a rare event scheme. Here we develop an artificial intelligence (AI) based reaction coordinate that effectively predicts which of a limited number of simulations of the Solar System will go un…
▽ More
Rare event schemes require an approximation of the probability of the rare event as a function of system state. Finding an appropriate reaction coordinate is typically the most challenging aspect of applying a rare event scheme. Here we develop an artificial intelligence (AI) based reaction coordinate that effectively predicts which of a limited number of simulations of the Solar System will go unstable using a convolutional neural network classifier. The performance of the algorithm does not degrade significantly even 3.5 billion years before the instability. We overcome the class imbalance intrinsic to rare event problems using a combination of minority class oversampling, increased minority class weighting, and pulling multiple non-overlapping training sequences from simulations. Our success suggests that AI may provide a promising avenue for developing reaction coordinates without detailed theoretical knowledge of the system.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Multiple timestep reversible $N$-body integrators for close encounters in planetary systems
Authors:
David M. Hernandez,
Walter Dehnen
Abstract:
We present new almost time-reversible integrators for solution of planetary systems consisting of "planets" and a dominant mass ("star"). The algorithms can be considered adaptive generalizations of the Wisdom--Holman method, in which all pairs of planets can be assigned timesteps. These timesteps, along with the global timestep, can be adapted time-reversibly, often at no appreciable additional c…
▽ More
We present new almost time-reversible integrators for solution of planetary systems consisting of "planets" and a dominant mass ("star"). The algorithms can be considered adaptive generalizations of the Wisdom--Holman method, in which all pairs of planets can be assigned timesteps. These timesteps, along with the global timestep, can be adapted time-reversibly, often at no appreciable additional compute cost, without sacrificing any of the long-term error benefits of the Wisdom--Holman method. The method can also be considered a simpler and more flexible version of the \texttt{SYMBA} symplectic code. We perform tests on several challenging problems with close encounters and find the reversible algorithms are up to $2.6$ times faster than a code based on \texttt{SYMBA}. The codes presented here are available on Github. We also find adapting a global timestep reversibly and discretely must be done in block-synchronized manner or similar.
△ Less
Submitted 5 April, 2024; v1 submitted 13 January, 2024;
originally announced January 2024.
-
Representations of shifted quantum affine algebras and cluster algebras I. The simply-laced case
Authors:
Christof Geiss,
David Hernandez,
Bernard Leclerc
Abstract:
We introduce a family of cluster algebras of infinite rank associated with root systems of type $A$, $D$, $E$. We show that suitable completions of these cluster algebras are isomorphic to the Grothendieck rings of the categories $\mathcal{O}_\mathbb{Z}$ of the corresponding shifted quantum affine algebras. The cluster variables of a class of distinguished initial seeds are certain formal power se…
▽ More
We introduce a family of cluster algebras of infinite rank associated with root systems of type $A$, $D$, $E$. We show that suitable completions of these cluster algebras are isomorphic to the Grothendieck rings of the categories $\mathcal{O}_\mathbb{Z}$ of the corresponding shifted quantum affine algebras. The cluster variables of a class of distinguished initial seeds are certain formal power series defined by E. Frenkel and the second author, which satisfy a system of functional relations called $QQ$-system. We conjecture that all cluster monomials are classes of simple objects of $\mathcal{O}_\mathbb{Z}$. In the final section, we show that these cluster algebras contain infinitely many cluster subalgebras isomorphic to the coordinate ring of the open double Bruhat cell of the corresponding simple simply-connected algebraic group. This explains the similarity between $QQ$-system relations and certain generalized minor identities discovered by Fomin and Zelevinsky.
△ Less
Submitted 26 January, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
Extended Baxter relations and QQ-systems for quantum affine algebras
Authors:
Edward Frenkel,
David Hernandez
Abstract:
Generalized Baxter's TQ-relations and the QQ-system are systems of algebraic relations in the category O of representations of the Borel subalgebra of the quantum affine algebra U_q(g^), which we established in our earlier works arXiv:1308.3444 and arXiv:1606.05301. In the present paper, we conjecture a family of analogous relations labeled by elements of the Weyl group W of g, so that the origina…
▽ More
Generalized Baxter's TQ-relations and the QQ-system are systems of algebraic relations in the category O of representations of the Borel subalgebra of the quantum affine algebra U_q(g^), which we established in our earlier works arXiv:1308.3444 and arXiv:1606.05301. In the present paper, we conjecture a family of analogous relations labeled by elements of the Weyl group W of g, so that the original relations correspond to the identity element. These relations are closely connected to the W-symmetry of q-characters established in arXiv:2211.09779. We prove these relations for all w in W if g has rank two, and we prove the extended TQ-relations if w is a simple reflection. We also generalize our results and conjectures to the shifted quantum affine algebras.
△ Less
Submitted 29 May, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Learning to bag with a simulation-free reinforcement learning framework for robots
Authors:
Francisco Munguia-Galeano,
Jihong Zhu,
Juan David Hernández,
Ze Ji
Abstract:
Bagging is an essential skill that humans perform in their daily activities. However, deformable objects, such as bags, are complex for robots to manipulate. This paper presents an efficient learning-based framework that enables robots to learn bagging. The novelty of this framework is its ability to perform bagging without relying on simulations. The learning process is accomplished through a rei…
▽ More
Bagging is an essential skill that humans perform in their daily activities. However, deformable objects, such as bags, are complex for robots to manipulate. This paper presents an efficient learning-based framework that enables robots to learn bagging. The novelty of this framework is its ability to perform bagging without relying on simulations. The learning process is accomplished through a reinforcement learning algorithm introduced in this work, designed to find the best grasping points of the bag based on a set of compact state representations. The framework utilizes a set of primitive actions and represents the task in five states. In our experiments, the framework reaches a 60 % and 80 % of success rate after around three hours of training in the real world when starting the bagging task from folded and unfolded, respectively. Finally, we test the trained model with two more bags of different sizes to evaluate its generalizability.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
Galactic Gamma-Ray Diffuse Emission at TeV energies with HAWC Data
Authors:
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velazquez,
K. P. Arunbabu,
D. Avila Rojas,
R. Babu,
V. Baghmanyan,
E. Belmont-Moreno,
C. Brisbois,
K. S. Caballero-Mora,
T. Capistran,
A. Carraminana,
S. Casanova,
O. Chaparro-Amaro,
U. Cotti,
J. Cotzomi,
S. Coutino de Leon,
E. De la Fuente,
R. Diaz Hernandez,
M. A. DuVernois,
M. Durocher,
J. C. Dıaz-Velez,
K. Engel,
C. Espinoza,
K. L. Fan
, et al. (55 additional authors not shown)
Abstract:
The Galactic gamma-ray diffuse emission (GDE) is emitted by cosmic rays (CRs), ultra-relativistic protons and electrons, interacting with gas and electromagnetic radiation fields in the interstellar medium. Here we present the analysis of TeV diffuse emission from a region of the Galactic Plane over the range in longitude of $l\in[43^\circ,73^\circ]$, using data collected with the High Altitude Wa…
▽ More
The Galactic gamma-ray diffuse emission (GDE) is emitted by cosmic rays (CRs), ultra-relativistic protons and electrons, interacting with gas and electromagnetic radiation fields in the interstellar medium. Here we present the analysis of TeV diffuse emission from a region of the Galactic Plane over the range in longitude of $l\in[43^\circ,73^\circ]$, using data collected with the High Altitude Water Cherenkov (HAWC) detector. Spectral, longitudinal and latitudinal distributions of the TeV diffuse emission are shown. The radiation spectrum is compatible with the spectrum of the emission arising from a CR population with an "index" similar to that of the observed CRs. When comparing with the \texttt{DRAGON} \textit{base model}, the HAWC GDE flux is higher by about a factor of two. Unresolved sources such as pulsar wind nebulae and TeV halos could explain the excess emission. Finally, deviations of the Galactic CR flux from the locally measured CR flux may additionally explain the difference between the predicted and measured diffuse fluxes.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
HAWC Study of Very-High-Energy $γ$-ray Spectrum of HAWC J1844-034
Authors:
HAWC Collaboration,
A. Albert,
C. Alvarez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
M. Breuhaus,
T. Capistrán,
A. Carramiñana,
S. Casanova,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
D. Depaoli,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
M. Durocher,
K. Engel,
C. Espinoza,
K. L. Fan,
K. Fang,
N. Fraija,
J. A. García-González
, et al. (52 additional authors not shown)
Abstract:
Recently, the region surrounding eHWC J1842-035 has been studied extensively by gamma-ray observatories due to its extended emission reaching up to a few hundred TeV and potential as a hadronic accelerator. In this work, we use 1,910 days of cumulative data from the High Altitude Water Cherenkov (HAWC) observatory to carry out a dedicated systematic source search of the eHWC J1842-035 region. Duri…
▽ More
Recently, the region surrounding eHWC J1842-035 has been studied extensively by gamma-ray observatories due to its extended emission reaching up to a few hundred TeV and potential as a hadronic accelerator. In this work, we use 1,910 days of cumulative data from the High Altitude Water Cherenkov (HAWC) observatory to carry out a dedicated systematic source search of the eHWC J1842-035 region. During the search we have found three sources in the region, namely, HAWC J1844-034, HAWC J1843-032, and HAWC J1846-025. We have identified HAWC J1844-034 as the extended source that emits photons with energies up to 175 TeV. We compute the spectrum for HAWC J1844-034 and by comparing with the observational results from other experiments, we have identified HESS J1843-033, LHAASO J1843-0338, and TASG J1844-038 as very-high-energy gamma-ray sources with a matching origin. Also, we present and use the multi-wavelength data to fit the hadronic and leptonic particle spectra. We have identified four pulsar candidates in the nearby region from which PSR J1844-0346 is found to be the most likely candidate due to its proximity to HAWC J1844-034 and the computed energy budget. We have also found SNR G28.6-0.1 as a potential counterpart source of HAWC J1844-034 for which both leptonic and hadronic scenarios are feasible.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Search for Decaying Dark Matter in the Virgo Cluster of Galaxies with HAWC
Authors:
A. Albert,
R. Alfaro,
J. C. Arteaga-Velázquez,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
J. Cotzomi,
S. Coutiño de León,
D. Depaoli,
R. Diaz Hernandez,
M. A. DuVernois,
M. Durocher,
N. Fraija,
J. A. García-González,
M. M. González,
J. A. Goodman,
J. P. Harding,
S. Hernández-Cadena,
I. Herzog,
D. Huang,
F. Hueyotl-Zahuantitla
, et al. (33 additional authors not shown)
Abstract:
The decay or annihilation of dark matter particles may produce a steady flux of very-high-energy gamma rays detectable above the diffuse background. Nearby clusters of galaxies provide excellent targets to search for the signatures of particle dark matter interactions. In particular, the Virgo cluster spans several degrees across the sky and can be efficiently probed with a wide field-of-view inst…
▽ More
The decay or annihilation of dark matter particles may produce a steady flux of very-high-energy gamma rays detectable above the diffuse background. Nearby clusters of galaxies provide excellent targets to search for the signatures of particle dark matter interactions. In particular, the Virgo cluster spans several degrees across the sky and can be efficiently probed with a wide field-of-view instrument. The High Altitude Water Cherenkov (HAWC) observatory, due to its wide field of view and sensitivity to gamma rays at an energy scale of 300 GeV--100 TeV is well-suited for this search. Using 2141 days of data, we search for gamma-ray emission from the Virgo cluster, assuming well-motivated dark matter sub-structure models. Our results provide some of the strongest constraints on the decay lifetime of dark matter for masses above 10 TeV.
△ Less
Submitted 10 January, 2024; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Isometries of Almost-Riemannian structures on nonnilpotent, solvable 3D Lie groups
Authors:
Victor Ayala,
Adriano Da Silva,
Danilo A. Garcia Hernández
Abstract:
In this paper we prove that automorphisms are the only isometries between rank two Almost-Riemannian Structures on the class of nonnilpotent, solvable, connected 3D Lie groups. As a consequence, a classification result for rank two ARSs on the groups in question is obtained.
In this paper we prove that automorphisms are the only isometries between rank two Almost-Riemannian Structures on the class of nonnilpotent, solvable, connected 3D Lie groups. As a consequence, a classification result for rank two ARSs on the groups in question is obtained.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
ETHER: Aligning Emergent Communication for Hindsight Experience Replay
Authors:
Kevin Denamganaï,
Daniel Hernandez,
Ozan Vardal,
Sondess Missaoui,
James Alfred Walker
Abstract:
Natural language instruction following is paramount to enable collaboration between artificial agents and human beings. Natural language-conditioned reinforcement learning (RL) agents have shown how natural languages' properties, such as compositionality, can provide a strong inductive bias to learn complex policies. Previous architectures like HIGhER combine the benefit of language-conditioning w…
▽ More
Natural language instruction following is paramount to enable collaboration between artificial agents and human beings. Natural language-conditioned reinforcement learning (RL) agents have shown how natural languages' properties, such as compositionality, can provide a strong inductive bias to learn complex policies. Previous architectures like HIGhER combine the benefit of language-conditioning with Hindsight Experience Replay (HER) to deal with sparse rewards environments. Yet, like HER, HIGhER relies on an oracle predicate function to provide a feedback signal highlighting which linguistic description is valid for which state. This reliance on an oracle limits its application. Additionally, HIGhER only leverages the linguistic information contained in successful RL trajectories, thus hurting its final performance and data-efficiency. Without early successful trajectories, HIGhER is no better than DQN upon which it is built. In this paper, we propose the Emergent Textual Hindsight Experience Replay (ETHER) agent, which builds on HIGhER and addresses both of its limitations by means of (i) a discriminative visual referential game, commonly studied in the subfield of Emergent Communication (EC), used here as an unsupervised auxiliary task and (ii) a semantic grounding scheme to align the emergent language with the natural language of the instruction-following benchmark. We show that the referential game's agents make an artificial language emerge that is aligned with the natural-like language used to describe goals in the BabyAI benchmark and that it is expressive enough so as to also describe unsuccessful RL trajectories and thus provide feedback to the RL agent to leverage the linguistic, structured information contained in all trajectories. Our work shows that EC is a viable unsupervised auxiliary task for RL and provides missing pieces to make HER more widely applicable.
△ Less
Submitted 17 December, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Measuring Faithfulness in Chain-of-Thought Reasoning
Authors:
Tamera Lanham,
Anna Chen,
Ansh Radhakrishnan,
Benoit Steiner,
Carson Denison,
Danny Hernandez,
Dustin Li,
Esin Durmus,
Evan Hubinger,
Jackson Kernion,
Kamilė Lukošiūtė,
Karina Nguyen,
Newton Cheng,
Nicholas Joseph,
Nicholas Schiefer,
Oliver Rausch,
Robin Larson,
Sam McCandlish,
Sandipan Kundu,
Saurav Kadavath,
Shannon Yang,
Thomas Henighan,
Timothy Maxwell,
Timothy Telleen-Lawton,
Tristan Hume
, et al. (5 additional authors not shown)
Abstract:
Large language models (LLMs) perform better when they produce step-by-step, "Chain-of-Thought" (CoT) reasoning before answering a question, but it is unclear if the stated reasoning is a faithful explanation of the model's actual reasoning (i.e., its process for answering the question). We investigate hypotheses for how CoT reasoning may be unfaithful, by examining how the model predictions change…
▽ More
Large language models (LLMs) perform better when they produce step-by-step, "Chain-of-Thought" (CoT) reasoning before answering a question, but it is unclear if the stated reasoning is a faithful explanation of the model's actual reasoning (i.e., its process for answering the question). We investigate hypotheses for how CoT reasoning may be unfaithful, by examining how the model predictions change when we intervene on the CoT (e.g., by adding mistakes or paraphrasing it). Models show large variation across tasks in how strongly they condition on the CoT when predicting their answer, sometimes relying heavily on the CoT and other times primarily ignoring it. CoT's performance boost does not seem to come from CoT's added test-time compute alone or from information encoded via the particular phrasing of the CoT. As models become larger and more capable, they produce less faithful reasoning on most tasks we study. Overall, our results suggest that CoT can be faithful if the circumstances such as the model size and task are carefully chosen.
△ Less
Submitted 16 July, 2023;
originally announced July 2023.
-
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning
Authors:
Ansh Radhakrishnan,
Karina Nguyen,
Anna Chen,
Carol Chen,
Carson Denison,
Danny Hernandez,
Esin Durmus,
Evan Hubinger,
Jackson Kernion,
Kamilė Lukošiūtė,
Newton Cheng,
Nicholas Joseph,
Nicholas Schiefer,
Oliver Rausch,
Sam McCandlish,
Sheer El Showk,
Tamera Lanham,
Tim Maxwell,
Venkatesa Chandrasekaran,
Zac Hatfield-Dodds,
Jared Kaplan,
Jan Brauner,
Samuel R. Bowman,
Ethan Perez
Abstract:
As large language models (LLMs) perform more difficult tasks, it becomes harder to verify the correctness and safety of their behavior. One approach to help with this issue is to prompt LLMs to externalize their reasoning, e.g., by having them generate step-by-step reasoning as they answer a question (Chain-of-Thought; CoT). The reasoning may enable us to check the process that models use to perfo…
▽ More
As large language models (LLMs) perform more difficult tasks, it becomes harder to verify the correctness and safety of their behavior. One approach to help with this issue is to prompt LLMs to externalize their reasoning, e.g., by having them generate step-by-step reasoning as they answer a question (Chain-of-Thought; CoT). The reasoning may enable us to check the process that models use to perform tasks. However, this approach relies on the stated reasoning faithfully reflecting the model's actual reasoning, which is not always the case. To improve over the faithfulness of CoT reasoning, we have models generate reasoning by decomposing questions into subquestions. Decomposition-based methods achieve strong performance on question-answering tasks, sometimes approaching that of CoT while improving the faithfulness of the model's stated reasoning on several recently-proposed metrics. By forcing the model to answer simpler subquestions in separate contexts, we greatly increase the faithfulness of model-generated reasoning over CoT, while still achieving some of the performance gains of CoT. Our results show it is possible to improve the faithfulness of model-generated reasoning; continued improvements may lead to reasoning that enables us to verify the correctness and safety of LLM behavior.
△ Less
Submitted 25 July, 2023; v1 submitted 16 July, 2023;
originally announced July 2023.
-
Co-creating a Transdisciplinary Map of Technology-mediated Harms, Risks and Vulnerabilities: Challenges, Ambivalences and Opportunities
Authors:
Andrés Domínguez Hernández,
Kopo M. Ramokapane,
Partha Das Chowdhury,
Ola Michalec,
Emily Johnstone,
Emily Godwin,
Alicia G Cork,
Awais Rashid
Abstract:
The phrase "online harms" has emerged in recent years out of a growing political willingness to address the ethical and social issues associated with the use of the Internet and digital technology at large. The broad landscape that surrounds online harms gathers a multitude of disciplinary, sectoral and organizational efforts while raising myriad challenges and opportunities for the crossing entre…
▽ More
The phrase "online harms" has emerged in recent years out of a growing political willingness to address the ethical and social issues associated with the use of the Internet and digital technology at large. The broad landscape that surrounds online harms gathers a multitude of disciplinary, sectoral and organizational efforts while raising myriad challenges and opportunities for the crossing entrenched boundaries. In this paper we draw lessons from a journey of co-creating a transdisciplinary knowledge infrastructure within a large research initiative animated by the online harms agenda. We begin with a reflection of the implications of mapping, taxonomizing and constructing knowledge infrastructures and a brief review of how online harm and adjacent themes have been theorized and classified in the literature to date. Grounded on our own experience of co-creating a map of online harms, we then argue that the map -- and the process of mapping -- perform three mutually constitutive functions, acting simultaneously as method, medium and provocation. We draw lessons from how an open-ended approach to mapping, despite not guaranteeing consensus, can foster productive debate and collaboration in ethically and politically fraught areas of research. We end with a call for CSCW research to surface and engage with the multiple temporalities, social lives and political sensibilities of knowledge infrastructures.
△ Less
Submitted 19 July, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
Towards Measuring the Representation of Subjective Global Opinions in Language Models
Authors:
Esin Durmus,
Karina Nguyen,
Thomas I. Liao,
Nicholas Schiefer,
Amanda Askell,
Anton Bakhtin,
Carol Chen,
Zac Hatfield-Dodds,
Danny Hernandez,
Nicholas Joseph,
Liane Lovitt,
Sam McCandlish,
Orowa Sikder,
Alex Tamkin,
Janel Thamkul,
Jared Kaplan,
Jack Clark,
Deep Ganguli
Abstract:
Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to evaluate whose opinions model-generated responses are more similar to. We first build a dataset, GlobalOpinionQA, comprised of questions and answers from cross-national surveys designed to capture diverse opinions on global issues across dif…
▽ More
Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to evaluate whose opinions model-generated responses are more similar to. We first build a dataset, GlobalOpinionQA, comprised of questions and answers from cross-national surveys designed to capture diverse opinions on global issues across different countries. Next, we define a metric that quantifies the similarity between LLM-generated survey responses and human responses, conditioned on country. With our framework, we run three experiments on an LLM trained to be helpful, honest, and harmless with Constitutional AI. By default, LLM responses tend to be more similar to the opinions of certain populations, such as those from the USA, and some European and South American countries, highlighting the potential for biases. When we prompt the model to consider a particular country's perspective, responses shift to be more similar to the opinions of the prompted populations, but can reflect harmful cultural stereotypes. When we translate GlobalOpinionQA questions to a target language, the model's responses do not necessarily become the most similar to the opinions of speakers of those languages. We release our dataset for others to use and build on. Our data is at https://huggingface.co/datasets/Anthropic/llm_global_opinions. We also provide an interactive visualization at https://llmglobalvalues.anthropic.com.
△ Less
Submitted 11 April, 2024; v1 submitted 28 June, 2023;
originally announced June 2023.
-
A New View on Density Corrected DFT: Can One Get a Better Answer for a Good Reason?
Authors:
Devin J. Hernandez,
Adam Rettig,
Martin Head-Gordon
Abstract:
Despite its widespread use, density functional theory (DFT) has several notable areas of failure; perhaps the most well-studied of these failures is self-interaction error (SIE). Density corrected DFT (DC-DFT) was proposed as a potential solution to systems where SIE causes traditional DFT to fail. The Hartree-Fock (HF) density is then used for cases where the DFT energy is suitable but the self-c…
▽ More
Despite its widespread use, density functional theory (DFT) has several notable areas of failure; perhaps the most well-studied of these failures is self-interaction error (SIE). Density corrected DFT (DC-DFT) was proposed as a potential solution to systems where SIE causes traditional DFT to fail. The Hartree-Fock (HF) density is then used for cases where the DFT energy is suitable but the self-consistent density is erroneous. In this study, we investigate the utility of the higher quality orbital optimized MP2 densities in DC-DFT for barrier heights and halogen bonded complexes. For functionals such as PBE and r$^2$SCAN, find that these densities yield worse results than the HF density due to favorable cancellation between the density-driven and functional-driven errors, confirming a recent study. Error decomposition reveals functional driven error, not density driven error, to be the primary cause of inaccuracy in DFT calculations where SIE is prominent. We therefore advise caution when using HF-DFT, because the only rigorous way to remove large functional-driven errors in lower rungs of Jacob's ladder is by climbing to higher rungs that include exact exchange. We recommend that better functionals be improved by using a better density in SIE-sensitive cases. Examples support the value of this variant of DC-DFT. We also emphasize that DC-DFT potential energy surfaces have first derivative discontinuities at Coulson-Fischer points, in contrast to the second derivative discontinuities in SCF solutions.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Mercury's chaotic secular evolution as a subdiffusive process
Authors:
Dorian S. Abbot,
Robert J. Webber,
David M. Hernandez,
Sam Hadden,
Jonathan Weare
Abstract:
Mercury's orbit can destabilize, generally resulting in a collision with either Venus or the Sun. Chaotic evolution can cause g1 to decrease to the approximately constant value of g5 and create a resonance. Previous work has approximated the variation in g1 as stochastic diffusion, which leads to a phenomological model that can reproduce the Mercury instability statistics of secular and N-body mod…
▽ More
Mercury's orbit can destabilize, generally resulting in a collision with either Venus or the Sun. Chaotic evolution can cause g1 to decrease to the approximately constant value of g5 and create a resonance. Previous work has approximated the variation in g1 as stochastic diffusion, which leads to a phenomological model that can reproduce the Mercury instability statistics of secular and N-body models on timescales longer than 10 Gyr. Here we show that the diffusive model underpredicts the Mercury instability probability by a factor of 3-10,000 on timescales less than 5 Gyr, the remaining lifespan of the Solar System. This is because g1 exhibits larger variations on short timescales than the diffusive model would suggest. To better model the variations on short timescales, we build a new subdiffusive phenomological model for g1. Subdiffusion is similar to diffusion but exhibits larger displacements on short timescales and smaller displacements on long timescales. We choose model parameters based on the behavior of the g1 trajectories in the N-body simulations, leading to a tuned model that can reproduce Mercury instability statistics from 1-40 Gyr. This work motivates fundamental questions in Solar System dynamics: Why does subdiffusion better approximate the variation in g1 than standard diffusion? Why is there an upper bound on g1, but not a lower bound that would prevent it from reaching g5?
△ Less
Submitted 12 April, 2024; v1 submitted 20 June, 2023;
originally announced June 2023.
-
Composing Efficient, Robust Tests for Policy Selection
Authors:
Dustin Morrill,
Thomas J. Walsh,
Daniel Hernandez,
Peter R. Wurman,
Peter Stone
Abstract:
Modern reinforcement learning systems produce many high-quality policies throughout the learning process. However, to choose which policy to actually deploy in the real world, they must be tested under an intractable number of environmental conditions. We introduce RPOSST, an algorithm to select a small set of test cases from a larger pool based on a relatively small number of sample evaluations.…
▽ More
Modern reinforcement learning systems produce many high-quality policies throughout the learning process. However, to choose which policy to actually deploy in the real world, they must be tested under an intractable number of environmental conditions. We introduce RPOSST, an algorithm to select a small set of test cases from a larger pool based on a relatively small number of sample evaluations. RPOSST treats the test case selection problem as a two-player game and optimizes a solution with provable $k$-of-$N$ robustness, bounding the error relative to a test that used all the test cases in the pool. Empirical results demonstrate that RPOSST finds a small set of test cases that identify high quality policies in a toy one-shot game, poker datasets, and a high-fidelity racing simulator.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
An optimized search for dark matter in the galactic halo with HAWC
Authors:
A. Albert,
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velazquez,
D. Avila Rojas,
H. A. Ayala Solares,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistran,
A. Carraminana,
S. Casanova,
O. Chaparro-Amaro,
U. Cotti,
J. Cotzomi,
E. De la Fuente,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
M. Durocher,
J. C. Dıaz-Velez,
C. Espinoza,
K. L. Fan,
N. Fraija,
J. A. Garcıa-Gonzalez,
F. Garfias
, et al. (41 additional authors not shown)
Abstract:
The Galactic Halo is a key target for indirect dark matter detection. The High Altitude Water Cherenkov (HAWC) observatory is a high-energy (~300 GeV to >100 TeV) gamma-ray detector located in central Mexico. HAWC operates via the water Cherenkov technique and has both a wide field of view of 2 sr and a >95% duty cycle, making it ideal for analyses of highly extended sources. We made use of these…
▽ More
The Galactic Halo is a key target for indirect dark matter detection. The High Altitude Water Cherenkov (HAWC) observatory is a high-energy (~300 GeV to >100 TeV) gamma-ray detector located in central Mexico. HAWC operates via the water Cherenkov technique and has both a wide field of view of 2 sr and a >95% duty cycle, making it ideal for analyses of highly extended sources. We made use of these properties of HAWC and a new background-estimation technique optimized for extended sources to probe a large region of the Galactic Halo for dark matter signals. With this approach, we set improved constraints on dark matter annihilation and decay between masses of 10 and 100 TeV. Due to the large spatial extent of the HAWC field of view, these constraints are robust against uncertainties in the Galactic dark matter spatial profile.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
A Contribution of the HAWC Observatory to the TeV era in the High Energy Gamma-Ray Astrophysics: The case of the TeV-Halos
Authors:
Ramiro Torres-Escobedo,
Hao Zhou,
Eduardo de la Fuente,
A. U. Abeysekara,
A. Albert,
R. Alfaro,
C. Alvarez,
J. D. Álvarez,
J. R. Angeles Camacho,
J. C. Arteaga-Velázquez,
K. P. Arunbabu,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
V. Baghmanyan,
A. S. Barber,
J. Becerra Gonzalez,
E. Belmont-Moreno,
S. Y. BenZvi,
D. Berley,
C. Brisbois,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova
, et al. (108 additional authors not shown)
Abstract:
We present a short overview of the TeV-Halos objects as a discovery and a relevant contribution of the High Altitude Water Čerenkov (HAWC) observatory to TeV astrophysics. We discuss history, discovery, knowledge, and the next step through a new and more detailed analysis than the original study in 2017. TeV-Halos will contribute to resolving the problem of the local positron excess observed on th…
▽ More
We present a short overview of the TeV-Halos objects as a discovery and a relevant contribution of the High Altitude Water Čerenkov (HAWC) observatory to TeV astrophysics. We discuss history, discovery, knowledge, and the next step through a new and more detailed analysis than the original study in 2017. TeV-Halos will contribute to resolving the problem of the local positron excess observed on the Earth. To clarify the latter, understanding the diffusion process is mandatory.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
Array of Individual Circular Rydberg Atoms Trapped in Optical Tweezers
Authors:
Brice Ravon,
Paul Méhaignerie,
Yohann Machu,
Andrés Durán Hernández,
Maxime Favier,
Jean-Michel Raimond,
Michel Brune,
Clément Sayrin
Abstract:
Circular Rydberg atoms (CRAs), i.e., Rydberg atoms with maximal orbital momentum, are highly promising for quantum computation, simulation and sensing. They combine long natural lifetimes with strong inter-atomic interactions and coupling to electromagnetic fields. Trapping individual CRAs is essential to harness these unique features. We report the first demonstration of CRAs laser-trapping in a…
▽ More
Circular Rydberg atoms (CRAs), i.e., Rydberg atoms with maximal orbital momentum, are highly promising for quantum computation, simulation and sensing. They combine long natural lifetimes with strong inter-atomic interactions and coupling to electromagnetic fields. Trapping individual CRAs is essential to harness these unique features. We report the first demonstration of CRAs laser-trapping in a programmable array of optical bottle beams. We observe the decay of a trapped Rubidium circular level over 5ms using a novel optical detection method. This first optical detection of alkali CRAs is both spatially- and level selective. We finally observe the mechanical oscillations of the CRAs in the traps. This work opens the route to the use of circular levels in quantum devices. It is also promising for quantum simulation and information processing using the full extent of Rydberg manifolds.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Isomorphisms among quantum Grothendieck rings and cluster algebras
Authors:
Ryo Fujita,
David Hernandez,
Se-jin Oh,
Hironori Oya
Abstract:
We establish a cluster theoretical interpretation of the isomorphisms of [F.-H.-O.-O., J. Reine Angew. Math., 2022] among quantum Grothendieck rings of representations of quantum loop algebras. Consequently, we obtain a quantization of the monoidal categorification theorem of [Kashiwara-Kim-Oh-Park, arXiv:2103.10067]. We establish applications of these new ingredients. First we solve long-standing…
▽ More
We establish a cluster theoretical interpretation of the isomorphisms of [F.-H.-O.-O., J. Reine Angew. Math., 2022] among quantum Grothendieck rings of representations of quantum loop algebras. Consequently, we obtain a quantization of the monoidal categorification theorem of [Kashiwara-Kim-Oh-Park, arXiv:2103.10067]. We establish applications of these new ingredients. First we solve long-standing problems for any non-simply-laced quantum loop algebras: the positivity of $(q,t)$-characters of all simple modules, and the analog of Kazhdan-Lusztig conjecture for all reachable modules (in the cluster monoidal categorification). We also establish the conjectural quantum $T$-systems for the $(q,t)$-characters of Kirillov-Reshetikhin modules. Eventually, we show that our isomorphisms arise from explicit birational transformations of variables, which we call substitution formulas. This reveals new non-trivial relations among $(q, t)$-characters of simple modules.
△ Less
Submitted 6 May, 2023; v1 submitted 5 April, 2023;
originally announced April 2023.
-
The High-Altitude Water Cherenkov (HAWC) Observatory in México: The Primary Detector
Authors:
A. U. Abeysekara,
A. Albert,
R. Alfaro,
C. Álvarez,
J. D. Álvarez,
M. Araya,
J. C. Arteaga-Velázquez,
K. P. Arunbabu,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
A. S. Barber,
A. Becerril,
E. Belmont-Moreno,
S. Y. BenZvi,
O. Blanco,
J. Braun,
C. Brisbois,
K. S. Caballero-Mora,
J. I. Cabrera Martínez,
T. Capistrán,
A. Carramiñana,
S. Casanova,
M. Castillo,
O. Chaparro-Amaro
, et al. (118 additional authors not shown)
Abstract:
The High-Altitude Water Cherenkov (HAWC) observatory is a second-generation continuously operated, wide field-of-view, TeV gamma-ray observatory. The HAWC observatory and its analysis techniques build on experience of the Milagro experiment in using ground-based water Cherenkov detectors for gamma-ray astronomy. HAWC is located on the Sierra Negra volcano in México at an elevation of 4100 meters a…
▽ More
The High-Altitude Water Cherenkov (HAWC) observatory is a second-generation continuously operated, wide field-of-view, TeV gamma-ray observatory. The HAWC observatory and its analysis techniques build on experience of the Milagro experiment in using ground-based water Cherenkov detectors for gamma-ray astronomy. HAWC is located on the Sierra Negra volcano in México at an elevation of 4100 meters above sea level. The completed HAWC observatory principal detector (HAWC) consists of 300 closely spaced water Cherenkov detectors, each equipped with four photomultiplier tubes to provide timing and charge information to reconstruct the extensive air shower energy and arrival direction. The HAWC observatory has been optimized to observe transient and steady emission from sources of gamma rays within an energy range from several hundred GeV to several hundred TeV. However, most of the air showers detected are initiated by cosmic rays, allowing studies of cosmic rays also to be performed. This paper describes the characteristics of the HAWC main array and its hardware.
△ Less
Submitted 10 April, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
ASSIST: An Ephemeris-Quality Test Particle Integrator
Authors:
Matthew J. Holman,
Arya Akmal,
Davide Farnocchia,
Hanno Rein,
Matthew J. Payne,
Robert Weryk,
Daniel Tamayo,
David M. Hernandez
Abstract:
We introduce ASSIST, a software package for ephemeris-quality integrations of test particles. ASSIST is an extension of the REBOUND framework and makes use of its IAS15 integrator to integrate test particle trajectories in the field of the Sun, Moon, planets, and 16 massive asteroids, with the positions of the masses coming from the JPL DE441 ephemeris and its associated asteroid perturber file. T…
▽ More
We introduce ASSIST, a software package for ephemeris-quality integrations of test particles. ASSIST is an extension of the REBOUND framework and makes use of its IAS15 integrator to integrate test particle trajectories in the field of the Sun, Moon, planets, and 16 massive asteroids, with the positions of the masses coming from the JPL DE441 ephemeris and its associated asteroid perturber file. The package incorporates the most significant gravitational harmonics and general relativistic corrections. ASSIST also accounts for position- and velocity-dependent non-gravitational effects. The first order variational equations are included for all terms to support orbit fitting and covariance mapping. This new framework is meant to provide an open-source package written in a modern language to enable high-precision orbital analysis and science by the small body community. ASSIST is open source, freely distributed under the GNU General Public license, version 3.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
VISIONS: The VISTA Star Formation Atlas -- II. The data processing pipeline
Authors:
Stefan Meingast,
Hervé Bouy,
Verena Fürnkranz,
David Hernandez,
Alena Rottensteiner,
Erik Brändli
Abstract:
The VISIONS public survey provides large-scale, multiepoch imaging of five nearby star-forming regions at subarcsecond resolution in the near-infrared. All data collected within the program and provided by the European Southern Observatory (ESO) science archive are processed with a custom end-to-end pipeline infrastructure to provide science-ready images and source catalogs. The data reduction env…
▽ More
The VISIONS public survey provides large-scale, multiepoch imaging of five nearby star-forming regions at subarcsecond resolution in the near-infrared. All data collected within the program and provided by the European Southern Observatory (ESO) science archive are processed with a custom end-to-end pipeline infrastructure to provide science-ready images and source catalogs. The data reduction environment has been specifically developed for the purpose of mitigating several shortcomings of the bona fide data products processed with software provided by the Cambridge Astronomical Survey Unit (CASU), such as spatially variable astrometric and photometric biases of up to 100 mas and 0.1 mag, respectively. At the same time, the resolution of coadded images is up to 20% higher compared to the same products from the CASU processing environment. Most pipeline modules are written in Python and make extensive use of C extension libraries for numeric computations, thereby simultaneously providing accessibility, robustness, and high performance. The astrometric calibration is performed relative to the Gaia reference frame, and fluxes are calibrated with respect to the source magnitudes provided in the Two Micron All Sky Survey (2MASS). For bright sources, absolute astrometric errors are typically on the order of 10 to 15 mas and fluxes are determined with subpercent precision. Moreover, the calibration with respect to 2MASS photometry is largely free of color terms. The pipeline produces data that are compliant with the ESO Phase 3 regulations and furthermore provides curated source catalogs that are structured similarly to those provided by the 2MASS survey.
△ Less
Submitted 15 March, 2023;
originally announced March 2023.
-
VISIONS: The VISTA Star Formation Atlas -- I. Survey overview
Authors:
Stefan Meingast,
João Alves,
Hervé Bouy,
Monika G. Petr-Gotzens,
Verena Fürnkranz,
Josefa E. Großschedl,
David Hernandez,
Alena Rottensteiner,
Magda Arnaboldi,
Joana Ascenso,
Amelia Bayo,
Erik Brändli,
Anthony G. A. Brown,
Jan Forbrich,
Alyssa Goodman,
Alvaro Hacar,
Birgit Hasenberger,
Rainer Köhler,
Karolina Kubiak,
Michael Kuhn,
Charles Lada,
Kieran Leschinski,
Marco Lombardi,
Diego Mardones,
Laura Mascetti
, et al. (15 additional authors not shown)
Abstract:
VISIONS is an ESO public survey of five nearby (d < 500 pc) star-forming molecular cloud complexes that are canonically associated with the constellations of Chamaeleon, Corona Australis, Lupus, Ophiuchus, and Orion. The survey was carried out with VISTA, using VIRCAM, and collected data in the near-infrared passbands J, H, and Ks. With a total on-sky exposure time of 49.4 h VISIONS covers an area…
▽ More
VISIONS is an ESO public survey of five nearby (d < 500 pc) star-forming molecular cloud complexes that are canonically associated with the constellations of Chamaeleon, Corona Australis, Lupus, Ophiuchus, and Orion. The survey was carried out with VISTA, using VIRCAM, and collected data in the near-infrared passbands J, H, and Ks. With a total on-sky exposure time of 49.4 h VISIONS covers an area of 650 deg$^2$, and it was designed to build an infrared legacy archive similar to that of 2MASS. Taking place between April 2017 and March 2022, the observations yielded approximately 1.15 million images, which comprise 19 TB of raw data. The observations are grouped into three different subsurveys: The wide subsurvey comprises shallow, large-scale observations and has visited the star-forming complexes six times over the course of its execution. The deep subsurvey of dedicated high-sensitivity observations has collected data on the areas with the largest amounts of dust extinction. The control subsurvey includes observations of areas of low-to-negligible dust extinction. Using this strategy, the VISIONS survey offers multi-epoch position measurements, is able to access deeply embedded objects, and provides a baseline for statistical comparisons and sample completeness. In particular, VISIONS is designed to measure the proper motions of point sources with a precision of 1 mas/yr or better, when complemented with data from VHS. Hence, VISIONS can provide proper motions for sources inaccessible to Gaia. VISIONS will enable addressing a range of topics, including the 3D distribution and motion of embedded stars and the nearby interstellar medium, the identification and characterization of young stellar objects, the formation and evolution of embedded stellar clusters and their initial mass function, as well as the characteristics of interstellar dust and the reddening law.
△ Less
Submitted 15 March, 2023;
originally announced March 2023.
-
A toolkit of dilemmas: Beyond debiasing and fairness formulas for responsible AI/ML
Authors:
Andrés Domínguez Hernández,
Vassilis Galanos
Abstract:
Approaches to fair and ethical AI have recently fell under the scrutiny of the emerging, chiefly qualitative, field of critical data studies, placing emphasis on the lack of sensitivity to context and complex social phenomena of such interventions. We employ some of these lessons to introduce a tripartite decision-making toolkit, informed by dilemmas encountered in the pursuit of responsible AI/ML…
▽ More
Approaches to fair and ethical AI have recently fell under the scrutiny of the emerging, chiefly qualitative, field of critical data studies, placing emphasis on the lack of sensitivity to context and complex social phenomena of such interventions. We employ some of these lessons to introduce a tripartite decision-making toolkit, informed by dilemmas encountered in the pursuit of responsible AI/ML. These are: (a) the opportunity dilemma between the availability of data shaping problem statements vs problem statements shaping data; (b) the trade-off between scalability and contextualizability (too much data versus too specific data); and (c) the epistemic positioning between the pragmatic technical objectivism and the reflexive relativism in acknowledging the social. This paper advocates for a situated reasoning and creative engagement with the dilemmas surrounding responsible algorithmic/data-driven systems, and going beyond the formulaic bias elimination and ethics operationalization narratives found in the fair-AI literature.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Detailed Analysis of the TeV γ-Ray Sources 3HWC J1928+178, 3HWC J1930+188, and the New Source HAWC J1932+192
Authors:
A. Albert,
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
C. Brisbois,
K. S. Caballero-Mora,
T. Capistrń,
A. Carramiñana,
S. Casanova,
O. Chaparro-Amaro,
U. Cotti,
J. Cotzomi,
S. CoutiñodeLeón,
E. De la Fuente,
C. de León,
R. Diaz Hernandez,
J. C. Díaz-Vélez,
B. L. Dingus,
M. A. DuVernois,
M. Durocher,
K. Engel
, et al. (69 additional authors not shown)
Abstract:
The latest High Altitude Water Cherenkov (HAWC) point-like source catalog up to 56 TeV reported the detection of two sources in the region of the Galactic plane at galactic longitude 52°< l < 55°, 3HWC J1930+188 and 3HWC J1928+178. The first one is associated with a known TeV source, the supernova remnant SNR G054.1+00.3. It was discovered by one of the currently operating Imaging Atmospheric Cher…
▽ More
The latest High Altitude Water Cherenkov (HAWC) point-like source catalog up to 56 TeV reported the detection of two sources in the region of the Galactic plane at galactic longitude 52°< l < 55°, 3HWC J1930+188 and 3HWC J1928+178. The first one is associated with a known TeV source, the supernova remnant SNR G054.1+00.3. It was discovered by one of the currently operating Imaging Atmospheric Cherenkov Telescope (IACT), the Very Energetic Radiation Imaging Telescope Array System (VERITAS), detected by the High Energy Stereoscopic System (H.E.S.S.), and identified as a composite SNR. However, the source 3HWC J1928+178, discovered by HAWC and coincident with the pulsar PSR J1928+1746, was not detected by any IACT despite their long exposure on the region, until a recent new analysis of H.E.S.S. data was able to confirm it. Moreover, no X-ray counterpart has been detected from this pulsar. We present a multicomponent fit of this region using the latest HAWC data. This reveals an additional new source, HAWC J1932+192, which is potentially associated with the pulsar PSR J1932+1916, whose gamma-ray emission could come from the acceleration of particles in its pulsar wind nebula. In the case of 3HWC J1928+178, several possible explanations are explored, in a attempt to unveil the origins of the very-high-energy gamma-ray emission.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Searching for TeV Dark Matter in Irregular dwarf galaxies with HAWC Observatory
Authors:
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
O. Chaparro-Amaro,
U. Cotti,
J. Cotzomi,
E. De la Fuente,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
M. Durocher,
J. C. Díaz-Vélez,
C. Espinoza,
K. L. Fan,
N. Fraija,
J. A. García-González,
F. Garfias
, et al. (47 additional authors not shown)
Abstract:
We present the results of dark matter (DM) searches in a sample of 31 dwarf irregular (dIrr) galaxies within the field of view of the HAWC Observatory. dIrr galaxies are DM dominated objects, which astrophysical gamma-ray emission is estimated to be negligible with respect to the secondary gamma-ray flux expected by annihilation or decay of Weakly Interacting Massive Particles (WIMPs). While we do…
▽ More
We present the results of dark matter (DM) searches in a sample of 31 dwarf irregular (dIrr) galaxies within the field of view of the HAWC Observatory. dIrr galaxies are DM dominated objects, which astrophysical gamma-ray emission is estimated to be negligible with respect to the secondary gamma-ray flux expected by annihilation or decay of Weakly Interacting Massive Particles (WIMPs). While we do not see any statistically significant DM signal in dIrr galaxies, we present the exclusion limits ($95\%~\text{C.L.}$) for annihilation cross-section and decay lifetime for WIMP candidates with masses between $1$ and $100~\text{TeV}$. Exclusion limits from dIrr galaxies are relevant and complementary to benchmark dwarf Spheroidal (dSph) galaxies. In fact, dIrr galaxies are targets kinematically different from benchmark dSph, preserving the footprints of different evolution histories. We compare the limits from dIrr galaxies to those from ultrafaint and classical dSph galaxies previously observed with HAWC. We find that the contraints are comparable to the limits from classical dSph galaxies and $\thicksim2$ orders of magnitude weaker than the ultrafaint dSph limits.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
The Capacity for Moral Self-Correction in Large Language Models
Authors:
Deep Ganguli,
Amanda Askell,
Nicholas Schiefer,
Thomas I. Liao,
Kamilė Lukošiūtė,
Anna Chen,
Anna Goldie,
Azalia Mirhoseini,
Catherine Olsson,
Danny Hernandez,
Dawn Drain,
Dustin Li,
Eli Tran-Johnson,
Ethan Perez,
Jackson Kernion,
Jamie Kerr,
Jared Mueller,
Joshua Landau,
Kamal Ndousse,
Karina Nguyen,
Liane Lovitt,
Michael Sellitto,
Nelson Elhage,
Noemi Mercado,
Nova DasSarma
, et al. (24 additional authors not shown)
Abstract:
We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so. We find strong evidence in support of this hypothesis across three different experiments, each of which reveal different facets of moral self-correction. We find that the capability…
▽ More
We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so. We find strong evidence in support of this hypothesis across three different experiments, each of which reveal different facets of moral self-correction. We find that the capability for moral self-correction emerges at 22B model parameters, and typically improves with increasing model size and RLHF training. We believe that at this level of scale, language models obtain two capabilities that they can use for moral self-correction: (1) they can follow instructions and (2) they can learn complex normative concepts of harm like stereotyping, bias, and discrimination. As such, they can follow instructions to avoid certain kinds of morally harmful outputs. We believe our results are cause for cautious optimism regarding the ability to train language models to abide by ethical principles.
△ Less
Submitted 18 February, 2023; v1 submitted 14 February, 2023;
originally announced February 2023.
-
Link Prediction with Attention Applied on Multiple Knowledge Graph Embedding Models
Authors:
Cosimo Gregucci,
Mojtaba Nayyeri,
Daniel Hernández,
Steffen Staab
Abstract:
Predicting missing links between entities in a knowledge graph is a fundamental task to deal with the incompleteness of data on the Web. Knowledge graph embeddings map nodes into a vector space to predict new links, scoring them according to geometric criteria. Relations in the graph may follow patterns that can be learned, e.g., some relations might be symmetric and others might be hierarchical.…
▽ More
Predicting missing links between entities in a knowledge graph is a fundamental task to deal with the incompleteness of data on the Web. Knowledge graph embeddings map nodes into a vector space to predict new links, scoring them according to geometric criteria. Relations in the graph may follow patterns that can be learned, e.g., some relations might be symmetric and others might be hierarchical. However, the learning capability of different embedding models varies for each pattern and, so far, no single model can learn all patterns equally well. In this paper, we combine the query representations from several models in a unified one to incorporate patterns that are independently captured by each model. Our combination uses attention to select the most suitable model to answer each query. The models are also mapped onto a non-Euclidean manifold, the Poincaré ball, to capture structural patterns, such as hierarchies, besides relational patterns, such as symmetry. We prove that our combination provides a higher expressiveness and inference power than each model on its own. As a result, the combined model can learn relational and structural patterns. We conduct extensive experimental analysis with various link prediction benchmarks showing that the combined model outperforms individual models, including state-of-the-art approaches.
△ Less
Submitted 13 February, 2023;
originally announced February 2023.
-
Switching integrators reversibly in the astrophysical $N$-body problem
Authors:
David M. Hernandez,
Walter Dehnen
Abstract:
We present a simple algorithm to switch between $N$-body time integrators in a reversible way. We apply it to planetary systems undergoing arbitrarily close encounters and highly eccentric orbits, but the potential applications are broader. Upgrading an ordinary non-reversible switching integrator to a reversible one is straightforward and introduces no appreciable computational burden in our test…
▽ More
We present a simple algorithm to switch between $N$-body time integrators in a reversible way. We apply it to planetary systems undergoing arbitrarily close encounters and highly eccentric orbits, but the potential applications are broader. Upgrading an ordinary non-reversible switching integrator to a reversible one is straightforward and introduces no appreciable computational burden in our tests. Our method checks if the integrator during the time step violates a time-symmetric selection condition and redoes the step if necessary. In our experiments a few percent of steps would have violated the condition without our corrections. By eliminating them the algorithm avoids long-term error accumulation, of several orders magnitude in some cases.
△ Less
Submitted 28 February, 2023; v1 submitted 15 January, 2023;
originally announced January 2023.
-
HAWC Detection of a TeV Halo Candidate Surrounding a Radio-quiet pulsar
Authors:
A. Albert,
R. Alfaro,
J. C. Arteaga-Velázquez,
E. Belmont-Moreno,
T. Capistrán,
A. Carramiñana,
S. Casanova,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
R. Diaz Hernandez,
M. A. DuVernois,
J. C. Díaz-Vélez,
C. Espinoza,
K. L. Fan,
N. Fraija,
K. Fang,
J. A. García-González,
F. Garfias,
Armelle Jardin-Blicq,
M. M. González,
J. A. Goodman,
J. P. Harding,
S. Hernandez,
D. Huang
, et al. (37 additional authors not shown)
Abstract:
Extended very-high-energy (VHE; 0.1-100 TeV) $γ$-ray emission has been observed around several middle-aged pulsars and referred to as ``TeV halos". Their formation mechanism remains under debate. It is also unknown whether they are ubiquitous or related to certain subgroup of pulsars. With 2321 days of observation, the High Altitude Water Cherenkov (HAWC) Gamma-Ray Observatory detected VHE $γ$-ray…
▽ More
Extended very-high-energy (VHE; 0.1-100 TeV) $γ$-ray emission has been observed around several middle-aged pulsars and referred to as ``TeV halos". Their formation mechanism remains under debate. It is also unknown whether they are ubiquitous or related to certain subgroup of pulsars. With 2321 days of observation, the High Altitude Water Cherenkov (HAWC) Gamma-Ray Observatory detected VHE $γ$-ray emission at the location of the radio-quiet pulsar PSR J0359+5414 with $>6σ$ significance. By performing likelihood tests with different spectral and spatial models and comparing the TeV spectrum with multi-wavelength observations of nearby sources, we show that this excess is consistent with a TeV halo associated with PSR J0359+5414, though future observation of HAWC and multi-wavelength follow-ups are needed to confirm this nature. This new halo candidate is located in a non-crowded region in the outer Galaxy. It shares similar properties to the other halos but its pulsar is younger and radio-quiet. Our observation implies that TeV halos could commonly exist around pulsars and their formation does not depend on the configuration of the pulsar magnetosphere.
△ Less
Submitted 11 January, 2023; v1 submitted 11 January, 2023;
originally announced January 2023.
-
Simple physics and integrators accurately reproduce Mercury instability statistics
Authors:
Dorian S. Abbot,
David M. Hernandez,
Sam Hadden,
Robert J. Webber,
Georgios P. Afentakis,
Jonathan Weare
Abstract:
The long-term stability of the Solar System is an issue of significant scientific and philosophical interest. The mechanism leading to instability is Mercury's eccentricity being pumped up so high that Mercury either collides with Venus or is scattered into the Sun. Previously, only three five-billion-year $N$-body ensembles of the Solar System with thousands of simulations have been run to assess…
▽ More
The long-term stability of the Solar System is an issue of significant scientific and philosophical interest. The mechanism leading to instability is Mercury's eccentricity being pumped up so high that Mercury either collides with Venus or is scattered into the Sun. Previously, only three five-billion-year $N$-body ensembles of the Solar System with thousands of simulations have been run to assess long-term stability. We generate two additional ensembles, each with 2750 members, and make them publicly available at \texttt{https://archive.org/details/@dorianabbot}. We find that accurate Mercury instability statistics can be obtained by (1) including only the Sun and the 8 planets, (2) using a simple Wisdom-Holman scheme without correctors, (3) using a basic representation of general relativity, and (4) using a time step of 3.16 days. By combining our Solar System ensembles with previous ensembles we form a 9,601-member ensemble of ensembles. In this ensemble of ensembles, the logarithm of the frequency of a Mercury instability event increases linearly with time between 1.3 and 5 Gyr, suggesting that a single mechanism is responsible for Mercury instabilities in this time range and that this mechanism becomes more active as time progresses. Our work provides a robust estimate of Mercury instability statistics over the next five billion years, outlines methodologies that may be useful for exoplanet system investigations, and provides two large ensembles of publicly available Solar System integrations that can serve as testbeds for theoretical ideas as well as training sets for artificial intelligence schemes.
△ Less
Submitted 21 February, 2023; v1 submitted 30 December, 2022;
originally announced December 2022.
-
Discovering Language Model Behaviors with Model-Written Evaluations
Authors:
Ethan Perez,
Sam Ringer,
Kamilė Lukošiūtė,
Karina Nguyen,
Edwin Chen,
Scott Heiner,
Craig Pettit,
Catherine Olsson,
Sandipan Kundu,
Saurav Kadavath,
Andy Jones,
Anna Chen,
Ben Mann,
Brian Israel,
Bryan Seethor,
Cameron McKinnon,
Christopher Olah,
Da Yan,
Daniela Amodei,
Dario Amodei,
Dawn Drain,
Dustin Li,
Eli Tran-Johnson,
Guro Khundadze,
Jackson Kernion
, et al. (38 additional authors not shown)
Abstract:
As language models (LMs) scale, they develop many novel behaviors, good and bad, exacerbating the need to evaluate how they behave. Prior work creates evaluations with crowdwork (which is time-consuming and expensive) or existing data sources (which are not always available). Here, we automatically generate evaluations with LMs. We explore approaches with varying amounts of human effort, from inst…
▽ More
As language models (LMs) scale, they develop many novel behaviors, good and bad, exacerbating the need to evaluate how they behave. Prior work creates evaluations with crowdwork (which is time-consuming and expensive) or existing data sources (which are not always available). Here, we automatically generate evaluations with LMs. We explore approaches with varying amounts of human effort, from instructing LMs to write yes/no questions to making complex Winogender schemas with multiple stages of LM-based generation and filtering. Crowdworkers rate the examples as highly relevant and agree with 90-100% of labels, sometimes more so than corresponding human-written datasets. We generate 154 datasets and discover new cases of inverse scaling where LMs get worse with size. Larger LMs repeat back a dialog user's preferred answer ("sycophancy") and express greater desire to pursue concerning goals like resource acquisition and goal preservation. We also find some of the first examples of inverse scaling in RL from Human Feedback (RLHF), where more RLHF makes LMs worse. For example, RLHF makes LMs express stronger political views (on gun rights and immigration) and a greater desire to avoid shut down. Overall, LM-written evaluations are high-quality and let us quickly discover many novel LM behaviors.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
Constitutional AI: Harmlessness from AI Feedback
Authors:
Yuntao Bai,
Saurav Kadavath,
Sandipan Kundu,
Amanda Askell,
Jackson Kernion,
Andy Jones,
Anna Chen,
Anna Goldie,
Azalia Mirhoseini,
Cameron McKinnon,
Carol Chen,
Catherine Olsson,
Christopher Olah,
Danny Hernandez,
Dawn Drain,
Deep Ganguli,
Dustin Li,
Eli Tran-Johnson,
Ethan Perez,
Jamie Kerr,
Jared Mueller,
Jeffrey Ladish,
Joshua Landau,
Kamal Ndousse,
Kamile Lukosuite
, et al. (26 additional authors not shown)
Abstract:
As AI systems become more capable, we would like to enlist their help to supervise other AIs. We experiment with methods for training a harmless AI assistant through self-improvement, without any human labels identifying harmful outputs. The only human oversight is provided through a list of rules or principles, and so we refer to the method as 'Constitutional AI'. The process involves both a supe…
▽ More
As AI systems become more capable, we would like to enlist their help to supervise other AIs. We experiment with methods for training a harmless AI assistant through self-improvement, without any human labels identifying harmful outputs. The only human oversight is provided through a list of rules or principles, and so we refer to the method as 'Constitutional AI'. The process involves both a supervised learning and a reinforcement learning phase. In the supervised phase we sample from an initial model, then generate self-critiques and revisions, and then finetune the original model on revised responses. In the RL phase, we sample from the finetuned model, use a model to evaluate which of the two samples is better, and then train a preference model from this dataset of AI preferences. We then train with RL using the preference model as the reward signal, i.e. we use 'RL from AI Feedback' (RLAIF). As a result we are able to train a harmless but non-evasive AI assistant that engages with harmful queries by explaining its objections to them. Both the SL and RL methods can leverage chain-of-thought style reasoning to improve the human-judged performance and transparency of AI decision making. These methods make it possible to control AI behavior more precisely and with far fewer human labels.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
The TeV Sun Rises: Discovery of Gamma rays from the Quiescent Sun with HAWC
Authors:
A. Albert,
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velazquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
C. Brisbois,
K. S. Caballero-Mora,
T. Capistran,
A. Carraminana,
S. Casanova,
O. Chaparro-Amaro,
U. Cotti,
J. Cotzomi,
S. Coutino de Leon,
E. De la Fuente,
R. Diaz Hernandez,
B. L. Dingus,
M. A. DuVernois,
M. Durocher,
J. C. Diaz-Velez,
R. W. Ellsworth,
K. Engel
, et al. (67 additional authors not shown)
Abstract:
We report the first detection of a TeV gamma-ray flux from the solar disk (6.3$σ$), based on 6.1 years of data from the High Altitude Water Cherenkov (HAWC) observatory. The 0.5--2.6 TeV spectrum is well fit by a power law, dN/dE = $A (E/1 \text{ TeV})^{-γ}$, with $A = (1.6 \pm 0.3) \times 10^{-12}$ TeV$^{-1}$ cm$^{-2}$ s$^{-1}$ and $γ= -3.62 \pm 0.14$. The flux shows a strong indication of antico…
▽ More
We report the first detection of a TeV gamma-ray flux from the solar disk (6.3$σ$), based on 6.1 years of data from the High Altitude Water Cherenkov (HAWC) observatory. The 0.5--2.6 TeV spectrum is well fit by a power law, dN/dE = $A (E/1 \text{ TeV})^{-γ}$, with $A = (1.6 \pm 0.3) \times 10^{-12}$ TeV$^{-1}$ cm$^{-2}$ s$^{-1}$ and $γ= -3.62 \pm 0.14$. The flux shows a strong indication of anticorrelation with solar activity. These results extend the bright, hard GeV emission from the disk observed with Fermi-LAT, seemingly due to hadronic Galactic cosmic rays showering on nuclei in the solar atmosphere. However, current theoretical models are unable to explain the details of how solar magnetic fields shape these interactions. HAWC's TeV detection thus deepens the mysteries of the solar-disk emission.
△ Less
Submitted 10 July, 2023; v1 submitted 1 December, 2022;
originally announced December 2022.