subscribe to arXiv mailings

Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning

Authors: Eric Pasewark, Kyle Montgomery, Kefei Duan, Dawn Song, Chenguang Wang

Abstract: We present a new method for large language models to solve compositional tasks. Although they have shown strong performance on traditional language understanding tasks, large language models struggle to solve compositional tasks, where the solution depends on solving smaller instances of the same problem. We propose a natural approach to solve compositional tasks recursively. Our method, Re-Tuning… ▽ More We present a new method for large language models to solve compositional tasks. Although they have shown strong performance on traditional language understanding tasks, large language models struggle to solve compositional tasks, where the solution depends on solving smaller instances of the same problem. We propose a natural approach to solve compositional tasks recursively. Our method, Re-Tuning, tunes models to break down a problem into subproblems, solve those subproblems, and combine the results. We show that our method significantly improves model performance on three representative compositional tasks: integer addition, dynamic programming, and parity. Compared to state-of-the-art methods that keep intermediate steps towards solving the problems, Re-Tuning achieves significantly higher accuracy and is more GPU memory efficient. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: Accepted to ACL 2024

arXiv:2407.00262 [pdf, other]

Prospects for the detection of very-high-energy pulsars with LHAASO and SWGO

Authors: Quan Hu, Yi Zhang, Kaikai Duan, Houdun Zeng

Abstract: Pulsations from the Crab pulsar have been detected by the MAGIC telescopes at energies up to 1.5 TeV, and the pulsed emission from the Vela pulsar was detected by H.E.S.S., reaching tens of TeV. These discoveries, along with the proposed additional emission due to inverse Compton scattering at TeV energies, lead us to consider suitable candidates for detection with current and future extensive air… ▽ More Pulsations from the Crab pulsar have been detected by the MAGIC telescopes at energies up to 1.5 TeV, and the pulsed emission from the Vela pulsar was detected by H.E.S.S., reaching tens of TeV. These discoveries, along with the proposed additional emission due to inverse Compton scattering at TeV energies, lead us to consider suitable candidates for detection with current and future extensive air show (EAS) experiments at very-high-energy (VHE; 0.1 $-$ 100 TeV) ranges. Leveraging energy spectrum data from pulsars as observed by Fermi and Imaging Atmospheric Cherenkov Telescopes (IACTs) and considering the sensitivities of both LHAASO and SWGO, this study evaluates their detectability and estimates the time required for their significant detection. Our results indicate that LHAASO could detect the Crab's pulsed signal within six years, while SWGO might detect Vela's signal within one year. Observations of the most energetic Fermi pulsars with EAS experiments will provide insight into the nature of VHE pulsar emissions, helping to clarify the primary characteristics of VHE pulsars. △ Less

Submitted 28 June, 2024; originally announced July 2024.

Comments: 6 pages, 3 figures, 2 Tables and accepted for publication in MNRAS

arXiv:2406.08698 [pdf, other]

Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes of astrophysical $γ$-ray background while large amount of dark matter. By analyzing more than 700 days observational data at LHAASO, no significant dark matter signal from 1 TeV to 1 EeV is detected. Accordingly we derive the most stringent constraints on the ultra-heavy dark matter annihilation cross-section up to EeV. The constraints on the lifetime of dark matter in decay mode are also derived. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 17 pages, 12 figures, accepted by PRL

arXiv:2406.08310 [pdf, other]

GraphFM: A Comprehensive Benchmark for Graph Foundation Model

Authors: Yuhao Xu, Xinqi Liu, Keyu Duan, Yi Fang, Yu-Neng Chuang, Daochen Zha, Qiaoyu Tan

Abstract: Foundation Models (FMs) serve as a general class for the development of artificial intelligence systems, offering broad potential for generalization across a spectrum of downstream tasks. Despite extensive research into self-supervised learning as the cornerstone of FMs, several outstanding issues persist in Graph Foundation Models that rely on graph self-supervised learning, namely: 1) Homogeniza… ▽ More Foundation Models (FMs) serve as a general class for the development of artificial intelligence systems, offering broad potential for generalization across a spectrum of downstream tasks. Despite extensive research into self-supervised learning as the cornerstone of FMs, several outstanding issues persist in Graph Foundation Models that rely on graph self-supervised learning, namely: 1) Homogenization. The extent of generalization capability on downstream tasks remains unclear. 2) Scalability. It is unknown how effectively these models can scale to large datasets. 3) Efficiency. The training time and memory usage of these models require evaluation. 4) Training Stop Criteria. Determining the optimal stopping strategy for pre-training across multiple tasks to maximize performance on downstream tasks. To address these questions, we have constructed a rigorous benchmark that thoroughly analyzes and studies the generalization and scalability of self-supervised Graph Neural Network (GNN) models. Regarding generalization, we have implemented and compared the performance of various self-supervised GNN models, trained to generate node representations, across tasks such as node classification, link prediction, and node clustering. For scalability, we have compared the performance of various models after training using full-batch and mini-batch strategies. Additionally, we have assessed the training efficiency of these models by conducting experiments to test their GPU memory usage and throughput. Through these experiments, we aim to provide insights to motivate future research. The code for this benchmark is publicly available at https://github.com/NYUSHCS/GraphFM. △ Less

Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.01140 [pdf, other]

Logical Reasoning with Relation Network for Inductive Knowledge Graph Completion

Authors: Qinggang Zhang, Keyu Duan, Junnan Dong, Pai Zheng, Xiao Huang

Abstract: Inductive knowledge graph completion (KGC) aims to infer the missing relation for a set of newly-coming entities that never appeared in the training set. Such a setting is more in line with reality, as real-world KGs are constantly evolving and introducing new knowledge. Recent studies have shown promising results using message passing over subgraphs to embed newly-coming entities for inductive KG… ▽ More Inductive knowledge graph completion (KGC) aims to infer the missing relation for a set of newly-coming entities that never appeared in the training set. Such a setting is more in line with reality, as real-world KGs are constantly evolving and introducing new knowledge. Recent studies have shown promising results using message passing over subgraphs to embed newly-coming entities for inductive KGC. However, the inductive capability of these methods is usually limited by two key issues. (i) KGC always suffers from data sparsity, and the situation is even exacerbated in inductive KGC where new entities often have few or no connections to the original KG. (ii) Cold-start problem. It is over coarse-grained for accurate KG reasoning to generate representations for new entities by gathering the local information from few neighbors. To this end, we propose a novel iNfOmax RelAtion Network, namely NORAN, for inductive KG completion. It aims to mine latent relation patterns for inductive KG completion. Specifically, by centering on relations, NORAN provides a hyper view towards KG modeling, where the correlations between relations can be naturally captured as entity-independent logical evidence to conduct inductive KGC. Extensive experiment results on five benchmarks show that our framework substantially outperforms the state-of-the-art KGC methods. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.11826 [pdf, other]

Data quality control system and long-term performance monitor of the LHAASO-KM2A

Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To ensure the reliability of the LHAASO-KM2A data, a three-level quality control system has been established. It is used to monitor the status of detector units, stability of reconstructed parameters and the performance of the array based on observations of the Crab Nebula and Moon shadow. This paper will introduce the control system and its application on the LHAASO-KM2A data collected from August 2021 to July 2023. During this period, the pointing and angular resolution of the array were stable. From the observations of the Moon shadow and Crab Nebula, the results achieved using the two methods are consistent with each other. According to the observation of the Crab Nebula at energies from 25 TeV to 100 TeV, the time averaged pointing errors are estimated to be $-0.003^{\circ} \pm 0.005^{\circ}$ and $0.001^{\circ} \pm 0.006^{\circ}$ in the R.A. and Dec directions, respectively. △ Less

Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 15 pages, 9 figures

arXiv:2405.07691 [pdf, other]

Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) is compatible with NGC 4278 within $\sim0.03$ degree. Variation analysis shows an indication of the variability at a few months level in the TeV band, which is consistent with low frequency observations. Based on these observations, we report the detection of TeV $γ$-ray emissions from this low-luminosity AGN NGC 4278. The observations by LHAASO-WCDA during active period has a significance level of 8.8\,$σ$ with best-fit photon spectral index $\varGamma=2.56\pm0.14$ and a flux $f_{1-10\,\rm{TeV}}=(7.0\pm1.1_{\rm{sta}}\pm0.35_{\rm{syst}})\times10^{-13}\,\rm{photons\,cm^{-2}\,s^{-1}}$, or approximately $5\%$ of the Crab Nebula. The discovery of VHE from NGC 4278 indicates that the compact, weak radio jet can efficiently accelerate particles and emit TeV photons. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 11 pages, 5 figures

arXiv:2404.04801 [pdf, ps, other]

doi 10.1007/s41605-024-00467-8

LHAASO-KM2A detector simulation using Geant4

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with large altitude difference (30 m) and huge coverage (1.3 km^2). In this paper, the design of the KM2A simulation code G4KM2A based on Geant4 is introduced. The process of G4KM2A is optimized mainly in memory consumption to avoid memory overffow. Some simpliffcations are used to signiffcantly speed up the execution of G4KM2A. The running time is reduced by at least 30 times compared to full detector simulation. The particle distributions and the core/angle resolution comparison between simulation and experimental data of the full KM2A array are also presented, which show good agreement. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2403.10010 [pdf, other]

doi 10.1103/PhysRevLett.132.131002

Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at $3.67 \pm 0.05 \pm 0.15$ PeV. Below the knee, the spectral index is found to be -$2.7413 \pm 0.0004 \pm 0.0050$, while above the knee, it is -$3.128 \pm 0.005 \pm 0.027$, with the sharpness of the transition measured with a statistical error of 2%. The mean logarithmic mass of cosmic rays is almost heavier than helium in the whole measured energy range. It decreases from 1.7 at 0.3 PeV to 1.3 at 3 PeV, representing a 24% decline following a power law with an index of -$0.1200 \pm 0.0003 \pm 0.0341$. This is equivalent to an increase in abundance of light components. Above the knee, the mean logarithmic mass exhibits a power law trend towards heavier components, which is reversal to the behavior observed in the all-particle energy spectrum. Additionally, the knee position and the change in power-law index are approximately the same. These findings suggest that the knee observed in the all-particle spectrum corresponds to the knee of the light component, rather than the medium-heavy components. △ Less

Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: 8 pages, 3 figures

Journal ref: Physical Review Letters 132, 131002 (2024)

arXiv:2402.02935 [pdf, other]

doi 10.1016/j.adt.2024.101661

Nuclear mass table in deformed relativistic Hartree-Bogoliubov theory in continuum, II: Even-$Z$ nuclei

Authors: DRHBc Mass Table Collaboration, Peng Guo, Xiaojie Cao, Kangmin Chen, Zhihui Chen, Myung-Ki Cheoun, Yong-Beom Choi, Pak Chung Lam, Wenmin Deng, Jianmin Dong, Pengxiang Du, Xiaokai Du, Kangda Duan, Xiaohua Fan, Wei Gao, Lisheng Geng, Eunja Ha, Xiao-Tao He, Jinniu Hu, Jingke Huang, Kun Huang, Yanan Huang, Zidan Huang, Kim Da Hyung, Hoi Yat Chan , et al. (58 additional authors not shown)

Abstract: The mass table in the deformed relativistic Hartree-Bogoliubov theory in continuum (DRHBc) with the PC-PK1 density functional has been established for even-$Z$ nuclei with $8\le Z\le120$, extended from the previous work for even-even nuclei [Zhang $\it{et.~al.}$ (DRHBc Mass Table Collaboration), At. Data Nucl. Data Tables 144, 101488 (2022)]. The calculated binding energies, two-nucleon and one-ne… ▽ More The mass table in the deformed relativistic Hartree-Bogoliubov theory in continuum (DRHBc) with the PC-PK1 density functional has been established for even-$Z$ nuclei with $8\le Z\le120$, extended from the previous work for even-even nuclei [Zhang $\it{et.~al.}$ (DRHBc Mass Table Collaboration), At. Data Nucl. Data Tables 144, 101488 (2022)]. The calculated binding energies, two-nucleon and one-neutron separation energies, root-mean-square (rms) radii of neutron, proton, matter, and charge distributions, quadrupole deformations, and neutron and proton Fermi surfaces are tabulated and compared with available experimental data. A total of 4829 even-$Z$ nuclei are predicted to be bound, with an rms deviation of 1.477 MeV from the 1244 mass data. Good agreement with the available experimental odd-even mass differences, $α$ decay energies, and charge radii is also achieved. The description accuracy for nuclear masses and nucleon separation energies as well as the prediction for drip lines is compared with the results obtained from other relativistic and nonrelativistic density functional. The comparison shows that the DRHBc theory with PC-PK1 provides an excellent microscopic description for the masses of even-$Z$ nuclei. The systematics of the nucleon separation energies, odd-even mass differences, pairing energies, two-nucleon gaps, $α$ decay energies, rms radii, quadrupole deformations, potential energy curves, neutron density distributions, and neutron mean-field potentials are discussed. △ Less

Submitted 10 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

Comments: 394 pages, 17 figures, 2 tables, published in Atomic Data and Nuclear Data Tables, data file in the TXT form is available for download under "Ancillary files"

Journal ref: Peng Guo, et. al. (DRHBc Mass Table Collaboration), Atomic Data and Nuclear Data Tables 158 (2024) 101661

arXiv:2310.17082 [pdf, ps, other]

Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE; $E_γ\geq 100$~TeV) $γ$-rays. In this context, the historical SNR Cassiopeia A (Cas A) is considered one of the most promising target for UHE observations. This paper presents the observation of Cas A and its vicinity by the LHAASO KM2A detector. The exceptional sensitivity of LHAASO KM2A in the UHE band, combined with the young age of Cas A, enabled us to derive stringent model-independent limits on the energy budget of UHE protons and nuclei accelerated by Cas A at any epoch after the explosion. The results challenge the prevailing paradigm that Cas A-type SNRs are major suppliers of PeV CRs in the Milky Way. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 11 pages, 3 figures, Accepted by the APJL

arXiv:2310.08845 [pdf, other]

doi 10.1126/sciadv.adj2778

Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the trigger. The intrinsic energy spectrum of gamma-rays can be described by a power-law after correcting for extragalactic background light (EBL) absorption. Such a hard spectrum challenges the synchrotron self-Compton (SSC) scenario of relativistic electrons for the afterglow emission above several TeV. Observations of gamma-rays up to 13 TeV from a source with a measured redshift of z=0.151 hints more transparency in intergalactic space than previously expected. Alternatively, one may invoke new physics such as Lorentz Invariance Violation (LIV) or an axion origin of very high energy (VHE) signals. △ Less

Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

Comments: 49pages, 11figures

Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

arXiv:2308.02565 [pdf, other]

SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning

Authors: Keyu Duan, Qian Liu, Tat-Seng Chua, Shuicheng Yan, Wei Tsang Ooi, Qizhe Xie, Junxian He

Abstract: Textual graphs (TGs) are graphs whose nodes correspond to text (sentences or documents), which are widely prevalent. The representation learning of TGs involves two stages: (i) unsupervised feature extraction and (ii) supervised graph representation learning. In recent years, extensive efforts have been devoted to the latter stage, where Graph Neural Networks (GNNs) have dominated. However, the fo… ▽ More Textual graphs (TGs) are graphs whose nodes correspond to text (sentences or documents), which are widely prevalent. The representation learning of TGs involves two stages: (i) unsupervised feature extraction and (ii) supervised graph representation learning. In recent years, extensive efforts have been devoted to the latter stage, where Graph Neural Networks (GNNs) have dominated. However, the former stage for most existing graph benchmarks still relies on traditional feature engineering techniques. More recently, with the rapid development of language models (LMs), researchers have focused on leveraging LMs to facilitate the learning of TGs, either by jointly training them in a computationally intensive framework (merging the two stages), or designing complex self-supervised training tasks for feature extraction (enhancing the first stage). In this work, we present SimTeG, a frustratingly Simple approach for Textual Graph learning that does not innovate in frameworks, models, and tasks. Instead, we first perform supervised parameter-efficient fine-tuning (PEFT) on a pre-trained LM on the downstream task, such as node classification. We then generate node embeddings using the last hidden states of finetuned LM. These derived features can be further utilized by any GNN for training on the same task. We evaluate our approach on two fundamental graph representation learning tasks: node classification and link prediction. Through extensive experiments, we show that our approach significantly improves the performance of various GNNs on multiple graph benchmarks. △ Less

Submitted 3 August, 2023; originally announced August 2023.

Comments: 9 pages, 3 figures

arXiv:2308.00120

The Giant Radio Array for Neutrino Detection (GRAND) Collaboration -- Contributions to the 38th International Cosmic Ray Conference (ICRC 2023)

Authors: GRAND Collaboration, Rafael Alves Batista, Aurélien Benoit-Lévy, Teresa Bister, Mauricio Bustamante, Yiren Chen, LingMei Cheng, Simon Chiche, Jean-Marc Colley, Pablo Correa, Nicoleta Cucu Laurenciu, Zigao Dai, Beatriz de Errico, Sijbrand de Jong, João R. T. de Mello Neto, Krijn D. de Vries, Peter B. Denton, Valentin Deocoene, Kaikai Duan, Bohao Duan, Ralph Engel, Yizhong Fan, Arsène Ferrière, QuanBu Gou, Junhua Gu , et al. (74 additional authors not shown)

Abstract: The Giant Radio Array for Neutrino Detection (GRAND) is an envisioned observatory of ultra-high-energy particles of cosmic origin, with energies in excess of 100 PeV. GRAND uses large surface arrays of autonomous radio-detection units to look for the radio emission from extensive air showers that are triggered by the interaction of ultra-high-energy cosmic rays, gamma rays, and neutrinos in the at… ▽ More The Giant Radio Array for Neutrino Detection (GRAND) is an envisioned observatory of ultra-high-energy particles of cosmic origin, with energies in excess of 100 PeV. GRAND uses large surface arrays of autonomous radio-detection units to look for the radio emission from extensive air showers that are triggered by the interaction of ultra-high-energy cosmic rays, gamma rays, and neutrinos in the atmosphere or underground. In particular, for ultra-high-energy neutrinos, the future final phase of GRAND aims to be sensitive enough to discover them in spite of their plausibly tiny flux. Presently, three prototype GRAND radio arrays are in operation: GRANDProto300, in China, GRAND@Auger, in Argentina, and GRAND@Nancay, in France. Their goals are to field-test the design of the radio-detection units, understand the radio background to which they are exposed, and develop tools for diagnostic, data gathering, and data analysis. This list of contributions to the 38th International Cosmic Ray Conference (ICRC 2023) presents an overview of GRAND, in its present and future incarnations, and a look at the first data collected by GRANDProto13, the first phase of GRANDProto300. △ Less

Submitted 27 July, 2023; originally announced August 2023.

Comments: Note: To access the list of contributions, please follow the "HTML" link that can be found on the arXiv page

arXiv:2307.13234 [pdf, other]

The simulated performance of GRANDProto300

Authors: Kai-Kai Duan, Peng-Xiong Ma, Ke-Wen Zhang, Xiao-Yuan Huang, Yi Zhang

Abstract: GRANDProto300 is a 300-antenna prototype array of the envisioned GRAND (Giant Radio Array for Neutrino Detection) project. The goal of GRANDProto300 is to detect radio signals emitted by cosmic ray-induced air showers, with energies ranging from $10^{16.5}$~eV to $10^{18.5}$~eV, which covers the transition region between Galactic and extragalactic sources. We use simulations to optimize the layout… ▽ More GRANDProto300 is a 300-antenna prototype array of the envisioned GRAND (Giant Radio Array for Neutrino Detection) project. The goal of GRANDProto300 is to detect radio signals emitted by cosmic ray-induced air showers, with energies ranging from $10^{16.5}$~eV to $10^{18.5}$~eV, which covers the transition region between Galactic and extragalactic sources. We use simulations to optimize the layout of GRANDProto300 and develop a shower reconstruction method. Based on them, we present the performance of GRANDProto300 for cosmic-ray detection, by means of its effective area, angular resolution, and energy resolution. △ Less

Submitted 24 July, 2023; originally announced July 2023.

Comments: Presented at the 38th International Cosmic Ray Conference (ICRC 2023), 8 pages, 9 figures

Report number: PoS(ICRC2023)298

arXiv:2307.12769 [pdf, other]

First look at data from the 13-antenna setup of GRANDProto300 in northwest China

Authors: Peng-Xiong Ma, Bo-Hao Duan, Xin Xu, Ke-Wen Zhang, Kai-Kai Duan, Shen Wang, Yi Zhang, Peng-Fei Zhang

Abstract: The Giant Radio Array for Neutrino Detection (GRAND) is an envisioned observatory of ultra-high-energy neutrinos, cosmic rays, and gamma rays, with energies above 100 PeV. GRAND targets the radio signals emitted by extensive air showers induced by the interaction of ultra-high-energy particles in the atmosphere, using an array of 200,000 radio antennas split into sub-arrays deployed worldwide. GRA… ▽ More The Giant Radio Array for Neutrino Detection (GRAND) is an envisioned observatory of ultra-high-energy neutrinos, cosmic rays, and gamma rays, with energies above 100 PeV. GRAND targets the radio signals emitted by extensive air showers induced by the interaction of ultra-high-energy particles in the atmosphere, using an array of 200,000 radio antennas split into sub-arrays deployed worldwide. GRANDProto13 (GP13) is a 13-antenna demonstrator array deployed in February 2023 in the Gansu province of China, as a precursor for GRANDProto300, which will validate the detection principle of the GRAND experiment. Its goal is to measure the radio background present at the site, validate the design of the detection units and develop an autonomous radio trigger for air showers. We will describe GP13 and its operation, and show preliminary results on noise monitoring. △ Less

Submitted 24 July, 2023; originally announced July 2023.

Comments: Proceedings of the 38th International Cosmic Ray Conference (ICRC2023)

arXiv:2306.07952 [pdf, other]

MOFI: Learning Image Representations from Noisy Entity Annotated Images

Authors: Wentao Wu, Aleksei Timofeev, Chen Chen, Bowen Zhang, Kun Duan, Shuangning Liu, Yantao Zheng, Jonathon Shlens, Xianzhi Du, Zhe Gan, Yinfei Yang

Abstract: We present MOFI, Manifold OF Images, a new vision foundation model designed to learn image representations from noisy entity annotated images. MOFI differs from previous work in two key aspects: (i) pre-training data, and (ii) training recipe. Regarding data, we introduce a new approach to automatically assign entity labels to images from noisy image-text pairs. Our approach involves employing a n… ▽ More We present MOFI, Manifold OF Images, a new vision foundation model designed to learn image representations from noisy entity annotated images. MOFI differs from previous work in two key aspects: (i) pre-training data, and (ii) training recipe. Regarding data, we introduce a new approach to automatically assign entity labels to images from noisy image-text pairs. Our approach involves employing a named entity recognition model to extract entities from the alt-text, and then using a CLIP model to select the correct entities as labels of the paired image. It's a simple, cost-effective method that can scale to handle billions of web-mined image-text pairs. Through this method, we have created Image-to-Entities (I2E), a new dataset with 1 billion images and 2 million distinct entities, covering rich visual concepts in the wild. Building upon the I2E dataset, we study different training recipes like supervised pre-training, contrastive pre-training, and multi-task learning. For contrastive pre-training, we treat entity names as free-form text, and further enrich them with entity descriptions. Experiments show that supervised pre-training with large-scale fine-grained entity labels is highly effective for image retrieval tasks, and multi-task training further improves the performance. The final MOFI model achieves 86.66% mAP on the challenging GPR1200 dataset, surpassing the previous state-of-the-art performance of 72.19% from OpenAI's CLIP model. Further experiments on zero-shot and linear probe image classification also show that MOFI outperforms a CLIP model trained on the original image-text data, demonstrating the effectiveness of the I2E dataset in learning strong image representations. We release our code and model weights at https://github.com/apple/ml-mofi. △ Less

Submitted 17 March, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

Comments: Accepted to ICLR 2024

arXiv:2305.17030 [pdf, other]

doi 10.3847/1538-4365/acfd29

The First LHAASO Catalog of Gamma-Ray Sources

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.… ▽ More We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022. This catalog represents the main result from the most sensitive large coverage gamma-ray survey of the sky above 1 TeV, covering declination from $-$20$^{\circ}$ to 80$^{\circ}$. In total, the catalog contains 90 sources with an extended size smaller than $2^\circ$ and a significance of detection at $> 5σ$. Based on our source association criteria, 32 new TeV sources are proposed in this study. Among the 90 sources, 43 sources are detected with ultra-high energy ($E > 100$ TeV) emission at $> 4σ$ significance level. We provide the position, extension, and spectral characteristics of all the sources in this catalog. △ Less

Submitted 27 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

Comments: 40 pages, 13 figures, 4 tables

Journal ref: The Astrophysical Journal Supplement Series, 271 (2024) 25

arXiv:2305.14586 [pdf]

MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks

Authors: Kangkang Duan, Christine Wun Ki Suen, Zhengbo Zou

Abstract: Letting robots emulate human behavior has always posed a challenge, particularly in scenarios involving multiple robots. In this paper, we presented a framework aimed at achieving multi-agent reinforcement learning for robot control in construction tasks. The construction industry often necessitates complex interactions and coordination among multiple robots, demanding a solution that enables effe… ▽ More Letting robots emulate human behavior has always posed a challenge, particularly in scenarios involving multiple robots. In this paper, we presented a framework aimed at achieving multi-agent reinforcement learning for robot control in construction tasks. The construction industry often necessitates complex interactions and coordination among multiple robots, demanding a solution that enables effective collaboration and efficient task execution. Our proposed framework leverages the principles of proximal policy optimization and developed a multi-agent version to enable the robots to acquire sophisticated control policies. We evaluated the effectiveness of our framework by learning four different collaborative tasks in the construction environments. The results demonstrated the capability of our approach in enabling multiple robots to learn and adapt their behaviors in complex construction tasks while effectively preventing collisions. Results also revealed the potential of combining and exploring the advantages of reinforcement learning algorithms and inverse kinematics. The findings from this research contributed to the advancement of multi-agent reinforcement learning in the domain of construction robotics. By enabling robots to behave like human counterparts and collaborate effectively, we pave the way for more efficient, flexible, and intelligent construction processes. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: 19 pages

arXiv:2305.14584 [pdf]

Learning from demonstrations: An intuitive VR environment for imitation learning of construction robots

Authors: Kangkang Duan, Zhengbo Zou

Abstract: Construction robots are challenging the traditional paradigm of labor intensive and repetitive construction tasks. Present concerns regarding construction robots are focused on their abilities in performing complex tasks consisting of several subtasks and their adaptability to work in unstructured and dynamic construction environments. Imitation learning (IL) has shown advantages in training a rob… ▽ More Construction robots are challenging the traditional paradigm of labor intensive and repetitive construction tasks. Present concerns regarding construction robots are focused on their abilities in performing complex tasks consisting of several subtasks and their adaptability to work in unstructured and dynamic construction environments. Imitation learning (IL) has shown advantages in training a robot to imitate expert actions in complex tasks and the policy thereafter generated by reinforcement learning (RL) is more adaptive in comparison with pre-programmed robots. In this paper, we proposed a framework composed of two modules for imitation learning of construction robots. The first module provides an intuitive expert demonstration collection Virtual Reality (VR) platform where a robot will automatically follow the position, rotation, and actions of the expert's hand in real-time, instead of requiring an expert to control the robot via controllers. The second module provides a template for imitation learning using observations and actions recorded in the first module. In the second module, Behavior Cloning (BC) is utilized for pre-training, Generative Adversarial Imitation Learning (GAIL) and Proximal Policy Optimization (PPO) are combined to achieve a trade-off between the strength of imitation vs. exploration. Results show that imitation learning, especially when combined with PPO, could significantly accelerate training in limited training steps and improve policy performance. △ Less

Submitted 23 May, 2023; originally announced May 2023.

Comments: 22 pages, 8 figures

arXiv:2305.05372 [pdf, other]

doi 10.1103/PhysRevLett.131.151001

Measurement of ultra-high-energy diffuse gamma-ray emission of the Galactic plane from 10 TeV to 1 PeV with LHAASO-KM2A

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The diffuse Galactic $γ$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γ$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer ar… ▽ More The diffuse Galactic $γ$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γ$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer array of the Large High Altitude Air Shower Observatory (LHAASO). Diffuse emissions from the inner ($15^{\circ}<l<125^{\circ}$, $|b|<5^{\circ}$) and outer ($125^{\circ}<l<235^{\circ}$, $|b|<5^{\circ}$) Galactic plane are detected with $29.1σ$ and $12.7σ$ significance, respectively. The outer Galactic plane diffuse emission is detected for the first time in the very- to ultra-high-energy domain ($E>10$~TeV). The energy spectrum in the inner Galaxy regions can be described by a power-law function with an index of $-2.99\pm0.04$, which is different from the curved spectrum as expected from hadronic interactions between locally measured cosmic rays and the line-of-sight integrated gas content. Furthermore, the measured flux is higher by a factor of $\sim3$ than the prediction. A similar spectrum with an index of $-2.99\pm0.07$ is found in the outer Galaxy region, and the absolute flux for $10\lesssim E\lesssim60$ TeV is again higher than the prediction for hadronic cosmic ray interactions. The latitude distributions of the diffuse emission are consistent with the gas distribution, while the longitude distributions show clear deviation from the gas distribution. The LHAASO measurements imply that either additional emission sources exist or cosmic ray intensities have spatial variations. △ Less

Submitted 19 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

Comments: 12 pages, 8 figures, 5 tables; accepted for publication in Physical Review Letters; source mask file provided as ancillary file

Journal ref: Phys. Rev. Lett. 131, 151001 (2023)

arXiv:2211.10030 [pdf, other]

doi 10.1145/3511808.3557264

Contrastive Knowledge Graph Error Detection

Authors: Qinggang Zhang, Junnan Dong, Keyu Duan, Xiao Huang, Yezi Liu, Linchuan Xu

Abstract: Knowledge Graph (KG) errors introduce non-negligible noise, severely affecting KG-related downstream tasks. Detecting errors in KGs is challenging since the patterns of errors are unknown and diverse, while ground-truth labels are rare or even unavailable. A traditional solution is to construct logical rules to verify triples, but it is not generalizable since different KGs have distinct rules wit… ▽ More Knowledge Graph (KG) errors introduce non-negligible noise, severely affecting KG-related downstream tasks. Detecting errors in KGs is challenging since the patterns of errors are unknown and diverse, while ground-truth labels are rare or even unavailable. A traditional solution is to construct logical rules to verify triples, but it is not generalizable since different KGs have distinct rules with domain knowledge involved. Recent studies focus on designing tailored detectors or ranking triples based on KG embedding loss. However, they all rely on negative samples for training, which are generated by randomly replacing the head or tail entity of existing triples. Such a negative sampling strategy is not enough for prototyping practical KG errors, e.g., (Bruce_Lee, place_of_birth, China), in which the three elements are often relevant, although mismatched. We desire a more effective unsupervised learning mechanism tailored for KG error detection. To this end, we propose a novel framework - ContrAstive knowledge Graph Error Detection (CAGED). It introduces contrastive learning into KG learning and provides a novel way of modeling KG. Instead of following the traditional setting, i.e., considering entities as nodes and relations as semantic edges, CAGED augments a KG into different hyper-views, by regarding each relational triple as a node. After joint training with KG embedding and contrastive learning loss, CAGED assesses the trustworthiness of each triple based on two learning signals, i.e., the consistency of triple representations across multi-views and the self-consistency within the triple. Extensive experiments on three real-world KGs show that CAGED outperforms state-of-the-art methods in KG error detection. Our codes and datasets are available at https://github.com/Qing145/CAGED.git. △ Less

Submitted 18 November, 2022; originally announced November 2022.

Journal ref: CIKM 2022: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

arXiv:2210.07494 [pdf, other]

A Comprehensive Study on Large-Scale Graph Training: Benchmarking and Rethinking

Authors: Keyu Duan, Zirui Liu, Peihao Wang, Wenqing Zheng, Kaixiong Zhou, Tianlong Chen, Xia Hu, Zhangyang Wang

Abstract: Large-scale graph training is a notoriously challenging problem for graph neural networks (GNNs). Due to the nature of evolving graph structures into the training process, vanilla GNNs usually fail to scale up, limited by the GPU memory space. Up to now, though numerous scalable GNN architectures have been proposed, we still lack a comprehensive survey and fair benchmark of this reservoir to find… ▽ More Large-scale graph training is a notoriously challenging problem for graph neural networks (GNNs). Due to the nature of evolving graph structures into the training process, vanilla GNNs usually fail to scale up, limited by the GPU memory space. Up to now, though numerous scalable GNN architectures have been proposed, we still lack a comprehensive survey and fair benchmark of this reservoir to find the rationale for designing scalable GNNs. To this end, we first systematically formulate the representative methods of large-scale graph training into several branches and further establish a fair and consistent benchmark for them by a greedy hyperparameter searching. In addition, regarding efficiency, we theoretically evaluate the time and space complexity of various branches and empirically compare them w.r.t GPU memory usage, throughput, and convergence. Furthermore, We analyze the pros and cons for various branches of scalable GNNs and then present a new ensembling training manner, named EnGCN, to address the existing issues. Our code is available at https://github.com/VITA-Group/Large_Scale_GCN_Benchmarking. △ Less

Submitted 1 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

Comments: Accepted by NeurIPS 2022 Dataset and Benchmark Track

arXiv:2210.07488 [pdf, other]

MetaFill: Text Infilling for Meta-Path Generation on Heterogeneous Information Networks

Authors: Zequn Liu, Kefei Duan, Junwei Yang, Hanwen Xu, Ming Zhang, Sheng Wang

Abstract: Heterogeneous Information Network (HIN) is essential to study complicated networks containing multiple edge types and node types. Meta-path, a sequence of node types and edge types, is the core technique to embed HINs. Since manually curating meta-paths is time-consuming, there is a pressing need to develop automated meta-path generation approaches. Existing meta-path generation approaches cannot… ▽ More Heterogeneous Information Network (HIN) is essential to study complicated networks containing multiple edge types and node types. Meta-path, a sequence of node types and edge types, is the core technique to embed HINs. Since manually curating meta-paths is time-consuming, there is a pressing need to develop automated meta-path generation approaches. Existing meta-path generation approaches cannot fully exploit the rich textual information in HINs, such as node names and edge type names. To address this problem, we propose MetaFill, a text-infilling-based approach for meta-path generation. The key idea of MetaFill is to formulate meta-path identification problem as a word sequence infilling problem, which can be advanced by Pretrained Language Models (PLMs). We observed the superior performance of MetaFill against existing meta-path generation methods and graph embedding methods that do not leverage meta-paths in both link prediction and node classification on two real-world HIN datasets. We further demonstrated how MetaFill can accurately classify edges in the zero-shot setting, where existing approaches cannot generate any meta-paths. MetaFill exploits PLMs to generate meta-paths for graph embedding, opening up new avenues for language model applications in graph analysis. △ Less

Submitted 13 October, 2022; originally announced October 2022.

Comments: Accepted by the main conference of EMNLP 2022

arXiv:2209.04260 [pdf, other]

doi 10.1103/PhysRevD.106.063026

Search for relativistic fractionally charged particles in space

Authors: DAMPE Collaboration, F. Alemanno, C. Altomare, Q. An, P. Azzarello, F. C. T. Barbato, P. Bernardini, X. J. Bi, M. S. Cai, E. Casilli, E. Catanzani, J. Chang, D. Y. Chen, J. L. Chen, Z. F. Chen, M. Y. Cui, T. S. Cui, Y. X. Cui, H. T. Dai, A. De-Benedittis, I. De Mitri, F. de Palma, M. Deliyergiyev, A. Di Giovanni, M. Di Santo , et al. (126 additional authors not shown)

Abstract: More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been… ▽ More More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been few searches for FCPs in cosmic rays carried out in orbit other than AMS-01 flown by a space shuttle and BESS by a balloon at the top of the atmosphere. In this study, we conduct an FCP search in space based on on-orbit data obtained using the DArk Matter Particle Explorer (DAMPE) satellite over a period of five years. Unlike underground experiments, which require an FCP energy of the order of hundreds of GeV, our FCP search starts at only a few GeV. An upper limit of $6.2\times 10^{-10}~~\mathrm{cm^{-2}sr^{-1} s^{-1}}$ is obtained for the flux. Our results demonstrate that DAMPE exhibits higher sensitivity than experiments of similar types by three orders of magnitude that more stringently restricts the conditions for the existence of FCP in primary cosmic rays. △ Less

Submitted 9 September, 2022; originally announced September 2022.

Comments: 19 pages, 6 figures, accepted by PRD

Report number: 106, 063026

Journal ref: Physical Review D 106.6 (2022): 063026

arXiv:2207.12601 [pdf]

doi 10.1088/1674-1137/ac9371

Flux Variations of Cosmic Ray Air Showers Detected by LHAASO-KM2A During a Thunderstorm on 10 June 2021

Authors: LHAASO Collaboration, F. Aharonian, Q. An, Axikegu, L. X. Bai, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Zhe Cao, Zhen Cao, J. Chang, J. F. Chang, E. S. Chen, Liang Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, S. H. Chen, S. Z. Chen, T. L. Chen, X. J. Chen , et al. (248 additional authors not shown)

Abstract: The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations… ▽ More The Large High Altitude Air Shower Observatory (LHAASO) has three sub-arrays, KM2A, WCDA and WFCTA. The flux variations of cosmic ray air showers were studied by analyzing the KM2A data during the thunderstorm on 10 June 2021. The number of shower events that meet the trigger conditions increases significantly in atmospheric electric fields, with maximum fractional increase of 20%. The variations of trigger rates (increases or decreases) are found to be strongly dependent on the primary zenith angle. The flux of secondary particles increases significantly, following a similar trend with that of the shower events. To better understand the observed behavior, Monte Carlo simulations are performed with CORSIKA and G4KM2A (a code based on GEANT4). We find that the experimental data (in saturated negative fields) are in good agreement with simulations, assuming the presence of a uniform upward electric field of 700 V/cm with a thickness of 1500 m in the atmosphere above the observation level. Due to the acceleration/deceleration and deflection by the atmospheric electric field, the number of secondary particles with energy above the detector threshold is modified, resulting in the changes in shower detection rate. △ Less

Submitted 6 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: 18 pages, 11 figures

Journal ref: Chinese Phys. C 47 015001 (2023)

arXiv:2204.08394 [pdf, other]

CenterNet++ for Object Detection

Authors: Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, Qi Tian

Abstract: There are two mainstreams for object detection: top-down and bottom-up. The state-of-the-art approaches mostly belong to the first category. In this paper, we demonstrate that the bottom-up approaches are as competitive as the top-down and enjoy higher recall. Our approach, named CenterNet, detects each object as a triplet keypoints (top-left and bottom-right corners and the center keypoint). We f… ▽ More There are two mainstreams for object detection: top-down and bottom-up. The state-of-the-art approaches mostly belong to the first category. In this paper, we demonstrate that the bottom-up approaches are as competitive as the top-down and enjoy higher recall. Our approach, named CenterNet, detects each object as a triplet keypoints (top-left and bottom-right corners and the center keypoint). We firstly group the corners by some designed cues and further confirm the objects by the center keypoints. The corner keypoints equip the approach with the ability to detect objects of various scales and shapes and the center keypoint avoids the confusion brought by a large number of false-positive proposals. Our approach is a kind of anchor-free detector because it does not need to define explicit anchor boxes. We adapt our approach to the backbones with different structures, i.e., the 'hourglass' like networks and the the 'pyramid' like networks, which detect objects on a single-resolution feature map and multi-resolution feature maps, respectively. On the MS-COCO dataset, CenterNet with Res2Net-101 and Swin-Transformer achieves APs of 53.7% and 57.1%, respectively, outperforming all existing bottom-up detectors and achieving state-of-the-art. We also design a real-time CenterNet, which achieves a good trade-off between accuracy and speed with an AP of 43.6% at 30.5 FPS. https://github.com/Duankaiwen/PyCenterNet. △ Less

Submitted 18 April, 2022; originally announced April 2022.

Comments: 11 pages, 9 figures, 8 tables. arXiv admin note: substantial text overlap with arXiv:1904.08189

arXiv:2202.04266 [pdf, other]

MMLN: Leveraging Domain Knowledge for Multimodal Diagnosis

Authors: Haodi Zhang, Chenyu Xu, Peirou Liang, Ke Duan, Hao Ren, Weibin Cheng, Kaishun Wu

Abstract: Recent studies show that deep learning models achieve good performance on medical imaging tasks such as diagnosis prediction. Among the models, multimodality has been an emerging trend, integrating different forms of data such as chest X-ray (CXR) images and electronic medical records (EMRs). However, most existing methods incorporate them in a model-free manner, which lacks theoretical support an… ▽ More Recent studies show that deep learning models achieve good performance on medical imaging tasks such as diagnosis prediction. Among the models, multimodality has been an emerging trend, integrating different forms of data such as chest X-ray (CXR) images and electronic medical records (EMRs). However, most existing methods incorporate them in a model-free manner, which lacks theoretical support and ignores the intrinsic relations between different data sources. To address this problem, we propose a knowledge-driven and data-driven framework for lung disease diagnosis. By incorporating domain knowledge, machine learning models can reduce the dependence on labeled data and improve interpretability. We formulate diagnosis rules according to authoritative clinical medicine guidelines and learn the weights of rules from text data. Finally, a multimodal fusion consisting of text and image data is designed to infer the marginal probability of lung disease. We conduct experiments on a real-world dataset collected from a hospital. The results show that the proposed method outperforms the state-of-the-art multimodal baselines in terms of accuracy and interpretability. △ Less

Submitted 8 February, 2022; originally announced February 2022.

arXiv:2112.08860 [pdf, other]

doi 10.1016/j.scib.2021.12.015

Search for gamma-ray spectral lines with the DArk Matter Particle Explorer

Authors: Francesca Alemanno, Qi An, Philipp Azzarello, Felicia Carla Tiziana Barbato, Paolo Bernardini, Xiao-Jun Bi, Ming-Sheng Cai, Elisabetta Casilli, Enrico Catanzani, Jin Chang, Deng-Yi Chen, Jun-Ling Chen, Zhan-Fang Chen, Ming-Yang Cui, Tian-Shu Cui, Yu-Xing Cui, Hao-Ting Dai, Antonio De Benedittis, Ivan De Mitri, Francesco de Palma, Maksym Deliyergiyev, Margherita Di Santo, Qi Ding, Tie-Kuang Dong, Zhen-Xing Dong , et al. (121 additional authors not shown)

Abstract: The DArk Matter Particle Explorer (DAMPE) is well suitable for searching for monochromatic and sharp $γ$-ray structures in the GeV$-$TeV range thanks to its unprecedented high energy resolution. In this work, we search for $γ$-ray line structures using five years of DAMPE data. To improve the sensitivity, we develop two types of dedicated data sets (including the BgoOnly data which is the first ti… ▽ More The DArk Matter Particle Explorer (DAMPE) is well suitable for searching for monochromatic and sharp $γ$-ray structures in the GeV$-$TeV range thanks to its unprecedented high energy resolution. In this work, we search for $γ$-ray line structures using five years of DAMPE data. To improve the sensitivity, we develop two types of dedicated data sets (including the BgoOnly data which is the first time to be used in the data analysis for the calorimeter-based gamma-ray observatories) and adopt the signal-to-noise ratio optimized regions of interest (ROIs) for different DM density profiles. No line signals or candidates are found between 10 and 300 GeV in the Galaxy. The constraints on the velocity-averaged cross section for $χχ\to γγ$ and the decay lifetime for $χ\to γν$, both at 95% confidence level, have been calculated and the systematic uncertainties have been taken into account. Comparing to the previous Fermi-LAT results, though DAMPE has an acceptance smaller by a factor of $\sim 10$, similar constraints on the DM parameters are achieved and below 100 GeV the lower limits on the decay lifetime are even stronger by a factor of a few. Our results demonstrate the potential of high-energy-resolution observations on dark matter detection. △ Less

Submitted 6 December, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

Comments: 14 pages, 8 figures. Update the content to keep up with the published version

Journal ref: Science Bulletin, Volume 67, Issue 7, 15 April 2022, Pages 679-684

arXiv:2111.06545 [pdf, ps, other]

doi 10.1126/science.abg5137

Peta-electron volt gamma-ray emission from the Crab Nebula

Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, Axikegu, L. X. Bai, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, H. Cai, J. T. Cai, Zhe Cao, J. Chang, J. F. Chang, B. M. Chen, E. S. Chen, J. Chen, Liang Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen , et al. (250 additional authors not shown)

Abstract: The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ pet… ▽ More The Crab pulsar and the surrounding nebula powered by the pulsar's rotational energy through the formation and termination of a relativistic electron-positron wind is a bright source of gamma-rays carrying crucial information about this complex conglomerate. We report the detection of $γ$-rays with a spectrum showing gradual steepening over three energy decades, from $5\times 10^{-4}$ to $1.1$ petaelectronvolt (PeV). The ultra-high-energy photons exhibit the presence of a PeV electron accelerator (a pevatron) with an acceleration rate exceeding 15% of the absolute theoretical limit. Assuming that unpulsed $γ$-rays are produced at the termination of the pulsar's wind, we constrain the pevatron's size, between $0.025$ and $0.1$ pc, and the magnetic field $\approx 110 μ$G. The production rate of PeV electrons, $2.5 \times 10^{36}$ erg $\rm s^{-1}$, constitutes 0.5% of the pulsar's spin-down luminosity, although we do not exclude a non-negligible contribution of PeV protons to the production of the highest energy $γ$-rays. △ Less

Submitted 11 November, 2021; originally announced November 2021.

Comments: 43 pages, 13 figures, 2 tables; Published in Science

Journal ref: Science, 2021, Vol 373, Issue 6553, pp. 425-430

arXiv:2110.00123 [pdf, other]

doi 10.3847/2041-8213/ac2de6

Observations of Forbush Decreases of cosmic ray electrons and positrons with the Dark Matter Particle Explorer

Authors: Francesca Alemanno, Qi An, Philipp Azzarello, Felicia Carla Tiziana Barbato, Paolo Bernardini, XiaoJun Bi, MingSheng Cai, Elisabetta Casilli, Enrico Catanzani, Jin Chang, DengYi Chen, JunLing Chen, ZhanFang Chen, MingYang Cui, TianShu Cui, YuXing Cui, HaoTing Dai, Antonio De Benedittis, Ivan De Mitri, Francesco de Palma, Maksym Deliyergiyev, Margherita Di Santo, Qi Ding, TieKuang Dong, ZhenXing Dong , et al. (124 additional authors not shown)

Abstract: The Forbush Decrease (FD) represents the rapid decrease of the intensities of charged particles accompanied with the coronal mass ejections (CMEs) or high-speed streams from coronal holes. It has been mainly explored with ground-based neutron monitors network which indirectly measure the integrated intensities of all species of cosmic rays by counting secondary neutrons produced from interaction b… ▽ More The Forbush Decrease (FD) represents the rapid decrease of the intensities of charged particles accompanied with the coronal mass ejections (CMEs) or high-speed streams from coronal holes. It has been mainly explored with ground-based neutron monitors network which indirectly measure the integrated intensities of all species of cosmic rays by counting secondary neutrons produced from interaction between atmosphere atoms and cosmic rays. The space-based experiments can resolve the species of particles but the energy ranges are limited by the relative small acceptances except for the most abundant particles like protons and helium. Therefore, the FD of cosmic ray electrons and positrons have just been investigated by the PAMELA experiment in the low energy range ($<5$ GeV) with limited statistics. In this paper, we study the FD event occurred in September, 2017, with the electron and positron data recorded by the Dark Matter Particle Explorer. The evolution of the FDs from 2 GeV to 20 GeV with a time resolution of 6 hours are given. We observe two solar energetic particle events in the time profile of the intensity of cosmic rays, the earlier and weak one has not been shown in the neutron monitor data. Furthermore, both the amplitude and recovery time of fluxes of electrons and positrons show clear energy-dependence, which is important in probing the disturbances of the interplanetary environment by the coronal mass ejections. △ Less

Submitted 30 September, 2021; originally announced October 2021.

Comments: This article is dedicated to the 72nd anniversary of People's Republic of China

arXiv:2108.10521 [pdf, other]

Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study

Authors: Tianlong Chen, Kaixiong Zhou, Keyu Duan, Wenqing Zheng, Peihao Wang, Xia Hu, Zhangyang Wang

Abstract: Training deep graph neural networks (GNNs) is notoriously hard. Besides the standard plights in training deep architectures such as vanishing gradients and overfitting, it also uniquely suffers from over-smoothing, information squashing, and so on, which limits their potential power for encoding the high-order neighbor structure in large-scale graphs. Although numerous efforts are proposed to addr… ▽ More Training deep graph neural networks (GNNs) is notoriously hard. Besides the standard plights in training deep architectures such as vanishing gradients and overfitting, it also uniquely suffers from over-smoothing, information squashing, and so on, which limits their potential power for encoding the high-order neighbor structure in large-scale graphs. Although numerous efforts are proposed to address these limitations, such as various forms of skip connections, graph normalization, and random dropping, it is difficult to disentangle the advantages brought by a deep GNN architecture from those "tricks" necessary to train such an architecture. Moreover, the lack of a standardized benchmark with fair and consistent experimental settings poses an almost insurmountable obstacle to gauge the effectiveness of new mechanisms. In view of those, we present the first fair and reproducible benchmark dedicated to assessing the "tricks" of training deep GNNs. We categorize existing approaches, investigate their hyperparameter sensitivity, and unify the basic configuration. Comprehensive evaluations are then conducted on tens of representative graph datasets including the recent large-scale Open Graph Benchmark, with diverse deep GNN backbones. We demonstrate that an organic combo of initial connection, identity mapping, group and batch normalization attains the new state-of-the-art results for deep GNNs on large datasets. Codes are available: https://github.com/VITA-Group/Deep_GCN_Benchmarking. △ Less

Submitted 8 May, 2022; v1 submitted 24 August, 2021; originally announced August 2021.

Comments: Accepted by TPAMI

arXiv:2107.13208 [pdf, ps, other]

Optimal gamma-ray selections for monochromatic line searches with DAMPE

Authors: Zun-Lei Xu, Kai-Kai Duan, Wei Jiang, Shi-Jun Lei, Xiang Li, Zhao-Qiang Shen, Tao Ma, Meng Su, Qiang Yuan, Chuan Yue, Yi-Zhong Fan, Jin Chang

Abstract: The DArk Matter Particle Explorer (DAMPE) is a space high-energy cosmic-ray detector covering a wide energy band with a high energy resolution. One of the key scientific goals of DAMPE is to carry out indirect detection of dark matter by searching for high-energy gamma-ray line structure. To promote the sensitivity of gamma-ray line search with DAMPE, it is crucial to improve the acceptance and en… ▽ More The DArk Matter Particle Explorer (DAMPE) is a space high-energy cosmic-ray detector covering a wide energy band with a high energy resolution. One of the key scientific goals of DAMPE is to carry out indirect detection of dark matter by searching for high-energy gamma-ray line structure. To promote the sensitivity of gamma-ray line search with DAMPE, it is crucial to improve the acceptance and energy resolution of gamma-ray photons. In this paper, we quantitatively prove that the photon sample with the largest ratio of acceptance to energy resolution is optimal for line search. We therefore develop a line-search sample specifically optimized for the line search. Meanwhile, in order to increase the statistics, we also selected the so called BGO-only photons that convert into $e^+e^-$ pairs only in the BGO calorimeter. The standard, the line-search, and the BGO-only photon samples are then tested for line search individually and collectively. The results show that a significantly improved limit could be obtained from an appropriate combination of the date sets, and the increase is about 20\% for the highest case compared with using the standard sample only. △ Less

Submitted 11 November, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

arXiv:2105.09073 [pdf, other]

doi 10.1103/PhysRevLett.126.201102

Measurement of the cosmic ray helium energy spectrum from 70 GeV to 80 TeV with the DAMPE space mission

Authors: F. Alemanno, Q. An, P. Azzarello, F. C. T. Barbato, P. Bernardini, X. J. Bi, M. S. Cai, E. Catanzani, J. Chang, D. Y. Chen, J. L. Chen, Z. F. Chen, M. Y. Cui, T. S. Cui, Y. X. Cui, H. T. Dai, A. D'Amone, A. De Benedittis, I. De Mitri, F. de Palma, M. Deliyergiyev, M. Di Santo, T. K. Dong, Z. X. Dong, G. Donvito , et al. (120 additional authors not shown)

Abstract: The measurement of the energy spectrum of cosmic ray helium nuclei from 70 GeV to 80 TeV using 4.5 years of data recorded by the DArk Matter Particle Explorer (DAMPE) is reported in this work. A hardening of the spectrum is observed at an energy of about 1.3 TeV, similar to previous observations. In addition, a spectral softening at about 34 TeV is revealed for the first time with large statistics… ▽ More The measurement of the energy spectrum of cosmic ray helium nuclei from 70 GeV to 80 TeV using 4.5 years of data recorded by the DArk Matter Particle Explorer (DAMPE) is reported in this work. A hardening of the spectrum is observed at an energy of about 1.3 TeV, similar to previous observations. In addition, a spectral softening at about 34 TeV is revealed for the first time with large statistics and well controlled systematic uncertainties, with an overall significance of $4.3σ$. The DAMPE spectral measurements of both cosmic protons and helium nuclei suggest a particle charge dependent softening energy, although with current uncertainties a dependence on the number of nucleons cannot be ruled out. △ Less

Submitted 21 May, 2021; v1 submitted 19 May, 2021; originally announced May 2021.

Comments: 11 pages, 13 figures, published in Phys. Rev. Lett. Add one more digit for first three columns in Table S2

Journal ref: Phys. Rev. Lett. 126, 201102 (2021)

arXiv:2104.04899 [pdf, other]

Location-Sensitive Visual Recognition with Cross-IOU Loss

Authors: Kaiwen Duan, Lingxi Xie, Honggang Qi, Song Bai, Qingming Huang, Qi Tian

Abstract: Object detection, instance segmentation, and pose estimation are popular visual recognition tasks which require localizing the object by internal or boundary landmarks. This paper summarizes these tasks as location-sensitive visual recognition and proposes a unified solution named location-sensitive network (LSNet). Based on a deep neural network as the backbone, LSNet predicts an anchor point and… ▽ More Object detection, instance segmentation, and pose estimation are popular visual recognition tasks which require localizing the object by internal or boundary landmarks. This paper summarizes these tasks as location-sensitive visual recognition and proposes a unified solution named location-sensitive network (LSNet). Based on a deep neural network as the backbone, LSNet predicts an anchor point and a set of landmarks which together define the shape of the target object. The key to optimizing the LSNet lies in the ability of fitting various scales, for which we design a novel loss function named cross-IOU loss that computes the cross-IOU of each anchor point-landmark pair to approximate the global IOU between the prediction and ground-truth. The flexibly located and accurately predicted landmarks also enable LSNet to incorporate richer contextual information for visual recognition. Evaluated on the MS-COCO dataset, LSNet set the new state-of-the-art accuracy for anchor-free object detection (a 53.5% box AP) and instance segmentation (a 40.2% mask AP), and shows promising performance in detecting multi-scale human poses. Code is available at https://github.com/Duankaiwen/LSNet △ Less

Submitted 10 April, 2021; originally announced April 2021.

Comments: 13 pages, 7 figures and 5 tables

arXiv:2103.10910 [pdf]

Local fluid pressure perturbations inside faults matter in unexpectedly large injection-triggered earthquakes

Authors: Yinlin Ji, Hannes Hofmann, Kang Duan, Arno Zang

Abstract: Anticipating the maximum magnitude of injection-triggered earthquakes is highly valuable for the safe and efficient exploitation of geoenergies. The recent work by Li et al.(2021) reached the conclusion that unexpectedly large injection-triggered earthquakes are primarily caused by large pre-existing critical shear stresses on seismogenic faults. Also of great interest is the proposal of the ratio… ▽ More Anticipating the maximum magnitude of injection-triggered earthquakes is highly valuable for the safe and efficient exploitation of geoenergies. The recent work by Li et al.(2021) reached the conclusion that unexpectedly large injection-triggered earthquakes are primarily caused by large pre-existing critical shear stresses on seismogenic faults. Also of great interest is the proposal of the ratio of fault slip to dilation as an index to anticipate the fault rupture energetics. However, their fluid injection experiments were conducted under fully drained conditions, where the fluid pressure distribution on the fault plane is always uniform. As already pointed out by the authors, local fluid pressure perturbations inside faults under locally undrained conditions also have the potential to cause unexpectedly large seismic events. Here we add more datasets and possible explanations for this viewpoint, which has not been explored in such detail by Li et al.(2021). △ Less

Submitted 29 March, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

arXiv:2008.07940 [pdf, other]

doi 10.1103/PhysRevApplied.15.024061

Mechanical dissipation below 1$μ$Hz with a cryogenic diamagnetic-levitated micro-oscillator

Authors: Yingchun Leng, Rui Li, Xi Kong, Han Xie, Di Zheng, Peiran Yin, Fang Xiong, Tong Wu, Chang Kui Duan, Youwei Du, Zhang qi Yin, Pu Huang, Jiangfeng Du

Abstract: Ultralow dissipation plays an important role in sensing applications and exploring macroscopic quantum phenomena using micro-and nano-mechanical systems. We report a diamagnetic-levitated micro-mechanical oscillator operating at a low temperature of 3K with measured dissipation as low as 0.59 $μ$Hz and a quality factor as high as $2 \times 10^7$. To the best of our knowledge the achieved dissipati… ▽ More Ultralow dissipation plays an important role in sensing applications and exploring macroscopic quantum phenomena using micro-and nano-mechanical systems. We report a diamagnetic-levitated micro-mechanical oscillator operating at a low temperature of 3K with measured dissipation as low as 0.59 $μ$Hz and a quality factor as high as $2 \times 10^7$. To the best of our knowledge the achieved dissipation is the lowest in micro- and nano-mechanical systems to date, orders of magnitude improvement over the reported state-of-the-art systems based on different principles. The cryogenic diamagnetic-levitated oscillator described here is applicable to a wide range of mass, making it a good candidate for measuring both force and acceleration with ultra-high sensitivity. By virtue of the naturally existing strong magnetic gradient, this system has great potential in quantum spin mechanics study. △ Less

Submitted 18 August, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

Comments: 6 pages, 3 figures

Journal ref: Phys. Rev. Applied 15, 024061 (2021)

arXiv:2007.13816 [pdf, other]

Corner Proposal Network for Anchor-free, Two-stage Object Detection

Authors: Kaiwen Duan, Lingxi Xie, Honggang Qi, Song Bai, Qingming Huang, Qi Tian

Abstract: The goal of object detection is to determine the class and location of objects in an image. This paper proposes a novel anchor-free, two-stage framework which first extracts a number of object proposals by finding potential corner keypoint combinations and then assigns a class label to each proposal by a standalone classification stage. We demonstrate that these two stages are effective solutions… ▽ More The goal of object detection is to determine the class and location of objects in an image. This paper proposes a novel anchor-free, two-stage framework which first extracts a number of object proposals by finding potential corner keypoint combinations and then assigns a class label to each proposal by a standalone classification stage. We demonstrate that these two stages are effective solutions for improving recall and precision, respectively, and they can be integrated into an end-to-end network. Our approach, dubbed Corner Proposal Network (CPN), enjoys the ability to detect objects of various scales and also avoids being confused by a large number of false-positive proposals. On the MS-COCO dataset, CPN achieves an AP of 49.2% which is competitive among state-of-the-art object detection methods. CPN also fits the scenario of computational efficiency, which achieves an AP of 41.6%/39.7% at 26.2/43.3 FPS, surpassing most competitors with the same inference speed. Code is available at https://github.com/Duankaiwen/CPNDet △ Less

Submitted 27 July, 2020; originally announced July 2020.

Comments: 18 pages (including 3 pages of References), 3 figures, 7 tables, accepted by ECCV 2020

arXiv:2001.01804 [pdf, other]

doi 10.1088/1674-4527/20/6/92

Boresight Alignment of DArk Matter Particle Explorer

Authors: Wei Jiang, Xiang Li, Kai-Kai Duan, Zhao-Qiang Shen, Zun-Lei Xu, Jing-Jing Zang, Shi-Jun Lei, Qiang Yuan

Abstract: The DArk Matter Particle Explorer (DAMPE) can measure $γ$-rays in the energy range from a few GeV to about 10 TeV. The direction of each $γ$-ray is reconstructed with respect to the reference system of the DAMPE payload. In this paper, we adopt a maximum likelihood method and use the $γ$-ray data centered around several bright point-like sources to measure and correct the angular deviation from th… ▽ More The DArk Matter Particle Explorer (DAMPE) can measure $γ$-rays in the energy range from a few GeV to about 10 TeV. The direction of each $γ$-ray is reconstructed with respect to the reference system of the DAMPE payload. In this paper, we adopt a maximum likelihood method and use the $γ$-ray data centered around several bright point-like sources to measure and correct the angular deviation from the real celestial coordinate system, the so called ``boresight alignment'' of the DAMPE payload. As a check, we also estimate the boresight alignment for some sets of simulation data with artificial orientation and obtain consistent results. The time-dependent boresight alignment analysis does not show evidence for significant variation of the parameters. △ Less

Submitted 6 January, 2020; originally announced January 2020.

arXiv:1911.08967 [pdf, ps, other]

Transfer Learning Toolkit: Primers and Benchmarks

Authors: Fuzhen Zhuang, Keyu Duan, Tongjia Guo, Yongchun Zhu, Dongbo Xi, Zhiyuan Qi, Qing He

Abstract: The transfer learning toolkit wraps the codes of 17 transfer learning models and provides integrated interfaces, allowing users to use those models by calling a simple function. It is easy for primary researchers to use this toolkit and to choose proper models for real-world applications. The toolkit is written in Python and distributed under MIT open source license. In this paper, the current sta… ▽ More The transfer learning toolkit wraps the codes of 17 transfer learning models and provides integrated interfaces, allowing users to use those models by calling a simple function. It is easy for primary researchers to use this toolkit and to choose proper models for real-world applications. The toolkit is written in Python and distributed under MIT open source license. In this paper, the current state of this toolkit is described and the necessary environment setting and usage are introduced. △ Less

Submitted 20 November, 2019; originally announced November 2019.

Comments: A Transfer Learning Toolkit

arXiv:1911.02685 [pdf, ps, other]

A Comprehensive Survey on Transfer Learning

Authors: Fuzhen Zhuang, Zhiyuan Qi, Keyu Duan, Dongbo Xi, Yongchun Zhu, Hengshu Zhu, Hui Xiong, Qing He

Abstract: Transfer learning aims at improving the performance of target learners on target domains by transferring the knowledge contained in different but related source domains. In this way, the dependence on a large number of target domain data can be reduced for constructing target learners. Due to the wide application prospects, transfer learning has become a popular and promising area in machine learn… ▽ More Transfer learning aims at improving the performance of target learners on target domains by transferring the knowledge contained in different but related source domains. In this way, the dependence on a large number of target domain data can be reduced for constructing target learners. Due to the wide application prospects, transfer learning has become a popular and promising area in machine learning. Although there are already some valuable and impressive surveys on transfer learning, these surveys introduce approaches in a relatively isolated way and lack the recent advances in transfer learning. Due to the rapid expansion of the transfer learning area, it is both necessary and challenging to comprehensively review the relevant studies. This survey attempts to connect and systematize the existing transfer learning researches, as well as to summarize and interpret the mechanisms and the strategies of transfer learning in a comprehensive way, which may help readers have a better understanding of the current research status and ideas. Unlike previous surveys, this survey paper reviews more than forty representative transfer learning approaches, especially homogeneous transfer learning approaches, from the perspectives of data and model. The applications of transfer learning are also briefly introduced. In order to show the performance of different transfer learning models, over twenty representative transfer learning models are used for experiments. The models are performed on three different datasets, i.e., Amazon Reviews, Reuters-21578, and Office-31. And the experimental results demonstrate the importance of selecting appropriate transfer learning models for different applications in practice. △ Less

Submitted 23 June, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

Comments: 31 pages, 7 figures

arXiv:1909.12860 [pdf, other]

doi 10.1126/sciadv.aax3793

Measurement of the cosmic-ray proton spectrum from 40 GeV to 100 TeV with the DAMPE satellite

Authors: Q. An, R. Asfandiyarov, P. Azzarello, P. Bernardini, X. J. Bi, M. S. Cai, J. Chang, D. Y. Chen, H. F. Chen, J. L. Chen, W. Chen, M. Y. Cui, T. S. Cui, H. T. Dai, A. D'Amone, A. De Benedittis, I. De Mitri, M. Di Santo, M. Ding, T. K. Dong, Y. F. Dong, Z. X. Dong, G. Donvito, D. Droz, J. L. Duan , et al. (129 additional authors not shown)

Abstract: The precise measurement of the spectrum of protons, the most abundant component of the cosmic radiation, is necessary to understand the source and acceleration of cosmic rays in the Milky Way. This work reports the measurement of the cosmic ray proton fluxes with kinetic energies from 40 GeV to 100 TeV, with two and a half years of data recorded by the DArk Matter Particle Explorer (DAMPE). This i… ▽ More The precise measurement of the spectrum of protons, the most abundant component of the cosmic radiation, is necessary to understand the source and acceleration of cosmic rays in the Milky Way. This work reports the measurement of the cosmic ray proton fluxes with kinetic energies from 40 GeV to 100 TeV, with two and a half years of data recorded by the DArk Matter Particle Explorer (DAMPE). This is the first time an experiment directly measures the cosmic ray protons up to ~100 TeV with a high statistics. The measured spectrum confirms the spectral hardening found by previous experiments and reveals a softening at ~13.6 TeV, with the spectral index changing from ~2.60 to ~2.85. Our result suggests the existence of a new spectral feature of cosmic rays at energies lower than the so-called knee, and sheds new light on the origin of Galactic cosmic rays. △ Less

Submitted 30 September, 2019; v1 submitted 27 September, 2019; originally announced September 2019.

Comments: 37 pages, 5 figures, published in Science Advances

Journal ref: Science Advances, Vol. 5, no. 9, eaax3793 (2019)

arXiv:1907.02173 [pdf, other]

doi 10.1016/j.astropartphys.2018.10.006

The on-orbit calibration of DArk Matter Particle Explorer

Authors: G. Ambrosi, Q. An, R. Asfandiyarov, P. Azzarello, P. Bernardini, M. S. Cai, M. Caragiulo, J. Chang, D. Y. Chen, H. F. Chen, J. L. Chen, W. Chen, M. Y. Cui, T. S. Cui, H. T. Dai, A. D'Amone, A. De Benedittis, I. De Mitri, M. Ding, M. Di Santo, J. N. Dong, T. K. Dong, Y. F. Dong, Z. X. Dong, D. Droz , et al. (133 additional authors not shown)

Abstract: The DArk Matter Particle Explorer (DAMPE), a satellite-based cosmic ray and gamma-ray detector, was launched on December 17, 2015, and began its on-orbit operation on December 24, 2015. In this work we document the on-orbit calibration procedures used by DAMPE and report the calibration results of the Plastic Scintillator strip Detector (PSD), the Silicon-Tungsten tracKer-converter (STK), the BGO… ▽ More The DArk Matter Particle Explorer (DAMPE), a satellite-based cosmic ray and gamma-ray detector, was launched on December 17, 2015, and began its on-orbit operation on December 24, 2015. In this work we document the on-orbit calibration procedures used by DAMPE and report the calibration results of the Plastic Scintillator strip Detector (PSD), the Silicon-Tungsten tracKer-converter (STK), the BGO imaging calorimeter (BGO), and the Neutron Detector (NUD). The results are obtained using Galactic cosmic rays, bright known GeV gamma-ray sources, and charge injection into the front-end electronics of each sub-detector. The determination of the boundary of the South Atlantic Anomaly (SAA), the measurement of the live time, and the alignments of the detectors are also introduced. The calibration results demonstrate the stability of the detectors in almost two years of the on-orbit operation. △ Less

Submitted 3 July, 2019; originally announced July 2019.

Journal ref: Astroparticle Physics, Volume 106, p. 18-34 (2019)

arXiv:1904.13098 [pdf, other]

doi 10.1088/1674-4527/19/9/132

DmpIRFs and DmpST: DAMPE Instrument Response Functions and Science Tools for Gamma-Ray Data Analysis

Authors: Kai-Kai Duan, Wei Jiang, Yun-Feng Liang, Zhao-Qiang Shen, Zun-Lei Xu, Yi-Zhong Fan, Fabio Gargano, Simone Garrappa, Dong-Ya Guo, Shi-Jun Lei, Xiang Li, Mario Nicola Mazziotta, Maria Fernanda Munoz Salinas, Meng Su, Valerio Vagelli, Qiang Yuan, Chuan Yue, Stephan Zimmer

Abstract: GeV gamma ray is an important observation target of DArk Matter Particle Explorer (DAMPE) for indirect dark matter searching and high energy astrophysics. We present in this work a set of accurate instrument response functions of DAMPE (DmpIRFs) including the effective area, point-spread function and energy dispersion that are crucial for the gamma-ray data analysis based on the high statistics si… ▽ More GeV gamma ray is an important observation target of DArk Matter Particle Explorer (DAMPE) for indirect dark matter searching and high energy astrophysics. We present in this work a set of accurate instrument response functions of DAMPE (DmpIRFs) including the effective area, point-spread function and energy dispersion that are crucial for the gamma-ray data analysis based on the high statistics simulation data. A dedicated software named DmpST is developed to facilitate the scientific analyses of DAMPE gamma-ray data. Considering the limited number of photons and the angular resolution of DAMPE, the maximum likelihood method is adopted in the DmpST to better disentangle different source components. The basic mathematics and the framework regarding this software are also introduced in this paper. △ Less

Submitted 30 April, 2019; originally announced April 2019.

Comments: 12 pages, 9 figures, Accepted by RAA

arXiv:1904.08189 [pdf, other]

CenterNet: Keypoint Triplets for Object Detection

Authors: Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, Qi Tian

Abstract: In object detection, keypoint-based approaches often suffer a large number of incorrect object bounding boxes, arguably due to the lack of an additional look into the cropped regions. This paper presents an efficient solution which explores the visual patterns within each cropped region with minimal costs. We build our framework upon a representative one-stage keypoint-based detector named CornerN… ▽ More In object detection, keypoint-based approaches often suffer a large number of incorrect object bounding boxes, arguably due to the lack of an additional look into the cropped regions. This paper presents an efficient solution which explores the visual patterns within each cropped region with minimal costs. We build our framework upon a representative one-stage keypoint-based detector named CornerNet. Our approach, named CenterNet, detects each object as a triplet, rather than a pair, of keypoints, which improves both precision and recall. Accordingly, we design two customized modules named cascade corner pooling and center pooling, which play the roles of enriching information collected by both top-left and bottom-right corners and providing more recognizable information at the central regions, respectively. On the MS-COCO dataset, CenterNet achieves an AP of 47.0%, which outperforms all existing one-stage detectors by at least 4.9%. Meanwhile, with a faster inference speed, CenterNet demonstrates quite comparable performance to the top-ranked two-stage detectors. Code is available at https://github.com/Duankaiwen/CenterNet. △ Less

Submitted 18 April, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

Comments: 10 pages (including 2 pages of References), 7 figures, 5 tables

arXiv:1902.10402 [pdf]

Social Credibility Incorporating Semantic Analysis and Machine Learning: A Survey of the State-of-the-Art and Future Research Directions

Authors: Bilal Abu-Salih, Bushra Bremie, Pornpit Wongthongtham, Kevin Duan, Tomayess Issa, Kit Yan Chan, Mohammad Alhabashneh, Teshreen Albtoush, Sulaiman Alqahtani, Abdullah Alqahtani, Muteeb Alahmari, Naser Alshareef, Abdulaziz Albahlal

Abstract: The wealth of Social Big Data (SBD) represents a unique opportunity for organisations to obtain the excessive use of such data abundance to increase their revenues. Hence, there is an imperative need to capture, load, store, process, analyse, transform, interpret, and visualise such manifold social datasets to develop meaningful insights that are specific to an application domain. This paper lays… ▽ More The wealth of Social Big Data (SBD) represents a unique opportunity for organisations to obtain the excessive use of such data abundance to increase their revenues. Hence, there is an imperative need to capture, load, store, process, analyse, transform, interpret, and visualise such manifold social datasets to develop meaningful insights that are specific to an application domain. This paper lays the theoretical background by introducing the state-of-the-art literature review of the research topic. This is associated with a critical evaluation of the current approaches, and fortified with certain recommendations indicated to bridge the research gap. △ Less

Submitted 27 February, 2019; originally announced February 2019.

arXiv:1901.01521 [pdf, other]

doi 10.3847/2041-8213/ab1c64

Late afterglow emission statistics: a clear link between GW170817 and bright short GRBs

Authors: Kai-Kai Duan, Zhi-Ping Jin, Fu-Wen Zhang, Yi-Ming Zhu, Xiang Li, Yi-Zhong Fan, Da-Ming Wei

Abstract: GW170817, the first neutron star merger event detected by advanced LIGO/Virgo detectors, was associated with an underluminous short duration GRB 170817A. In this work we compare the forward shock afterglow emission of GW170817/GRB 170817A to other luminous short GRBs (sGRBs) with both a known redshift and an afterglow emission lasting at least one day after the burst. In the rapid decay phase, the… ▽ More GW170817, the first neutron star merger event detected by advanced LIGO/Virgo detectors, was associated with an underluminous short duration GRB 170817A. In this work we compare the forward shock afterglow emission of GW170817/GRB 170817A to other luminous short GRBs (sGRBs) with both a known redshift and an afterglow emission lasting at least one day after the burst. In the rapid decay phase, the afterglow emission of the bright sGRBs and GW170817/GRB 170817A form a natural and continuous sequence, though separated by an observation time gap. If viewed on-axis, the forward shock afterglow emission of GW170817/GRB 170817A would be among the brightest ones detected so far. This provides a strong evidence for the GW170817-like merger origin of bright sGRBs, and suggests that the detection of the forward shock afterglow emission of most neutron star merger events are more challenging than the case of GW170817 since usually the mergers will be more distant and the viewing angles are plausibly higher. △ Less

Submitted 6 January, 2019; originally announced January 2019.

Comments: 7 pages, 3 figures

arXiv:1808.00216 [pdf]

An AI Based Super Nodes Selection Algorithm in BlockChain Networks

Authors: Jianwen Chen, Kai Duan, Rumin Zhang, Liaoyuan Zeng, Wenyi Wang

Abstract: In blockchain systems, especially cryptographic currencies such as Bitcoin, the double-spending and Byzantine-general-like problem are solved by reaching consensus protocols among all nodes. The state-of-the-art protocols include Proof-of-Work, Proof-of-Stake and Delegated-Proof-of-Stake. Proof-of-Work urges nodes to prove their computing power measured in hash rate in a crypto-puzzle solving comp… ▽ More In blockchain systems, especially cryptographic currencies such as Bitcoin, the double-spending and Byzantine-general-like problem are solved by reaching consensus protocols among all nodes. The state-of-the-art protocols include Proof-of-Work, Proof-of-Stake and Delegated-Proof-of-Stake. Proof-of-Work urges nodes to prove their computing power measured in hash rate in a crypto-puzzle solving competition. The other two take into account the amount of stake of each nodes and even design a vote in Delegated-Proof-of-Stake. However, these frameworks have several drawbacks, such as consuming a large number of electricity, leading the whole blockchain to a centralized system and so on. In this paper, we propose the conceptual framework, fundamental theory and research methodology, based on artificial intelligence technology that exploits nearly complementary information of each nodes. And we designed a particular convolutional neural network and a dynamic threshold, which obtained the super nodes and the random nodes, to reach the consensus. Experimental results demonstrate that our framework combines the advantages of Proof-of-Work, Proof-of-Stake and Delegated-Proof-of-Stake by avoiding complicated hash operation and monopoly. Furthermore, it compares favorably to the three state-of-the-art consensus frameworks, in terms of security and the speed of transaction confirmation. △ Less

Submitted 1 August, 2018; originally announced August 2018.

arXiv:1806.00733 [pdf, other]

doi 10.1103/PhysRevD.99.123519

Search for line signal candidates in the Fermi-LAT data

Authors: Shang Li, Zi-Qing Xia, Yun-Feng Liang, Kai-Kai Duan, Zhao-Qiang Shen, Xiang Li, Lei Feng, Qiang Yuan, Yi-Zhong Fan, Jin Chang

Abstract: In order to search for the line-like signals in the Fermi-LAT data, we have analyzed totally 49152 regions of interest (ROIs) that cover the whole sky. No ROI displays a line signal with test statistic (TS) value above 25, while for 50 ROIs weak line-like excesses with ${\rm TS}>16$ are presented. The intrinsic significances of these potential signals are further reduced by the large trial factor… ▽ More In order to search for the line-like signals in the Fermi-LAT data, we have analyzed totally 49152 regions of interest (ROIs) that cover the whole sky. No ROI displays a line signal with test statistic (TS) value above 25, while for 50 ROIs weak line-like excesses with ${\rm TS}>16$ are presented. The intrinsic significances of these potential signals are further reduced by the large trial factor introduced in such kind of analysis. For the largest TS value of 24.3 derived in our analysis, the corresponding global significance is only $0.54σ$. We thus do not find any significant line-like signal and set up constraints on the cross section of dark matter annihilating to gamma-ray lines, $\left<σ{v}\right>_{γγ}$. △ Less

Submitted 3 June, 2019; v1 submitted 2 June, 2018; originally announced June 2018.

Comments: 10 pages, 6 figures, accepted by PRD

Journal ref: Phys. Rev. D 99, 123519 (2019)

arXiv:1805.06612 [pdf, other]

doi 10.1103/PhysRevD.97.122001

Search for gamma-ray emission from the nearby dwarf spheroidal galaxies with 9 years of Fermi-LAT data

Authors: Shang Li, Kai-Kai Duan, Yun-Feng Liang, Zi-Qing Xia, Zhao-Qiang Shen, Xiang Li, Neng-Hui Liao, Lei Feng, Qiang Yuan, Yi-Zhong Fan, Jin Chang

Abstract: In this work, we search for $γ$-ray emission from the directions of some nearby Milky Way dwarf spheroidal galaxies (dSphs) and candidates with the publicly-available Pass 8 data of Fermi-LAT. Our sample includes 12 sources with the distances $<50$ kpc. Very weak $γ$-ray excesses ($\sim 2σ$) are found in some dSphs/candidates, consistent with that reported in the previous literature. Intriguingly,… ▽ More In this work, we search for $γ$-ray emission from the directions of some nearby Milky Way dwarf spheroidal galaxies (dSphs) and candidates with the publicly-available Pass 8 data of Fermi-LAT. Our sample includes 12 sources with the distances $<50$ kpc. Very weak $γ$-ray excesses ($\sim 2σ$) are found in some dSphs/candidates, consistent with that reported in the previous literature. Intriguingly, the peak test statistic (TS) value of the weak emission from the direction of Reticulum II rises continually. If interpreted as dark matter (DM) annihilation, the peak TS value is 13.5 for the annihilation channel of $χχ\rightarrow τ^{+}τ^{-}$ and the DM mass of $m_χ\sim 16$ GeV. The combination of all these nearby sources yields a more significant (with local significance $> 4σ$) $γ$-ray signal. △ Less

Submitted 17 May, 2018; originally announced May 2018.

Comments: 7 pages, 4 figures, accepted for publication in PRD

Showing 1–50 of 63 results for author: Duan, K