subscribe to arXiv mailings

Observation of Klein bottle quadrupole topological insulators in electric circuits

Authors: Xizhou Shen, Keyu Pan, Xiumei Wang, Xingping Zhou

Abstract: The Klein bottle Benalcazar-Bernevig-Hughes (BBH) insulator phase plays a pivotal role in understanding higher-order topological phases. The insulator phase is characterized by a unique feature: a nonsymmorphic glide symmetry that exists within momentum space, rather than real space. This characteristic transforms the Brillouin zone's fundamental domain into a structure of Klein bottle. Here, we r… ▽ More The Klein bottle Benalcazar-Bernevig-Hughes (BBH) insulator phase plays a pivotal role in understanding higher-order topological phases. The insulator phase is characterized by a unique feature: a nonsymmorphic glide symmetry that exists within momentum space, rather than real space. This characteristic transforms the Brillouin zone's fundamental domain into a structure of Klein bottle. Here, we report an observation of a Klein bottle topoelectrical model under gauge fields. To provide a comprehensive understanding of the different corner distributions of odd and even unit cells, we present theoretical calculations and demonstrate that the symmetry properties significantly affect the topological nature. These theoretical predictions are confirmed by experimental results, which demonstrate the practical feasibility of such topological configurations in electronic circuits. Our work establishes a vital connection between the realms of condensed matter physics and circuit systems, thereby paving a pathway for investigating exotic condensed matter physics. △ Less

Submitted 10 July, 2024; originally announced July 2024.

arXiv:2407.02964 [pdf, other]

FSM: A Finite State Machine Based Zero-Shot Prompting Paradigm for Multi-Hop Question Answering

Authors: Xiaochen Wang, Junqing He, Zhe yang, Yiru Wang, Xiangdi Meng, Kunhao Pan, Zhifang Sui

Abstract: Large Language Models (LLMs) with chain-of-thought (COT) prompting have demonstrated impressive abilities on simple nature language inference tasks. However, they tend to perform poorly on Multi-hop Question Answering (MHQA) tasks due to several challenges, including hallucination, error propagation and limited context length. We propose a prompting method, Finite State Machine (FSM) to enhance th… ▽ More Large Language Models (LLMs) with chain-of-thought (COT) prompting have demonstrated impressive abilities on simple nature language inference tasks. However, they tend to perform poorly on Multi-hop Question Answering (MHQA) tasks due to several challenges, including hallucination, error propagation and limited context length. We propose a prompting method, Finite State Machine (FSM) to enhance the reasoning capabilities of LLM for complex tasks in addition to improved effectiveness and trustworthiness. Different from COT methods, FSM addresses MHQA by iteratively decomposing a question into multi-turn sub-questions, and self-correcting in time, improving the accuracy of answers in each step. Specifically, FSM addresses one sub-question at a time and decides on the next step based on its current result and state, in an automaton-like format. Experiments on benchmarks show the effectiveness of our method. Although our method performs on par with the baseline on relatively simpler datasets, it excels on challenging datasets like Musique. Moreover, this approach mitigates the hallucination phenomenon, wherein the correct final answer can be recovered despite errors in intermediate reasoning. Furthermore, our method improves LLMs' ability to follow specified output format requirements, significantly reducing the difficulty of answer interpretation and the need for reformatting. △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2406.19593 [pdf, other]

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Authors: Xin Su, Man Luo, Kris W Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard

Abstract: Synthetic data generation has gained significant attention recently for its utility in training large vision and language models. However, the application of synthetic data to the training of multimodal context-augmented generation systems has been relatively unexplored. This gap in existing work is important because existing vision and language models (VLMs) are not trained specifically for conte… ▽ More Synthetic data generation has gained significant attention recently for its utility in training large vision and language models. However, the application of synthetic data to the training of multimodal context-augmented generation systems has been relatively unexplored. This gap in existing work is important because existing vision and language models (VLMs) are not trained specifically for context-augmented generation. Resources for adapting such models are therefore crucial for enabling their use in retrieval-augmented generation (RAG) settings, where a retriever is used to gather relevant information that is then subsequently provided to a generative model via context augmentation. To address this challenging problem, we generate SK-VQA: a large synthetic multimodal dataset containing over 2 million question-answer pairs which require external knowledge to determine the final answer. Our dataset is both larger and significantly more diverse than existing resources of its kind, possessing over 11x more unique questions and containing images from a greater variety of sources than previously-proposed datasets. Through extensive experiments, we demonstrate that our synthetic dataset can not only serve as a challenging benchmark, but is also highly effective for adapting existing generative multimodal models for context-augmented generation. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.18070 [pdf, other]

EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

Authors: Baoqi Pei, Guo Chen, Jilan Xu, Yuping He, Yicheng Liu, Kanghua Pan, Yifei Huang, Yali Wang, Tong Lu, Limin Wang, Yu Qiao

Abstract: In this report, we present our solutions to the EgoVis Challenges in CVPR 2024, including five tracks in the Ego4D challenge and three tracks in the EPIC-Kitchens challenge. Building upon the video-language two-tower model and leveraging our meticulously organized egocentric video data, we introduce a novel foundation model called EgoVideo. This model is specifically designed to cater to the uniqu… ▽ More In this report, we present our solutions to the EgoVis Challenges in CVPR 2024, including five tracks in the Ego4D challenge and three tracks in the EPIC-Kitchens challenge. Building upon the video-language two-tower model and leveraging our meticulously organized egocentric video data, we introduce a novel foundation model called EgoVideo. This model is specifically designed to cater to the unique characteristics of egocentric videos and provides strong support for our competition submissions. In the Ego4D challenges, we tackle various tasks including Natural Language Queries, Step Grounding, Moment Queries, Short-term Object Interaction Anticipation, and Long-term Action Anticipation. In addition, we also participate in the EPIC-Kitchens challenge, where we engage in the Action Recognition, Multiple Instance Retrieval, and Domain Adaptation for Action Recognition tracks. By adapting EgoVideo to these diverse tasks, we showcase its versatility and effectiveness in different egocentric video analysis scenarios, demonstrating the powerful representation ability of EgoVideo as an egocentric foundation model. Our codebase and pretrained models are publicly available at https://github.com/OpenGVLab/EgoVideo. △ Less

Submitted 30 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

Comments: Champion solutions in the EgoVis CVPR 2024 workshop

arXiv:2406.17555 [pdf, ps, other]

A response to commenter Ke Lan's comment on our paper published in Nature Communications (2023)14:5782 by J. Yan et al

Authors: Ji Yan, Jiwei Li, X. T. He, Lifeng Wang, Yaohua Chen, Feng Wang, Xiaoying Han, Kaiqiang Pan, Juxi Liang, Yulong Li, Zanyang Guan, Xiangming Liu, Xingsen Che, Zhongjing Chen, Xing Zhang, Yan Xu, Bin Li, Minging He, Hongbo Cai, Liang. Hao, Zhanjun Liu, Chunyang Zheng, Zhensheng Dai, Zhengfeng Fan, Bin Qiao , et al. (4 additional authors not shown)

Abstract: A response to commenter Ke Lan's comment on our paper published in Nature Communications (2023)14:5782 by J. Yan et al A response to commenter Ke Lan's comment on our paper published in Nature Communications (2023)14:5782 by J. Yan et al △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.16095 [pdf, other]

Constrained Measurement Incompatibility from Generalised Contextuality of Steered Preparation

Authors: Sumit Mukherjee, A. K. Pan

Abstract: In a bipartite Bell scenario involving two local measurements per party and two outcome per measurement, the measurement incompatibility in one wing is both necessary and sufficient to reveal the nonlocality. However, such a one-to-one correspondence fails when one of the observers performs more than two measurements. In such a scenario, the measurement incompatibility is necessary but not suffici… ▽ More In a bipartite Bell scenario involving two local measurements per party and two outcome per measurement, the measurement incompatibility in one wing is both necessary and sufficient to reveal the nonlocality. However, such a one-to-one correspondence fails when one of the observers performs more than two measurements. In such a scenario, the measurement incompatibility is necessary but not sufficient to reveal the nonlocality. In this work, within the formalism of general probabilistic theory (GPT), we demonstrate that unlike the nonlocality, the incompatibility of N arbitrary measurements in one wing is both necessary and sufficient for revealing the generalised contextuality for the sub-system in the other wing. Further, we formulate a novel form of inequality for any GPT that are necessary for N-wise compatibility of N arbitrary observables. Moreover, we argue that any theory that violates the proposed inequality possess a degree of incompatibility that can be quantified through the amount of violation. Finally, we claim that it is the generalised contextuality that provides a restriction to the allowed degree of measurement incompatibility of any viable theory of nature and thereby super-select the the quantum theory. △ Less

Submitted 10 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

Comments: three figures, comments are welcome

arXiv:2406.10496 [pdf, other]

Radiation RMHD accretion flows around spinning AGNs: a comparative study of MAD and SANE state

Authors: Ramiz Aktar, Kuo-Chuan Pan, Toru Okuda

Abstract: In our study, we examine a 2D radiation, relativistic, magnetohydrodynamics (Rad-RMHD) accretion flows around a spinning supermassive black hole. We begin by setting an initial equilibrium torus around the black hole, with an embedded initial magnetic field inside the torus. The strength of the initial magnetic field is determined by the plasma beta parameter, which is the ratio of the gas pressur… ▽ More In our study, we examine a 2D radiation, relativistic, magnetohydrodynamics (Rad-RMHD) accretion flows around a spinning supermassive black hole. We begin by setting an initial equilibrium torus around the black hole, with an embedded initial magnetic field inside the torus. The strength of the initial magnetic field is determined by the plasma beta parameter, which is the ratio of the gas pressure to the magnetic pressure. In this paper, we perform a comparative study of the `magnetically arrested disc (MAD)' and `standard and normal evolution (SANE)' states. We observe that MAD state is possible for comparatively high initial magnetic field strength flow. Additionally, we also adopt a self-consistent two-temperature model to evaluate the luminosity and energy spectrum for our model. We observe that the total luminosity is mostly dominated by bremsstrahlung luminosity compared to the synchrotron luminosity due to the presence of highly dense torus. We also identify similar quasi-periodic oscillations (QPOs) for both MAD and SANE states based on power density spectrum analysis. Furthermore, our comparative study of the energy spectrum does not reveal any characteristic differences between MAD and SANE states. Lastly, we note that the MAD state is possible for both prograde and retrograde accretion flow. △ Less

Submitted 15 June, 2024; originally announced June 2024.

Comments: 21 pages, 12 figures, Accepted for publication in ApJ

arXiv:2406.08622 [pdf, other]

doi 10.3847/1538-4357/ad429e

The SDSS-V Black Hole Mapper Reverberation Mapping Project: CIV BAL Acceleration in the Quasar SBS 1408+544

Authors: Robert Wheatley, Catherine J. Grier, Patrick B. Hall, W. N. Brandt, Jonah Lotz, D. P. Schneider, Jonathan R. Trump, Yue Shen, Lucas M. Seaton, Scott F. Anderson, Matthew J. Temple, Roberto Assef, Logan B. Fries, Y. Homayouni, Darshan Kakkad, Anton M. Koekemoer, Mary Loli Martınez-Aldama, C. Alenka Negrete, Claudio Ricci, Dmitry Bizyaev, Joel R. Brownstein, Sean Morrison, Kaike Pan

Abstract: We present the results of an investigation of a highly variable CIV broad absorption-line feature in the quasar SBS 1408+544 (z=2.337) that shows a significant shift in velocity over time. This source was observed as a part of the Sloan Digital Sky Survey Reverberation Mapping Project and the SDSS-V Black Hole Mapper Reverberation Mapping Project, and has been included in two previous studies, bot… ▽ More We present the results of an investigation of a highly variable CIV broad absorption-line feature in the quasar SBS 1408+544 (z=2.337) that shows a significant shift in velocity over time. This source was observed as a part of the Sloan Digital Sky Survey Reverberation Mapping Project and the SDSS-V Black Hole Mapper Reverberation Mapping Project, and has been included in two previous studies, both of which identified significant variability in a high-velocity CIV broad absorption line (BAL) on timescales of just a few days in the quasar rest frame. Using ~130 spectra acquired over eight years of spectroscopic monitoring with SDSS, we have determined that this BAL is not only varying in strength, but is also systematically shifting to higher velocities. Using cross-correlation methods, we measure the velocity shifts (and corresponding acceleration) of the BAL on a wide range of timescales, measuring an overall velocity shift of delta v = -683 (+89, -84) km s-1 over the 8-year monitoring period. This corresponds to an average rest-frame acceleration of a=1.04 (+0.14, -0.13) cm s-2, though the magnitude of the acceleration on shorter timescales is not constant throughout. We place our measurements in the context of BAL-acceleration models and examine various possible causes of the observed velocity shift. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: Published in the Astrophysical Journal

Journal ref: ApJ, Volume 968, Issue 2, Article #9 (2024)

arXiv:2405.11594 [pdf, other]

Extreme Magnetic Field Modulus Variability of the Bp star HD 57372

Authors: S. Hubrig, S. D. Chojnowski, S. P. Jarvinen, I. Ilyin, K. Pan

Abstract: Context. In chemically peculiar Ap/Bp stars with large-scale organised magnetic fields with a simple centred dipole configuration, the ratio between the maximum and the minimum of the mean magnetic field modulus is of the order of 1.25. Values of 2 or more are observed only for very few Ap/Bp stars and are indicative of a very unusual magnetic field geometry. Aims. Determining the magnetic field s… ▽ More Context. In chemically peculiar Ap/Bp stars with large-scale organised magnetic fields with a simple centred dipole configuration, the ratio between the maximum and the minimum of the mean magnetic field modulus is of the order of 1.25. Values of 2 or more are observed only for very few Ap/Bp stars and are indicative of a very unusual magnetic field geometry. Aims. Determining the magnetic field structure of Ap/Bp stars is bound to provide a different insight into the physics and the origin of the magnetic fields in early-type stars. In this respect, the Bp star HD 57372 is of particular interest because strongly variable magnetically split lines are observed in HARPS and APOGEE spectra. Methods. We obtained and analysed measurements of the mean magnetic field modulus and of the mean longitudinal magnetic field using near-infrared spectra and optical polarimetric spectra distributed over the stellar rotation period. Results. The mean magnetic field modulus <B> of HD 57372, as estimated from absorption lines that are split via the Zeeman effect and resolved in both optical and near-infrared spectra, is found to vary by an extraordinary amount of about 10 kG. The exceptional value of 3 for the ratio between the maximum and the minimum of the field modulus is indicative of a very unusual geometry of HD 57372's magnetic field. All observable quantities are found to vary in phase with the photometric period of 7.889 days. This includes the longitudinal magnetic field <Bz>, which varies from -6 kG up to 1.7 kG in FORS2 spectra as well as the metal line strengths, whose equivalent widths change by up to 50% of their mean values over the course of the rotation period. The B8 temperature class of HD 57372 also places it among the hottest stars known to exhibit resolved, magnetically split lines. △ Less

Submitted 19 May, 2024; originally announced May 2024.

arXiv:2405.01926 [pdf, other]

Auto-Encoding Morph-Tokens for Multimodal LLM

Authors: Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang

Abstract: For multimodal LLMs, the synergy of visual comprehension (textual output) and generation (visual output) presents an ongoing challenge. This is due to a conflicting objective: for comprehension, an MLLM needs to abstract the visuals; for generation, it needs to preserve the visuals as much as possible. Thus, the objective is a dilemma for visual-tokens. To resolve the conflict, we propose encoding… ▽ More For multimodal LLMs, the synergy of visual comprehension (textual output) and generation (visual output) presents an ongoing challenge. This is due to a conflicting objective: for comprehension, an MLLM needs to abstract the visuals; for generation, it needs to preserve the visuals as much as possible. Thus, the objective is a dilemma for visual-tokens. To resolve the conflict, we propose encoding images into morph-tokens to serve a dual purpose: for comprehension, they act as visual prompts instructing MLLM to generate texts; for generation, they take on a different, non-conflicting role as complete visual-tokens for image reconstruction, where the missing visual cues are recovered by the MLLM. Extensive experiments show that morph-tokens can achieve a new SOTA for multimodal comprehension and generation simultaneously. Our project is available at https://github.com/DCDmllm/MorphTokens. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: Accepted by ICML 2024

arXiv:2404.11084 [pdf]

Observation of Young's double-slit phenomenon in anti-PT-symmetric electrical circuits

Authors: Keyu Pan, Xiumei Wang, Xizhou Shen, Haoyi Zhou, Xingping Zhou

Abstract: In the last few decades, interference has been extensively studied in both the quantum and classical fields, which reveals light volatility and is widely used for high-precision measurements. We have put forward the phenomenon in which the discrete diffraction and interference phenomena, presented by the time-varying voltage of a Su-Schrieffer-Heeger (SSH) circuit model with an anti-PT (APT) symme… ▽ More In the last few decades, interference has been extensively studied in both the quantum and classical fields, which reveals light volatility and is widely used for high-precision measurements. We have put forward the phenomenon in which the discrete diffraction and interference phenomena, presented by the time-varying voltage of a Su-Schrieffer-Heeger (SSH) circuit model with an anti-PT (APT) symmetry. To demonstrate Young's double-slit phenomenon in an APT circuit, we initially explore the coupled mode theory (CMT) of voltage in the broken phase, observe discrete diffraction under single excitation and interference under double excitations. Furthermore, we design a phase-shifting circuit to observe the effects of phase difference and distance on discrete interference. Our work combines the effects in optics with condensed matter physics, show the Young's double-slit phenomenon in electrical circuits theoretically and experimentally. △ Less

Submitted 21 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

arXiv:2403.07864 [pdf]

Unraveling the nature of quasi van der Waals Epitaxy of magnetic topological insulators Cr: (BixSb1-x)2Te3 on a GaAs (111) substrate through coherently strained interface

Authors: Yuxing Ren, Lixuan Tai, Kaicheng Pan, Yueyun Chen, Benjamin Z. Gregory, Jin Ho Kang, Malcolm Jackson, Michael Liao, Yifei Sun, Noah Bodzin, Kin Wong, Suchismita Sarker, B. C. Regan, Chee-Wei Wong, Mark Goorsky, Andrej Singer, Kang L. Wang

Abstract: Quasi van der Waals Epitaxy (qvdWE) has been realized for decades at the interfaces between 3D and 2D materials or van der Waals materials. The growth of magnetic topological insulators (MTI) Cr: (BixSb1-x)2Te3 (CBST) on GaAs (111) substrates for Quantum Anomalous Hall Effect (QAH) is actually one of the examples of qvdWE, which is not well noticed despite the fact that its advantages have been us… ▽ More Quasi van der Waals Epitaxy (qvdWE) has been realized for decades at the interfaces between 3D and 2D materials or van der Waals materials. The growth of magnetic topological insulators (MTI) Cr: (BixSb1-x)2Te3 (CBST) on GaAs (111) substrates for Quantum Anomalous Hall Effect (QAH) is actually one of the examples of qvdWE, which is not well noticed despite the fact that its advantages have been used in growth of various MTI materials. This is distinguished from the growth of MTIs on other substrates. Although the qvdWE mode has been used in many 2D growth on III-V substrates, the specific features and mechanisms are not well demonstrated and summarized yet. Here in this work, we have for the first time shown the features of both coherent interfaces and the existence of strain originating from qvdWE at the same time. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 5 figures, 1 table. Already shown in APS March Meeting 2023 and 2024

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2402.14188 [pdf, ps, other]

On the cohomology of Lie algebras associated with graphs

Authors: Marco Aldi, Andrew Butler, Jordan Gardiner, Daniele Grandini, Monica Lichtenwalner, Kevin Pan

Abstract: We describe a canonical decomposition of the cohomology of the Dani-Mainkar metabelian Lie algebras associated with graphs. As applications, we obtain explicit formulas for the third cohomology of any Dani-Mainkar Lie algebra and for the cohomology in all degrees of Lie algebras associated with arbitrary star graphs. We also describe a procedure to reduce the calculation of the cohomology of solva… ▽ More We describe a canonical decomposition of the cohomology of the Dani-Mainkar metabelian Lie algebras associated with graphs. As applications, we obtain explicit formulas for the third cohomology of any Dani-Mainkar Lie algebra and for the cohomology in all degrees of Lie algebras associated with arbitrary star graphs. We also describe a procedure to reduce the calculation of the cohomology of solvable Lie algebras associated with graphs through the Grantcharov-Grantcharov-Iliev construction to the cohomology of Dani-Mainkar Lie algebras. △ Less

Submitted 13 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

Comments: 17 pages

arXiv:2402.12934 [pdf, ps, other]

Qualitative analysis to an eigenvalue problem of the Hartree type Brézis-Nirenberg problem

Authors: Kefan Pan, Shixin Wen, Jing Yang

Abstract: In this paper, we are concerned with the critical Hartree equation \begin{equation*} \begin{cases} -Δu=\left(\displaystyle{\displaystyle{\int_Ω}}\frac{u^{2^{*}_μ}(y)}{|x-y|^μ}dy\right)u^{2^{*}_μ-1}+\varepsilon u,\quad u>0,\quad &\text{in $Ω$,}\\ u=0,\quad &\text{on $\partialΩ$,} \end{cases} \end{equation*} where $Ω\subset \mathbb{R}^N$ ($N\geq 5$) is a smooth bounded domain, $μ\in (0,4)$ and… ▽ More In this paper, we are concerned with the critical Hartree equation \begin{equation*} \begin{cases} -Δu=\left(\displaystyle{\displaystyle{\int_Ω}}\frac{u^{2^{*}_μ}(y)}{|x-y|^μ}dy\right)u^{2^{*}_μ-1}+\varepsilon u,\quad u>0,\quad &\text{in $Ω$,}\\ u=0,\quad &\text{on $\partialΩ$,} \end{cases} \end{equation*} where $Ω\subset \mathbb{R}^N$ ($N\geq 5$) is a smooth bounded domain, $μ\in (0,4)$ and $2^{*}_μ=\frac{2N-μ}{N-2}$ is the upper critical exponent in the sense of the Hardy-Littlewood-Sobolev inequality. Under a non-degeneracy condition on the critical point $x_0\inΩ$ of the Robin function $R(x)$, we perform that for $\varepsilon>0$ sufficiently small, the Morse index of the blow-up solutions $u_\varepsilon$ concentrating at $x_0$ can be computed in terms of the negative eigenvalues of the Hessian matrix $D^{2}R(x)$ at $x_0$. Compared with the usual local cases, our problem is non-local due to the nonlinearity with Hartree-type, and several difficulties arise and new estimates of the eigenpairs $\{\left(λ_{i,\varepsilon},v_{i,\varepsilon}\right)\}$ to the associated linearized problem at $u_{\varepsilon}$ should be introduced. To our knowledge, this seems to be the first paper to consider the qualitative analysis of a Hartree type Brézis-Nirenberg problem and our results extend the works established by M. Grossi et al in \cite{GP} and F. Takahashi in \cite{Ta3} to the non-local case. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2401.08985 [pdf, other]

The Influence of Stellar Rotation in Binary Systems on Core-Collapse Supernova Progenitors and Multi-messenger Signals

Authors: Hao-Sheng Wang, Kuo-Chuan Pan

Abstract: The detailed structure of core-collapse supernova progenitors is crucial for studying supernova explosion engines and the corresponding multimessenger signals. In this paper, we investigate the influence of stellar rotation on binary systems consisting of a 30 solar mass donor star and a 20 solar mass accretor using the MESA stellar evolution code. We find that through mass transfer in binary syst… ▽ More The detailed structure of core-collapse supernova progenitors is crucial for studying supernova explosion engines and the corresponding multimessenger signals. In this paper, we investigate the influence of stellar rotation on binary systems consisting of a 30 solar mass donor star and a 20 solar mass accretor using the MESA stellar evolution code. We find that through mass transfer in binary systems, fast-rotating red- and blue-supergiant progenitors can be formed within a certain range of initial orbital periods, albeit the correlation is not linear. We also find that even with the same initial mass ratio of the binary system, the resulting final masses of the collapsars, the iron core masses, the compactness parameters, and the final rotational rates can vary widely and are sensitive to the initial orbital periods. For instance, the progenitors with strong convection form a thinner Si-shell and a wider O-shell compared to those in single-star systems. In addition, we conduct two-dimensional self-consistent core-collapse supernova simulations with neutrino transport for these rotating progenitors derived from binary stellar evolution. We find that the neutrino and gravitational-wave signatures of these binary progenitors could exhibit significant variations. Progenitors with larger compactness parameters produce more massive proto-neutron stars, have higher mass-accretion rates, and emit brighter neutrino luminosity and louder gravitational emissions. Finally, we observe stellar-mass black hole formation in some of our failed exploding models. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: 19 pages, 16 figures, accepted by ApJ

arXiv:2401.02484 [pdf, other]

Spectacular nucleosynthesis from early massive stars

Authors: Alexander P. Ji, Sanjana Curtis, Nicholas Storm, Vedant Chandra, Kevin C. Schlaufman, Keivan G. Stassun, Alexander Heger, Marco Pignatari, Adrian M. Price-Whelan, Maria Bergemann, Guy S. Stringfellow, Carla Frohlich, Henrique Reggiani, Erika M. Holmbeck, Jamie Tayar, Shivani P. Shah, Emily J. Griffith, Chervin F. P. Laporte, Andrew R. Casey, Keith Hawkins, Danny Horta, William Cerny, Pierre Thibodeaux, Sam A. Usman, Joao A. S. Amarante , et al. (17 additional authors not shown)

Abstract: Stars formed with initial mass over 50 Msun are very rare today, but they are thought to be more common in the early universe. The fates of those early, metal-poor, massive stars are highly uncertain. Most are expected to directly collapse to black holes, while some may explode as a result of rotationally powered engines or the pair-creation instability. We present the chemical abundances of J0931… ▽ More Stars formed with initial mass over 50 Msun are very rare today, but they are thought to be more common in the early universe. The fates of those early, metal-poor, massive stars are highly uncertain. Most are expected to directly collapse to black holes, while some may explode as a result of rotationally powered engines or the pair-creation instability. We present the chemical abundances of J0931+0038, a nearby low-mass star identified in early followup of SDSS-V Milky Way Mapper, which preserves the signature of unusual nucleosynthesis from a massive star in the early universe. J0931+0038 has relatively high metallicity ([Fe/H] = -1.76 +/- 0.13) but an extreme odd-even abundance pattern, with some of the lowest known abundance ratios of [N/Fe], [Na/Fe], [K/Fe], [Sc/Fe], and [Ba/Fe]. The implication is that a majority of its metals originated in a single extremely metal-poor nucleosynthetic source. An extensive search through nucleosynthesis predictions finds a clear preference for progenitors with initial mass > 50 Msun, making J0931+0038 one of the first observational constraints on nucleosynthesis in this mass range. However the full abundance pattern is not matched by any models in the literature. J0931+0038 thus presents a challenge for the next generation of nucleosynthesis models and motivates study of high-mass progenitor stars impacted by convection, rotation, jets, and/or binary companions. Though rare, more examples of unusual early nucleosynthesis in metal-poor stars should be found in upcoming large spectroscopic surveys. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: 11 pages + 22 page appendix, accepted to ApJL

arXiv:2401.01933 [pdf, other]

doi 10.3847/1538-4357/ad2f30

Exploring Changing-look Active Galactic Nuclei with the Sloan Digital Sky Survey V: First Year Results

Authors: Grisha Zeltyn, Benny Trakhtenbrot, Michael Eracleous, Qian Yang, Paul Green, Scott F. Anderson, Stephanie LaMassa, Jessie Runnoe, Roberto J. Assef, Franz E. Bauer, W. N. Brandt, Megan C. Davis, Sara E. Frederick, Logan B. Fries, Matthew J. Graham, Norman A. Grogin, Muryel Guolo, Lorena Hernández-García, Anton M. Koekemoer, Mirko Krumpe, Xin Liu, Mary Loli Martínez-Aldama, Claudio Ricci, Donald P. Schneider, Yue Shen , et al. (10 additional authors not shown)

Abstract: "Changing-look" active galactic nuclei (CL-AGNs) challenge our basic ideas about the physics of accretion flows and circumnuclear gas around supermassive black holes. Using first-year Sloan Digital Sky Survey V (SDSS-V) repeated spectroscopy of nearly 29,000 previously known AGNs, combined with dedicated follow-up spectroscopy, and publicly available optical light curves, we have identified 116 CL… ▽ More "Changing-look" active galactic nuclei (CL-AGNs) challenge our basic ideas about the physics of accretion flows and circumnuclear gas around supermassive black holes. Using first-year Sloan Digital Sky Survey V (SDSS-V) repeated spectroscopy of nearly 29,000 previously known AGNs, combined with dedicated follow-up spectroscopy, and publicly available optical light curves, we have identified 116 CL-AGNs where (at least) one broad emission line has essentially (dis-)appeared, as well as 88 other extremely variable systems. Our CL-AGN sample, with 107 newly identified cases, is the largest reported to date, and includes $\sim0.4\%$ of the AGNs reobserved in first-year SDSS-V operations. Among our CL-AGNs, 67% exhibit dimming while 33% exhibit brightening. Our sample probes extreme AGN spectral variability on months to decades timescales, including some cases of recurring transitions on surprisingly short timescales ($\lesssim 2$ months in the rest frame). We find that CL events are preferentially found in lower-Eddington-ratio ($f_{Edd}$) systems: Our CL-AGNs have a $f_{Edd}$ distribution that significantly differs from that of a carefully constructed, redshift- and luminosity-matched control sample (Anderson-Darling test yielding $p_{\rm AD}\approx 6\times10^{-5}$; median $f_{Edd}\approx0.025$ vs. $0.043$). This preference for low $f_{Edd}$ strengthens previous findings of higher CL-AGN incidence at lower $f_{Edd}$, found in smaller samples. Finally, we show that the broad MgII emission line in our CL-AGN sample tends to vary significantly less than the broad H$β$ emission line. Our large CL-AGN sample demonstrates the advantages and challenges in using multi-epoch spectroscopy from large surveys to study extreme AGN variability and physics. △ Less

Submitted 1 May, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: Submitted to ApJ. Full tables and figure-sets will be published upon acceptance, and can be made available upon request$.$

Journal ref: ApJ 966 85 (2024)

arXiv:2312.07226 [pdf, other]

Super-Resolution on Rotationally Scanned Photoacoustic Microscopy Images Incorporating Scanning Prior

Authors: Kai Pan, Linyang Li, Li Lin, Pujin Cheng, Junyan Lyu, Lei Xi, Xiaoyin Tang

Abstract: Photoacoustic Microscopy (PAM) images integrating the advantages of optical contrast and acoustic resolution have been widely used in brain studies. However, there exists a trade-off between scanning speed and image resolution. Compared with traditional raster scanning, rotational scanning provides good opportunities for fast PAM imaging by optimizing the scanning mechanism. Recently, there is a t… ▽ More Photoacoustic Microscopy (PAM) images integrating the advantages of optical contrast and acoustic resolution have been widely used in brain studies. However, there exists a trade-off between scanning speed and image resolution. Compared with traditional raster scanning, rotational scanning provides good opportunities for fast PAM imaging by optimizing the scanning mechanism. Recently, there is a trend to incorporate deep learning into the scanning process to further increase the scanning speed.Yet, most such attempts are performed for raster scanning while those for rotational scanning are relatively rare. In this study, we propose a novel and well-performing super-resolution framework for rotational scanning-based PAM imaging. To eliminate adjacent rows' displacements due to subject motion or high-frequency scanning distortion,we introduce a registration module across odd and even rows in the preprocessing and incorporate displacement degradation in the training. Besides, gradient-based patch selection is proposed to increase the probability of blood vessel patches being selected for training. A Transformer-based network with a global receptive field is applied for better performance. Experimental results on both synthetic and real datasets demonstrate the effectiveness and generalizability of our proposed framework for rotationally scanned PAM images'super-resolution, both quantitatively and qualitatively. Code is available at https://github.com/11710615/PAMSR.git. △ Less

Submitted 12 December, 2023; originally announced December 2023.

arXiv:2311.09198 [pdf, other]

Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering

Authors: Junqing He, Kunhao Pan, Xiaoqun Dong, Zhuoyang Song, Yibo Liu, Yuxin Liang, Hao Wang, Qianguo Sun, Songxin Zhang, Zejian Xie, Jiaxing Zhang

Abstract: While large language models (LLMs) are equipped with longer text input capabilities than before, they are struggling to seek correct information in long contexts. The "lost in the middle" problem challenges most LLMs, referring to the dramatic decline in accuracy when correct information is located in the middle. To overcome this crucial issue, this paper proposes to enhance the information search… ▽ More While large language models (LLMs) are equipped with longer text input capabilities than before, they are struggling to seek correct information in long contexts. The "lost in the middle" problem challenges most LLMs, referring to the dramatic decline in accuracy when correct information is located in the middle. To overcome this crucial issue, this paper proposes to enhance the information searching and reflection ability of LLMs in long contexts via specially designed tasks called Attention Strengthening Multi-doc QA (ASM QA). Following these tasks, our model excels in focusing more precisely on the desired information. Experimental results show substantial improvement in Multi-doc QA and other benchmarks, superior to state-of-the-art models by 13.7% absolute gain in shuffled settings, by 21.5% in passage retrieval task. We release our model, Ziya-Reader to promote related research in the community. △ Less

Submitted 15 November, 2023; originally announced November 2023.

arXiv:2311.07977 [pdf, ps, other]

Unbounded Sharing of Nonlocality Using Projective Measurements

Authors: S. Sasmal, S. Kanjilal, A. K. Pan

Abstract: It is a common perception that a sharp projective measurement in one side of the Bell experiment destroys the entanglement of the shared state, thereby preventing the demonstration of sequential sharing of nonlocality. In contrast, we introduce a local randomness-assisted projective measurement protocol, enabling the sharing of nonlocality by an arbitrary number of sequential observers (Bobs) with… ▽ More It is a common perception that a sharp projective measurement in one side of the Bell experiment destroys the entanglement of the shared state, thereby preventing the demonstration of sequential sharing of nonlocality. In contrast, we introduce a local randomness-assisted projective measurement protocol, enabling the sharing of nonlocality by an arbitrary number of sequential observers (Bobs) with a single spatially separated party Alice. Subsequently, a crucial feature of the interplay between the degrees of incompatibility of observables of both parties is revealed, enabling the unbounded sharing of nonlocality. Our findings, not only offer a new paradigm for understanding the fundamental nature of incompatibility in demonstrating quantum nonlocality but also pave a new path for various information processing tasks based on local randomness-assisted projective measurement. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2311.04631 [pdf, other]

Self-testing of an unbounded number of mutually commuting local observables

Authors: Sneha Munshi, A. K. Pan

Abstract: Based on the optimal quantum violation of suitable Bell's inequality, the device-independent self-testing of state and observables has been reported. It is well-studied that locally commuting or compatible observables cannot be used to reveal quantum nonlocality. Therefore, the self-testing of commuting local observables cannot be possible through the Bell test. In this work, we demonstrate the se… ▽ More Based on the optimal quantum violation of suitable Bell's inequality, the device-independent self-testing of state and observables has been reported. It is well-studied that locally commuting or compatible observables cannot be used to reveal quantum nonlocality. Therefore, the self-testing of commuting local observables cannot be possible through the Bell test. In this work, we demonstrate the self-testing of a set of mutually commuting local observables. Such certification has not hitherto been reported. We show that the optimal quantum violations of suitably formulated bilocality and n-locality inequalities in networks uniquely fix the observables of one party to be mutually commuting. In particular, we first demonstrate that in a two-input-arbitrary-party star network, two commuting local observables can be self-tested. Further, by considering an arbitrary-input three-party bilocal network scenario, we demonstrate the self-testing of an unbounded number of mutually commuting local observables. △ Less

Submitted 8 November, 2023; originally announced November 2023.

arXiv:2311.04621 [pdf, other]

doi 10.1002/andp.202300060

Optimal quantum violations of n-locality inequalities with conditional dependence on inputs

Authors: Sneha Munshi, A. K. Pan

Abstract: Bell experiment in the network gives rise to a form of quantum nonlocality which is conceptually different from traditional multipartite Bell nonlocality. Conventional multipartite Bell experiment features a single source that distributes physical systems to multiple parties. In contrast, the network Bell experiment features multiple independent sources. This work considers a nontrivial quantum ne… ▽ More Bell experiment in the network gives rise to a form of quantum nonlocality which is conceptually different from traditional multipartite Bell nonlocality. Conventional multipartite Bell experiment features a single source that distributes physical systems to multiple parties. In contrast, the network Bell experiment features multiple independent sources. This work considers a nontrivial quantum network, the star-network configuration in an arbitrary input scenario involving n independent sources and (n+1) parties, including n edge parties and one central party. Each of the n edge parties shares a physical system with the central party. We consider that the central party received an arbitrary m number of inputs, and each edge party receives 2^{m-1} number of inputs. The joint probabilities of the system are bounded by some linear constraints. We show that this behaviour of the joint probabilities in turn imposes conditional dependence on the inputs of the edge parties such that the observables of each edge party are bounded by few linear constraints. We derive a family of generalized n-locality inequalities and demonstrate its optimal quantum violation. We introduce an elegant sum-of-squares approach that enables the optimization in quantum theory without specifying the dimension of the quantum system. The optimal quantum value self-tests the observables of each edge party along with the conditional dependence. The observables of the central party along with the quantum state are also self-tested from the optimization procedure itself. Further, we characterize the network nonlocality and examine its correspondence with suitably derived standard Bell nonlocality. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Journal ref: Annalen l der Physik 2300060, 1-15, 2023

arXiv:2311.04583 [pdf, other]

doi 10.1103/PhysRevA.107.022425

Nonlocal correlations in an asymmetric quantum network

Authors: Souradeep Sasmal, Shyam Sundar Mahato, Alok Kumar Pan

Abstract: The nonlocality revealed in a multiparty multisource network Bell experiment is conceptually different than the standard multiparty Bell nonlocality involving a single common source. Here, by introducing variants of asymmetric bilocal as well as trilocal network scenarios, we go beyond the typical bilocal network scenario where both the edge parties have an equal number of measurement settings. We… ▽ More The nonlocality revealed in a multiparty multisource network Bell experiment is conceptually different than the standard multiparty Bell nonlocality involving a single common source. Here, by introducing variants of asymmetric bilocal as well as trilocal network scenarios, we go beyond the typical bilocal network scenario where both the edge parties have an equal number of measurement settings. We first introduce an asymmetric bilocal network where one of the edge parties (say, Alice) receives $2^{n-1}$ inputs and the other edge party (say, Charlie) receives $n$ inputs. We derive two variants of asymmetric bilocality inequalities and demonstrate their optimal quantum violations. Further, we explore two types of asymmetric trilocal scenarios: (i) when two edge parties receive $2^{n-1}$ inputs each and the other edge party receives $n$ inputs, and (ii) when one edge party receives $2^{n-1}$ inputs, and the other two edge parties have $n$ inputs each. We use an elegant sum-of-squares technique that enables us to evaluate the quantum optimal values of the proposed network inequalities without assuming the dimension of the systems for both the asymmetric bilocal as well as the trilocal scenarios. Further, we demonstrate the robustness of the quantum violations of the proposed inequalities in the presence of white noise. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Journal ref: Phys. Rev. A 107, 022425 (2023)

arXiv:2311.04568 [pdf, other]

doi 10.1103/PhysRevA.107.012615

Sharing preparation contextuality in Bell experiment by arbitrary pair of sequential observers

Authors: Asmita Kumari, Alok Kumar Pan

Abstract: Based on the quantum violation of bipartite Bell inequality, it has been demonstrated that the sharing of non-locality can be demonstrated for at most two sequential observers at one end and at most one-pair of observers at both ends. In this work, we study the sharing of non-locality and preparation contextuality based on a bipartite Bell inequality, involving arbitrary $n$ measurements by one pa… ▽ More Based on the quantum violation of bipartite Bell inequality, it has been demonstrated that the sharing of non-locality can be demonstrated for at most two sequential observers at one end and at most one-pair of observers at both ends. In this work, we study the sharing of non-locality and preparation contextuality based on a bipartite Bell inequality, involving arbitrary $n$ measurements by one party and $2^{n-1}$ measurements by other party. Such a Bell inequality has two bounds, the local bound and the preparation non-contextual bound, which is smaller than the local bound. We show that while non-locality can be shared only by first pair of the sequential observers, the preparation contextuality can be shared by arbitrary pair of independent sequential observers at both ends. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: 8 pages, 3 Figures

Journal ref: Physical Review A 107, 012615 (2023)

arXiv:2311.04497 [pdf, other]

doi 10.1103/PhysRevA.107.022204

Device-independent certification of degeneracy-breaking measurements

Authors: Prabuddha Roy, Shyam Sundar Mahato, Sumit Mukherjee, A. K. Pan

Abstract: In a device-independent Bell test, the devices are considered to be black boxes and the dimension of the system remains unspecified. The dichotomic observables involved in such a Bell test can be degenerate and one may invoke a suitable measurement scheme to lift the degeneracy. However, the standard Bell test cannot account for whether or up to what extent the degeneracy is lifted, as the effect… ▽ More In a device-independent Bell test, the devices are considered to be black boxes and the dimension of the system remains unspecified. The dichotomic observables involved in such a Bell test can be degenerate and one may invoke a suitable measurement scheme to lift the degeneracy. However, the standard Bell test cannot account for whether or up to what extent the degeneracy is lifted, as the effect of lifting the degeneracy can only be reflected in the post-measurement states, which the standard Bell tests do not certify. In this work, we demonstrate the device-independent certification of degeneracy-breaking measurement based on the sequential Bell test by multiple observers who perform degeneracy-breaking unsharp measurements characterized by positive-operator-valued measures (POVMs) - the noisy variants of projectors. The optimal quantum violation of Clauser-Horne-Shimony-Holt inequality by multiple sequential observers eventually enables us to certify up to what extent the degeneracy has been lifted. In particular, our protocol certifies the upper bound on the number of POVMs used for performing such measurements along with the entangled state and measurement observables. We use an elegant sum-of-squares approach that powers such certification of degeneracy-breaking measurements. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Journal ref: Phys. Rev. A 107, 022204 (2023)

arXiv:2311.04492 [pdf, other]

doi 10.1007/s40509-023-00300-9

Sharing nonlocality in a network using the quantum violation of chain network inequality

Authors: Rahul Kumar, A. K. Pan

Abstract: Based on the quantum violation of suitable $n$-local inequality in a star network for arbitrary $m$ inputs, we demonstrate the sharing of nonlocality in the network. Such a network features an arbitrary $n$ number of independent sources, $n$ edge parties, and a central party. Each party receives arbitrary $m$ inputs. We consider two different types of sharing of nonlocality in the network. i) The… ▽ More Based on the quantum violation of suitable $n$-local inequality in a star network for arbitrary $m$ inputs, we demonstrate the sharing of nonlocality in the network. Such a network features an arbitrary $n$ number of independent sources, $n$ edge parties, and a central party. Each party receives arbitrary $m$ inputs. We consider two different types of sharing of nonlocality in the network. i) The symmetric case - when the sharing of nonlocality is considered across all edge parties. ii) The asymmetric case - when the sharing of nonlocality is considered across only one edge party. For simplicity, we first consider the bilocal scenario $(n=2)$ with three inputs $m=3$ and demonstrate that while in the symmetric case at most two sequential observers can share nonlocality, in the asymmetric case at most four sequential observers can share nonlocality. We extend the study to $n$-local scenario by assuming each party receives three inputs and show that in the symmetric case the result remains the same for any $n$, but in the asymmetrical case, an unbounded number of sequential observers can share nonlocality across one edge for a sufficiently large value of $n$. We further extend our result for arbitrary $m$ input in $n$-local scenario. We demonstrate that for $m\geq 4$, in the symmetric case at most one sequential observer can share nonlocality irrespective of the value of $n$. For the asymmetric case, we analytically show that there exists $n(k)$ for which an arbitrary $k$ number of sequential observers can share the nonlocality across one edge. The optimal quantum violation of $m$-input $n$-local inequality is derived through an elegant SOS approach without specifying the dimension of the quantum system. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: arXiv admin note: text overlap with arXiv:2212.14325

Journal ref: Quantum Studies, 10, 353-372 (2023)

arXiv:2311.04490 [pdf, other]

Generalized parity-oblivious communication games powered by quantum preparation contextuality

Authors: Prabuddha Roy, A. K. Pan

Abstract: The parity-oblivious random-access-code (PORAC) is a class of communication games involving a sender (Alice) and a receiver (Bob). In such games, Alice's amount of communication to Bob is constraint by the parity-oblivious (PO) conditions, so that the parity information of her inputs remains oblivious to Bob. The PO condition in an operational theory is equivalently represented in an ontological m… ▽ More The parity-oblivious random-access-code (PORAC) is a class of communication games involving a sender (Alice) and a receiver (Bob). In such games, Alice's amount of communication to Bob is constraint by the parity-oblivious (PO) conditions, so that the parity information of her inputs remains oblivious to Bob. The PO condition in an operational theory is equivalently represented in an ontological model that satisfies the preparation noncontextuality. In this paper, we provide a nontrivial generalization of the existing two-level PORAC and derive the winning probability of the game in the preparation noncontextual ontological model. We demonstrate that the quantum theory outperforms the preparation noncontextual model by predicting higher winning probability in our generalized PORAC. △ Less

Submitted 8 November, 2023; originally announced November 2023.

arXiv:2311.04485 [pdf, other]

doi 10.1088/1367-2630/acb4b5

Device-independent self-testing of unsharp measurements

Authors: Prabuddha Roy, A. K. Pan

Abstract: Semi-device-independent certification of an unsharp instrument has recently been demonstrated [New J. Phys. 21, 083034 (2019)] based on the sequential sharing of quantum advantages in a prepare-measure communication game by assuming the system to be qubit. In this work, we provide device-independent (DI) self-testing of the unsharp instrument through the quantum violation of two Bell inequalities… ▽ More Semi-device-independent certification of an unsharp instrument has recently been demonstrated [New J. Phys. 21, 083034 (2019)] based on the sequential sharing of quantum advantages in a prepare-measure communication game by assuming the system to be qubit. In this work, we provide device-independent (DI) self-testing of the unsharp instrument through the quantum violation of two Bell inequalities where the devices are uncharacterized and the dimension of the system remains unspecified. We introduce an elegant sum-of-squares approach to derive the dimension-independent optimal quantum violation of Bell inequalities which plays a crucial role. Note that the standard Bell test cannot self-test the post-measurement states and consequently cannot self-test unsharp instrument. The sequential Bell test possess the potential to self-test an unsharp instrument. We demonstrate that there exists a trade-off between the maximum sequential quantum violations of the Clauser-Horne-Shimony-Holt inequality, and they form an optimal pair that enables the DI self-testing of the entangled state, the observables, and the unsharpness parameter. Further, we extend our study to the case of elegant Bell inequality and we argue that it has two classical bounds - the local bound and the non-trivial preparation non-contextual bound, lower than the local bound. Based on the sharing of preparation contextuality by three independent sequential observers, we demonstrate the DI self-testing of two unsharpness parameters. Since an actual experimental scenario involves losses and imperfection, we demonstrate robustness of our certification to noise. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Journal ref: New J. Phys. 25 013040 (2023)

arXiv:2311.04484 [pdf, other]

doi 10.1016/j.physleta.2023.128898

Leggett-Garg test of macrorealism using indefinite causal order of measurements

Authors: A. K. Pan

Abstract: Macrorealism is a belief that constitutes the core of our perception of reality in the everyday world. The Leggett-Garg (LG) test is a conceptually elegant approach for probing the compatibility between the notion of macrorealism and quantum theory. However, a conclusive LG test hinges on how one fixes the operational invasiveness loophole, i.e., how the statistical form of non-invasive measurabil… ▽ More Macrorealism is a belief that constitutes the core of our perception of reality in the everyday world. The Leggett-Garg (LG) test is a conceptually elegant approach for probing the compatibility between the notion of macrorealism and quantum theory. However, a conclusive LG test hinges on how one fixes the operational invasiveness loophole, i.e., how the statistical form of non-invasive measurability assumption is guaranteed in an LG test. Despite many attempts to close this loophole, no consensus has been achieved yet. In this work, we propose a simple and elegant scheme based on indefinite causal order in quantum switch experiment, which enables us to close this loophole, and eventually, the LG test becomes a conclusive test of macrorealism. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Journal ref: Physics Letters A, 478,128898 (2023)

arXiv:2311.03301 [pdf, other]

Ziya2: Data-centric Learning is All LLMs Need

Authors: Ruyi Gan, Ziwei Wu, Renliang Sun, Junyu Lu, Xiaojun Wu, Dixiang Zhang, Kunhao Pan, Junqing He, Yuanhe Tian, Ping Yang, Qi Yang, Hao Wang, Jiaxing Zhang, Yan Song

Abstract: Various large language models (LLMs) have been proposed in recent years, including closed- and open-source ones, continually setting new records on multiple benchmarks. However, the development of LLMs still faces several issues, such as high cost of training models from scratch, and continual pre-training leading to catastrophic forgetting, etc. Although many such issues are addressed along the l… ▽ More Various large language models (LLMs) have been proposed in recent years, including closed- and open-source ones, continually setting new records on multiple benchmarks. However, the development of LLMs still faces several issues, such as high cost of training models from scratch, and continual pre-training leading to catastrophic forgetting, etc. Although many such issues are addressed along the line of research on LLMs, an important yet practical limitation is that many studies overly pursue enlarging model sizes without comprehensively analyzing and optimizing the use of pre-training data in their learning process, as well as appropriate organization and leveraging of such data in training LLMs under cost-effective settings. In this work, we propose Ziya2, a model with 13 billion parameters adopting LLaMA2 as the foundation model, and further pre-trained on 700 billion tokens, where we focus on pre-training techniques and use data-centric optimization to enhance the learning process of Ziya2 on different stages. We define three data attributes and firstly establish data-centric scaling laws to illustrate how different data impacts LLMs. Experiments show that Ziya2 significantly outperforms other models in multiple benchmarks especially with promising results compared to representative open-source ones. Ziya2 (Base) is released at https://huggingface.co/IDEA-CCNL/Ziya2-13B-Base and https://modelscope.cn/models/Fengshenbang/Ziya2-13B-Base/summary. △ Less

Submitted 4 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

arXiv:2310.20411 [pdf, other]

A New Kilohertz Gravitational-Wave Feature from Rapidly Rotating Core-Collapse Supernovae

Authors: He-Feng Hsieh, Rubén Cabezón, Li-Ting Ma, Kuo-Chuan Pan

Abstract: We present self-consistent three-dimensional core-collapse supernova simulations of a rotating $20M_\odot$ progenitor model with various initial angular velocities from $0.0$ to $4.0$ rad s$^{-1}$ using a smoothed particle hydrodynamics code, SPHYNX, and a grid-based hydrodynamics code, FLASH. We identify two strong gravitational-wave features, with peak frequencies of $\sim300$ Hz and $\sim1.3$ k… ▽ More We present self-consistent three-dimensional core-collapse supernova simulations of a rotating $20M_\odot$ progenitor model with various initial angular velocities from $0.0$ to $4.0$ rad s$^{-1}$ using a smoothed particle hydrodynamics code, SPHYNX, and a grid-based hydrodynamics code, FLASH. We identify two strong gravitational-wave features, with peak frequencies of $\sim300$ Hz and $\sim1.3$ kHz in the first $100$ ms postbounce. We demonstrate that these two features are associated with the $m=1$ deformation from the proto-neutron star (PNS) modulation induced by the low-$T/|W|$ instability, regardless of the simulation code. The $300$ Hz feature is present in models with an initial angular velocity between $1.0$ and $4.0$ rad s$^{-1}$, while the $1.3$ kHz feature is present only in a narrower range, from $1.5$ to $3.5$ rad s$^{-1}$. We show that the $1.3$ kHz signal originates from the high-density inner core of the PNS, and the $m=1$ deformation triggers a strong asymmetric distribution of electron anti-neutrinos. In addition to the $300$ Hz and $1.3$ kHz features, we also observe one weaker but noticeable gravitational-wave feature from higher-order modes in the range between $1.5$ and $3.5$ rad s$^{-1}$. Its peak frequency is around $800$ Hz initially and gradually increases to $900-1000$ Hz. Therefore, in addition to the gravitational bounce signal, the detection of the $300$ Hz, $1.3$ kHz, the higher-order mode, and even the related asymmetric emission of neutrinos, could provide additional diagnostics to estimate the initial angular velocity of a collapsing core. △ Less

Submitted 31 October, 2023; originally announced October 2023.

Comments: 20 pages, 14 figures,. Accepted for publication in the Astrophysical Journal

arXiv:2310.15501 [pdf, other]

Evolution of MHD Torus and Mass Outflow Around Spinning AGN

Authors: Ramiz Aktar, Kuo-Chuan Pan, Toru Okuda

Abstract: We perform axisymmetric, two-dimensional magnetohydrodynamic (MHD) simulations to investigate accretion flows around spinning AGN. To mimic the space-time geometry of spinning black holes, we consider effective Kerr potential, and the mass of the black holes is $10^8 M_{\odot}$. We initialize the accretion disc with a magnetized torus by adopting the toroidal component of the magnetic vector poten… ▽ More We perform axisymmetric, two-dimensional magnetohydrodynamic (MHD) simulations to investigate accretion flows around spinning AGN. To mimic the space-time geometry of spinning black holes, we consider effective Kerr potential, and the mass of the black holes is $10^8 M_{\odot}$. We initialize the accretion disc with a magnetized torus by adopting the toroidal component of the magnetic vector potential. The initial magnetic field strength is set by using the plasma beta parameter ($β_0$). We observe self-consistent turbulence generated by magneto rotational instability (MRI) in the disc. The MRI turbulence transports angular momentum in the disc, resulting in an angular momentum distribution that approaches a Keplerian distribution. We investigate the effect of the magnetic field on the dynamics of the torus and associated mass outflow from the disc around a maximally spinning black hole $(a_k = 0.99)$. For the purpose of our analysis, we investigate the magnetic state of our simulation model. The model $β_0 = 10$ indicates the behaviour similar to the "magnetically arrested disk (MAD)'' state, and all the other low magnetic model remains in the SANE state. We observe that mass outflow rates are significantly enhanced with the increased magnetic field in the disc. We find a positive correlation between the magnetic field and mass outflow rates. We also investigate the effect of black hole spin on the magnetized torus evolution. However, we have not found any significant effect of black hole spin on mass outflows in our model. Finally, we discuss the possible astrophysical applications of our simulation results. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: 15 pages, 13 figures (2 appendix figures), Accepted for publication in MNRAS

arXiv:2310.04922 [pdf, ps, other]

Robust Multivariate Detection and Estimation with Fault Frequency Content Information

Authors: Jingwei Dong, Kaikai Pan, Sergio Pequito, Peyman Mohajerin Esfahani

Abstract: This paper studies the problem of fault detection and estimation (FDE) for linear time-invariant (LTI) systems with a particular focus on frequency content information of faults, possibly as multiple disjoint continuum ranges, and under both disturbances and stochastic noise. To ensure the worst-case fault sensitivity in the considered frequency ranges and mitigate the effects of disturbances and… ▽ More This paper studies the problem of fault detection and estimation (FDE) for linear time-invariant (LTI) systems with a particular focus on frequency content information of faults, possibly as multiple disjoint continuum ranges, and under both disturbances and stochastic noise. To ensure the worst-case fault sensitivity in the considered frequency ranges and mitigate the effects of disturbances and noise, an optimization framework incorporating a mixed H_/H2 performance index is developed to compute the optimal detection filter. Moreover, a thresholding rule is proposed to guarantee both the false alarm rate (FAR) and the fault detection rate (FDR). Next, shifting attention to fault estimation in specific frequency ranges, an exact reformulation of the optimal estimation filter design using the restricted Hinf performance index is derived, which is inherently non-convex. However, focusing on finite frequency samples and fixed poles, a lower bound is established via a highly tractable quadratic programming (QP) problem. This lower bound together with an alternating optimization (AO) approach to the original estimation problem leads to a suboptimality gap for the overall estimation filter design. The effectiveness of the proposed approaches is validated through a synthetic non-minimum phase system and an application of the multi-area power system. △ Less

Submitted 15 May, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

Comments: 32pages, 15 figures

arXiv:2310.02821 [pdf, other]

Improving Vision Anomaly Detection with the Guidance of Language Modality

Authors: Dong Chen, Kaihang Pan, Guoming Wang, Yueting Zhuang, Siliang Tang

Abstract: Recent years have seen a surge of interest in anomaly detection for tackling industrial defect detection, event detection, etc. However, existing unsupervised anomaly detectors, particularly those for the vision modality, face significant challenges due to redundant information and sparse latent space. Conversely, the language modality performs well due to its relatively single data. This paper ta… ▽ More Recent years have seen a surge of interest in anomaly detection for tackling industrial defect detection, event detection, etc. However, existing unsupervised anomaly detectors, particularly those for the vision modality, face significant challenges due to redundant information and sparse latent space. Conversely, the language modality performs well due to its relatively single data. This paper tackles the aforementioned challenges for vision modality from a multimodal point of view. Specifically, we propose Cross-modal Guidance (CMG), which consists of Cross-modal Entropy Reduction (CMER) and Cross-modal Linear Embedding (CMLE), to tackle the redundant information issue and sparse space issue, respectively. CMER masks parts of the raw image and computes the matching score with the text. Then, CMER discards irrelevant pixels to make the detector focus on critical contents. To learn a more compact latent space for the vision anomaly detector, CMLE learns a correlation structure matrix from the language modality, and then the latent space of vision modality will be learned with the guidance of the matrix. Thereafter, the vision latent space will get semantically similar images closer. Extensive experiments demonstrate the effectiveness of the proposed methods. Particularly, CMG outperforms the baseline that only uses images by 16.81%. Ablation experiments further confirm the synergy among the proposed methods, as each component depends on the other to achieve optimal performance. △ Less

Submitted 4 October, 2023; originally announced October 2023.

Comments: 9 pages, 10 figures

arXiv:2309.09526 [pdf, other]

DFIL: Deepfake Incremental Learning by Exploiting Domain-invariant Forgery Clues

Authors: Kun Pan, Yin Yifang, Yao Wei, Feng Lin, Zhongjie Ba, Zhenguang Liu, ZhiBo Wang, Lorenzo Cavallaro, Kui Ren

Abstract: The malicious use and widespread dissemination of deepfake pose a significant crisis of trust. Current deepfake detection models can generally recognize forgery images by training on a large dataset. However, the accuracy of detection models degrades significantly on images generated by new deepfake methods due to the difference in data distribution. To tackle this issue, we present a novel increm… ▽ More The malicious use and widespread dissemination of deepfake pose a significant crisis of trust. Current deepfake detection models can generally recognize forgery images by training on a large dataset. However, the accuracy of detection models degrades significantly on images generated by new deepfake methods due to the difference in data distribution. To tackle this issue, we present a novel incremental learning framework that improves the generalization of deepfake detection models by continual learning from a small number of new samples. To cope with different data distributions, we propose to learn a domain-invariant representation based on supervised contrastive learning, preventing overfit to the insufficient new data. To mitigate catastrophic forgetting, we regularize our model in both feature-level and label-level based on a multi-perspective knowledge distillation approach. Finally, we propose to select both central and hard representative samples to update the replay set, which is beneficial for both domain-invariant representation learning and rehearsal-based knowledge preserving. We conduct extensive experiments on four benchmark datasets, obtaining the new state-of-the-art average forgetting rate of 7.01 and average accuracy of 85.49 on FF++, DFDC-P, DFD, and CDF2. Our code is released at https://github.com/DeepFakeIL/DFIL. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: Accepted by ACMMM2023

arXiv:2308.13666 [pdf, other]

A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run

Authors: C. Fletcher, J. Wood, R. Hamburg, P. Veres, C. M. Hui, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. M. Giles, A. Goldstein, B. A. Hristov, D. Kocevski, S. Lesage, B. Mailyan, C. Malacaria, S. Poolakkil, A. von Kienlin, C. A. Wilson-Hodge, The Fermi Gamma-ray Burst Monitor Team, M. Crnogorčević, J. DeLaunay, A. Tohuvavohu, R. Caputo, S. B. Cenko , et al. (1674 additional authors not shown)

Abstract: We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,… ▽ More We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses, the Targeted Search and the Untargeted Search, we investigate whether there are any coincident GRBs associated with the GWs. We also search the Swift-BAT rate data around the GW times to determine whether a GRB counterpart is present. No counterparts are found. Using both the Fermi-GBM Targeted Search and the Swift-BAT search, we calculate flux upper limits and present joint upper limits on the gamma-ray luminosity of each GW. Given these limits, we constrain theoretical models for the emission of gamma-rays from binary black hole mergers. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.10025 [pdf, other]

I3: Intent-Introspective Retrieval Conditioned on Instructions

Authors: Kaihang Pan, Juncheng Li, Wenjie Wang, Hao Fei, Hongye Song, Wei Ji, Jun Lin, Xiaozhong Liu, Tat-Seng Chua, Siliang Tang

Abstract: Recent studies indicate that dense retrieval models struggle to perform well on a wide variety of retrieval tasks that lack dedicated training data, as different retrieval tasks often entail distinct search intents. To address this challenge, in this work we leverage instructions to flexibly describe retrieval intents and introduce I3, a unified retrieval system that performs Intent-Introspective… ▽ More Recent studies indicate that dense retrieval models struggle to perform well on a wide variety of retrieval tasks that lack dedicated training data, as different retrieval tasks often entail distinct search intents. To address this challenge, in this work we leverage instructions to flexibly describe retrieval intents and introduce I3, a unified retrieval system that performs Intent-Introspective retrieval across various tasks, conditioned on Instructions without any task-specific training. I3 innovatively incorporates a pluggable introspector in a parameter-isolated manner to comprehend specific retrieval intents by jointly reasoning over the input query and instruction, and seamlessly integrates the introspected intent into the original retrieval model for intent-aware retrieval. Furthermore, we propose progressively-pruned intent learning. It utilizes extensive LLM-generated data to train I3 phase-by-phase, embodying two key designs: progressive structure pruning and drawback extrapolation-based data refinement. Extensive experiments show that in the BEIR benchmark, I3 significantly outperforms baseline methods designed with task-specific retrievers, achieving state-of-the-art zero-shot performance without any task-specific tuning. △ Less

Submitted 25 April, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

Comments: Accepted by SIGIR 2024

arXiv:2308.04793 [pdf, other]

Cosmic ray calorimetry in star-forming galaxy populations and implications for their contribution to the extra-galactic $γ$-ray background

Authors: Ellis R. Owen, Albert K. H. Kong, Kuo-Chuan Pan

Abstract: Star-forming galaxies (SFGs) have been established as an important source population in the extra-galactic $γ$-ray background (EGB). Their intensive star-formation creates an abundance of environments able to accelerate particles, and these build-up a rich sea of cosmic rays (CRs). Above GeV energies, CR protons can undergo hadronic interactions with their environment to produce $γ$-rays. SFGs can… ▽ More Star-forming galaxies (SFGs) have been established as an important source population in the extra-galactic $γ$-ray background (EGB). Their intensive star-formation creates an abundance of environments able to accelerate particles, and these build-up a rich sea of cosmic rays (CRs). Above GeV energies, CR protons can undergo hadronic interactions with their environment to produce $γ$-rays. SFGs can operate as CR proton "calorimeters", where a large fraction of the CR energy is converted to $γ$-rays. However, CRs also deposit energy and momentum to modify the thermal and hydrodynamic conditions of the gas in SFGs, and can become a powerful driver of outflows. Such outflows are ubiquitous among some types of SFGs, and have the potential to severely degrade their CR proton calorimetry. This diminishes their contribution to the EGB. In this work, we adopt a self-consistent treatment of particle transport in outflows from SFGs to assess their calorimetry. We use 1D numerical treatments of galactic outflows driven by CRs and thermal gas pressure, accounting for the dynamical effects and interactions of CRs. We show the impact CR-driven flows have on the relative contribution of SFG populations to the EGB, and investigate the properties of SFGs that contribute most strongly. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: 8 pages, 4 figures, 1 table. Presented at the 38th International Cosmic Ray Conference (ICRC2023)

Journal ref: PoS (ICRC2023), 554

arXiv:2308.04152 [pdf, other]

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

Authors: Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang, Hanwang Zhang, Yueting Zhuang

Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) have been utilizing Visual Prompt Generators (VPGs) to convert visual features into tokens that LLMs can recognize. This is achieved by training the VPGs on millions of image-caption pairs, where the VPG-generated tokens of images are fed into a frozen LLM to generate the corresponding captions. However, this image-captioning based tr… ▽ More Recent advancements in Multimodal Large Language Models (MLLMs) have been utilizing Visual Prompt Generators (VPGs) to convert visual features into tokens that LLMs can recognize. This is achieved by training the VPGs on millions of image-caption pairs, where the VPG-generated tokens of images are fed into a frozen LLM to generate the corresponding captions. However, this image-captioning based training objective inherently biases the VPG to concentrate solely on the primary visual contents sufficient for caption generation, often neglecting other visual details. This shortcoming results in MLLMs' underperformance in comprehending demonstrative instructions consisting of multiple, interleaved, and multimodal instructions that demonstrate the required context to complete a task. To address this issue, we introduce a generic and lightweight Visual Prompt Generator Complete module (VPG-C), which can infer and complete the missing details essential for comprehending demonstrative instructions. Further, we propose a synthetic discriminative training strategy to fine-tune VPG-C, eliminating the need for supervised demonstrative instructions. As for evaluation, we build DEMON, a comprehensive benchmark for demonstrative instruction understanding. Synthetically trained with the proposed strategy, VPG-C achieves significantly stronger zero-shot performance across all tasks of DEMON. Further evaluation on the MME and OwlEval benchmarks also demonstrate the superiority of VPG-C. Our benchmark, code, and pre-trained models are available at https://github.com/DCDmllm/Cheetah. △ Less

Submitted 25 May, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

Comments: Accepted by ICLR 2024 (Spotlight)

arXiv:2308.03822 [pdf, other]

Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1750 additional authors not shown)

Abstract: Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect… ▽ More Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 24 pages, 5 figures

Report number: LIGO-P2300080

arXiv:2307.16180 [pdf, other]

Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models

Authors: Keyu Pan, Yawen Zeng

Abstract: The field of large language models (LLMs) has made significant progress, and their knowledge storage capacity is approaching that of human beings. Furthermore, advanced techniques, such as prompt learning and reinforcement learning, are being employed to address ethical concerns and hallucination problems associated with LLMs, bringing them closer to aligning with human values. This situation natu… ▽ More The field of large language models (LLMs) has made significant progress, and their knowledge storage capacity is approaching that of human beings. Furthermore, advanced techniques, such as prompt learning and reinforcement learning, are being employed to address ethical concerns and hallucination problems associated with LLMs, bringing them closer to aligning with human values. This situation naturally raises the question of whether LLMs with human-like abilities possess a human-like personality? In this paper, we aim to investigate the feasibility of using the Myers-Briggs Type Indicator (MBTI), a widespread human personality assessment tool, as an evaluation metric for LLMs. Specifically, extensive experiments will be conducted to explore: 1) the personality types of different LLMs, 2) the possibility of changing the personality types by prompt engineering, and 3) How does the training dataset affect the model's personality. Although the MBTI is not a rigorous assessment, it can still reflect the similarity between LLMs and human personality. In practice, the MBTI has the potential to serve as a rough indicator. Our codes are available at https://github.com/HarderThenHarder/transformers_tasks/tree/main/LLM/llms_mbti. △ Less

Submitted 30 July, 2023; originally announced July 2023.

arXiv:2306.01919 [pdf, other]

Characterizing the Directionality of Gravitational Wave Emission from Matter Motions within Core-collapse Supernovae

Authors: Michael A. Pajkos, Steven J. VanCamp, Kuo-Chuan Pan, David Vartanyan, Nils Deppe, Sean M. Couch

Abstract: We analyze the directional dependence of the gravitational wave (GW) emission from 15 3D neutrino radiation hydrodynamic simulations of core-collapse supernovae. Using spin weighted spherical harmonics, we develop a new analytic technique to quantify the evolution of the distribution of GW emission over all angles. We construct a physics-informed toy model that can be used to approximate GW distri… ▽ More We analyze the directional dependence of the gravitational wave (GW) emission from 15 3D neutrino radiation hydrodynamic simulations of core-collapse supernovae. Using spin weighted spherical harmonics, we develop a new analytic technique to quantify the evolution of the distribution of GW emission over all angles. We construct a physics-informed toy model that can be used to approximate GW distributions for general ellipsoid-like systems, and use it to provide closed form expressions for the distribution of GWs for different CCSN phases. Using these toy models, we approximate the PNS dynamics during multiple CCSN stages and obtain similar GW distributions to simulation outputs. When considering all viewing angles, we apply this new technique to quantify the evolution of preferred directions of GW emission. For nonrotating cases, this dominant viewing angle drifts isotropically throughout the supernova, set by the dynamical timescale of the protoneutron star. For rotating cases, during core bounce and the following tens of ms, the strongest GW signal is observed along the equator. During the accretion phase, comparable -- if not stronger -- GW amplitudes are generated along the axis of rotation, which can be enhanced by the low T/|W| instability. We show two dominant factors influencing the directionality of GW emission are the degree of initial rotation and explosion morphology. Lastly, looking forward, we note the sensitive interplay between GW detector site and supernova orientation, along with its effect on detecting individual polarization modes. △ Less

Submitted 25 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

Comments: 34 pages, 17 Figures, accepted in ApJ

arXiv:2305.07065 [pdf, other]

doi 10.3847/1538-4357/acd4bd

Stellar Characterization and Radius Inflation of Hyades M Dwarf Stars From the APOGEE Survey

Authors: Fábio Wanderley, Katia Cunha, Diogo Souto, Verne V. Smith, Lyra Cao, Marc Pinsonneault, C. Allende Prieto, Kevin Covey, Thomas Masseron, Ilaria Pascucci, Keivan G. Stassun, Ryan Terrien, Galen J. Bergsten, Dmitry Bizyaev, José G. Fernández-Trincado, Henrik Jönsson, Sten Hasselquist, Jon A. Holtzman, Richard R. Lane, Suvrath Mahadevan, Steven R. Majewski, Dante Minniti, Kaike Pan, Javier Serna, Jennifer Sobeck , et al. (1 additional authors not shown)

Abstract: We present a spectroscopic analysis of a sample of 48 M dwarf stars ($0.2 M_{\odot}< M < 0.6 M_{\odot}$) from the Hyades open cluster using high-resolution H-band spectra from the SDSS/APOGEE survey. Our methodology adopts spectrum synthesis with LTE MARCS model atmospheres, along with the APOGEE DR17 line list, to determine effective temperatures, surface gravities, metallicities, and projected r… ▽ More We present a spectroscopic analysis of a sample of 48 M dwarf stars ($0.2 M_{\odot}< M < 0.6 M_{\odot}$) from the Hyades open cluster using high-resolution H-band spectra from the SDSS/APOGEE survey. Our methodology adopts spectrum synthesis with LTE MARCS model atmospheres, along with the APOGEE DR17 line list, to determine effective temperatures, surface gravities, metallicities, and projected rotational velocities. The median metallicity obtained for the Hyades M dwarfs is [M/H]= 0.09$\pm$0.03 dex, indicating a small internal uncertainty and good agreement with optical results for Hyades red-giants. Overall, the median radii are larger than predicted by stellar models by 1.6$\pm$2.3\% and 2.4$\pm$2.3\%, relative to a MIST and DARTMOUTH isochrone, respectively. We emphasize, however, that these isochrones are different and the fractional radius inflation for the fully- and partially-convective regimes have distinct behaviors depending on the isochrone. Using a MIST isochrone there is no evidence of radius inflation for the fully convective stars, while for the partially convective M-dwarfs the radii are inflated by 2.7$\pm$2.1\%, which is in agreement with predictions from models that include magnetic fields. For the partially-convective stars, rapid-rotators present on average higher inflation levels than slow-rotators. The comparison with SPOTS isochrone models indicates that the derived M dwarf radii can be explained by accounting for stellar spots in the photosphere of the stars, with 76\% of the studied M dwarfs having up to 20\% spot coverage, and the most inflated stars with $\sim$20 -- 40\% spot coverage. △ Less

Submitted 11 May, 2023; originally announced May 2023.

Comments: Accepted for publication by The Astrophysical Journal (ApJ)

arXiv:2305.03996 [pdf, ps, other]

Optimized Dimensionality Reduction for Moment-based Distributionally Robust Optimization

Authors: Shiyi Jiang, Jianqiang Cheng, Kai Pan, Zuo-Jun Max Shen

Abstract: Moment-based distributionally robust optimization (DRO) provides an optimization framework to integrate statistical information with traditional optimization approaches. Under this framework, one assumes that the underlying joint distribution of random parameters runs in a distributional ambiguity set constructed by moment information and makes decisions against the worst-case distribution within… ▽ More Moment-based distributionally robust optimization (DRO) provides an optimization framework to integrate statistical information with traditional optimization approaches. Under this framework, one assumes that the underlying joint distribution of random parameters runs in a distributional ambiguity set constructed by moment information and makes decisions against the worst-case distribution within the set. Although most moment-based DRO problems can be reformulated as semidefinite programming (SDP) problems that can be solved in polynomial time, solving high-dimensional SDPs is still time-consuming. Unlike existing approximation approaches that first reduce the dimensionality of random parameters and then solve the approximated SDPs, we propose an optimized dimensionality reduction (ODR) approach. We first show that the ranks of the matrices in the SDP reformulations are small, by which we are then motivated to integrate the dimensionality reduction of random parameters with the subsequent optimization problems. Such integration enables two outer and one inner approximations of the original problem, all of which are low-dimensional SDPs that can be solved efficiently. More importantly, these approximations can theoretically achieve the optimal value of the original high-dimensional SDPs. As these approximations are nonconvex SDPs, we develop modified Alternating Direction Method of Multipliers (ADMM) algorithms to solve them efficiently. We demonstrate the effectiveness of our proposed ODR approach and algorithm in solving two practical problems. Numerical results show significant advantages of our approach on the computational time and solution quality over the three best possible benchmark approaches. Our approach can obtain an optimal or near-optimal (mostly within 0.1%) solution and reduce the computational time by up to three orders of magnitude. △ Less

Submitted 31 October, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

arXiv:2304.08393 [pdf, other]

Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1670 additional authors not shown)

Abstract: Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated… ▽ More Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated signals from strong lensing by 1) performing targeted searches for subthreshold signals, 2) calculating the degree of overlap amongst the intrinsic parameters and sky location of pairs of signals, 3) comparing the similarities of the spectrograms amongst pairs of signals, and 4) performing dual-signal Bayesian analysis that takes into account selection effects and astrophysical knowledge. We also search for distortions to the gravitational waveform caused by 1) frequency-independent phase shifts in strongly lensed images, and 2) frequency-dependent modulation of the amplitude and phase due to point masses. None of these searches yields significant evidence for lensing. Finally, we use the non-detection of gravitational-wave lensing to constrain the lensing rate based on the latest merger-rate estimates and the fraction of dark matter composed of compact objects. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: 28 pages, 11 figures

Report number: LIGO-P2200031

arXiv:2304.02662 [pdf, other]

doi 10.3847/1538-4357/acc9af

Exploring the Observability of Surviving Companions of Stripped-Envelope Supernovae: A Case Study of Type Ic SN 2020oi

Authors: Hsin-Pei Chen, Shiau-Jie Rau, Kuo-Chuan Pan

Abstract: Stripped-envelope supernovae (SE SNe) were considered as the explosions of single massive stars with strong stellar winds, while later observations favor binary origins. One direct evidence to support the binary origins is to find the surviving companions of SE SNe since previous numerical studies suggested that the binary companion should survive the supernova impact and could be detectable. Rece… ▽ More Stripped-envelope supernovae (SE SNe) were considered as the explosions of single massive stars with strong stellar winds, while later observations favor binary origins. One direct evidence to support the binary origins is to find the surviving companions of SE SNe since previous numerical studies suggested that the binary companion should survive the supernova impact and could be detectable. Recently, Gagliano et al. (2022) reported that the nearby Type Ic SN 2020oi in M100 (~17.1 Mpc) resulted from a binary system based on the HST photometric and spectroscopic observation. Based on the suggested binary properties of SN 2020oi, we conduct two-dimensional hydrodynamics simulations of supernova-companion interactions and the subsequent post-impact evolution of the companion. Our results suggest that a surviving companion becomes brighter in two orders of magnitude and temporarily redder after the SN impact. The companion might be detectable with the JWST NIRCam short wavelength channel in a few years. Furthermore, the predicted magnitudes of surviving companions show a significant magnitude gradient around the peak. This could be another indicator to identify the surviving companion from a SE SN. △ Less

Submitted 5 April, 2023; originally announced April 2023.

Comments: 14 pages, 9 figures, accepted by ApJ

arXiv:2303.12314 [pdf, other]

Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization

Authors: Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang

Abstract: Prompt tuning is a parameter-efficient method, which learns soft prompts and conditions frozen language models to perform specific downstream tasks. Though effective, prompt tuning under few-shot settings on the one hand heavily relies on a good initialization of soft prompts. On the other hand, it can easily overfit to few-shot training samples, thereby undermining generalizability. Existing work… ▽ More Prompt tuning is a parameter-efficient method, which learns soft prompts and conditions frozen language models to perform specific downstream tasks. Though effective, prompt tuning under few-shot settings on the one hand heavily relies on a good initialization of soft prompts. On the other hand, it can easily overfit to few-shot training samples, thereby undermining generalizability. Existing works leverage pre-training or supervised meta-learning to initialize soft prompts but they fail to data-efficiently generalize to unseen downstream tasks. To address the above problems, this paper proposes a novel Self-sUpervised meta-Prompt learning framework with MEta-gradient Regularization for few-shot generalization (SUPMER). SUPMER leverages self-supervised meta-learning with a diverse set of well-designed meta-training tasks to learn a universal prompt initialization for efficient adaptation using only unlabeled data. Additionally, it jointly meta-learns a gradient regularization function to transform raw gradients into a domain-generalizable direction, thus alleviating the problem of overfitting. Extensive experiments show that SUPMER achieves better performance for different few-shot downstream tasks, and also exhibits a stronger domain generalization ability. The code for SUPMER will be available at https://github.com/beepkh/SUPMER. △ Less

Submitted 23 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

Comments: Accepted by EMNLP 2023 (Findings)

arXiv:2302.03676 [pdf, other]

doi 10.3847/1538-4365/acdc9f

Open data from the third observing run of LIGO, Virgo, KAGRA and GEO

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné, A. Allocca , et al. (1719 additional authors not shown)

Abstract: The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasti… ▽ More The global network of gravitational-wave observatories now includes five detectors, namely LIGO Hanford, LIGO Livingston, Virgo, KAGRA, and GEO 600. These detectors collected data during their third observing run, O3, composed of three phases: O3a starting in April of 2019 and lasting six months, O3b starting in November of 2019 and lasting five months, and O3GK starting in April of 2020 and lasting 2 weeks. In this paper we describe these data and various other science products that can be freely accessed through the Gravitational Wave Open Science Center at https://gwosc.org. The main dataset, consisting of the gravitational-wave strain time series that contains the astrophysical signals, is released together with supporting data useful for their analysis and documentation, tutorials, as well as analysis software packages. △ Less

Submitted 7 February, 2023; originally announced February 2023.

Comments: 27 pages, 3 figures

Report number: LIGO-P2200316

arXiv:2301.07688 [pdf, other]

doi 10.3847/1538-4365/acda98

The Eighteenth Data Release of the Sloan Digital Sky Surveys: Targeting and First Spectra from SDSS-V

Authors: Andrés Almeida, Scott F. Anderson, Maria Argudo-Fernández, Carles Badenes, Kat Barger, Jorge K. Barrera-Ballesteros, Chad F. Bender, Erika Benitez, Felipe Besser, Dmitry Bizyaev, Michael R. Blanton, John Bochanski, Jo Bovy, William Nielsen Brandt, Joel R. Brownstein, Johannes Buchner, Esra Bulbul, Joseph N. Burchett, Mariana Cano Díaz, Joleen K. Carlberg, Andrew R. Casey, Vedant Chandra, Brian Cherinka, Cristina Chiappini, Abigail A. Coker , et al. (129 additional authors not shown)

Abstract: The eighteenth data release of the Sloan Digital Sky Surveys (SDSS) is the first one for SDSS-V, the fifth generation of the survey. SDSS-V comprises three primary scientific programs, or "Mappers": Milky Way Mapper (MWM), Black Hole Mapper (BHM), and Local Volume Mapper (LVM). This data release contains extensive targeting information for the two multi-object spectroscopy programs (MWM and BHM),… ▽ More The eighteenth data release of the Sloan Digital Sky Surveys (SDSS) is the first one for SDSS-V, the fifth generation of the survey. SDSS-V comprises three primary scientific programs, or "Mappers": Milky Way Mapper (MWM), Black Hole Mapper (BHM), and Local Volume Mapper (LVM). This data release contains extensive targeting information for the two multi-object spectroscopy programs (MWM and BHM), including input catalogs and selection functions for their numerous scientific objectives. We describe the production of the targeting databases and their calibration- and scientifically-focused components. DR18 also includes ~25,000 new SDSS spectra and supplemental information for X-ray sources identified by eROSITA in its eFEDS field. We present updates to some of the SDSS software pipelines and preview changes anticipated for DR19. We also describe three value-added catalogs (VACs) based on SDSS-IV data that have been published since DR17, and one VAC based on the SDSS-V data in the eFEDS field. △ Less

Submitted 6 July, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

Comments: Accepted to ApJS

Showing 1–50 of 497 results for author: Pan, K