subscribe to arXiv mailings

arXiv:2406.19969 [pdf, other]

Enhancing Terrestrial Net Primary Productivity Estimation with EXP-CASA: A Novel Light Use Efficiency Model Approach

Authors: Guanzhou Chen, Kaiqi Zhang, Xiaodong Zhang, Hong Xie, Haobo Yang, Xiaoliang Tan, Tong Wang, Yule Ma, Qing Wang, Jinzhou Cao, Weihong Cui

Abstract: The Light Use Efficiency model, epitomized by the CASA model, is extensively applied in the quantitative estimation of vegetation Net Primary Productivity. However, the classic CASA model is marked by significant complexity: the estimation of environmental stress parameters, in particular, necessitates multi-source observation data, adding to the complexity and uncertainty of the model's operation… ▽ More The Light Use Efficiency model, epitomized by the CASA model, is extensively applied in the quantitative estimation of vegetation Net Primary Productivity. However, the classic CASA model is marked by significant complexity: the estimation of environmental stress parameters, in particular, necessitates multi-source observation data, adding to the complexity and uncertainty of the model's operation. Additionally, the saturation effect of the Normalized Difference Vegetation Index (NDVI), a key variable in the CASA model, weakened the accuracy of CASA's NPP predictions in densely vegetated areas. To address these limitations, this study introduces the Exponential-CASA (EXP-CASA) model. The EXP-CASA model effectively improves the CASA model by using novel functions for estimating the fraction of absorbed photosynthetically active radiation (FPAR) and environmental stress, by utilizing long-term observational data from FLUXNET and MODIS surface reflectance data. In a comparative analysis of NPP estimation accuracy among four different NPP products, EXP-CASA ($R^2 = 0.68, RMSE= 1.1gC\cdot m^{-2} \cdot d^{-1}$) outperforms others, followed by GLASS-NPP, and lastly MODIS-NPP and classic CASA. Additionally, this research assesses the EXP-CASA model's adaptability to various vegetation indices, evaluates the sensitivity and stability of its parameters over time, and compares its accuracy against other leading NPP estimation products. The findings reveal that the EXP-CASA model exhibits strong adaptability to diverse vegetation indices and stability of model parameters over time series. By introducing a novel estimation approach that optimizes model construction, the EXP-CASA model remarkably improves the accuracy of NPP estimations and paves the way for global-scale, consistent, and continuous assessment of vegetation NPP. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2406.17248 [pdf, other]

MindSpore Quantum: A User-Friendly, High-Performance, and AI-Compatible Quantum Computing Framework

Authors: Xusheng Xu, Jiangyu Cui, Zidong Cui, Runhong He, Qingyu Li, Xiaowei Li, Yanling Lin, Jiale Liu, Wuxin Liu, Jiale Lu, Maolin Luo, Chufan Lyu, Shijie Pan, Mosharev Pavel, Runqiu Shu, Jialiang Tang, Ruoqian Xu, Shu Xu, Kang Yang, Fan Yu, Qingguo Zeng, Haiying Zhao, Qiang Zheng, Junyuan Zhou, Xu Zhou , et al. (14 additional authors not shown)

Abstract: We introduce MindSpore Quantum, a pioneering hybrid quantum-classical framework with a primary focus on the design and implementation of noisy intermediate-scale quantum (NISQ) algorithms. Leveraging the robust support of MindSpore, an advanced open-source deep learning training/inference framework, MindSpore Quantum exhibits exceptional efficiency in the design and training of variational quantum… ▽ More We introduce MindSpore Quantum, a pioneering hybrid quantum-classical framework with a primary focus on the design and implementation of noisy intermediate-scale quantum (NISQ) algorithms. Leveraging the robust support of MindSpore, an advanced open-source deep learning training/inference framework, MindSpore Quantum exhibits exceptional efficiency in the design and training of variational quantum algorithms on both CPU and GPU platforms, delivering remarkable performance. Furthermore, this framework places a strong emphasis on enhancing the operational efficiency of quantum algorithms when executed on real quantum hardware. This encompasses the development of algorithms for quantum circuit compilation and qubit mapping, crucial components for achieving optimal performance on quantum processors. In addition to the core framework, we introduce QuPack, a meticulously crafted quantum computing acceleration engine. QuPack significantly accelerates the simulation speed of MindSpore Quantum, particularly in variational quantum eigensolver (VQE), quantum approximate optimization algorithm (QAOA), and tensor network simulations, providing astonishing speed. This combination of cutting-edge technologies empowers researchers and practitioners to explore the frontiers of quantum computing with unprecedented efficiency and performance. △ Less

Submitted 10 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.17012 [pdf, other]

The HYENAS project: a prediction for the X-ray undetected galaxy groups

Authors: Weiguang Cui, Fred Jennings, Romeel Dave, Arif Babul, Ghassem Gozaliasl

Abstract: Galaxy groups contain the majority of bound mass with a significant portion of baryons due to the combination of halo mass and abundance (Cui 2024). Hence they serve as a crucial missing piece in the puzzle of galaxy formation and the evolution of large-scale structures in the Universe. In observations, mass-complete group catalogues are normally derived from galaxy redshift surveys detected throu… ▽ More Galaxy groups contain the majority of bound mass with a significant portion of baryons due to the combination of halo mass and abundance (Cui 2024). Hence they serve as a crucial missing piece in the puzzle of galaxy formation and the evolution of large-scale structures in the Universe. In observations, mass-complete group catalogues are normally derived from galaxy redshift surveys detected through various three-dimensional group-finding algorithms. Confirming the reality of such groups, particularly in the X-rays, is critical for ensuring robust studies of galaxy evolution in these environments. Recent works have reported numerous optical groups that are X-ray undetected (see, e.g., Popesso et al. 2024), sparking debates regarding the reasons for the unexpectedly low hot gas fraction in galaxy groups. To address this issue, we utilise zoomed-in simulations of galaxy groups from the novel HYENAS project to explore the range of hot gas fractions within galaxy groups and investigate the intrinsic factors behind the observed variability in X-ray emission. We find that the halo formation time can play a critical role -- we see that groups in halos that formed earlier exhibit up to an order of magnitude brighter X-ray luminosities compared to those formed later. This suggests that undetected X-ray groups are preferentially late-formed halos and highlights the connection between gas fraction and halo formation time in galaxy groups. Accounting for these biases in galaxy group identification is essential for advancing our understanding of galaxy formation and achieving precision in cosmological studies. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 10 pages, 3 figures. Comments are welcome

arXiv:2406.16103 [pdf, other]

Populating Galaxies Into Halos Via Machine Learning on the Simba Simulation

Authors: Pratyush Kumar Das, Romeel Davé, Weiguang Cui

Abstract: We present machine learning (ML)-based pipelines designed to populate galaxies into dark matter halos from N-body simulations. These pipelines predict galaxy stellar mass ($M_*$), star formation rate (SFR), atomic and molecular gas contents, and metallicities, and can be easily extended to other galaxy properties and simulations. Our approach begins by categorizing galaxies into central and satell… ▽ More We present machine learning (ML)-based pipelines designed to populate galaxies into dark matter halos from N-body simulations. These pipelines predict galaxy stellar mass ($M_*$), star formation rate (SFR), atomic and molecular gas contents, and metallicities, and can be easily extended to other galaxy properties and simulations. Our approach begins by categorizing galaxies into central and satellite classifications, followed by their ML classification into quenched (Q) and star-forming (SF) galaxies. We then develop regressors specifically for the SF galaxies within both central and satellite subgroups. We train the model on the $(100\mathrm{h^{-1}Mpc})^3$ Simba galaxy formation simulation at $z=0$. Our pipeline yields robust predictions for stellar mass and metallicity and offers significant improvements for SFR and gas properties compared to previous works, achieving an unbiased scatter of less than 0.2 dex around true Simba values for the halo-$M_{\rm HI}$ relation of central galaxies. We also show the effectiveness of the ML-based pipelines at $z=1,2$. Interestingly, we find that training on fraction-based properties (e.g. $M_{\rm HI}$/$M_{*}$) and then multiplying by the ML-predicted $M_{*}$ yields improved predictions versus directly training on the property value, for many quantities across redshifts. However, we find that the ML-predicted scatter around the mean is lower than the true scatter, leading to artificially suppressed distribution functions at high values. To alleviate this, we add a "ML scatter bias", finely tuned to recover the true distribution functions, critical for accurate predictions of integrated quantities such as $\rm{HI}$ intensity maps. △ Less

Submitted 23 June, 2024; originally announced June 2024.

Comments: 21 pages, 13 figures, Submitted to MNRAS

arXiv:2406.15555 [pdf, other]

doi 10.1093/mnras/stae1566

Reconsidering the dynamical states of galaxy clusters using PCA and UMAP

Authors: Roan Haggar, Federico De Luca, Marco De Petris, Elizaveta Sazonova, James E. Taylor, Alexander Knebe, Meghan E. Gray, Frazer R. Pearce, Ana Contreras-Santos, Weiguang Cui, Ulrike Kuchner, Robert A. Mostoghiu Paun, Chris Power

Abstract: Numerous metrics exist to quantify the dynamical state of galaxy clusters, both observationally and within simulations. Many of these correlate strongly with one another, but it is not clear whether all of these measures probe the same intrinsic properties. In this work, we use two different statistical approaches -- principal component analysis (PCA) and uniform manifold approximation and project… ▽ More Numerous metrics exist to quantify the dynamical state of galaxy clusters, both observationally and within simulations. Many of these correlate strongly with one another, but it is not clear whether all of these measures probe the same intrinsic properties. In this work, we use two different statistical approaches -- principal component analysis (PCA) and uniform manifold approximation and projection (UMAP) -- to investigate which dynamical properties of a cluster are in fact the best descriptors of its dynamical state. We use measurements taken directly from The Three Hundred suite of galaxy cluster simulations, as well as morphological properties calculated using mock X-ray and SZ maps of the same simulated clusters. We find that four descriptions of dynamical state naturally arise, and although correlations exist between these, a given cluster can be "dynamically relaxed" according to all, none, or some of these four descriptions. These results demonstrate that it is highly important for future observational and theoretical studies to consider in which sense clusters are dynamically relaxed. Cluster dynamical states are complex and multi-dimensional, and so it is not meaningful to classify them simply as "relaxed" and "unrelaxed" based on a single linear scale. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 18 pages, 7 figures, 3 tables, accepted for publication in MNRAS

arXiv:2406.10410 [pdf]

doi 10.1029/2023JD039920

Changes in mesoscale convective system precipitation structures in response to a warming climate

Authors: Wenjun Cui, Thomas Galarneau, Kimberly Hoogewind

Abstract: Mesoscale convective systems (MCSs) are crucial components of the hydrological cycle and often produce flash floods. Given their impact, it is crucial to understand how they will change under a warming climate. This study uses a satellite- and radar-based MCS tracking algorithm on convection-permitting climate model simulations and examines changes in MCS properties and precipitation structures be… ▽ More Mesoscale convective systems (MCSs) are crucial components of the hydrological cycle and often produce flash floods. Given their impact, it is crucial to understand how they will change under a warming climate. This study uses a satellite- and radar-based MCS tracking algorithm on convection-permitting climate model simulations and examines changes in MCS properties and precipitation structures between historical and future simulations. An underestimation in MCS total precipitation is evident in historical simulation compared to observations, due to model's depiction of MCS precipitation area and summertime occurrence frequency. Under pseudo-global warming, increases in MCS frequency and total warm season precipitation are observed, most notably in the southern U.S. The precipitation intensity and precipitating area generated by future MCSs also rises and results in an increase in precipitation volume. MCS precipitation structures are further classified into convective core and stratiform regions to understand how change in these structures contributes to future rainfall changes. In a warmer climate, the stratiform region demonstrates minimal change in size, but increases in mean precipitation rate and mean maximum precipitation rate by 15% and 29% are noted, respectively. A more robust future response is observed in the convective core region, with its size, mean precipitation rate and mean maximum precipitation rate increasing significantly by 24%, 37% and 42%, respectively. Finally, by examining the environmental properties of MCS initial condition, future intensification of convective rain may be attributed to a combined effect of substantial increases in atmospheric instability and moisture availability. △ Less

Submitted 14 June, 2024; originally announced June 2024.

arXiv:2406.09813 [pdf, other]

Diffuse X-ray Explorer: a high-resolution X-ray spectroscopic sky surveyor on the China Space Station

Authors: Hai Jin, Junjie Mao, Liubiao Chen, Naihui Chen, Wei Cui, Bo Gao, Jinjin Li, Xinfeng Li, Jiejia Liu, Jia Quan, Chunyang Jiang, Guole Wang, Le Wang, Qian Wang, Sifan Wang, Aimin Xiao, Shuo Zhang

Abstract: DIffuse X-ray Explorer (DIXE) is a proposed high-resolution X-ray spectroscopic sky surveyor on the China Space Station (CSS). DIXE will focus on studying hot baryons in the Milky Way. Galactic hot baryons like the X-ray emitting Milky Way halo and eROSITA bubbles are best observed in the sky survey mode with a large field of view. DIXE will take advantage of the orbital motion of the CSS to scan… ▽ More DIffuse X-ray Explorer (DIXE) is a proposed high-resolution X-ray spectroscopic sky surveyor on the China Space Station (CSS). DIXE will focus on studying hot baryons in the Milky Way. Galactic hot baryons like the X-ray emitting Milky Way halo and eROSITA bubbles are best observed in the sky survey mode with a large field of view. DIXE will take advantage of the orbital motion of the CSS to scan a large fraction of the sky. High-resolution X-ray spectroscopy, enabled by superconducting microcalorimeters based on the transition-edge sensor (TES) technology, will probe the physical properties (e.g., temperature, density, elemental abundances, kinematics) of the Galactic hot baryons. This will complement the high-resolution imaging data obtained with the eROSITA mission. Here we present the preliminary design of DIXE. The payload consists mainly of a detector assembly and a cryogenic cooling system. The key components of the detector assembly are a microcalorimeter array and frequency-domain multiplexing readout electronics. To provide a working temperature for the detector assembly, the cooling system consists of an adiabatic demagnetization refrigerator and a mechanical cryocooler system. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: 12 pages, 6 figures, the full version is published by Journal of Low Temperature Physics

arXiv:2406.09261 [pdf, other]

Non-Invertible Surface Defects in 2+1d QFTs from Half Spacetime Gauging

Authors: Wei Cui, Babak Haghighat, Lorenzo Ruggeri

Abstract: We study duality defects in 2+1d theories with $\bZ^{(0)}_N\times\bZ^{(1)}_N$ global symmetry and trivial mixed 't Hooft anomaly. By gauging these symmetries simultaneously in half of the spacetime, we define duality defects for theories that are self-dual under gauging. We calculate the fusion rules involving duality defects and show that they obey a fusion 2-category. We also construct the corre… ▽ More We study duality defects in 2+1d theories with $\bZ^{(0)}_N\times\bZ^{(1)}_N$ global symmetry and trivial mixed 't Hooft anomaly. By gauging these symmetries simultaneously in half of the spacetime, we define duality defects for theories that are self-dual under gauging. We calculate the fusion rules involving duality defects and show that they obey a fusion 2-category. We also construct the corresponding symmetry topological field theory, a four-dimensional BF theory on a slab which realizes the duality defects on the boundary upon shrinking the interval. Furthermore, we provide explicit examples of such duality defects in $U(1)\times U(1)$ gauge theories and in more general product theories. Finally, we find duality defects in non-Lagrangian theories obtained by compactification of 6d $\cN=(2,0)$ SCFTs of type $A_{N-1}$ on various three-manifolds. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.08698 [pdf, other]

Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes of astrophysical $γ$-ray background while large amount of dark matter. By analyzing more than 700 days observational data at LHAASO, no significant dark matter signal from 1 TeV to 1 EeV is detected. Accordingly we derive the most stringent constraints on the ultra-heavy dark matter annihilation cross-section up to EeV. The constraints on the lifetime of dark matter in decay mode are also derived. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 17 pages, 12 figures, accepted by PRL

arXiv:2406.04761 [pdf, other]

\texttt{Simba}-\texttt{C}: the evolution of the thermal and chemical properties in the intragroup medium

Authors: Renier T. Hough, Zhiwei Shao, Weiguang Cui, S. Ilani Loubser, Arif Babul, Romeel Davé, Douglas Rennehan, Chiaki Kobayashi

Abstract: The newly updated \texttt{GIZMO} and \texttt{Simba} based simulation, \texttt{Simba-C}, with its new stellar feedback, chemical enrichment, and recalibrated AGN feedback, allows for a detailed study of the intragroup medium X-ray properties. We discuss the impact of various physical mechanisms, e.g. stellar and AGN feedback, and chemical enrichment, on the composition and the global scaling relati… ▽ More The newly updated \texttt{GIZMO} and \texttt{Simba} based simulation, \texttt{Simba-C}, with its new stellar feedback, chemical enrichment, and recalibrated AGN feedback, allows for a detailed study of the intragroup medium X-ray properties. We discuss the impact of various physical mechanisms, e.g. stellar and AGN feedback, and chemical enrichment, on the composition and the global scaling relations of nearby galaxy groups. We also study the evolution ($z=2$ to $0$) of the global properties for the $1\,\mathrm{keV}$ temperature groups. \texttt{Simba-C} shows improved consistent matching with the observations of all X-ray scaling relations compared to \texttt{Simba}. It is well known that AGN feedback has a significant influence on $L_{X,0.5-2.0}-T_{spec,corr}$, $S_{500/2500}-T_{spec,corr}$, and gas mass fractions, with our \texttt{Simba-C} results consistent with it. Our recalibrated AGN feedback strength also showed an additional improvement in gas entropy, which now aligns with CLoGS observations. The updated stellar feedback and chemical enrichment model is shown to play an important role in our understanding of the chemical abundance ratios and their evolution within galaxy groups. In particular, we find that \texttt{Simba-C} produces an increase in the amount of heavier elements (specifically Si and Fe) relative to O, compared to \texttt{Simba}. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 20 pages, 13 figures, 2 tables, accepted by MNRAS on 6 June 2024

arXiv:2406.03829 [pdf, other]

How much do we know the halo mass function? Predictions beyond resolution

Authors: Weiguang Cui

Abstract: As a common gravitation virialized object in the standard $Λ$CDM cosmology, dark matter halo connects from the large-scale structure all the way down to galaxy and star formation. However, as the nature of dark matter particles is still unclear, the smallest halo that can be formed in the universe is still unknown. Based on some simple assumptions, this paper uses the \textsc{hmf} package to inves… ▽ More As a common gravitation virialized object in the standard $Λ$CDM cosmology, dark matter halo connects from the large-scale structure all the way down to galaxy and star formation. However, as the nature of dark matter particles is still unclear, the smallest halo that can be formed in the universe is still unknown. Based on some simple assumptions, this paper uses the \textsc{hmf} package to investigate different halo functions used to quantify its number and mass distributions -- the halo mass function and the integrated/differential mass function (IMF/DMF) respectively. The halo mass in this study extends from the galaxy cluster to the dark matter particle mass at the GeV scale. Surprisingly, different fitting functions for the HMF are in remarkable agreement, a scatter within 2 orders of magnitude, down to dark matter particle mass, of which the halo mass spans about 80 orders of magnitude and the HMF covers over 100 orders of magnitude. The DMF reveals an interesting and consistent peak at $\sim 10^{13} \hMsun$, which implies galaxy groups have the highest contribution to the total matter mass. Furthermore, the effects of cosmology parameters on these halo functions are also examined with the most massive halos, or these halo functions at the most massive halo mass end, more sensitive to them. Different behaviours of these halo functions due to the changes in cosmology parameters can be used to break the degeneracy between them. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 10 pages, 7 figures. Comments are welcome

arXiv:2405.20142 [pdf, other]

MSSC-BiMamba: Multimodal Sleep Stage Classification and Early Diagnosis of Sleep Disorders with Bidirectional Mamba

Authors: Chao Zhang, Weirong Cui, Jingjing Guo

Abstract: Monitoring sleep states is essential for evaluating sleep quality and diagnosing sleep disorders. Traditional manual staging is time-consuming and prone to subjective bias, often resulting in inconsistent outcomes. Here, we developed an automated model for sleep staging and disorder classification to enhance diagnostic accuracy and efficiency. Considering the characteristics of polysomnography (PS… ▽ More Monitoring sleep states is essential for evaluating sleep quality and diagnosing sleep disorders. Traditional manual staging is time-consuming and prone to subjective bias, often resulting in inconsistent outcomes. Here, we developed an automated model for sleep staging and disorder classification to enhance diagnostic accuracy and efficiency. Considering the characteristics of polysomnography (PSG) multi-lead sleep monitoring, we designed a multimodal sleep state classification model, MSSC-BiMamba, that combines an Efficient Channel Attention (ECA) mechanism with a Bidirectional State Space Model (BSSM). The ECA module allows for weighting data from different sensor channels, thereby amplifying the influence of diverse sensor inputs. Additionally, the implementation of bidirectional Mamba (BiMamba) enables the model to effectively capture the multidimensional features and long-range dependencies of PSG data. The developed model demonstrated impressive performance on sleep stage classification tasks on both the ISRUC-S3 and ISRUC-S1 datasets, respectively containing data with healthy and unhealthy sleep patterns. Also, the model exhibited a high accuracy for sleep health prediction when evaluated on a combined dataset consisting of ISRUC and Sleep-EDF. Our model, which can effectively handle diverse sleep conditions, is the first to apply BiMamba to sleep staging with multimodal PSG data, showing substantial gains in computational and memory efficiency over traditional Transformer-style models. This method enhances sleep health management by making monitoring more accessible and extending advanced healthcare through innovative technology. △ Less

Submitted 30 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

Comments: 10 pages

arXiv:2405.17440 [pdf, other]

CataLM: Empowering Catalyst Design Through Large Language Models

Authors: Ludi Wang, Xueqing Chen, Yi Du, Yuanchun Zhou, Yang Gao, Wenjuan Cui

Abstract: The field of catalysis holds paramount importance in shaping the trajectory of sustainable development, prompting intensive research efforts to leverage artificial intelligence (AI) in catalyst design. Presently, the fine-tuning of open-source large language models (LLMs) has yielded significant breakthroughs across various domains such as biology and healthcare. Drawing inspiration from these adv… ▽ More The field of catalysis holds paramount importance in shaping the trajectory of sustainable development, prompting intensive research efforts to leverage artificial intelligence (AI) in catalyst design. Presently, the fine-tuning of open-source large language models (LLMs) has yielded significant breakthroughs across various domains such as biology and healthcare. Drawing inspiration from these advancements, we introduce CataLM Cata}lytic Language Model), a large language model tailored to the domain of electrocatalytic materials. Our findings demonstrate that CataLM exhibits remarkable potential for facilitating human-AI collaboration in catalyst knowledge exploration and design. To the best of our knowledge, CataLM stands as the pioneering LLM dedicated to the catalyst domain, offering novel avenues for catalyst discovery and development. △ Less

Submitted 12 May, 2024; originally announced May 2024.

arXiv:2405.17239 [pdf, other]

The Three Hundred project: Estimating the dependence of gas filaments on the mass of galaxy clusters

Authors: Sara Santoni, Marco De Petris, Gustavo Yepes, Antonio Ferragamo, Matteo Bianconi, Meghan E. Gray, Ulrike Kuchner, Frazer R. Pearce, Weiguang Cui, Stefano Ettori

Abstract: Galaxy clusters are located in the densest areas of the universe and are intricately connected to larger structures through the filamentary network of the Cosmic Web. In this scenario, matter flows from areas of lower density to higher density. As a result, the properties of galaxy clusters are deeply influenced by the filaments that are attached to them, which are quantified by a parameter known… ▽ More Galaxy clusters are located in the densest areas of the universe and are intricately connected to larger structures through the filamentary network of the Cosmic Web. In this scenario, matter flows from areas of lower density to higher density. As a result, the properties of galaxy clusters are deeply influenced by the filaments that are attached to them, which are quantified by a parameter known as connectivity. We explore the dependence of gas-traced filaments connected to galaxy clusters on the mass and dynamical state of the cluster. Moreover, we evaluate the effectiveness of the cosmic web extraction procedure from the gas density maps of simulated cluster regions. Using the DisPerSE cosmic web finder, we identify filamentary structures from 3D gas particle distribution in 324 simulated regions of $30 \, h^{-1}$ Mpc side from The Three Hundred hydrodynamical simulation at redshifts z=0, 1, and 2. We estimate the connectivity at various apertures for $\sim3000$ groups and clusters spanning a mass range from $10^{13} \, h^{-1} \, M_{\odot}$ to $10^{15} \, h^{-1} \, M_{\odot}$. Relationships between connectivity and cluster properties like radius, mass, dynamical state and hydrostatic mass bias are explored. We show that the connectivity is strongly correlated with the mass of galaxy clusters, with more massive clusters being on average more connected. This finding aligns with previous studies in literature, both from observational and simulated data sets. Additionally, we observe a dependence of the connectivity on the aperture at which it is estimated. We find that connectivity decreases with cosmic time, while no dependencies on the dynamical state and hydrostatic mass bias of the cluster are found. Lastly, we observe a significant agreement between the connectivity measured from gas-traced and mock-galaxies-traced filaments in the simulation. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 12 pages, 11 figures

arXiv:2405.15571 [pdf, other]

RCInvestigator: Towards Better Investigation of Anomaly Root Causes in Cloud Computing Systems

Authors: Shuhan Liu, Yunfan Zhou, Lu Ying, Yuan Tian, Jue Zhang, Shandan Zhou, Weiwei Cui, Qingwei Lin, Thomas Moscibroda, Haidong Zhang, Di Weng, Yingcai Wu

Abstract: Finding the root causes of anomalies in cloud computing systems quickly is crucial to ensure availability and efficiency since accurate root causes can guide engineers to take appropriate actions to address the anomalies and maintain customer satisfaction. However, it is difficult to investigate and identify the root causes based on large-scale and high-dimension monitoring data collected from com… ▽ More Finding the root causes of anomalies in cloud computing systems quickly is crucial to ensure availability and efficiency since accurate root causes can guide engineers to take appropriate actions to address the anomalies and maintain customer satisfaction. However, it is difficult to investigate and identify the root causes based on large-scale and high-dimension monitoring data collected from complex cloud computing environments. Due to the inherently dynamic characteristics of cloud computing systems, the existing approaches in practice largely rely on manual analyses for flexibility and reliability, but massive unpredictable factors and high data complexity make the process time-consuming. Despite recent advances in automated detection and investigation approaches, the speed and quality of root cause analyses remain limited by the lack of expert involvement in these approaches. The limitations found in the current solutions motivate us to propose a visual analytics approach that facilitates the interactive investigation of the anomaly root causes in cloud computing systems. We identified three challenges, namely, a) modeling databases for the root cause investigation, b) inferring root causes from large-scale time series, and c) building comprehensible investigation results. In collaboration with domain experts, we addressed these challenges with RCInvestigator, a novel visual analytics system that establishes a tight collaboration between human and machine and assists experts in investigating the root causes of cloud computing system anomalies. We evaluated the effectiveness of RCInvestigator through two use cases based on real-world data and received positive feedback from experts. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.13808 [pdf, other]

Hybrid Quantum-Classical Normalizing Flow

Authors: Anlei Zhang, Wei Cui

Abstract: With the rapid development of quantum computing technology, we have entered the era of noisy intermediate-scale quantum (NISQ) computers. Therefore, designing quantum algorithms that adapt to the hardware conditions of current NISQ devices and can preliminarily solve some practical problems has become the focus of researchers. In this paper, we focus on quantum generative models in the field of qu… ▽ More With the rapid development of quantum computing technology, we have entered the era of noisy intermediate-scale quantum (NISQ) computers. Therefore, designing quantum algorithms that adapt to the hardware conditions of current NISQ devices and can preliminarily solve some practical problems has become the focus of researchers. In this paper, we focus on quantum generative models in the field of quantum machine learning, and propose a hybrid quantum-classical normalizing flow (HQCNF) model based on parameterized quantum circuits. Based on the ideas of classical normalizing flow models and the characteristics of parameterized quantum circuits, we cleverly design the form of the ansatz and the hybrid method of quantum and classical computing, and derive the form of the loss function in the case that quantum computing is involved. We test our model on the image generation problem. Experimental results show that our model is capable of generating images of good quality. Compared with other quantum generative models, such as quantum generative adversarial networks (QGAN), our model achieves lower (better) Fréchet inception distance (FID) score, and compared with classical generative models, we can complete the image generation task with significantly fewer parameters. These results prove the advantage of our proposed model. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.11826 [pdf, other]

Data quality control system and long-term performance monitor of the LHAASO-KM2A

Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To ensure the reliability of the LHAASO-KM2A data, a three-level quality control system has been established. It is used to monitor the status of detector units, stability of reconstructed parameters and the performance of the array based on observations of the Crab Nebula and Moon shadow. This paper will introduce the control system and its application on the LHAASO-KM2A data collected from August 2021 to July 2023. During this period, the pointing and angular resolution of the array were stable. From the observations of the Moon shadow and Crab Nebula, the results achieved using the two methods are consistent with each other. According to the observation of the Crab Nebula at energies from 25 TeV to 100 TeV, the time averaged pointing errors are estimated to be $-0.003^{\circ} \pm 0.005^{\circ}$ and $0.001^{\circ} \pm 0.006^{\circ}$ in the R.A. and Dec directions, respectively. △ Less

Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 15 pages, 9 figures

arXiv:2405.11299 [pdf, other]

The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving

Authors: Pai Zeng, Zhenyu Ning, Jieru Zhao, Weihao Cui, Mengwei Xu, Liwei Guo, Xusheng Chen, Yizhou Shan

Abstract: We survey the large language model (LLM) serving area to understand the intricate dynamics between cost-efficiency and accuracy, which is magnified by the growing need for longer contextual understanding when deploying models at a massive scale. Our findings reveal that works in this space optimize along three distinct but conflicting goals: improving serving context length (C), improving serving… ▽ More We survey the large language model (LLM) serving area to understand the intricate dynamics between cost-efficiency and accuracy, which is magnified by the growing need for longer contextual understanding when deploying models at a massive scale. Our findings reveal that works in this space optimize along three distinct but conflicting goals: improving serving context length (C), improving serving accuracy (A), and improving serving performance (P). Drawing inspiration from the CAP theorem in databases, we propose a CAP principle for LLM serving, which suggests that any optimization can improve at most two of these three goals simultaneously. Our survey categorizes existing works within this framework. We find the definition and continuity of user-perceived measurement metrics are crucial in determining whether a goal has been met, akin to prior CAP databases in the wild. We recognize the CAP principle for LLM serving as a guiding principle, rather than a formal theorem, to inform designers of the inherent and dynamic trade-offs in serving models. As serving accuracy and performance have been extensively studied, this survey focuses on works that extend serving context length and address the resulting challenges. △ Less

Submitted 26 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.07691 [pdf, other]

Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) is compatible with NGC 4278 within $\sim0.03$ degree. Variation analysis shows an indication of the variability at a few months level in the TeV band, which is consistent with low frequency observations. Based on these observations, we report the detection of TeV $γ$-ray emissions from this low-luminosity AGN NGC 4278. The observations by LHAASO-WCDA during active period has a significance level of 8.8\,$σ$ with best-fit photon spectral index $\varGamma=2.56\pm0.14$ and a flux $f_{1-10\,\rm{TeV}}=(7.0\pm1.1_{\rm{sta}}\pm0.35_{\rm{syst}})\times10^{-13}\,\rm{photons\,cm^{-2}\,s^{-1}}$, or approximately $5\%$ of the Crab Nebula. The discovery of VHE from NGC 4278 indicates that the compact, weak radio jet can efficiently accelerate particles and emit TeV photons. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 11 pages, 5 figures

arXiv:2405.07559 [pdf, other]

doi 10.1007/s10909-024-03131-z

Preliminary Design of Detector Assembly for DIXE

Authors: Jiejia Liu, Sifan Wang, Hai Jin, Qian Wang, Wei Cui

Abstract: Diffuse X-ray Explorer (DIXE) is a proposed X-ray spectroscopic survey experiment for the China Space Station. Its detector assembly (DA) contains the transition edge sensor (TES) microcalorimeter and readout electronics based on the superconducting quantum interference device (SQUID) on the cold stage. The cold stage is thermally connected to the ADR stage, and a Kevlar suspension is used to stab… ▽ More Diffuse X-ray Explorer (DIXE) is a proposed X-ray spectroscopic survey experiment for the China Space Station. Its detector assembly (DA) contains the transition edge sensor (TES) microcalorimeter and readout electronics based on the superconducting quantum interference device (SQUID) on the cold stage. The cold stage is thermally connected to the ADR stage, and a Kevlar suspension is used to stabilize and isolate it from the 4 K environment. TES and SQUID are both sensitive to the magnetic field, so a hybrid shielding structure consisting of an outer Cryoperm shield and an inner niobium shield is used to attenuate the magnetic field. In addition, IR/optical/UV photons can produce shot noise and thus degrade the energy resolution of the TES microcalorimeter. A blocking filter assembly is designed to minimize the effects. In it, five filters are mounted at different temperature stages, reducing the probability of IR/optical/UV photons reaching the detector through multiple reflections between filters and absorption. This paper will describe the preliminary design of the detector assembly and its optimization. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 13 pages, 6 figures. Submitted version, the full version is published by Journal of Low Temperature Physics

arXiv:2404.17489 [pdf, other]

Tabular Data Contrastive Learning via Class-Conditioned and Feature-Correlation Based Augmentation

Authors: Wei Cui, Rasa Hosseinzadeh, Junwei Ma, Tongzi Wu, Yi Sui, Keyvan Golestan

Abstract: Contrastive learning is a model pre-training technique by first creating similar views of the original data, and then encouraging the data and its corresponding views to be close in the embedding space. Contrastive learning has witnessed success in image and natural language data, thanks to the domain-specific augmentation techniques that are both intuitive and effective. Nonetheless, in tabular d… ▽ More Contrastive learning is a model pre-training technique by first creating similar views of the original data, and then encouraging the data and its corresponding views to be close in the embedding space. Contrastive learning has witnessed success in image and natural language data, thanks to the domain-specific augmentation techniques that are both intuitive and effective. Nonetheless, in tabular domain, the predominant augmentation technique for creating views is through corrupting tabular entries via swapping values, which is not as sound or effective. We propose a simple yet powerful improvement to this augmentation technique: corrupting tabular data conditioned on class identity. Specifically, when corrupting a specific tabular entry from an anchor row, instead of randomly sampling a value in the same feature column from the entire table uniformly, we only sample from rows that are identified to be within the same class as the anchor row. We assume the semi-supervised learning setting, and adopt the pseudo labeling technique for obtaining class identities over all table rows. We also explore the novel idea of selecting features to be corrupted based on feature correlation structures. Extensive experiments show that the proposed approach consistently outperforms the conventional corruption method for tabular data classification tasks. Our code is available at https://github.com/willtop/Tabular-Class-Conditioned-SSL. △ Less

Submitted 30 April, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

Comments: 14 pages, 4 algorithms, 3 figures, 5 tables

arXiv:2404.16655 [pdf]

Rational Designing of Anthocyanidins-Directed Near-Infrared Two-Photon Fluorescence Probes

Authors: Xiu-e Zhang, Xue Wei, Wei-Bo Cui, Jin-Pu Bai, Aynur Matyusup, Jing-Fu Guo, Hui Li, Ai-Min Ren

Abstract: Recently, two-photon fluorescent probes based on anthocyanidins molecules have attracted extensive attention due to their outstanding photophysical properties. However, there are only a few two-photon excited fluorescent probes that really meet the requirements of relatively long emission wavelengths (>600 nm), large two-photon absorption (TPA) cross sections (300 GM), significant Stokes shift (>8… ▽ More Recently, two-photon fluorescent probes based on anthocyanidins molecules have attracted extensive attention due to their outstanding photophysical properties. However, there are only a few two-photon excited fluorescent probes that really meet the requirements of relatively long emission wavelengths (>600 nm), large two-photon absorption (TPA) cross sections (300 GM), significant Stokes shift (>80 nm), and high fluorescence intensity. Herein, the photophysical properties of a series of anthocyanidins with the same substituents but different fluorophore skeletons were investigated in detail. Compared with b-series molecules, a-series molecules with a six-membered ring in the backbone have a slightly higher reorganization energy. This results in more energy loss upon light excitation, enabling the reaction products to detect NTR through a larger Stokes shift. More importantly, there is very little decrease in fluorescence intensity as the Stokes shift increases. These features are extremely valuable for high-resolution NTR detection. In light of this, novel 2a-n (n=1-5) compounds are designed, which are accomplished by inhibiting the twisted intramolecular charge transfer (TICT) effect through alkyl cyclization, azetidine ring and extending π conjugation. Among them, 2a-3 gains long emission spectrum (λem=691.42 nm), noticeable TPA cross section (957.36 GM), and large Stokes shift (110.88 nm), indicating that it serves as a promising candidate for two-photon fluorescent dyes. It is hoped that this work will offer some insightful theoretical direction for the development of novel high performance anthocyanin fluorescent materials. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16425 [pdf, other]

Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a, whose bright peak was also detected by the Swift Burst Alert Telescope and Konus-Wind through off-line analyses. At a redshift of $z=4.859$, EP240315a showed a much longer and more complicated light curve in the soft X-ray band than in gamma-rays. Benefiting from a large field-of-view ($\sim$3600 deg$^2$) and a high sensitivity, EP-WXT captured the earlier engine activation and extended late engine activity through a continuous detection. With a peak X-ray flux at the faint end of previously known high-$z$ GRBs, the detection of EP240315a demonstrates the great potential for EP to study the early universe via GRBs. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 41 pages, 8 figures, 7 tables

arXiv:2404.14709 [pdf, ps, other]

SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer

Authors: Tong Zhang, Wenxue Cui, Shaohui Liu, Feng Jiang

Abstract: Convolutional Neural Network (CNN) and Transformer have attracted much attention recently for video post-processing (VPP). However, the interaction between CNN and Transformer in existing VPP methods is not fully explored, leading to inefficient communication between the local and global extracted features. In this paper, we explore the interaction between CNN and Transformer in the task of VPP, a… ▽ More Convolutional Neural Network (CNN) and Transformer have attracted much attention recently for video post-processing (VPP). However, the interaction between CNN and Transformer in existing VPP methods is not fully explored, leading to inefficient communication between the local and global extracted features. In this paper, we explore the interaction between CNN and Transformer in the task of VPP, and propose a novel Spatial and Channel Hybrid-Attention Video Post-Processing Network (SC-HVPPNet), which can cooperatively exploit the image priors in both spatial and channel domains. Specifically, in the spatial domain, a novel spatial attention fusion module is designed, in which two attention weights are generated to fuse the local and global representations collaboratively. In the channel domain, a novel channel attention fusion module is developed, which can blend the deep representations at the channel dimension dynamically. Extensive experiments show that SC-HVPPNet notably boosts video restoration quality, with average bitrate savings of 5.29%, 12.42%, and 13.09% for Y, U, and V components in the VTM-11.0-NNVC RA configuration. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.14691 [pdf, other]

Towards Fast Setup and High Throughput of GPU Serverless Computing

Authors: Han Zhao, Weihao Cui, Quan Chen, Shulai Zhang, Zijun Li, Jingwen Leng, Chao Li, Deze Zeng, Minyi Guo

Abstract: Integrating GPUs into serverless computing platforms is crucial for improving efficiency. However, existing solutions for GPU-enabled serverless computing platforms face two significant problems due to coarse-grained GPU management: long setup time and low function throughput. To address these issues, we propose SAGE, a GPU serverless framework with fast setup and high throughput. First, based o… ▽ More Integrating GPUs into serverless computing platforms is crucial for improving efficiency. However, existing solutions for GPU-enabled serverless computing platforms face two significant problems due to coarse-grained GPU management: long setup time and low function throughput. To address these issues, we propose SAGE, a GPU serverless framework with fast setup and high throughput. First, based on the data knowability of GPU function ahead of actual execution, SAGE first devises the parallelized function setup mechanism, which parallelizes the data preparation and context creation. In this way, SAGE achieves fast setup of GPU function invocations.Second, SAGE further proposes the sharing-based memory management mechanism, which shares the read-only memory and context memory across multiple invocations of the same function. The memory sharing mechanism avoids repeated data preparation and then unnecessary data-loading contention. As a consequence, the function throughput could be improved. Our experimental results show that SAGE reduces function duration by 11.3X and improves function density by 1.22X compared to the state-of-the-art serverless platform. △ Less

Submitted 22 April, 2024; originally announced April 2024.

arXiv:2404.12608 [pdf, other]

Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations

Authors: Sibei Chen, Yeye He, Weiwei Cui, Ju Fan, Song Ge, Haidong Zhang, Dongmei Zhang, Surajit Chaudhuri

Abstract: Spreadsheets are widely recognized as the most popular end-user programming tools, which blend the power of formula-based computation, with an intuitive table-based interface. Today, spreadsheets are used by billions of users to manipulate tables, most of whom are neither database experts nor professional programmers. Despite the success of spreadsheets, authoring complex formulas remains challe… ▽ More Spreadsheets are widely recognized as the most popular end-user programming tools, which blend the power of formula-based computation, with an intuitive table-based interface. Today, spreadsheets are used by billions of users to manipulate tables, most of whom are neither database experts nor professional programmers. Despite the success of spreadsheets, authoring complex formulas remains challenging, as non-technical users need to look up and understand non-trivial formula syntax. To address this pain point, we leverage the observation that there is often an abundance of similar-looking spreadsheets in the same organization, which not only have similar data, but also share similar computation logic encoded as formulas. We develop an Auto-Formula system that can accurately predict formulas that users want to author in a target spreadsheet cell, by learning and adapting formulas that already exist in similar spreadsheets, using contrastive-learning techniques inspired by "similar-face recognition" from compute vision. Extensive evaluations on over 2K test formulas extracted from real enterprise spreadsheets show the effectiveness of Auto-Formula over alternatives. Our benchmark data is available at https://github.com/microsoft/Auto-Formula to facilitate future research. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: full version of a paper to appear in SIGMOD 2024

arXiv:2404.09124 [pdf, other]

Discovery of a new IW And-type dwarf nova with both tilted disk and tidal instability

Authors: Yongkang Sun, Xin Li, Qige Ao, Wenyuan Cui, Bowen Zhang, Yang Huang, Jianrong Shi, Linlin Li, Jifeng Liu

Abstract: IW And-type dwarf novae are anomalous Z Cam stars featured with outbursts happening during standstill states, which are not expected in the standard disk instability model. The physical mechanisms for these variations remain unclear. In this study, we report the discovery of a new candidate IW And-type dwarf nova J0652+2436, identified with its frequent outbursts from the slowly rising standstill… ▽ More IW And-type dwarf novae are anomalous Z Cam stars featured with outbursts happening during standstill states, which are not expected in the standard disk instability model. The physical mechanisms for these variations remain unclear. In this study, we report the discovery of a new candidate IW And-type dwarf nova J0652+2436, identified with its frequent outbursts from the slowly rising standstill states. Luckily, the TESS observations during a long standstill state and the earlier K2 observations give a chance to find the orbital and negative superhump period in the light curve of J0652+2436, allowing the measurement of its mass ratio of 0.366. This mass ratio is marginally possible for the tidal instability to set in according to previous SPH simulations. Thus, we propose that the outbursts in J0652+2436 are likely to be caused by the growing accretion disk during standstills, in favor of the previous hypothesis of the mechanisms lying in all IW And stars. We conclude that J0652+2436 might be the first IW And star with both a precessing tilted disk and tidal instability, which will be an important laboratory for studying the accretion disk dynamics and help understand IW And phenomenon. △ Less

Submitted 13 April, 2024; originally announced April 2024.

Comments: 12 pages, 12 figures, accepted to MNRAS

arXiv:2404.05400 [pdf, other]

doi 10.1051/epjconf/202429300013

Generating Galaxy Clusters Mass Density Maps from Mock Multiview Images via Deep Learning

Authors: Daniel de Andres, Weiguang Cui, Gustavo Yepes, Marco De Petris, Gianmarco Aversano, Antonio Ferragamo, Federico De Luca, A. Jiménez Muñoz

Abstract: Galaxy clusters are composed of dark matter, gas and stars. Their dark matter component, which amounts to around 80\% of the total mass, cannot be directly observed but traced by the distribution of diffused gas and galaxy members. In this work, we aim to infer the cluster's projected total mass distribution from mock observational data, i.e. stars, Sunyaev-Zeldovich, and X-ray, by training deep l… ▽ More Galaxy clusters are composed of dark matter, gas and stars. Their dark matter component, which amounts to around 80\% of the total mass, cannot be directly observed but traced by the distribution of diffused gas and galaxy members. In this work, we aim to infer the cluster's projected total mass distribution from mock observational data, i.e. stars, Sunyaev-Zeldovich, and X-ray, by training deep learning models. To this end, we have created a multiview images dataset from {\sc{The Three Hundred}} simulation that is optimal for training Machine Learning models. We further study deep learning architectures based on the U-Net to account for single-input and multi-input models. We show that the predicted mass distribution agrees well with the true one. △ Less

Submitted 9 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

Comments: To appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

arXiv:2404.04801 [pdf, ps, other]

doi 10.1007/s41605-024-00467-8

LHAASO-KM2A detector simulation using Geant4

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with large altitude difference (30 m) and huge coverage (1.3 km^2). In this paper, the design of the KM2A simulation code G4KM2A based on Geant4 is introduced. The process of G4KM2A is optimized mainly in memory consumption to avoid memory overffow. Some simpliffcations are used to signiffcantly speed up the execution of G4KM2A. The running time is reduced by at least 30 times compared to full detector simulation. The particle distributions and the core/angle resolution comparison between simulation and experimental data of the full KM2A array are also presented, which show good agreement. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2404.03376 [pdf, other]

doi 10.1051/epjconf/202429300037

3D scaling laws and projection effects in The300-NIKA2 Sunyaev-Zeldovich Large Program Twin Samples

Authors: A. Paliwal, W. Cui, D. de Andrés, M. De Petris, A. Ferragamo, C. Hanser, J. -F. Macías-Pérez, F. Mayet, A. Moyer-Anin, M. Muñoz-Echeverría, L. Perotto, E. Rasia, G. Yepes

Abstract: The abundance of galaxy clusters with mass and redshift is a well-known cosmological probe. The cluster mass is a key parameter for studies that aim to constrain cosmological parameters using galaxy clusters, making it critical to understand and properly account for the errors in its estimates. Subsequently, it becomes important to correctly calibrate scaling relations between observables like the… ▽ More The abundance of galaxy clusters with mass and redshift is a well-known cosmological probe. The cluster mass is a key parameter for studies that aim to constrain cosmological parameters using galaxy clusters, making it critical to understand and properly account for the errors in its estimates. Subsequently, it becomes important to correctly calibrate scaling relations between observables like the integrated Compton parameter and the mass of the cluster. The NIKA2 Sunyaev-Zeldovich Large program (LPSZ) enables one to map the intracluster medium profiles in the mm-wavelength band with great details (resolution of $11 \ \mathrm{\&}\ 17^{\prime \prime}$ at $1.2 \ \mathrm{\&}\ 2 $ mm, respectively) and hence, to estimate the cluster hydrostatic mass more precisely than previous SZ observations. However, there are certain systematic effects which can only be accounted for with the use of simulations. For this purpose, we employ THE THREE HUNDRED simulations which have been modelled with a range of physics modules to simulate galaxy clusters. The so-called twin samples are constructed by picking synthetic clusters of galaxies with properties close to the observational targets of the LPSZ. In particular, we use the Compton parameter maps and projected total mass maps of these twin samples along 29 different lines of sight. We investigate the scatter that projection induces on the total masses. Eventually, we consider the statistical values along different lines of sight to construct a kind of 3D scaling law between the integrated Compton parameter, total mass, and overdensity of the galaxy clusters to determine the overdensity that is least impacted by the projection effect. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: to appear in Proc. of the mm Universe 2023 conference, Grenoble (France), June 2023, published by F. Mayet et al. (Eds), EPJ Web of conferences, EDP Sciences

arXiv:2404.02837 [pdf, other]

Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models

Authors: Wanyun Cui, Qianle Wang

Abstract: This paper reveals the phenomenon of parameter heterogeneity in large language models (LLMs). We find that a small subset of ``cherry'' parameters exhibit a disproportionately large influence on model performance, while the vast majority of parameters have minimal impact. This heterogeneity is found to be prevalent across different model families, scales, and types. Motivated by this observation,… ▽ More This paper reveals the phenomenon of parameter heterogeneity in large language models (LLMs). We find that a small subset of ``cherry'' parameters exhibit a disproportionately large influence on model performance, while the vast majority of parameters have minimal impact. This heterogeneity is found to be prevalent across different model families, scales, and types. Motivated by this observation, we propose CherryQ, a novel quantization method that unifies the optimization of mixed-precision parameters. CherryQ identifies and preserves the critical cherry parameters in high precision while aggressively quantizing the remaining parameters to low precision. Extensive experiments demonstrate the effectiveness of CherryQ. CherryQ outperforms existing quantization approaches in terms of perplexity and downstream task performance. Notably, our 3-bit quantized Vicuna-1.5 exhibits competitive performance compared to their 16-bit counterparts. These findings highlight the potential of CherryQ for enabling efficient deployment of LLMs by taking advantage of parameter heterogeneity. △ Less

Submitted 3 April, 2024; originally announced April 2024.

arXiv:2404.00505 [pdf, other]

doi 10.1109/TMLCN.2024.3384329

Transfer Learning with Reconstruction Loss

Authors: Wei Cui, Wei Yu

Abstract: In most applications of utilizing neural networks for mathematical optimization, a dedicated model is trained for each specific optimization objective. However, in many scenarios, several distinct yet correlated objectives or tasks often need to be optimized on the same set of problem inputs. Instead of independently training a different neural network for each problem separately, it would be more… ▽ More In most applications of utilizing neural networks for mathematical optimization, a dedicated model is trained for each specific optimization objective. However, in many scenarios, several distinct yet correlated objectives or tasks often need to be optimized on the same set of problem inputs. Instead of independently training a different neural network for each problem separately, it would be more efficient to exploit the correlations between these objectives and to train multiple neural network models with shared model parameters and feature representations. To achieve this, this paper first establishes the concept of common information: the shared knowledge required for solving the correlated tasks, then proposes a novel approach for model training by adding into the model an additional reconstruction stage associated with a new reconstruction loss. This loss is for reconstructing the common information starting from a selected hidden layer in the model. The proposed approach encourages the learned features to be general and transferable, and therefore can be readily used for efficient transfer learning. For numerical simulations, three applications are studied: transfer learning on classifying MNIST handwritten digits, the device-to-device wireless network power allocation, and the multiple-input-single-output network downlink beamforming and localization. Simulation results suggest that the proposed approach is highly efficient in data and model complexity, is resilient to over-fitting, and has competitive performances. △ Less

Submitted 11 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

Comments: 16 pages, 5 figures. To appear in IEEE Transactions on Machine Learning in Communications and Networking (TMLCN)

arXiv:2404.00321 [pdf, other]

doi 10.3847/1538-4357/ad3931

Intrinsic mass-richness relation of clusters from THE THREE HUNDRED hydrodynamic simulations

Authors: Mingjing Chen, Weiguang Cui, Wenjuan Fang, Zhonglue Wen

Abstract: The main systematics in cluster cosmology is the uncertainty in the mass-observable relation. In this paper, we focus on the most direct cluster observable in optical surveys, i.e. richness, and constrain the intrinsic mass-richness (MR) relation of clusters in THE THREE HUNDRED hydrodynamic simulations with two runs: GIZMO-SIMBA and GADGET-X. We find that modeling the richness at fixed halo mass… ▽ More The main systematics in cluster cosmology is the uncertainty in the mass-observable relation. In this paper, we focus on the most direct cluster observable in optical surveys, i.e. richness, and constrain the intrinsic mass-richness (MR) relation of clusters in THE THREE HUNDRED hydrodynamic simulations with two runs: GIZMO-SIMBA and GADGET-X. We find that modeling the richness at fixed halo mass with a skewed Gaussian distribution yields a simpler and smaller scatter compared to the commonly used log-normal distribution. Additionally, we observe that baryon models have a significant impact on the scatter, while exhibiting no influence on the mass dependence and a slight effect on the amplitude in the MR relation. We select member galaxies based on both stellar mass $M_\star$ and absolute magnitude $\mathscr{M}$. We demonstrate that the MR relation obtained from these two selections can be converted to each other by using the $M_\star-\mathscr{M}$ relation. Finally, we provide a 7-parameter fitting result comprehensively capturing the dependence of the MR relation on both stellar mass cutoff and redshift. △ Less

Submitted 2 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

Comments: accepted to ApJ

Journal ref: The Astrophysical Journal (2024)

arXiv:2403.16125 [pdf, other]

A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters

Authors: Chunyu Xue, Weihao Cui, Han Zhao, Quan Chen, Shulai Zhang, Pengyu Yang, Jing Yang, Shaobo Li, Minyi Guo

Abstract: Joint consideration of scheduling and adaptive parallelism offers great opportunities for improving the training efficiency of large models on heterogeneous GPU clusters. However, integrating adaptive parallelism into a cluster scheduler expands the cluster scheduling space. The new space is the product of the original scheduling space and the parallelism exploration space of adaptive parallelism… ▽ More Joint consideration of scheduling and adaptive parallelism offers great opportunities for improving the training efficiency of large models on heterogeneous GPU clusters. However, integrating adaptive parallelism into a cluster scheduler expands the cluster scheduling space. The new space is the product of the original scheduling space and the parallelism exploration space of adaptive parallelism (also a product of pipeline, data, and tensor parallelism). The exponentially enlarged scheduling space and ever-changing optimal parallelism plan from adaptive parallelism together result in the contradiction between low-overhead and accurate performance data acquisition for efficient cluster scheduling. This paper presents Crius, a training system for efficiently scheduling multiple large models with adaptive parallelism in a heterogeneous cluster. Crius proposes a novel scheduling granularity called Cell. It represents a job with deterministic resources and pipeline stages. The exploration space of Cell is shrunk to the product of only data and tensor parallelism, thus exposing the potential for accurate and low-overhead performance estimation. Crius then accurately estimates Cells and efficiently schedules training jobs. When a Cell is selected as a scheduling choice, its represented job runs with the optimal parallelism plan explored. Experimental results show that Crius reduces job completion time by up to 48.9% and schedules large models with up to 1.49x cluster throughput improvement. △ Less

Submitted 24 March, 2024; originally announced March 2024.

arXiv:2403.13338 [pdf, other]

Adaptive Critical Subgraph Mining for Cognitive Impairment Conversion Prediction with T1-MRI-based Brain Network

Authors: Yilin Leng, Wenju Cui, Bai Chen, Xi Jiang, Shuangqing Chen, Jian Zheng

Abstract: Prediction the conversion to early-stage dementia is critical for mitigating its progression but remains challenging due to subtle cognitive impairments and structural brain changes. Traditional T1-weighted magnetic resonance imaging (T1-MRI) research focus on identifying brain atrophy regions but often fails to address the intricate connectivity between them. This limitation underscores the neces… ▽ More Prediction the conversion to early-stage dementia is critical for mitigating its progression but remains challenging due to subtle cognitive impairments and structural brain changes. Traditional T1-weighted magnetic resonance imaging (T1-MRI) research focus on identifying brain atrophy regions but often fails to address the intricate connectivity between them. This limitation underscores the necessity of focuing on inter-regional connectivity for a comprehensive understand of the brain's complex network. Moreover, there is a pressing demand for methods that adaptively preserve and extract critical information, particularly specialized subgraph mining techniques for brain networks. These are essential for developing high-quality feature representations that reveal critical spatial impacts of structural brain changes and its topology. In this paper, we propose Brain-SubGNN, a novel graph representation network to mine and enhance critical subgraphs based on T1-MRI. This network provides a subgraph-level interpretation, enhancing interpretability and insights for graph analysis. The process begins by extracting node features and a correlation matrix between nodes to construct a task-oriented brain network. Brain-SubGNN then adaptively identifies and enhances critical subgraphs, capturing both loop and neighbor subgraphs. This method reflects the loop topology and local changes, indicative of long-range connections, and maintains local and global brain attributes. Extensive experiments validate the effectiveness and advantages of Brain-SubGNN, demonstrating its potential as a powerful tool for understanding and diagnosing early-stage dementia. Source code is available at https://github.com/Leng-10/Brain-SubGNN. △ Less

Submitted 26 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

Comments: 20 pages

arXiv:2403.12135 [pdf, other]

Evolution and distribution of superbubbles in simulated Milky Way-like galaxies

Authors: Chengzhe Li, Hui Li, Wei Cui, Federico Marinacci, Laura V. Sales, Mark Vogelsberger, Paul Torrey

Abstract: Stellar feedback plays a crucial role in regulating baryon cycles of a galactic ecosystem, and may manifest itself in the formation of superbubbles in the interstellar medium. In this work, we used a set of high-resolution simulations to systematically study the properties and evolution of superbubbles in galactic environments. The simulations were based on the SMUGGLE galaxy formation framework u… ▽ More Stellar feedback plays a crucial role in regulating baryon cycles of a galactic ecosystem, and may manifest itself in the formation of superbubbles in the interstellar medium. In this work, we used a set of high-resolution simulations to systematically study the properties and evolution of superbubbles in galactic environments. The simulations were based on the SMUGGLE galaxy formation framework using the hydrodynamical moving-mesh code Arepo, reaching a spatial resolution of $\sim 4 \, \rm pc$ and mass resolution of $\sim 10^3 \, \rm M_{\odot}$. We identified superbubbles and tracked their time evolution using the parent stellar associations within the bubbles. The X-ray luminosity-size distribution of superbubbles in the fiducial run is largely consistent with the observations of nearby galaxies. The size of superbubbles shows a double-peaked distribution, with the peaks attributed to early feedback (radiative and stellar wind feedback) and supernova feedback. The early feedback tends to suppress the subsequent supernova feedback, and it is strongly influenced by star formation efficiency, which regulates the environmental density. Our results show that the volume filling factor of hot gas ($T > 10^{5.5} ~\mathrm{K}$) is about $12 \%$ averaged over a region of 4 kpc in height and 20 kpc in radius centered on the disk of the galaxy. Overall, the properties of superbubbles are sensitive to the choice of subgrid galaxy formation models and can, therefore, be used to constrain these models. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: Accepted for publication in MNRAS, 14 pages, 10 figures, 4 tables

arXiv:2403.10010 [pdf, other]

doi 10.1103/PhysRevLett.132.131002

Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at $3.67 \pm 0.05 \pm 0.15$ PeV. Below the knee, the spectral index is found to be -$2.7413 \pm 0.0004 \pm 0.0050$, while above the knee, it is -$3.128 \pm 0.005 \pm 0.027$, with the sharpness of the transition measured with a statistical error of 2%. The mean logarithmic mass of cosmic rays is almost heavier than helium in the whole measured energy range. It decreases from 1.7 at 0.3 PeV to 1.3 at 3 PeV, representing a 24% decline following a power law with an index of -$0.1200 \pm 0.0003 \pm 0.0341$. This is equivalent to an increase in abundance of light components. Above the knee, the mean logarithmic mass exhibits a power law trend towards heavier components, which is reversal to the behavior observed in the all-particle energy spectrum. Additionally, the knee position and the change in power-law index are approximately the same. These findings suggest that the knee observed in the all-particle spectrum corresponds to the knee of the light component, rather than the medium-heavy components. △ Less

Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

Comments: 8 pages, 3 figures

Journal ref: Physical Review Letters 132, 131002 (2024)

arXiv:2403.09957 [pdf, other]

Suppression of Star Formation in Galaxy Pairs

Authors: Shuai Feng, Shi-Yin Shen, Fang-Ting Yuan, Wen-Xin Zhong, Wen-Yuan Cui, Lin-Lin Li

Abstract: We investigate the suppression of star formation in galaxy pairs based on the isolated galaxy pair sample derived from the SDSS survey. By comparing the star formation rate between late-type galaxies in galaxy pairs and those in the isolated environment, we detect the signal of star formation suppression in galaxy pairs at $d_p < 100$kpc and $200$kpc$ < d_p < 350$kpc. The occurrence of star format… ▽ More We investigate the suppression of star formation in galaxy pairs based on the isolated galaxy pair sample derived from the SDSS survey. By comparing the star formation rate between late-type galaxies in galaxy pairs and those in the isolated environment, we detect the signal of star formation suppression in galaxy pairs at $d_p < 100$kpc and $200$kpc$ < d_p < 350$kpc. The occurrence of star formation suppression in these late-type galaxies requires their companion galaxies to have an early-type morphology ($n_s > 2.5$). Star formation suppression in wide galaxy pairs with $200$kpc$ < d_p < 350$kpc mainly occurs in massive late-type galaxies, while in close galaxy pairs with $d_p < 100$kpc, it only appears in late-type galaxies with a massive companion ( $\log M_\star > 11.0$), nearly independent of their own stellar mass. Based on these findings, we infer that star formation suppression in wide galaxy pairs is actually a result of galaxy conformity, while in close galaxy pairs, it stems from the influence of hot circum-galactic medium surrounding companion galaxies. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 10 pages, 4 figures, accepted for publication in ApJ

arXiv:2403.09273 [pdf, other]

doi 10.1093/mnras/stae568

Identifying Galaxy Cluster Mergers with Deep Neural Networks using Idealized Compton-y and X-ray maps

Authors: Ashleigh R. Arendt, Yvette C. Perrott, Ana Contreras-Santos, Daniel de Andres, Weiguang Cui, Douglas Rennehan

Abstract: We present a novel approach to identify galaxy clusters that are undergoing a merger using a deep learning approach. This paper uses massive galaxy clusters spanning $0 \leq z \leq 2$ from \textsc{The Three Hundred} project, a suite of hydrodynamic re-simulations of 324 large galaxy clusters. Mock, idealised Compton-{\it y} and X-ray maps were constructed for the sample, capturing them out to a ra… ▽ More We present a novel approach to identify galaxy clusters that are undergoing a merger using a deep learning approach. This paper uses massive galaxy clusters spanning $0 \leq z \leq 2$ from \textsc{The Three Hundred} project, a suite of hydrodynamic re-simulations of 324 large galaxy clusters. Mock, idealised Compton-{\it y} and X-ray maps were constructed for the sample, capturing them out to a radius of $2R_{200}$. The idealised nature of these maps mean they do not consider observational effects such as foreground or background astrophysical objects, any spatial resolution limits or restriction on X-ray energy bands. Half of the maps belong to a merging population as defined by a mass increase $Δ${\it M/M} $\geq$ 0.75, and the other half serve as a control, relaxed population. We employ a convolutional neural network architecture and train the model to classify clusters into one of the groups. A best-performing model was able to correctly distinguish between the two populations with a balanced accuracy (BA) and recall of 0.77, ROC-AUC of 0.85, PR-AUC of 0.55 and $F_{1}$ score of 0.53. Using a multichannel model relative to a single channel model, we obtain a 3\% improvement in BA score, and a 6\% improvement in $F_{1}$ score. We use a saliency interpretation approach to discern the regions most important to each classification decision. By analysing radially binned saliency values we find a preference to utilise regions out to larger distances for mergers with respect to non-mergers, greater than $\sim1.2 R_{200}$ and $\sim0.7 R_{200}$ for SZ and X-ray respectively. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 15 pages, 17 figures, published in MNRAS

arXiv:2403.09031 [pdf, other]

Projected Gradient Descent for Spectral Compressed Sensing via Symmetric Hankel Factorization

Authors: Jinsheng Li, Wei Cui, Xu Zhang

Abstract: Current spectral compressed sensing methods via Hankel matrix completion employ symmetric factorization to demonstrate the low-rank property of the Hankel matrix. However, previous non-convex gradient methods only utilize asymmetric factorization to achieve spectral compressed sensing. In this paper, we propose a novel nonconvex projected gradient descent method for spectral compressed sensing via… ▽ More Current spectral compressed sensing methods via Hankel matrix completion employ symmetric factorization to demonstrate the low-rank property of the Hankel matrix. However, previous non-convex gradient methods only utilize asymmetric factorization to achieve spectral compressed sensing. In this paper, we propose a novel nonconvex projected gradient descent method for spectral compressed sensing via symmetric factorization named Symmetric Hankel Projected Gradient Descent (SHGD), which updates only one matrix and avoids a balancing regularization term. SHGD reduces about half of the computation and storage costs compared to the prior gradient method based on asymmetric factorization. {Besides, the symmetric factorization employed in our work is completely novel to the prior low-rank factorization model, introducing a new factorization ambiguity under complex orthogonal transformation}. Novel distance metrics are designed for our factorization method and a linear convergence guarantee to the desired signal is established with $O(r^2\log(n))$ observations. Numerical simulations demonstrate the superior performance of the proposed SHGD method in phase transitions and computation efficiency compared to state-of-the-art methods. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: accepted in IEEE Transactions on Signal Processing

arXiv:2403.05497 [pdf, other]

Les Houches Lectures on Community Ecology: From Niche Theory to Statistical Mechanics

Authors: Wenping Cui, Robert Marsland III, Pankaj Mehta

Abstract: Ecosystems are among the most interesting and well-studied examples of self-organized complex systems. Community ecology, the study of how species interact with each other and the environment, has a rich tradition. Over the last few years, there has been a growing theoretical and experimental interest in these problems from the physics and quantitative biology communities. Here, we give an overvie… ▽ More Ecosystems are among the most interesting and well-studied examples of self-organized complex systems. Community ecology, the study of how species interact with each other and the environment, has a rich tradition. Over the last few years, there has been a growing theoretical and experimental interest in these problems from the physics and quantitative biology communities. Here, we give an overview of community ecology, highlighting the deep connections between ecology and statistical physics. We start by introducing the two classes of mathematical models that have served as the workhorses of community ecology: Consumer Resource Models (CRM) and the generalized Lotka-Volterra models (GLV). We place a special emphasis on graphical methods and general principles. We then review recent works showing a deep and surprising connection between ecological dynamics and constrained optimization. We then shift our focus by analyzing these same models in "high-dimensions" (i.e. in the limit where the number of species and resources in the ecosystem becomes large) and discuss how such complex ecosystems can be analyzed using methods from the statistical physics of disordered systems such as the cavity method and Random Matrix Theory. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: 48 pages, 9 figures, Les Houches Theoretical Biophysics Summer School 2023

arXiv:2403.04299 [pdf, other]

LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation

Authors: Haojie Xin, Xiaodong Zhang, Renzhi Tang, Songyang Yan, Qianrui Zhao, Chunze Yang, Wen Cui, Zijiang Yang

Abstract: Simulation is pivotal in evaluating the performance of autonomous driving systems due to the advantages of high efficiency and low cost compared to on-road testing. Bridging the gap between simulation and the real world requires realistic agent behaviors. However, the existing works have the following shortcomings in achieving this goal: (1) log replay offers realistic scenarios but often leads to… ▽ More Simulation is pivotal in evaluating the performance of autonomous driving systems due to the advantages of high efficiency and low cost compared to on-road testing. Bridging the gap between simulation and the real world requires realistic agent behaviors. However, the existing works have the following shortcomings in achieving this goal: (1) log replay offers realistic scenarios but often leads to collisions due to the absence of dynamic interactions, and (2) both heuristic-based and data-based solutions, which are parameterized and trained on real-world datasets, encourage interactions but often deviate from real-world data over long horizons. In this work, we propose LitSim, a long-term interactive simulation approach that maximizes realism by minimizing the interventions in the log. Specifically, our approach primarily uses log replay to ensure realism and intervenes only when necessary to prevent potential conflicts. We then encourage interactions among the agents and resolve the conflicts, thereby reducing the risk of unrealistic behaviors. We train and validate our model on the real-world dataset NGSIM, and the experimental results demonstrate that LitSim outperforms the currently popular approaches in terms of realism and reactivity. △ Less

Submitted 1 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

Comments: 9 pages, 6 figures, under review

arXiv:2403.00350 [pdf, other]

Eckart streaming with nonlinear high-order harmonics: an example at gigahertz

Authors: Shiyu Li, Weiwei Cui, Thierry Baasch, Bin Wang, Zhixiong Gong

Abstract: Acoustic streaming shows great potential in applications such as bubble dynamics, cell aggregation, and nano-sized particle isolation in the biomedical and drug industries. As the acoustic shock distance decreases with the increase of incident frequency, the nonlinear propagation effect will play a role in acoustic streaming, e.g., Eckart (bulk) streaming at a few gigahertz (GHz). However, the the… ▽ More Acoustic streaming shows great potential in applications such as bubble dynamics, cell aggregation, and nano-sized particle isolation in the biomedical and drug industries. As the acoustic shock distance decreases with the increase of incident frequency, the nonlinear propagation effect will play a role in acoustic streaming, e.g., Eckart (bulk) streaming at a few gigahertz (GHz). However, the theory of source terms of bulk streaming is still missing at this stage when high-order acoustic harmonics play a role. In this paper, we derive the source term including the contribution of higher-order harmonics. The streaming-induced hydrodynamic flow is assumed to be incompressible and no shock wave occurs during the nonlinear acoustic propagation as restricted by the traditional Goldberg number Γ< 1 or Γ\approx 1 which indicates the importance of nonlinearity relative to dissipation. The derived force terms allow evaluating bulk streaming with high-order harmonics at GHz and provide an exact expression compared to the existing empirical formulas. Numerical results show that the contribution of higher-order harmonics increases the streaming flow velocity by more than 20%. We show that the expression introduced by Nyborg should be avoided in numerical computations as it includes part of the acoustic radiation force that does not lead to acoustic streaming. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 11 pages, 7 figures

arXiv:2402.19388 [pdf, other]

A model of pan-immunity maintenance by horizontal gene transfer in the ecological dynamics of bacteria and phages

Authors: Wenping Cui, Jemma M. Fendley, Sriram Srikant, Boris Shraiman

Abstract: Phages and their bacterial hosts are locked in an evolutionary competition which in small and closed systems typically results in the extinction of one or the other. To resist phages bacteria have evolved numerous defense systems, which nevertheless are still overcome by specific phage counter-defense mechanisms. These defense/counter-defense systems are a major element of microbial genetic divers… ▽ More Phages and their bacterial hosts are locked in an evolutionary competition which in small and closed systems typically results in the extinction of one or the other. To resist phages bacteria have evolved numerous defense systems, which nevertheless are still overcome by specific phage counter-defense mechanisms. These defense/counter-defense systems are a major element of microbial genetic diversity and have been demonstrated to propagate between strains by horizontal gene transfer (HGT). It has been proposed that the totality of defense systems found in microbial communities collectively form a distributed "pan-immune" system with individual elements moving between strains via ubiquitous HGT. Here, we formulate a Lotka-Volterra type model of a host/phage system interacting via a combinatorial variety of defense/counter-defense systems and show that HGT enables stable maintenance of diverse defense/counter-defense genes in the microbial pan-genome even when individual microbial strains inevitably undergo extinction. This stability requires the HGT rate to be sufficiently high to ensure that some descendant of a "dying" strain survives thanks to the immunity acquired through HGT from the community at large, thus establishing a new strain. This mechanism of persistence for the pan-immune gene pool is fundamentally similar to the "island migration" model of ecological diversity, with genes moving between genomes instead of species migrating between islands. △ Less

Submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.19256 [pdf, ps, other]

Collet-Eckmann maps in the unicritical family

Authors: Magnus Aspenberg, Mats Bylund, Weiwei Cui

Abstract: In this paper we study perturbations of complex unicritical polynomials satisfying the Collet-Eckmann condition. We show that Collet-Eckmann parameters are Lebesgue density points of the complement of the Mandelbrot set (i.e. the connectedness locus). In this paper we study perturbations of complex unicritical polynomials satisfying the Collet-Eckmann condition. We show that Collet-Eckmann parameters are Lebesgue density points of the complement of the Mandelbrot set (i.e. the connectedness locus). △ Less

Submitted 29 February, 2024; originally announced February 2024.

MSC Class: 37F10; 37F12; 37F15

arXiv:2402.19111 [pdf, other]

Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling

Authors: Wenxue Cui, Xingtao Wang, Xiaopeng Fan, Shaohui Liu, Xinwei Gao, Debin Zhao

Abstract: Existing image compressed sensing (CS) coding frameworks usually solve an inverse problem based on measurement coding and optimization-based image reconstruction, which still exist the following two challenges: 1) The widely used random sampling matrix, such as the Gaussian Random Matrix (GRM), usually leads to low measurement coding efficiency. 2) The optimization-based reconstruction methods gen… ▽ More Existing image compressed sensing (CS) coding frameworks usually solve an inverse problem based on measurement coding and optimization-based image reconstruction, which still exist the following two challenges: 1) The widely used random sampling matrix, such as the Gaussian Random Matrix (GRM), usually leads to low measurement coding efficiency. 2) The optimization-based reconstruction methods generally maintain a much higher computational complexity. In this paper, we propose a new CNN based image CS coding framework using local structural sampling (dubbed CSCNet) that includes three functional modules: local structural sampling, measurement coding and Laplacian pyramid reconstruction. In the proposed framework, instead of GRM, a new local structural sampling matrix is first developed, which is able to enhance the correlation between the measurements through a local perceptual sampling strategy. Besides, the designed local structural sampling matrix can be jointly optimized with the other functional modules during training process. After sampling, the measurements with high correlations are produced, which are then coded into final bitstreams by the third-party image codec. At last, a Laplacian pyramid reconstruction network is proposed to efficiently recover the target image from the measurement domain to the image domain. Extensive experimental results demonstrate that the proposed scheme outperforms the existing state-of-the-art CS coding methods, while maintaining fast computational speed. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: Accepted by ACM Transactions on Multimedia Computing Communications and Applications (TOMM)

arXiv:2402.17237 [pdf, other]

Image-Text Matching with Multi-View Attention

Authors: Rui Cheng, Wanqing Cui

Abstract: Existing two-stream models for image-text matching show good performance while ensuring retrieval speed and have received extensive attention from industry and academia. These methods use a single representation to encode image and text separately and get a matching score with cosine similarity or the inner product of vectors. However, the performance of the two-stream model is often sub-optimal.… ▽ More Existing two-stream models for image-text matching show good performance while ensuring retrieval speed and have received extensive attention from industry and academia. These methods use a single representation to encode image and text separately and get a matching score with cosine similarity or the inner product of vectors. However, the performance of the two-stream model is often sub-optimal. On the one hand, a single representation is challenging to cover complex content comprehensively. On the other hand, in this framework of lack of interaction, it is challenging to match multiple meanings which leads to information being ignored. To address the problems mentioned above and facilitate the performance of the two-stream model, we propose a multi-view attention approach for two-stream image-text matching MVAM (\textbf{M}ulti-\textbf{V}iew \textbf{A}ttention \textbf{M}odel). It first learns multiple image and text representations by diverse attention heads with different view codes. And then concatenate these representations into one for matching. A diversity objective is also used to promote diversity between attention heads. With this method, models are able to encode images and text from different views and attend to more key points. So we can get representations that contain more information. When doing retrieval tasks, the matching scores between images and texts can be calculated from different aspects, leading to better matching performance. Experiment results on MSCOCO and Flickr30K show that our proposed model brings improvements over existing models. Further case studies show that different attention heads can focus on different contents and finally obtain a more comprehensive representation. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.13625 [pdf, other]

MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning

Authors: Wanqing Cui, Keping Bi, Jiafeng Guo, Xueqi Cheng

Abstract: Since commonsense information has been recorded significantly less frequently than its existence, language models pre-trained by text generation have difficulty to learn sufficient commonsense knowledge. Several studies have leveraged text retrieval to augment the models' commonsense ability. Unlike text, images capture commonsense information inherently but little effort has been paid to effectiv… ▽ More Since commonsense information has been recorded significantly less frequently than its existence, language models pre-trained by text generation have difficulty to learn sufficient commonsense knowledge. Several studies have leveraged text retrieval to augment the models' commonsense ability. Unlike text, images capture commonsense information inherently but little effort has been paid to effectively utilize them. In this work, we propose a novel Multi-mOdal REtrieval (MORE) augmentation framework, to leverage both text and images to enhance the commonsense ability of language models. Extensive experiments on the Common-Gen task have demonstrated the efficacy of MORE based on the pre-trained models of both single and multiple modalities. △ Less

Submitted 13 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

Comments: Published as a conference paper at ACL Findings 2024

arXiv:2402.13568 [pdf, other]

The Effect of AGN Feedback on the Lyman-α Forest Signature of Galaxy Protoclusters at z~2.3

Authors: Chenze Dong, Khee-Gan Lee, Romeel Davé, Weiguang Cui, Daniele Sorini

Abstract: The intergalactic medium (IGM) in the vicinity of galaxy protoclusters are interesting testbeds to study complex baryonic effects such as gravitational shocks and feedback. Here, we utilize hydrodynamical simulations from the SIMBA and The Three Hundred suites to study the mechanisms influencing large-scale Lyman-$α$ transmission in 2<z<2.5 protoclusters observed in the COSMOS field. We focus on t… ▽ More The intergalactic medium (IGM) in the vicinity of galaxy protoclusters are interesting testbeds to study complex baryonic effects such as gravitational shocks and feedback. Here, we utilize hydrodynamical simulations from the SIMBA and The Three Hundred suites to study the mechanisms influencing large-scale Lyman-$α$ transmission in 2<z<2.5 protoclusters observed in the COSMOS field. We focus on the matter overdensity-Lyman-$α$ transmission relation $(δ_m-δ_F)$ on Megaparsec-scales in these protoclusters, which is hypothesized to be sensitive to the feedback implementations. The lower-density regions represented by the SIMBA-100 cosmological volume trace the power-law $δ_m-δ_F$ relationship often known as the fluctuating Gunn-Peterson approximation (FGPA). This trend is continued into higher-density regions covered by the 300-GadgetMUSIC simulations that implement stellar feedback only. The 300-GadgetX and 300-SIMBA simulations, with AGN thermal and AGN jet feedback respectively, exhibit progressively more Lyman-$α$ transmission at fixed overdensity. Compared with the 7 protoclusters observed in the CLAMATO$\times$COSTCO data, only 2 appear consistent with the FGPA. The others exhibit clear deviations: 4 follow the trend of AGN X-ray thermal feedback models while the COSTCO-I protocluster appears to reflect intense jet feedback. The large discrepancy with the stellar-feedback-only 300-GadgetMUSIC model disfavours large-scale heating from gravitational collapse and/or stellar feedback. This indicates that some form of AGN feedback is likely at play in the observed protoclusters, and possibly long-ranged AGN jets in the case of COSTCO-I. While more detailed and resolved simulations are required to move forward, our findings open new avenues for probing AGN feedback at Cosmic Noon. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 12 pages, 4 figures, submitted to MNRAS

arXiv:2402.11347 [pdf, other]

PhaseEvo: Towards Unified In-Context Prompt Optimization for Large Language Models

Authors: Wendi Cui, Jiaxin Zhang, Zhuohang Li, Hao Sun, Damien Lopez, Kamalika Das, Bradley Malin, Sricharan Kumar

Abstract: Crafting an ideal prompt for Large Language Models (LLMs) is a challenging task that demands significant resources and expert human input. Existing work treats the optimization of prompt instruction and in-context learning examples as distinct problems, leading to sub-optimal prompt performance. This research addresses this limitation by establishing a unified in-context prompt optimization framew… ▽ More Crafting an ideal prompt for Large Language Models (LLMs) is a challenging task that demands significant resources and expert human input. Existing work treats the optimization of prompt instruction and in-context learning examples as distinct problems, leading to sub-optimal prompt performance. This research addresses this limitation by establishing a unified in-context prompt optimization framework, which aims to achieve joint optimization of the prompt instruction and examples. However, formulating such optimization in the discrete and high-dimensional natural language space introduces challenges in terms of convergence and computational efficiency. To overcome these issues, we present PhaseEvo, an efficient automatic prompt optimization framework that combines the generative capability of LLMs with the global search proficiency of evolution algorithms. Our framework features a multi-phase design incorporating innovative LLM-based mutation operators to enhance search efficiency and accelerate convergence. We conduct an extensive evaluation of our approach across 35 benchmark tasks. The results demonstrate that PhaseEvo significantly outperforms the state-of-the-art baseline methods by a large margin whilst maintaining good efficiency. △ Less

Submitted 17 February, 2024; originally announced February 2024.

Comments: 50 pages, 9 figures, 26 tables

Showing 1–50 of 800 results for author: Cui, W