subscribe to arXiv mailings

Fracture Characteristics of Rare-earth Phosphate under Molten Calcium Magnesium Aluminosilicate Corrosion

Authors: Subrato Sarkar, Rahul Rahul, Bishnu Pada Majee, Keith Bryce, Lucy Zhang, Liping Huang, Jie Lian, Suvranu De

Abstract: The fracture characteristics of LuPO4 rare-earth phosphate environmental barrier coating (EBC) material under molten calcium-magnesium aluminosilicate (CMAS) corrosion is quantified. EBCs are crucial for protecting SiC-based ceramic matrix composite components in the hot section of gas turbine engines. Recent research has highlighted the potential of rare-earth phosphates as better EBC materials t… ▽ More The fracture characteristics of LuPO4 rare-earth phosphate environmental barrier coating (EBC) material under molten calcium-magnesium aluminosilicate (CMAS) corrosion is quantified. EBCs are crucial for protecting SiC-based ceramic matrix composite components in the hot section of gas turbine engines. Recent research has highlighted the potential of rare-earth phosphates as better EBC materials than third-generation rare-earth silicates for CMAS corrosion resistance. However, the fracture of EBCs under CMAS corrosion during service remains a significant concern. This work investigates the fracture behavior of LuPO4 using mesoscale simulation and experiments. The model uses micrographs taken from fabricated EBC samples for mesoscale fracture simulations. The simulation results are compared with experimental fracture toughness data and validated using statistical tests (p<0.01). The simulation results and experimental observations demonstrate that LuPO4 may exhibit higher fracture resistance than Lu2SiO5 rare-earth silicate under similar CMAS corrosion conditions, offering potential insights for future EBC design and development. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.17686 [pdf, other]

The wave-like disk oscillations of mono-age stellar populations in the Solar neighbourhood from Gaia DR3

Authors: Tao Wang, Bing-Qiu Chen, Jian-Hui Lian, Mao-Sheng Xiang, Xiao-Wei Liu

Abstract: The North-South asymmetry in the number density and bulk velocity of stars in the solar neighborhood provides valuable insights into the formation and evolution of the Milky Way disk. Our objective is to investigate the wave-like disk oscillations of mono-age stellar populations in the Solar neighbourhood using data from Gaia Data Release 3. We have selected a comprehensive sample of main sequence… ▽ More The North-South asymmetry in the number density and bulk velocity of stars in the solar neighborhood provides valuable insights into the formation and evolution of the Milky Way disk. Our objective is to investigate the wave-like disk oscillations of mono-age stellar populations in the Solar neighbourhood using data from Gaia Data Release 3. We have selected a comprehensive sample of main sequence turn off stars. The ages of these stars can be accurately determined using isochrone fitting methods. Our findings indicate that the north-south density and mean vertical velocity asymmetries remain consistent across all age groups.The uniformity of perturbations across all subsamples suggests that all populations are responding to the same external influence, which likely affects them irrespective of their age. Moreover, the fact that these perturbations appear consistently implies they could be either ongoing or recent. Regarding vertical velocity dispersions, we observe that older stars exhibit larger dispersions. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 6 pages, 4 figures, accepted for publication in MNRAS Letters

arXiv:2406.11394 [pdf, other]

Disk Assembly of the Milky Way Suggested from the Time-resolved Chemical Abundance

Authors: Enci Wang, Jianhui Lian, Yingjie Peng, Xin Wang

Abstract: Both simulations and observations suggest that the disk assembly of galaxies is governed by the interplay between coplanar gas inflow, ex-planar gas outflow and in-situ star formation on the disk, known as the leaky accretion disk. This scenario predicts a strong connection between radial distributions of star formation and chemical abundances. The Milky Way, being the sole galaxy where we can rel… ▽ More Both simulations and observations suggest that the disk assembly of galaxies is governed by the interplay between coplanar gas inflow, ex-planar gas outflow and in-situ star formation on the disk, known as the leaky accretion disk. This scenario predicts a strong connection between radial distributions of star formation and chemical abundances. The Milky Way, being the sole galaxy where we can reliably measure star formation histories and the corresponding temporally-resolved chemical abundances with individual stars, provides a unique opportunity to scrutinize this scenario. Based on the recent large spectroscopic and photometric surveys of Milky Way stars, we obtain the radial profiles of magnesium abundance ([Mg/H]) and star formation rate (SFR) surface density at different lookback time. We find the radial profiles of [Mg/H] can be well-reproduced using the leaky accretion disk model with only two free parameters for stars formed within 4 Gyr, as well as the flattening at large radii of metallicity profiles traced by HII regions and Cepheids. Furthermore, the constraint effective yield of the Milky Way and nearby galaxies show broad consistency with the theoretical predictions from stellar chemical evolution model with a mass-loading factor of 0-2. These results support that the recent assembly of the Milky Way adheres to the leaky accretion disk scenario, bridging the disk formation of our home galaxy to the big picture of disk formation in the Universe. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 11 pages, 4 figures, accepted to ApJ

arXiv:2406.10969 [pdf, other]

The 95GeV Excesses in the $\mathbb{Z}_3$-symmetric Next-to Minimal Supersymmetric Standard Model

Authors: Jingwei Lian

Abstract: Recent analyses by CMS and ATLAS suggest a deviation in the di-photon channel at approximately 95 GeV, alongside a previously observed excess in $b\bar{b}$ signals at a similar mass by LEP, potentially hinting at a new scalar particle. This study explores this possibility within the framework of the well-established $\mathbb{Z}_3$-symmetric Next-to-Minimal Supersymmetric Standard Model. A comprehe… ▽ More Recent analyses by CMS and ATLAS suggest a deviation in the di-photon channel at approximately 95 GeV, alongside a previously observed excess in $b\bar{b}$ signals at a similar mass by LEP, potentially hinting at a new scalar particle. This study explores this possibility within the framework of the well-established $\mathbb{Z}_3$-symmetric Next-to-Minimal Supersymmetric Standard Model. A comprehensive parameter scan was conducted, integrating constraints from dark matter relic density, direct detection experiments, and the properties of the observed 125 GeV Higgs boson. The results demonstrate that the model can accommodate the observed excesses with a singlet-dominated CP-even Higgs boson near 95 GeV. The model accurately predicts signal strengths of the di-photon and $b\bar{b}$ channels at a level of $1σ$. Furthermore, it accounts for the measured dark matter relic abundance through Bino-dominated neutralinos co-annihilation with Wino-like electroweakinos, all while remaining consistent with existing LHC constraints. These findings pave the way for future validation at the high-luminosity LHC and linear colliders, which may offer crucial tests of the model's predictions. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: 30 pages, 6 figures,

arXiv:2406.05604 [pdf, other]

The size of the Milky Way galaxy

Authors: Jianhui Lian, Gail Zasowski, Bingqiu Chen, Julie Imig, Tao Wang, Nicholas Boardman, Xiaowei Liu

Abstract: The size of a galaxy is one of the fundamental parameters that reflects its growth and assembly history. Traditionally, the size of the Milky Way has been characterized by the scale length of the disk, based on the assumption of an exponential density profile. Earlier scale length measurements suggest the Milky Way is an overly compact galaxy, compared to similar galaxies of its mass. These size m… ▽ More The size of a galaxy is one of the fundamental parameters that reflects its growth and assembly history. Traditionally, the size of the Milky Way has been characterized by the scale length of the disk, based on the assumption of an exponential density profile. Earlier scale length measurements suggest the Milky Way is an overly compact galaxy, compared to similar galaxies of its mass. These size measurements, however, ignore the presence of the bulge, and the assumption of a single-exponential disk profile faces growing challenges from the recent observations. The half-light radius is an alternative size measurement that is independent of the galaxy density profile and has been widely used to quantify the size of external galaxies. Here we report the half-light radius of the Milky Way, derived from a new measurement of the age-resolved Galactic surface brightness profile in an unprecedentedly wide radial range from ${\rm R=0}$ to 17~kpc. We find a broken surface brightness profile with a nearly flat distribution between 3.5 and 7.5 kpc, which results in a half-light radius of 5.75$\pm$0.38 kpc, significantly larger than the scale-length inferred from the canonical single-exponential disk profile but in good consistency with local disk galaxies of similar mass. Because our density profile can be decomposed by stellar age and extrapolated backwards in time, we can also confirm that the size history of the Milky Way is broadly consistent with high-redshift galaxies but with systematically smaller size at each look back time. Our results suggest that the Milky Way is a typical disk galaxy regarding its size and has likely experienced inefficient secular size growth. △ Less

Submitted 28 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

Comments: 30 pages, 4figures, published online in Nature Astronomy on 27 June 2024, https://rdcu.be/dL3z5. Here is the version prior to the peer review

arXiv:2406.01706 [pdf, other]

The Extremely Metal Rich Knot of Stars at the Heart of the Galaxy

Authors: Hans-Walter Rix, Vedant Chandra, Gail Zasowski, Annalisa Pillepich, Sergey Khoperskov, Sofia Feltzing, Rosemary F. Wyse, Neige Frankel, Danny Horta, Juna Kollmeier, Keivan G. Stassun, Melissa Ness, Jonathan C. Bird, David L. Nidever, Jose G. Fernandez, João A. Amarante, Chervin F. Laporte, Jianhui Lian

Abstract: We show with Gaia XP spectroscopy that extremely metal-rich stars in the Milky Way (EMR; $[M/H]_{XP} > 0.5$) - but only those - are largely confined to a tight "knot" at the center of the Galaxy. This EMR knot is round in projection, has a fairly abrupt edge near $\sim 1.5$kpc, and is a dynamically hot system. This central knot also contains very metal-rich (VMR; $+0.2\le [M/H]_{XP} \le +0.4$) sta… ▽ More We show with Gaia XP spectroscopy that extremely metal-rich stars in the Milky Way (EMR; $[M/H]_{XP} > 0.5$) - but only those - are largely confined to a tight "knot" at the center of the Galaxy. This EMR knot is round in projection, has a fairly abrupt edge near $\sim 1.5$kpc, and is a dynamically hot system. This central knot also contains very metal-rich (VMR; $+0.2\le [M/H]_{XP} \le +0.4$) stars. However, in contrast to EMR stars, the bulk of VMR stars form an extended, highly flattened distribution in the inner Galaxy ($R_{\mathrm{GC}}\lesssim 5$ kpc). We draw on TNG50 simulations of Milky Way analogs for context and find that compact, metal-rich knots confined to $<1.5$kpc are a universal feature. In typical simulated analogs, the top 5-10% most metal-rich stars are confined to a central knot; however, in our Milky Way data this fraction is only 0.1%. Dust-penetrating wide-area near-infrared spectroscopy, such as SDSS-V, will be needed for a rigorous estimate of the fraction of stars in the Galactic EMR knot. Why in our Milky Way only EMR giants are confined to such a central knot remains to be explained. Remarkably, the central few kiloparsecs of the Milky Way harbor both the highest concentration of metal-poor stars (the `poor old heart') and almost all EMR stars. This highlights the stellar population diversity at the bottom of galactic potential wells. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 11 pages, 7 figures, submitted to ApJ

arXiv:2405.21027 [pdf, other]

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

Authors: Jiesong Lian

Abstract: A popular approach for solving zero-sum games is to maintain populations of policies to approximate the Nash Equilibrium (NE). Previous studies have shown that Policy Space Response Oracle (PSRO) algorithm is an effective multi-agent reinforcement learning framework for solving such games. However, repeatedly training new policies from scratch to approximate Best Response (BR) to opponents' mixed… ▽ More A popular approach for solving zero-sum games is to maintain populations of policies to approximate the Nash Equilibrium (NE). Previous studies have shown that Policy Space Response Oracle (PSRO) algorithm is an effective multi-agent reinforcement learning framework for solving such games. However, repeatedly training new policies from scratch to approximate Best Response (BR) to opponents' mixed policies at each iteration is both inefficient and costly. While some PSRO variants initialize a new policy by inheriting from past BR policies, this approach limits the exploration of new policies, especially against challenging opponents. To address this issue, we propose Fusion-PSRO, which employs policy fusion to initialize policies for better approximation to BR. By selecting high-quality base policies from meta-NE, policy fusion fuses the base policies into a new policy through model averaging. This approach allows the initialized policies to incorporate multiple expert policies, making it easier to handle difficult opponents compared to inheriting from past BR policies or initializing from scratch. Moreover, our method only modifies the policy initialization phase, allowing its application to nearly all PSRO variants without additional training overhead. Our experiments on non-transitive matrix games, Leduc Poker, and the more complex Liars Dice demonstrate that Fusion-PSRO enhances the performance of nearly all PSRO variants, achieving lower exploitability. △ Less

Submitted 21 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

Comments: 20 pages, 5 figures

arXiv:2405.12569 [pdf, other]

TypeII-CsiNet: CSI Feedback with TypeII Codebook

Authors: Yiliang Sang, Ke Ma, Yang Ming, Jin Lian, Zhaocheng Wang

Abstract: The latest TypeII codebook selects partial strongest angular-delay ports for the feedback of downlink channel state information (CSI), whereas its performance is limited due to the deficiency of utilizing the correlations among the port coefficients. To tackle this issue, we propose a tailored autoencoder named TypeII-CsiNet to effectively integrate the TypeII codebook with deep learning, wherein… ▽ More The latest TypeII codebook selects partial strongest angular-delay ports for the feedback of downlink channel state information (CSI), whereas its performance is limited due to the deficiency of utilizing the correlations among the port coefficients. To tackle this issue, we propose a tailored autoencoder named TypeII-CsiNet to effectively integrate the TypeII codebook with deep learning, wherein three novel designs are developed for sufficiently boosting the sum rate performance. Firstly, a dedicated pre-processing module is designed to sort the selected ports for reserving the correlations of their corresponding coefficients. Secondly, a position-filling layer is developed in the decoder to fill the feedback coefficients into their ports in the recovered CSI matrix, so that the corresponding angular-delay-domain structure is adequately leveraged to enhance the reconstruction accuracy. Thirdly, a two-stage loss function is proposed to improve the sum rate performance while avoiding the trapping in local optimums during model training. Simulation results verify that our proposed TypeII-CsiNet outperforms the TypeII codebook and existing deep learning benchmarks. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.10640 [pdf, other]

COMET: NFT Price Prediction with Wallet Profiling

Authors: Tianfu Wang, Liwei Deng, Chao Wang, Jianxun Lian, Yue Yan, Nicholas Jing Yuan, Qi Zhang, Hui Xiong

Abstract: As the non-fungible token (NFT) market flourishes, price prediction emerges as a pivotal direction for investors gaining valuable insight to maximize returns. However, existing works suffer from a lack of practical definitions and standardized evaluations, limiting their practical application. Moreover, the influence of users' multi-behaviour transactions that are publicly accessible on NFT price… ▽ More As the non-fungible token (NFT) market flourishes, price prediction emerges as a pivotal direction for investors gaining valuable insight to maximize returns. However, existing works suffer from a lack of practical definitions and standardized evaluations, limiting their practical application. Moreover, the influence of users' multi-behaviour transactions that are publicly accessible on NFT price is still not explored and exhibits challenges. In this paper, we address these gaps by presenting a practical and hierarchical problem definition. This approach unifies both collection-level and token-level task and evaluation methods, which cater to varied practical requirements of investors. To further understand the impact of user behaviours on the variation of NFT price, we propose a general wallet profiling framework and develop a COmmunity enhanced Multi-bEhavior Transaction graph model, named COMET. COMET profiles wallets with a comprehensive view and considers the impact of diverse relations and interactions within the NFT ecosystem on NFT price variations, thereby improving prediction performance. Extensive experiments conducted in our deployed system demonstrate the superiority of COMET, underscoring its potential in the insight toolkit for NFT investors. △ Less

Submitted 2 July, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

Comments: Accepted by KDD 2024 (ADS Track)

arXiv:2405.07964 [pdf, other]

Early phase simultaneous multi-band observations of Type II supernova SN 2024ggi with Mephisto

Authors: Xinlei Chen, Brajesh Kumar, Xinzhong Er, Helong Guo, Yuan-Pei Yang, Weikang Lin, Yuan Fang, Guowang Du, Chenxu Liu, Jiewei Zhao, Tianyu Zhang, Yuxi Bao, Xingzhu Zou, Yu Pan, Yu Wang, Xufeng Zhu, Kaushik Chatterjee, Xiangkun Liu, Dezi Liu, Edoardo P. Lagioia, Geeta Rangwal, Shiyan Zhong, Jinghua Zhang, Jianhui Lian, Yongzhi Cai , et al. (2 additional authors not shown)

Abstract: We present early-phase good cadence simultaneous multi-band ($ugi$, $vrz$--bands) imaging of nearby supernova SN 2024ggi, which exploded in the nearby galaxy, NGC~3621. A quick follow-up was conducted within less than a day after the explosion and continued $\sim$23 days. The $uvg$-band light curves display a rapid rise ($\sim$1.4 mag day$^{-1}$) to maximum in $\sim$4 days and absolute magnitude… ▽ More We present early-phase good cadence simultaneous multi-band ($ugi$, $vrz$--bands) imaging of nearby supernova SN 2024ggi, which exploded in the nearby galaxy, NGC~3621. A quick follow-up was conducted within less than a day after the explosion and continued $\sim$23 days. The $uvg$-band light curves display a rapid rise ($\sim$1.4 mag day$^{-1}$) to maximum in $\sim$4 days and absolute magnitude $M_{g}\sim$--17.75 mag. The post-peak decay rate in redder bands is $\sim$0.01 mag day$^{-1}$. Different colors (e.g., $u-g$ and $v-r$) of SN~2024ggi are slightly redder than SN~2023ixf. A significant rise ($\sim$12.5 kK) in black-body temperature (optical) was noticed within $\sim$2 days after the explosion, which successively decreased, indicating shock break out inside a dense circumstellar medium (CSM) surrounding the progenitor. Using semi-analytical modeling, the ejecta mass and progenitor radius were estimated as 1.2 M$_{\odot}$ and $\sim$550 R$_{\odot}$, respectively. The archival deep images ($g,r,i,z$-bands) from the Dark Energy Camera Legacy Survey (DECaLS) were examined, and a possible progenitor was detected in each band ($\sim$22--22.5 mag) and had a mass range of 14--17 M$_{\odot}$. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: Pages 9, Table 1, Figures 7

arXiv:2405.00527 [pdf, other]

ChatBI: Towards Natural Language to Complex Business Intelligence SQL

Authors: Jinqing Lian, Xinyi Liu, Yingxia Shao, Yang Dong, Ming Wang, Zhang Wei, Tianqi Wan, Ming Dong, Hailin Yan

Abstract: The Natural Language to SQL (NL2SQL) technology provides non-expert users who are unfamiliar with databases the opportunity to use SQL for data analysis.Converting Natural Language to Business Intelligence (NL2BI) is a popular practical scenario for NL2SQL in actual production systems. Compared to NL2SQL, NL2BI introduces more challenges. In this paper, we propose ChatBI, a comprehensive and eff… ▽ More The Natural Language to SQL (NL2SQL) technology provides non-expert users who are unfamiliar with databases the opportunity to use SQL for data analysis.Converting Natural Language to Business Intelligence (NL2BI) is a popular practical scenario for NL2SQL in actual production systems. Compared to NL2SQL, NL2BI introduces more challenges. In this paper, we propose ChatBI, a comprehensive and efficient technology for solving the NL2BI task. First, we analyze the interaction mode, an important module where NL2SQL and NL2BI differ in use, and design a smaller and cheaper model to match this interaction mode. In BI scenarios, tables contain a huge number of columns, making it impossible for existing NL2SQL methods that rely on Large Language Models (LLMs) for schema linking to proceed due to token limitations. The higher proportion of ambiguous columns in BI scenarios also makes schema linking difficult. ChatBI combines existing view technology in the database community to first decompose the schema linking problem into a Single View Selection problem and then uses a smaller and cheaper machine learning model to select the single view with a significantly reduced number of columns. The columns of this single view are then passed as the required columns for schema linking into the LLM. Finally, ChatBI proposes a phased process flow different from existing process flows, which allows ChatBI to generate SQL containing complex semantics and comparison relations more accurately. We have deployed ChatBI on Baidu's data platform and integrated it into multiple product lines for large-scale production task evaluation. The obtained results highlight its superiority in practicality, versatility, and efficiency. At the same time, compared with the current mainstream NL2SQL technology under our real BI scenario data tables and queries, it also achieved the best results. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2405.00347 [pdf, ps, other]

Origin of the Very High Energy γ-rays in the Low-luminosity Active Galactic Nucleus NGC 4278

Authors: Ji-Shun Lian, Jia-Xuan Li, Xin-Ke Hu, Ying-Ying Gan, Tan-Zheng Wu, Jin Zhang

Abstract: NGC 4278, a Low-luminosity active galactic nucleus (AGN), is generally classified as a low-ionization nuclear emission line region (LINER) type AGN. Recently, it is reported to be associated with a very high energy (VHE) $γ$-ray source 1LHAASO J1219+2915 in the first Large High Altitude Air Shower Observatory (LHAASO) source catalog. However, no associated counterpart has been detected by Fermi-LA… ▽ More NGC 4278, a Low-luminosity active galactic nucleus (AGN), is generally classified as a low-ionization nuclear emission line region (LINER) type AGN. Recently, it is reported to be associated with a very high energy (VHE) $γ$-ray source 1LHAASO J1219+2915 in the first Large High Altitude Air Shower Observatory (LHAASO) source catalog. However, no associated counterpart has been detected by Fermi-LAT. By analyzing its X-ray observation data from Swift-XRT, we find it is in a high-flux state on MJD 59546, with the X-ray flux more than one order of magnitude higher than that observed $\sim$ 11.7 year earlier by Chandra. We propose that the detection of VHE $γ$-rays from NGC 4278 may be attributed to the presence of an active nucleus displaying behavior similar to that of a BL Lac. To reproduce its spectral energy distribution (SED), we employ a one-zone leptonic model, typically used for fitting broadband SEDs of BL Lacs, and find that smaller values for both Doppler factor ($δ$) and magnetic field strength ($B$) are required than that of typical TeV BL Lacs. Furthermore, NGC 4278 exhibits significantly lower luminosity in both radio and TeV bands when compared with typical TeV BL Lacs. In the radio-luminosity vs. Eddington-ratio plane, NGC 4278 shows greater similarity to Seyfert galaxies and LINERs rather than BL Lacs; however, it still roughly follows the extension towards lower luminosity seen in BL Lacs. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 13 pages, 8 figures, 2 tables, Submitted, Comments are welcome

arXiv:2404.07687 [pdf, other]

Chaos in Motion: Unveiling Robustness in Remote Heart Rate Measurement through Brain-Inspired Skin Tracking

Authors: Jie Wang, Jing Lian, Minjie Ma, Junqiang Lei, Chunbiao Li, Bin Li, Jizhao Liu

Abstract: Heart rate is an important physiological indicator of human health status. Existing remote heart rate measurement methods typically involve facial detection followed by signal extraction from the region of interest (ROI). These SOTA methods have three serious problems: (a) inaccuracies even failures in detection caused by environmental influences or subject movement; (b) failures for special patie… ▽ More Heart rate is an important physiological indicator of human health status. Existing remote heart rate measurement methods typically involve facial detection followed by signal extraction from the region of interest (ROI). These SOTA methods have three serious problems: (a) inaccuracies even failures in detection caused by environmental influences or subject movement; (b) failures for special patients such as infants and burn victims; (c) privacy leakage issues resulting from collecting face video. To address these issues, we regard the remote heart rate measurement as the process of analyzing the spatiotemporal characteristics of the optical flow signal in the video. We apply chaos theory to computer vision tasks for the first time, thus designing a brain-inspired framework. Firstly, using an artificial primary visual cortex model to extract the skin in the videos, and then calculate heart rate by time-frequency analysis on all pixels. Our method achieves Robust Skin Tracking for Heart Rate measurement, called HR-RST. The experimental results show that HR-RST overcomes the difficulty of environmental influences and effectively tracks the subject movement. Moreover, the method could extend to other body parts. Consequently, the method can be applied to special patients and effectively protect individual privacy, offering an innovative solution. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 8 pages, 10 figures

arXiv:2403.06465 [pdf, other]

doi 10.1145/3589335.3651242

RecAI: Leveraging Large Language Models for Next-Generation Recommender Systems

Authors: Jianxun Lian, Yuxuan Lei, Xu Huang, Jing Yao, Wei Xu, Xing Xie

Abstract: This paper introduces RecAI, a practical toolkit designed to augment or even revolutionize recommender systems with the advanced capabilities of Large Language Models (LLMs). RecAI provides a suite of tools, including Recommender AI Agent, Recommendation-oriented Language Models, Knowledge Plugin, RecExplainer, and Evaluator, to facilitate the integration of LLMs into recommender systems from mult… ▽ More This paper introduces RecAI, a practical toolkit designed to augment or even revolutionize recommender systems with the advanced capabilities of Large Language Models (LLMs). RecAI provides a suite of tools, including Recommender AI Agent, Recommendation-oriented Language Models, Knowledge Plugin, RecExplainer, and Evaluator, to facilitate the integration of LLMs into recommender systems from multifaceted perspectives. The new generation of recommender systems, empowered by LLMs, are expected to be more versatile, explainable, conversational, and controllable, paving the way for more intelligent and user-centric recommendation experiences. We hope the open-source of RecAI can help accelerate evolution of new advanced recommender systems. The source code of RecAI is available at \url{https://github.com/microsoft/RecAI}. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: 4 pages. Webconf 2024 demo track

MSC Class: 68T50

arXiv:2403.05063 [pdf, other]

Aligning Large Language Models for Controllable Recommendations

Authors: Wensheng Lu, Jianxun Lian, Wei Zhang, Guanghua Li, Mingyang Zhou, Hao Liao, Xing Xie

Abstract: Inspired by the exceptional general intelligence of Large Language Models (LLMs), researchers have begun to explore their application in pioneering the next generation of recommender systems - systems that are conversational, explainable, and controllable. However, existing literature primarily concentrates on integrating domain-specific knowledge into LLMs to enhance accuracy, often neglecting th… ▽ More Inspired by the exceptional general intelligence of Large Language Models (LLMs), researchers have begun to explore their application in pioneering the next generation of recommender systems - systems that are conversational, explainable, and controllable. However, existing literature primarily concentrates on integrating domain-specific knowledge into LLMs to enhance accuracy, often neglecting the ability to follow instructions. To address this gap, we initially introduce a collection of supervised learning tasks, augmented with labels derived from a conventional recommender model, aimed at explicitly improving LLMs' proficiency in adhering to recommendation-specific instructions. Subsequently, we develop a reinforcement learning-based alignment procedure to further strengthen LLMs' aptitude in responding to users' intentions and mitigating formatting errors. Through extensive experiments on two real-world datasets, our method markedly advances the capability of LLMs to comply with instructions within recommender systems, while sustaining a high level of accuracy performance. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: 13 pages

MSC Class: 68T50

arXiv:2403.04483 [pdf, other]

GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability

Authors: Zihan Luo, Xiran Song, Hong Huang, Jianxun Lian, Chenhao Zhang, Jinqi Jiang, Xing Xie

Abstract: Evaluating and enhancing the general capabilities of large language models (LLMs) has been an important research topic. Graph is a common data structure in the real world, and understanding graph data is a crucial part for advancing general intelligence. To evaluate and enhance the graph understanding abilities of LLMs, in this paper, we propose a benchmark named GraphInstruct, which comprehensive… ▽ More Evaluating and enhancing the general capabilities of large language models (LLMs) has been an important research topic. Graph is a common data structure in the real world, and understanding graph data is a crucial part for advancing general intelligence. To evaluate and enhance the graph understanding abilities of LLMs, in this paper, we propose a benchmark named GraphInstruct, which comprehensively includes 21 classical graph reasoning tasks, providing diverse graph generation pipelines and detailed reasoning steps. Based on GraphInstruct, we further construct GraphLM through efficient instruction-tuning, which shows prominent graph understanding capability. In order to enhance the LLM with graph reasoning capability as well, we propose a step mask training strategy, and construct a model named GraphLM+. As one of the pioneering efforts to enhance the graph understanding and reasoning abilities of LLMs, extensive experiments have demonstrated the superiority of GraphLM and GraphLM+ over other LLMs. We look forward to more researchers exploring the potential of LLMs in the graph data mining domain through GraphInstruct. Our code for generating GraphInstruct is released publicly at: https://github.com/CGCL-codes/GraphInstruct. △ Less

Submitted 2 April, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

Comments: 9 pages

arXiv:2403.00529 [pdf, other]

VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis

Authors: Weiwei Lin, Chenhang He, Man-Wai Mak, Jiachen Lian, Kong Aik Lee

Abstract: Achieving nuanced and accurate emulation of human voice has been a longstanding goal in artificial intelligence. Although significant progress has been made in recent years, the mainstream of speech synthesis models still relies on supervised speaker modeling and explicit reference utterances. However, there are many aspects of human voice, such as emotion, intonation, and speaking style, for whic… ▽ More Achieving nuanced and accurate emulation of human voice has been a longstanding goal in artificial intelligence. Although significant progress has been made in recent years, the mainstream of speech synthesis models still relies on supervised speaker modeling and explicit reference utterances. However, there are many aspects of human voice, such as emotion, intonation, and speaking style, for which it is hard to obtain accurate labels. In this paper, we propose VoxGenesis, a novel unsupervised speech synthesis framework that can discover a latent speaker manifold and meaningful voice editing directions without supervision. VoxGenesis is conceptually simple. Instead of mapping speech features to waveforms deterministically, VoxGenesis transforms a Gaussian distribution into speech distributions conditioned and aligned by semantic tokens. This forces the model to learn a speaker distribution disentangled from the semantic content. During the inference, sampling from the Gaussian distribution enables the creation of novel speakers with distinct characteristics. More importantly, the exploration of latent space uncovers human-interpretable directions associated with specific speaker characteristics such as gender attributes, pitch, tone, and emotion, allowing for voice editing by manipulating the latent codes along these identified directions. We conduct extensive experiments to evaluate the proposed VoxGenesis using both subjective and objective metrics, finding that it produces significantly more diverse and realistic speakers with distinct characteristics than the previous approaches. We also show that latent space manipulation produces consistent and human-identifiable effects that are not detrimental to the speech quality, which was not possible with previous approaches. Audio samples of VoxGenesis can be found at: \url{https://bit.ly/VoxGenesis}. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: preprint

arXiv:2402.18899 [pdf, other]

Aligning Language Models for Versatile Text-based Item Retrieval

Authors: Yuxuan Lei, Jianxun Lian, Jing Yao, Mingqi Wu, Defu Lian, Xing Xie

Abstract: This paper addresses the gap between general-purpose text embeddings and the specific demands of item retrieval tasks. We demonstrate the shortcomings of existing models in capturing the nuances necessary for zero-shot performance on item retrieval tasks. To overcome these limitations, we propose generate in-domain dataset from ten tasks tailored to unlocking models' representation ability for ite… ▽ More This paper addresses the gap between general-purpose text embeddings and the specific demands of item retrieval tasks. We demonstrate the shortcomings of existing models in capturing the nuances necessary for zero-shot performance on item retrieval tasks. To overcome these limitations, we propose generate in-domain dataset from ten tasks tailored to unlocking models' representation ability for item retrieval. Our empirical studies demonstrate that fine-tuning embedding models on the dataset leads to remarkable improvements in a variety of retrieval tasks. We also illustrate the practical application of our refined model in a conversational setting, where it enhances the capabilities of LLM-based Recommender Agents like Chat-Rec. Our code is available at https://github.com/microsoft/RecAI. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: 4 pages,1 figures, 4 tables

arXiv:2402.15847 [pdf, other]

Unified Interpretation of Muon g-2 anomaly, 95 GeV Diphoton, and $b\bar{b}$ Excesses in the General Next-to-Minimal Supersymmetric Standard Model

Authors: Junjie Cao, Xinglong Jia, Jingwei Lian

Abstract: We investigate three intriguing anomalies within the framework of the General Next-to-Minimal Supersymmetric Standard Model. These anomalies include a significant deviation of the experimental results for the muon anomalous magnetic moment from its Standard Model prediction, with a confidence level of $5.1σ$; a joint observation by the CMS and ATLAS collaborations of a diphoton excess with a local… ▽ More We investigate three intriguing anomalies within the framework of the General Next-to-Minimal Supersymmetric Standard Model. These anomalies include a significant deviation of the experimental results for the muon anomalous magnetic moment from its Standard Model prediction, with a confidence level of $5.1σ$; a joint observation by the CMS and ATLAS collaborations of a diphoton excess with a local significance of $3.1 σ$ in the invariant mass distribution around 95.4 GeV; and a reported excess in the $b\bar{b}$ production at LEP with a local significance of $2.3 σ$. Through analytical and numerical analyses, we provide unified interpretations across an extensive parameter space that remain consistent with current experimental restrictions from data on the Higgs boson at 125 GeV, B-physics measurements, dark matter observables, as well as existing searches for supersymmetry and extra Higgs bosons. We attribute the muon anomaly to loops involving muon-smuon-neutralino and muon-sneutrino-chargino interactions, while attributing the diphoton and $b \bar{b}$ excesses to the resonant production of a singlet-dominated scalar. These proposed solutions are poised for experimental tests at the high-luminosity LHC and future linear colliders. △ Less

Submitted 24 February, 2024; originally announced February 2024.

Comments: 38 pages, 7 figures

arXiv:2402.15466 [pdf, other]

Automatic treatment planning for radiotherapy: a cross-modality and protocol study

Authors: Gregory Szalkowski, Xuanang Xu, Shiva Das, Pew-Thian Yap, Jun Lian

Abstract: This study investigates the applicability of 3D dose predictions from a model trained on one modality to a cross-modality automated planning workflow. Additionally, we explore the impact of integrating a multi-criteria optimizer on adapting predictions to different clinical preferences. Using a previously created three-stage UNet in-house model trained on the 2020 AAPM OpenKBP challenge dataset (3… ▽ More This study investigates the applicability of 3D dose predictions from a model trained on one modality to a cross-modality automated planning workflow. Additionally, we explore the impact of integrating a multi-criteria optimizer on adapting predictions to different clinical preferences. Using a previously created three-stage UNet in-house model trained on the 2020 AAPM OpenKBP challenge dataset (340 head and neck plans, planned using 9-field static IMRT), we retrospectively generated dose predictions for 20 patients. These dose predictions were used to generate deliverable IMRT, VMAT, and Tomotherapy plans using the fallback plan functionality in Raystation. The deliverable plans were evaluated against the dose predictions based on primary clinical goals. A new set of plans was also generated using MCO-based optimization with predicted dose values as constraints. The mimicking approach accurately replicated the predicted dose distributions across different modalities, with slight deviations in spinal cord and external contour maximum doses. MCO customization significantly reduced doses to OARs prioritized by our institution while maintaining target coverage. All tested plans met clinical deliverability standards, evidenced by a delivery QA gamma analysis passing rate above 98%. Our findings show that a model trained only on IMRT plans can contribute to planning across various modalities. Additionally, integrating predictions as constraints in an MCO-based workflow, rather than direct dose mimicking, enables a flexible, warm-start approach for treatment planning. Together, these approaches have the potential to significantly decrease plan turnaround time and quality variance, both at high resource medical centers that can train in-house models, and smaller centers that can adapt a model from another institution with minimal effort. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.14493 [pdf, ps, other]

An Improved Pseudopolynomial Time Algorithm for Subset Sum

Authors: Lin Chen, Jiayi Lian, Yuchen Mao, Guochuan Zhang

Abstract: We investigate pseudo-polynomial time algorithms for Subset Sum. Given a multi-set $X$ of $n$ positive integers and a target $t$, Subset Sum asks whether some subset of $X$ sums to $t$. Bringmann proposes an $\tilde{O}(n + t)$-time algorithm [Bringmann SODA'17], and an open question has naturally arisen: can Subset Sum be solved in $O(n + w)$ time? Here $w$ is the maximum integer in $X$. We make a… ▽ More We investigate pseudo-polynomial time algorithms for Subset Sum. Given a multi-set $X$ of $n$ positive integers and a target $t$, Subset Sum asks whether some subset of $X$ sums to $t$. Bringmann proposes an $\tilde{O}(n + t)$-time algorithm [Bringmann SODA'17], and an open question has naturally arisen: can Subset Sum be solved in $O(n + w)$ time? Here $w$ is the maximum integer in $X$. We make a progress towards resolving the open question by proposing an $\tilde{O}(n + \sqrt{wt})$-time algorithm. △ Less

Submitted 4 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: In first version, we falsely claimed that our algorithm is also able to reconstruct a subset that sums to t. In the latest version, we removed this false claim and explained why we cannot do reconstruction

arXiv:2402.11426 [pdf, ps, other]

Approximating Partition in Near-Linear Time

Authors: Lin Chen, Jiayi Lian, Yuchen Mao, Guochuan Zhang

Abstract: We propose an $\widetilde{O}(n + 1/\eps)$-time FPTAS (Fully Polynomial-Time Approximation Scheme) for the classical Partition problem. This is the best possible (up to a polylogarithmic factor) assuming SETH (Strong Exponential Time Hypothesis) [Abboud, Bringmann, Hermelin, and Shabtay'22]. Prior to our work, the best known FPTAS for Partition runs in $\widetilde{O}(n + 1/\eps^{5/4})$ time [Deng,… ▽ More We propose an $\widetilde{O}(n + 1/\eps)$-time FPTAS (Fully Polynomial-Time Approximation Scheme) for the classical Partition problem. This is the best possible (up to a polylogarithmic factor) assuming SETH (Strong Exponential Time Hypothesis) [Abboud, Bringmann, Hermelin, and Shabtay'22]. Prior to our work, the best known FPTAS for Partition runs in $\widetilde{O}(n + 1/\eps^{5/4})$ time [Deng, Jin and Mao'23, Wu and Chen'22]. Our result is obtained by solving a more general problem of weakly approximating Subset Sum. △ Less

Submitted 6 April, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

Comments: To appear in STOC2024

arXiv:2402.02411 [pdf, other]

Physics-Inspired Degradation Models for Hyperspectral Image Fusion

Authors: Jie Lian, Lizhi Wang, Lin Zhu, Renwei Dian, Zhiwei Xiong, Hua Huang

Abstract: The fusion of a low-spatial-resolution hyperspectral image (LR-HSI) with a high-spatial-resolution multispectral image (HR-MSI) has garnered increasing research interest. However, most fusion methods solely focus on the fusion algorithm itself and overlook the degradation models, which results in unsatisfactory performance in practical scenarios. To fill this gap, we propose physics-inspired degra… ▽ More The fusion of a low-spatial-resolution hyperspectral image (LR-HSI) with a high-spatial-resolution multispectral image (HR-MSI) has garnered increasing research interest. However, most fusion methods solely focus on the fusion algorithm itself and overlook the degradation models, which results in unsatisfactory performance in practical scenarios. To fill this gap, we propose physics-inspired degradation models (PIDM) to model the degradation of LR-HSI and HR-MSI, which comprises a spatial degradation network (SpaDN) and a spectral degradation network (SpeDN). SpaDN and SpeDN are designed based on two insights. First, we employ spatial warping and spectral modulation operations to simulate lens aberrations, thereby introducing non-uniformity into the spatial and spectral degradation processes. Second, we utilize asymmetric downsampling and parallel downsampling operations to separately reduce the spatial and spectral resolutions of the images, thus ensuring the matching of spatial and spectral degradation processes with specific physical characteristics. Once SpaDN and SpeDN are established, we adopt a self-supervised training strategy to optimize the network parameters and provide a plug-and-play solution for fusion methods. Comprehensive experiments demonstrate that our proposed PIDM can boost the fusion performance of existing fusion methods in practical scenarios. △ Less

Submitted 4 February, 2024; originally announced February 2024.

arXiv:2401.10015 [pdf, other]

Towards Hierarchical Spoken Language Dysfluency Modeling

Authors: Jiachen Lian, Gopala Anumanchipalli

Abstract: Speech disfluency modeling is the bottleneck for both speech therapy and language learning. However, there is no effective AI solution to systematically tackle this problem. We solidify the concept of disfluent speech and disfluent speech modeling. We then present Hierarchical Unconstrained Disfluency Modeling (H-UDM) approach, the hierarchical extension of UDM that addresses both disfluency trans… ▽ More Speech disfluency modeling is the bottleneck for both speech therapy and language learning. However, there is no effective AI solution to systematically tackle this problem. We solidify the concept of disfluent speech and disfluent speech modeling. We then present Hierarchical Unconstrained Disfluency Modeling (H-UDM) approach, the hierarchical extension of UDM that addresses both disfluency transcription and detection to eliminate the need for extensive manual annotation. Our experimental findings serve as clear evidence of the effectiveness and reliability of the methods we have introduced, encompassing both transcription and detection tasks. △ Less

Submitted 21 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

Comments: 2024 EACL. Hierarchical extension of our previous workshop paper arXiv:2312.12810

arXiv:2401.08649 [pdf, other]

Deep Pulse-Coupled Neural Networks

Authors: Zexiang Yi, Jing Lian, Yunliang Qi, Zhaofei Yu, Huajin Tang, Yide Ma, Jizhao Liu

Abstract: Spiking Neural Networks (SNNs) capture the information processing mechanism of the brain by taking advantage of spiking neurons, such as the Leaky Integrate-and-Fire (LIF) model neuron, which incorporates temporal dynamics and transmits information via discrete and asynchronous spikes. However, the simplified biological properties of LIF ignore the neuronal coupling and dendritic structure of real… ▽ More Spiking Neural Networks (SNNs) capture the information processing mechanism of the brain by taking advantage of spiking neurons, such as the Leaky Integrate-and-Fire (LIF) model neuron, which incorporates temporal dynamics and transmits information via discrete and asynchronous spikes. However, the simplified biological properties of LIF ignore the neuronal coupling and dendritic structure of real neurons, which limits the spatio-temporal dynamics of neurons and thus reduce the expressive power of the resulting SNNs. In this work, we leverage a more biologically plausible neural model with complex dynamics, i.e., a pulse-coupled neural network (PCNN), to improve the expressiveness and recognition performance of SNNs for vision tasks. The PCNN is a type of cortical model capable of emulating the complex neuronal activities in the primary visual cortex. We construct deep pulse-coupled neural networks (DPCNNs) by replacing commonly used LIF neurons in SNNs with PCNN neurons. The intra-coupling in existing PCNN models limits the coupling between neurons only within channels. To address this limitation, we propose inter-channel coupling, which allows neurons in different feature maps to interact with each other. Experimental results show that inter-channel coupling can efficiently boost performance with fewer neurons, synapses, and less training time compared to widening the networks. For instance, compared to the LIF-based SNN with wide VGG9, DPCNN with VGG9 uses only 50%, 53%, and 73% of neurons, synapses, and training time, respectively. Furthermore, we propose receptive field and time dependent batch normalization (RFTD-BN) to speed up the convergence and performance of DPCNNs. △ Less

Submitted 24 December, 2023; originally announced January 2024.

arXiv:2401.06633 [pdf, other]

Ada-Retrieval: An Adaptive Multi-Round Retrieval Paradigm for Sequential Recommendations

Authors: Lei Li, Jianxun Lian, Xiao Zhou, Xing Xie

Abstract: Retrieval models aim at selecting a small set of item candidates which match the preference of a given user. They play a vital role in large-scale recommender systems since subsequent models such as rankers highly depend on the quality of item candidates. However, most existing retrieval models employ a single-round inference paradigm, which may not adequately capture the dynamic nature of user pr… ▽ More Retrieval models aim at selecting a small set of item candidates which match the preference of a given user. They play a vital role in large-scale recommender systems since subsequent models such as rankers highly depend on the quality of item candidates. However, most existing retrieval models employ a single-round inference paradigm, which may not adequately capture the dynamic nature of user preferences and stuck in one area in the item space. In this paper, we propose Ada-Retrieval, an adaptive multi-round retrieval paradigm for recommender systems that iteratively refines user representations to better capture potential candidates in the full item space. Ada-Retrieval comprises two key modules: the item representation adapter and the user representation adapter, designed to inject context information into items' and users' representations. The framework maintains a model-agnostic design, allowing seamless integration with various backbone models such as RNNs or Transformers. We perform experiments on three widely used public datasets, incorporating five powerful sequential recommenders as backbone models. Our results demonstrate that Ada-Retrieval significantly enhances the performance of various base models, with consistent improvements observed across different datasets. Our code and data are publicly available at: https://github.com/ll0ruc/Ada-Retrieval. △ Less

Submitted 31 January, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

Comments: 9 pages, Accepted to AAAI2024

arXiv:2312.12810 [pdf, other]

Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection

Authors: Jiachen Lian, Carly Feng, Naasir Farooqi, Steve Li, Anshul Kashyap, Cheol Jun Cho, Peter Wu, Robbie Netzorg, Tingle Li, Gopala Krishna Anumanchipalli

Abstract: Dysfluent speech modeling requires time-accurate and silence-aware transcription at both the word-level and phonetic-level. However, current research in dysfluency modeling primarily focuses on either transcription or detection, and the performance of each aspect remains limited. In this work, we present an unconstrained dysfluency modeling (UDM) approach that addresses both transcription and dete… ▽ More Dysfluent speech modeling requires time-accurate and silence-aware transcription at both the word-level and phonetic-level. However, current research in dysfluency modeling primarily focuses on either transcription or detection, and the performance of each aspect remains limited. In this work, we present an unconstrained dysfluency modeling (UDM) approach that addresses both transcription and detection in an automatic and hierarchical manner. UDM eliminates the need for extensive manual annotation by providing a comprehensive solution. Furthermore, we introduce a simulated dysfluent dataset called VCTK++ to enhance the capabilities of UDM in phonetic transcription. Our experimental results demonstrate the effectiveness and robustness of our proposed methods in both transcription and detection tasks. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: 2023 ASRU

arXiv:2312.11111 [pdf, other]

The Good, The Bad, and Why: Unveiling Emotions in Generative AI

Authors: Cheng Li, Jindong Wang, Yixuan Zhang, Kaijie Zhu, Xinyi Wang, Wenxin Hou, Jianxun Lian, Fang Luo, Qiang Yang, Xing Xie

Abstract: Emotion significantly impacts our daily behaviors and interactions. While recent generative AI models, such as large language models, have shown impressive performance in various tasks, it remains unclear whether they truly comprehend emotions. This paper aims to address this gap by incorporating psychological theories to gain a holistic understanding of emotions in generative AI models. Specifica… ▽ More Emotion significantly impacts our daily behaviors and interactions. While recent generative AI models, such as large language models, have shown impressive performance in various tasks, it remains unclear whether they truly comprehend emotions. This paper aims to address this gap by incorporating psychological theories to gain a holistic understanding of emotions in generative AI models. Specifically, we propose three approaches: 1) EmotionPrompt to enhance AI model performance, 2) EmotionAttack to impair AI model performance, and 3) EmotionDecode to explain the effects of emotional stimuli, both benign and malignant. Through extensive experiments involving language and multi-modal models on semantic understanding, logical reasoning, and generation tasks, we demonstrate that both textual and visual EmotionPrompt can boost the performance of AI models while EmotionAttack can hinder it. Additionally, EmotionDecode reveals that AI models can comprehend emotional stimuli akin to the mechanism of dopamine in the human brain. Our work heralds a novel avenue for exploring psychology to enhance our understanding of generative AI models. △ Less

Submitted 7 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

Comments: International Conference on Machine Learning (ICML) 2024; an extension to EmotionPrompt (arXiv:2307.11760)

arXiv:2312.07070 [pdf, other]

The thickness of galaxy disks from z=5 to 0 probed by JWST

Authors: Jianhui Lian, Li Luo

Abstract: Although thick disk is a structure prevalent in local disk galaxies and also present in our home Galaxy, its formation and evolution is still unclear. Whether the thick disk is born thick and/or gradually heated to be thick after formation is under debate. To disentangle these two scenarios, one effective approach is to inspect the thickness of young disk galaxies in the high redshift Universe. In… ▽ More Although thick disk is a structure prevalent in local disk galaxies and also present in our home Galaxy, its formation and evolution is still unclear. Whether the thick disk is born thick and/or gradually heated to be thick after formation is under debate. To disentangle these two scenarios, one effective approach is to inspect the thickness of young disk galaxies in the high redshift Universe. In this work we study the vertical structure of 191 edge-on galaxies spanning redshift from 0.2 to 5 using JWST NIRCAM imaging observations. For each galaxy, we retrieve the vertical surface brightness profile at 1 ${R_e}$ and fit a sech$^2$ function that has been convolved with the line spread function. The obtained scale height of galaxies at $z>1.5$ show no clear dependence on redshift, with a median value in remarkable agreement with that of the Milky Way's thick disk. This suggests that local thick disks are already thick when they were formed in the early times and secular heating is unlikely the main driver of thick disk formation. For galaxies at $z<1.5$, however, the disk scale height decreases systematically towards lower redshift, with low-redshift galaxies having comparable scale height with that of the Milky Way's thin disk. This cosmic evolution of disk thickness favors an upside-down formation scenario of galaxy disks. △ Less

Submitted 12 December, 2023; originally announced December 2023.

Comments: Accepted by ApJL,9 pages, and 5 figures. Comments welcome!

arXiv:2312.01755 [pdf, ps, other]

X-Ray Polarization Variability of High Spectral Peak BL Lacertaes: Cases of 1ES 1959+650 and PKS 2155-304

Authors: Xin-Ke Hu, Yu-Wei Yu, Jin Zhang, Tan-Zheng Wu, Ji-Shun Lian, Xiang-Gao Wang, Hai-Ming Zhang, En-Wei Liang

Abstract: The high-energy-peaked BL Lacertae objects (HBLs) are the main targets of the Imaging X-ray Polarimetry Explorer (IXPE) for investigating the mechanisms of radiation and particle acceleration in jets. In this paper, we report the first IXPE observations of two HBLs, 1ES 1959+650 and PKS 2155--304. Both sources exhibit X-ray polarization with a confidence level exceeding 99\%, as well as significan… ▽ More The high-energy-peaked BL Lacertae objects (HBLs) are the main targets of the Imaging X-ray Polarimetry Explorer (IXPE) for investigating the mechanisms of radiation and particle acceleration in jets. In this paper, we report the first IXPE observations of two HBLs, 1ES 1959+650 and PKS 2155--304. Both sources exhibit X-ray polarization with a confidence level exceeding 99\%, as well as significant variability in polarization across different time intervals and energy ranges. Notably, PKS 2155--304 demonstrates the highest X-ray polarization among all blazars detected by IXPE within its entire energy band (2--8 keV), with a polarization degree of $Π_{\rm X}=21.9\%\pm1.9\%$ (MDP$_{99}\sim$6.0\%). An even higher polarization is observed in the 3--4 keV band, reaching $Π_{\rm X}=28.6\%\pm2.7\%$ (MDP$_{99}\sim$8.1\%) with a confidence level of 10.8$σ$. Furthermore, no polarization is detected above 5 keV energy band. For 1ES 1959+650, the highest detected polarization degree in the 2--8 keV band is $Π_{\rm X}=12.4\%\pm0.7\%$ (MDP$_{99}\sim$2.2\%), with an electric vector position angle (EVPA) of $ψ_{\rm X}=19.7^{\circ}\pm1.6^{\circ}$. The X-ray polarization of 1ES 1959+650 exhibits evident variability, accompanied by the variations of $ψ_{\rm X}$, flux, spectrum, and energy bin. We discuss possible implications of these observational findings, including the variability in polarization, rotation of EVPA, and transition between synchrotron and synchrotron-self-Compton. We speculate that the X-rays observed during different IXPE observations originate from distinct regions in the jet and may involve diverse mechanisms for particle acceleration. △ Less

Submitted 28 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

Comments: The Astrophysical Journal Letters, In Press

arXiv:2311.15519 [pdf, other]

Chemo-dynamical Nature of the Anticenter Stream and Monoceros Ring

Authors: Yi Qiao, Baitian Tang, Jianhui Lian, Jing Li, Cheng Xu

Abstract: In the epoch of deep photometric surveys, a large number of substructures, e.g., over-densities, streams, were identified. With the help of astrometry and spectroscopy, the community revealed a complex picture of our Milky Way (MW) after investigating their origins. Off-plane substructures Anticenter Stream (ACS) and Monoceros Ring (MNC), once considered as dissolving dwarf galaxies, were later fo… ▽ More In the epoch of deep photometric surveys, a large number of substructures, e.g., over-densities, streams, were identified. With the help of astrometry and spectroscopy, the community revealed a complex picture of our Milky Way (MW) after investigating their origins. Off-plane substructures Anticenter Stream (ACS) and Monoceros Ring (MNC), once considered as dissolving dwarf galaxies, were later found to share similar kinematics and metallicity with the Galactic outer thin disk. In this work, we aim to chemically tag ACS and MNC with high-accuracy abundances from the APOGEE survey. By extrapolating chemical abundance trends in the outer thin disk region (10 < Rgc < 18 kpc, 0 < |Zgc| < 3kpc), we found that ACS and MNC stars show consistent chemical abundances as the extrapolating values for 12 elements, including C, N, O, Mg, Al, Si, K, Ca, Cr, Mn, Co and Ni. The similar chemical patterns indicate that ACS and MNC have similar star formation history as the MW outer thin disk, meanwhile, we also excluded their dwarf galaxy association, as they are distinctive in multiple chemical spaces. The ages of ACS and MNC stars are consistent with the time of the first Sgr dSph passage, indicating their possible connection. △ Less

Submitted 25 December, 2023; v1 submitted 26 November, 2023; originally announced November 2023.

Comments: 11 pages, 9 figures, 1 table, accepted for publication in ApJ

arXiv:2311.14304 [pdf, other]

AdaMedGraph: Adaboosting Graph Neural Networks for Personalized Medicine

Authors: Jie Lian, Xufang Luo, Caihua Shan, Dongqi Han, Varut Vardhanabhuti, Dongsheng Li

Abstract: Precision medicine tailored to individual patients has gained significant attention in recent times. Machine learning techniques are now employed to process personalized data from various sources, including images, genetics, and assessments. These techniques have demonstrated good outcomes in many clinical prediction tasks. Notably, the approach of constructing graphs by linking similar patients a… ▽ More Precision medicine tailored to individual patients has gained significant attention in recent times. Machine learning techniques are now employed to process personalized data from various sources, including images, genetics, and assessments. These techniques have demonstrated good outcomes in many clinical prediction tasks. Notably, the approach of constructing graphs by linking similar patients and then applying graph neural networks (GNNs) stands out, because related information from analogous patients are aggregated and considered for prediction. However, selecting the appropriate edge feature to define patient similarity and construct the graph is challenging, given that each patient is depicted by high-dimensional features from diverse sources. Previous studies rely on human expertise to select the edge feature, which is neither scalable nor efficient in pinpointing crucial edge features for complex diseases. In this paper, we propose a novel algorithm named \ours, which can automatically select important features to construct multiple patient similarity graphs, and train GNNs based on these graphs as weak learners in adaptive boosting. \ours{} is evaluated on two real-world medical scenarios and shows superiors performance. △ Less

Submitted 24 November, 2023; originally announced November 2023.

Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 9 pages

arXiv:2311.10947 [pdf, other]

doi 10.1145/3637528.3671802

RecExplainer: Aligning Large Language Models for Explaining Recommendation Models

Authors: Yuxuan Lei, Jianxun Lian, Jing Yao, Xu Huang, Defu Lian, Xing Xie

Abstract: Recommender systems are widely used in online services, with embedding-based models being particularly popular due to their expressiveness in representing complex signals. However, these models often function as a black box, making them less transparent and reliable for both users and developers. Recently, large language models (LLMs) have demonstrated remarkable intelligence in understanding, rea… ▽ More Recommender systems are widely used in online services, with embedding-based models being particularly popular due to their expressiveness in representing complex signals. However, these models often function as a black box, making them less transparent and reliable for both users and developers. Recently, large language models (LLMs) have demonstrated remarkable intelligence in understanding, reasoning, and instruction following. This paper presents the initial exploration of using LLMs as surrogate models to explaining black-box recommender models. The primary concept involves training LLMs to comprehend and emulate the behavior of target recommender models. By leveraging LLMs' own extensive world knowledge and multi-step reasoning abilities, these aligned LLMs can serve as advanced surrogates, capable of reasoning about observations. Moreover, employing natural language as an interface allows for the creation of customizable explanations that can be adapted to individual user preferences. To facilitate an effective alignment, we introduce three methods: behavior alignment, intention alignment, and hybrid alignment. Behavior alignment operates in the language space, representing user preferences and item information as text to mimic the target model's behavior; intention alignment works in the latent space of the recommendation model, using user and item representations to understand the model's behavior; hybrid alignment combines both language and latent spaces. Comprehensive experiments conducted on three public datasets show that our approach yields promising results in understanding and mimicking target models, producing high-quality, high-fidelity, and distinct explanations. Our code is available at https://github.com/microsoft/RecAI. △ Less

Submitted 22 June, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

Comments: 12 pages, 9 figures, 5 tables

arXiv:2311.10779 [pdf, other]

Knowledge Plugins: Enhancing Large Language Models for Domain-Specific Recommendations

Authors: Jing Yao, Wei Xu, Jianxun Lian, Xiting Wang, Xiaoyuan Yi, Xing Xie

Abstract: The significant progress of large language models (LLMs) provides a promising opportunity to build human-like systems for various practical applications. However, when applied to specific task domains, an LLM pre-trained on a general-purpose corpus may exhibit a deficit or inadequacy in two types of domain-specific knowledge. One is a comprehensive set of domain data that is typically large-scale… ▽ More The significant progress of large language models (LLMs) provides a promising opportunity to build human-like systems for various practical applications. However, when applied to specific task domains, an LLM pre-trained on a general-purpose corpus may exhibit a deficit or inadequacy in two types of domain-specific knowledge. One is a comprehensive set of domain data that is typically large-scale and continuously evolving. The other is specific working patterns of this domain reflected in the data. The absence or inadequacy of such knowledge impacts the performance of the LLM. In this paper, we propose a general paradigm that augments LLMs with DOmain-specific KnowledgE to enhance their performance on practical applications, namely DOKE. This paradigm relies on a domain knowledge extractor, working in three steps: 1) preparing effective knowledge for the task; 2) selecting the knowledge for each specific sample; and 3) expressing the knowledge in an LLM-understandable way. Then, the extracted knowledge is incorporated through prompts, without any computational cost of model fine-tuning. We instantiate the general paradigm on a widespread application, i.e. recommender systems, where critical item attributes and collaborative filtering signals are incorporated. Experimental results demonstrate that DOKE can substantially improve the performance of LLMs in specific domains. △ Less

Submitted 16 November, 2023; originally announced November 2023.

arXiv:2311.05459 [pdf, other]

Observational constraints on the origin of the elements. VIII. Constraining the Barium, Strontium and Yttrium chemical evolution in metal-poor stars

Authors: G. Guiglion, M. Bergemann, N. Storm, J. Lian, G. Cescutti, A. Serenelli

Abstract: Recently Lian et al. (2023), thanks to Gaia-ESO data, studied the chemical evolution of neutron-capture elements in the regime [Fe/H]>-1. We aim here to complement this study down to [Fe/H]=-3, and focus on Ba, Y, Sr, and abundance ratios of [Ba/Y] and [Sr/Y], which give comprehensive views on s-process nucleosynthesis channels. We measured LTE and NLTE abundances of Ba, Y, and Sr in 323 Galactic… ▽ More Recently Lian et al. (2023), thanks to Gaia-ESO data, studied the chemical evolution of neutron-capture elements in the regime [Fe/H]>-1. We aim here to complement this study down to [Fe/H]=-3, and focus on Ba, Y, Sr, and abundance ratios of [Ba/Y] and [Sr/Y], which give comprehensive views on s-process nucleosynthesis channels. We measured LTE and NLTE abundances of Ba, Y, and Sr in 323 Galactic metal-poor stars using high-resolution optical spectra with high S/N. We used the spectral fitting code TSFitPy, together with 1D model atmospheres using previously determined LTE and NLTE atmospheric parameters. The NLTE effects are on the order of -0.1 to ~0.2dex depending on the element. T he ratio between heavy and light s-process elements [Ba/Y] varies weakly with [Fe/H] even in the metal-poor regime, consistently with the behavior in the metal-rich regime. The [Ba/Y] scatter at a given metallicity is larger than the abundance measurement uncertainties. Homogeneous chemical evolution models with different yields prescriptions are unable to accurately reproduce the [Ba/Y] scatter at low-[Fe/H]. Adopting the stochastic chemical evolution model by Cescutti & Chaippini (2014) allows to reproduce the observed scatter in the abundance pattern of [Ba/Y] and [Ba/Sr]. With our observations, we rule out the need for an arbitrary scaling of the r-process contribution as previously suggested by the model authors. We have showed how important it is to properly include NLTE effects when measuring chemical abundances, especially in the metal-poor regime. This work shows that the choice of the Galactic chemical evolution model (stochastic vs. 1-zone) is key when comparing models to observations. The upcoming surveys such as 4MOST and WEAVE will deliver high quality spectra of many thousands of metal-poor stars, and this work gives a typical case study of what could be achieved with such surveys. △ Less

Submitted 9 November, 2023; originally announced November 2023.

Comments: 8 pages, 8 figures, submitted to A&A, comments welcome

arXiv:2310.13260 [pdf, other]

A Data-Centric Multi-Objective Learning Framework for Responsible Recommendation Systems

Authors: Xu Huang, Jianxun Lian, Hao Wang, Defu Lian, Xing Xie

Abstract: Recommendation systems effectively guide users in locating their desired information within extensive content repositories. Generally, a recommendation model is optimized to enhance accuracy metrics from a user utility standpoint, such as click-through rate or matching relevance. However, a responsible industrial recommendation system must address not only user utility (responsibility to users) bu… ▽ More Recommendation systems effectively guide users in locating their desired information within extensive content repositories. Generally, a recommendation model is optimized to enhance accuracy metrics from a user utility standpoint, such as click-through rate or matching relevance. However, a responsible industrial recommendation system must address not only user utility (responsibility to users) but also other objectives, including increasing platform revenue (responsibility to platforms), ensuring fairness (responsibility to content creators), and maintaining unbiasedness (responsibility to long-term healthy development). Multi-objective learning is a potent approach for achieving responsible recommendation systems. Nevertheless, current methods encounter two challenges: difficulty in scaling to heterogeneous objectives within a unified framework, and inadequate controllability over objective priority during optimization, leading to uncontrollable solutions. In this paper, we present a data-centric optimization framework, MoRec, which unifies the learning of diverse objectives. MoRec is a tri-level framework: the outer level manages the balance between different objectives, utilizing a proportional-integral-derivative (PID)-based controller to ensure a preset regularization on the primary objective. The middle level transforms objective-aware optimization into data sampling weights using sign gradients. The inner level employs a standard optimizer to update model parameters with the sampled data. Consequently, MoRec can flexibly support various objectives while maintaining the original model intact. Comprehensive experiments on two public datasets and one industrial dataset showcase the effectiveness, controllability, flexibility, and Pareto efficiency of MoRec, making it highly suitable for real-world implementation. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 10 pages

arXiv:2310.08436 [pdf, other]

95 GeV Diphoton and $b \bar{b}$ Excesses in the General Next-to-Minimal Supersymmetric Standard Model

Authors: Junjie Cao, Xinglong Jia, Jingwei Lian, Lei Meng

Abstract: The CMS and ATLAS collaborations recently published their results searching for light Higgs bosons, using the complete Run 2 data of the LHC. Both reported an excess in the diphoton invariant mass distribution at $m_{γγ} \simeq 95.4~{\rm GeV}$ with compatible signal strengths. The combined result corresponded to a local significance of $3.1σ$. Besides, the mass of the diphoton signal coincided wit… ▽ More The CMS and ATLAS collaborations recently published their results searching for light Higgs bosons, using the complete Run 2 data of the LHC. Both reported an excess in the diphoton invariant mass distribution at $m_{γγ} \simeq 95.4~{\rm GeV}$ with compatible signal strengths. The combined result corresponded to a local significance of $3.1σ$. Besides, the mass of the diphoton signal coincided with that of the $b\bar{b}$ excess observed at the LEP. Given the remarkable theoretical advantages of the general Next-to-Minimal Supersymmetric Standard Model, we interpret these excesses by the resonant productions of the singlet-dominated CP-even Higgs boson predicted by the theory. Using both analytic formulae and numerical results, we show that the idea can interpret the excesses by broad parameter space without contradicting current experimental restrictions, including those from the 125~{\rm GeV} Higgs data, the dark matter relic abundance and direct detection experiments, and the collider searches for supersymmetry and extra Higgs bosons. Although the explanations are scarcely affected by present Higgs data and the LHC search for supersymmetry, the dark matter physics may leave footprints on them. We also survey the other signals of the light Higgs boson at the LHC. △ Less

Submitted 19 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

Comments: 34 pages, 11 figures

arXiv:2310.05962 [pdf, other]

Improving the Performance of R17 Type-II Codebook with Deep Learning

Authors: Ke Ma, Yiliang Sang, Yang Ming, Jin Lian, Chang Tian, Zhaocheng Wang

Abstract: The Type-II codebook in Release 17 (R17) exploits the angular-delay-domain partial reciprocity between uplink and downlink channels to select part of angular-delay-domain ports for measuring and feeding back the downlink channel state information (CSI), where the performance of existing deep learning enhanced CSI feedback methods is limited due to the deficiency of sparse structures. To address th… ▽ More The Type-II codebook in Release 17 (R17) exploits the angular-delay-domain partial reciprocity between uplink and downlink channels to select part of angular-delay-domain ports for measuring and feeding back the downlink channel state information (CSI), where the performance of existing deep learning enhanced CSI feedback methods is limited due to the deficiency of sparse structures. To address this issue, we propose two new perspectives of adopting deep learning to improve the R17 Type-II codebook. Firstly, considering the low signal-to-noise ratio of uplink channels, deep learning is utilized to accurately select the dominant angular-delay-domain ports, where the focal loss is harnessed to solve the class imbalance problem. Secondly, we propose to adopt deep learning to reconstruct the downlink CSI based on the feedback of the R17 Type-II codebook at the base station, where the information of sparse structures can be effectively leveraged. Besides, a weighted shortcut module is designed to facilitate the accurate reconstruction. Simulation results demonstrate that our proposed methods could improve the sum rate performance compared with its traditional R17 Type-II codebook and deep learning benchmarks. △ Less

Submitted 13 September, 2023; originally announced October 2023.

Comments: Accepted by IEEE GLOBECOM 2023, conference version of Arxiv:2305.08081

arXiv:2309.15203 [pdf, other]

Eve Said Yes: AirBone Authentication for Head-Wearable Smart Voice Assistant

Authors: Chenpei Huang, Hui Zhong, Jie Lian, Pavana Prakash, Dian Shi, Yuan Xu, Miao Pan

Abstract: Recent advances in machine learning and natural language processing have fostered the enormous prosperity of smart voice assistants and their services, e.g., Alexa, Google Home, Siri, etc. However, voice spoofing attacks are deemed to be one of the major challenges of voice control security, and never stop evolving such as deep-learning-based voice conversion and speech synthesis techniques. To so… ▽ More Recent advances in machine learning and natural language processing have fostered the enormous prosperity of smart voice assistants and their services, e.g., Alexa, Google Home, Siri, etc. However, voice spoofing attacks are deemed to be one of the major challenges of voice control security, and never stop evolving such as deep-learning-based voice conversion and speech synthesis techniques. To solve this problem outside the acoustic domain, we focus on head-wearable devices, such as earbuds and virtual reality (VR) headsets, which are feasible to continuously monitor the bone-conducted voice in the vibration domain. Specifically, we identify that air and bone conduction (AC/BC) from the same vocalization are coupled (or concurrent) and user-level unique, which makes them suitable behavior and biometric factors for multi-factor authentication (MFA). The legitimate user can defeat acoustic domain and even cross-domain spoofing samples with the proposed two-stage AirBone authentication. The first stage answers \textit{whether air and bone conduction utterances are time domain consistent (TC)} and the second stage runs \textit{bone conduction speaker recognition (BC-SR)}. The security level is hence increased for two reasons: (1) current acoustic attacks on smart voice assistants cannot affect bone conduction, which is in the vibration domain; (2) even for advanced cross-domain attacks, the unique bone conduction features can detect adversary's impersonation and machine-induced vibration. Finally, AirBone authentication has good usability (the same level as voice authentication) compared with traditional MFA and those specially designed to enhance smart voice security. Our experimental results show that the proposed AirBone authentication is usable and secure, and can be easily equipped by commercial off-the-shelf head wearables with good user experience. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 13 pages, 12 figures

arXiv:2309.14764 [pdf, other]

InvKA: Gait Recognition via Invertible Koopman Autoencoder

Authors: Fan Li, Dong Liang, Jing Lian, Qidong Liu, Hegui Zhu, Jizhao Liu

Abstract: Most current gait recognition methods suffer from poor interpretability and high computational cost. To improve interpretability, we investigate gait features in the embedding space based on Koopman operator theory. The transition matrix in this space captures complex kinematic features of gait cycles, namely the Koopman operator. The diagonal elements of the operator matrix can represent the over… ▽ More Most current gait recognition methods suffer from poor interpretability and high computational cost. To improve interpretability, we investigate gait features in the embedding space based on Koopman operator theory. The transition matrix in this space captures complex kinematic features of gait cycles, namely the Koopman operator. The diagonal elements of the operator matrix can represent the overall motion trend, providing a physically meaningful descriptor. To reduce the computational cost of our algorithm, we use a reversible autoencoder to reduce the model size and eliminate convolutional layers to compress its depth, resulting in fewer floating-point operations. Experimental results on multiple datasets show that our method reduces computational cost to 1% compared to state-of-the-art methods while achieving competitive recognition accuracy 98% on non-occlusion datasets. △ Less

Submitted 27 September, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

arXiv:2309.13232 [pdf]

doi 10.1007/s11128-024-04398-7

Semiquantum private comparison via cavity QED

Authors: Xin Xu, Jiang-Yuan Lian, Tian-Yu Ye

Abstract: In this paper, we design the first semiquantum private comparison (SQPC) protocol which is realized via cavity quantum electrodynamics (QED) by making use of the evolution law of atom. With the help of a semi-honest third party (TP), the proposed protocol can compare the equality of private inputs from two semiquantum parties who only have limited quantum capabilities. The proposed protocol uses p… ▽ More In this paper, we design the first semiquantum private comparison (SQPC) protocol which is realized via cavity quantum electrodynamics (QED) by making use of the evolution law of atom. With the help of a semi-honest third party (TP), the proposed protocol can compare the equality of private inputs from two semiquantum parties who only have limited quantum capabilities. The proposed protocol uses product states as initial quantum resource and employs none of unitary operations, quantum entanglement swapping operation or delay lines. Security proof turns out that it can defeat both the external attack and the internal attack. △ Less

Submitted 9 May, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

Comments: 21 pages, 7 figures, 2 tables

Journal ref: Quantum Information Processing, 2024, 23: 174

arXiv:2309.12239 [pdf, other]

ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems

Authors: Jinqing Lian, Xinyi Zhang, Yingxia Shao, Zenglin Pu, Qingfeng Xiang, Yawen Li, Bin Cui

Abstract: The past decade has seen rapid growth of distributed stream data processing systems. Under these systems, a stream application is realized as a Directed Acyclic Graph (DAG) of operators, where the level of parallelism of each operator has a substantial impact on its overall performance. However, finding optimal levels of parallelism remains challenging. Most existing methods are heavily coupled wi… ▽ More The past decade has seen rapid growth of distributed stream data processing systems. Under these systems, a stream application is realized as a Directed Acyclic Graph (DAG) of operators, where the level of parallelism of each operator has a substantial impact on its overall performance. However, finding optimal levels of parallelism remains challenging. Most existing methods are heavily coupled with the topological graph of operators, unable to efficiently tune under-provisioned jobs. They either insufficiently use previous tuning experience by treating successively tuning independently, or explore the configuration space aggressively, violating the Service Level Agreements (SLA). To address the above problems, we propose ContTune, a continuous tuning system for stream applications. It is equipped with a novel Big-small algorithm, in which the Big phase decouples the tuning from the topological graph by decomposing the job tuning problem into sub-problems that can be solved concurrently. We propose a conservative Bayesian Optimization (CBO) technique in the Small phase to speed up the tuning process by utilizing the previous observations. It leverages the state-of-the-art (SOTA) tuning method as conservative exploration to avoid SLA violations. Experimental results show that ContTune reduces up to 60.75% number of reconfigurations under synthetic workloads and up to 57.5% number of reconfigurations under real workloads, compared to the SOTA method DS2. △ Less

Submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.09088 [pdf, other]

Enhancing GAN-Based Vocoders with Contrastive Learning Under Data-limited Condition

Authors: Haoming Guo, Seth Z. Zhao, Jiachen Lian, Gopala Anumanchipalli, Gerald Friedland

Abstract: Vocoder models have recently achieved substantial progress in generating authentic audio comparable to human quality while significantly reducing memory requirement and inference time. However, these data-hungry generative models require large-scale audio data for learning good representations. In this paper, we apply contrastive learning methods in training the vocoder to improve the perceptual q… ▽ More Vocoder models have recently achieved substantial progress in generating authentic audio comparable to human quality while significantly reducing memory requirement and inference time. However, these data-hungry generative models require large-scale audio data for learning good representations. In this paper, we apply contrastive learning methods in training the vocoder to improve the perceptual quality of the vocoder without modifying its architecture or adding more data. We design an auxiliary task with mel-spectrogram contrastive learning to enhance the utterance-level quality of the vocoder model under data-limited conditions. We also extend the task to include waveforms to improve the multi-modality comprehension of the model and address the discriminator overfitting problem. We optimize the additional task simultaneously with GAN training objectives. Our results show that the tasks improve model performance substantially in data-limited settings. △ Less

Submitted 18 December, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

arXiv:2308.16505 [pdf, other]

Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations

Authors: Xu Huang, Jianxun Lian, Yuxuan Lei, Jing Yao, Defu Lian, Xing Xie

Abstract: Recommender models excel at providing domain-specific item recommendations by leveraging extensive user behavior data. Despite their ability to act as lightweight domain experts, they struggle to perform versatile tasks such as providing explanations and engaging in conversations. On the other hand, large language models (LLMs) represent a significant step towards artificial general intelligence,… ▽ More Recommender models excel at providing domain-specific item recommendations by leveraging extensive user behavior data. Despite their ability to act as lightweight domain experts, they struggle to perform versatile tasks such as providing explanations and engaging in conversations. On the other hand, large language models (LLMs) represent a significant step towards artificial general intelligence, showcasing remarkable capabilities in instruction comprehension, commonsense reasoning, and human interaction. However, LLMs lack the knowledge of domain-specific item catalogs and behavioral patterns, particularly in areas that diverge from general world knowledge, such as online e-commerce. Finetuning LLMs for each domain is neither economic nor efficient. In this paper, we bridge the gap between recommender models and LLMs, combining their respective strengths to create a versatile and interactive recommender system. We introduce an efficient framework called \textbf{InteRecAgent}, which employs LLMs as the brain and recommender models as tools. We first outline a minimal set of essential tools required to transform LLMs into InteRecAgent. We then propose an efficient workflow within InteRecAgent for task execution, incorporating key components such as memory components, dynamic demonstration-augmented task planning, and reflection. InteRecAgent enables traditional recommender systems, such as those ID-based matrix factorization models, to become interactive systems with a natural language interface through the integration of LLMs. Experimental results on several public datasets show that InteRecAgent achieves satisfying performance as a conversational recommender system, outperforming general-purpose LLMs. The source code of InteRecAgent is released at https://aka.ms/recagent. △ Less

Submitted 29 January, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

Comments: 18 pages, 17 figures, 7 tables

arXiv:2308.09256 [pdf, other]

On Block Cholesky Decomposition for Sparse Inverse Covariance Estimation

Authors: Xiaoning Kang, Jiayi Lian, Xinwei Deng

Abstract: The modified Cholesky decomposition is popular for inverse covariance estimation, but often needs pre-specification on the full information of variable ordering. In this work, we propose a block Cholesky decomposition (BCD) for estimating inverse covariance matrix under the partial information of variable ordering, in the sense that the variables can be divided into several groups with available o… ▽ More The modified Cholesky decomposition is popular for inverse covariance estimation, but often needs pre-specification on the full information of variable ordering. In this work, we propose a block Cholesky decomposition (BCD) for estimating inverse covariance matrix under the partial information of variable ordering, in the sense that the variables can be divided into several groups with available ordering among groups, but variables within each group have no orderings. The proposed BCD model provides a unified framework for several existing methods including the modified Cholesky decomposition and the Graphical lasso. By utilizing the partial information on variable ordering, the proposed BCD model guarantees the positive definiteness of the estimated matrix with statistically meaningful interpretation. Theoretical results are established under regularity conditions. Simulation and case studies are conducted to evaluate the proposed BCD model. △ Less

Submitted 17 August, 2023; originally announced August 2023.

Journal ref: Statistica Sinica 2023

arXiv:2308.07821 [pdf, ps, other]

A Nearly Quadratic-Time FPTAS for Knapsack

Authors: Lin Chen, Jiayi Lian, Yuchen Mao, Guochuan Zhang

Abstract: We investigate polynomial-time approximation schemes for the classic 0-1 knapsack problem. The previous algorithm by Deng, Jin, and Mao (SODA'23) has approximation factor $1 + \eps$ with running time $\widetilde{O}(n + \frac{1}{\eps^{2.2}})$. There is a lower Bound of $(n + \frac{1}{\eps})^{2-o(1)}$ conditioned on the hypothesis that $(\min, +)$ has no truly subquadratic algorithm. We close the ga… ▽ More We investigate polynomial-time approximation schemes for the classic 0-1 knapsack problem. The previous algorithm by Deng, Jin, and Mao (SODA'23) has approximation factor $1 + \eps$ with running time $\widetilde{O}(n + \frac{1}{\eps^{2.2}})$. There is a lower Bound of $(n + \frac{1}{\eps})^{2-o(1)}$ conditioned on the hypothesis that $(\min, +)$ has no truly subquadratic algorithm. We close the gap by proposing an approximation scheme that runs in $\widetilde{O}(n + \frac{1}{\eps^2})$ time. △ Less

Submitted 29 April, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

arXiv:2308.02925 [pdf, other]

ConvFormer: Revisiting Transformer for Sequential User Modeling

Authors: Hao Wang, Jianxun Lian, Mingqi Wu, Haoxuan Li, Jiajun Fan, Wanyue Xu, Chaozhuo Li, Xing Xie

Abstract: Sequential user modeling, a critical task in personalized recommender systems, focuses on predicting the next item a user would prefer, requiring a deep understanding of user behavior sequences. Despite the remarkable success of Transformer-based models across various domains, their full potential in comprehending user behavior remains untapped. In this paper, we re-examine Transformer-like archit… ▽ More Sequential user modeling, a critical task in personalized recommender systems, focuses on predicting the next item a user would prefer, requiring a deep understanding of user behavior sequences. Despite the remarkable success of Transformer-based models across various domains, their full potential in comprehending user behavior remains untapped. In this paper, we re-examine Transformer-like architectures aiming to advance state-of-the-art performance. We start by revisiting the core building blocks of Transformer-based methods, analyzing the effectiveness of the item-to-item mechanism within the context of sequential user modeling. After conducting a thorough experimental analysis, we identify three essential criteria for devising efficient sequential user models, which we hope will serve as practical guidelines to inspire and shape future designs. Following this, we introduce ConvFormer, a simple but powerful modification to the Transformer architecture that meets these criteria, yielding state-of-the-art results. Additionally, we present an acceleration technique to minimize the complexity associated with processing extremely long sequences. Experiments on four public datasets showcase ConvFormer's superiority and confirm the validity of our proposed criteria. △ Less

Submitted 8 October, 2023; v1 submitted 5 August, 2023; originally announced August 2023.

arXiv:2308.01111 [pdf, other]

doi 10.1093/mnras/stad2390

Observational constraints on the origin of the elements. VI. Origin and evolution of neutron-capture elements as probed by the Gaia-ESO survey

Authors: Jianhui Lian, Nicholas Storm, Guillaume Guiglion, Aldo Serenelli, Benoit Cote, Amanda I. Karakas, Nick Boardman, Maria Bergemann

Abstract: Most heavy elements beyond the iron peak are synthesized via neutron capture processes. The nature of the astrophysical sites of neutron capture processes is still very unclear. In this work we explore the observational constraints of the chemical abundances of s-process and r-process elements on the sites of neutron-capture processes by applying Galactic chemical evolution (GCE) models to the dat… ▽ More Most heavy elements beyond the iron peak are synthesized via neutron capture processes. The nature of the astrophysical sites of neutron capture processes is still very unclear. In this work we explore the observational constraints of the chemical abundances of s-process and r-process elements on the sites of neutron-capture processes by applying Galactic chemical evolution (GCE) models to the data from Gaia-ESO large spectroscopic stellar survey. For the r-process, the [Eu/Fe]-[Fe/H] distribution suggests a short delay time of the site that produces Eu. Other independent observations (e.g., NS-NS binaries), however, suggest a significant fraction of long delayed ($>1$Gyr) neutron star mergers (NSM). When assuming NSM as the only r-process sites, these two observational constraints are inconsistent at above 1$σ$ level. Including short delayed r-process sites like magneto-rotational supernova can resolve this inconsistency. For the s-process, we find a weak metallicity dependence of the [Ba/Y] ratio, which traces the s-process efficiency. Our GCE model with up-to-date yields of AGB stars qualitatively reproduces this metallicity dependence, but the model predicts a much higher [Ba/Y] ratio compared to the data. This mismatch suggests that the s-process efficiency of low mass AGB stars in the current AGB nucleosynthesis models could be overestimated. △ Less

Submitted 21 September, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

Comments: 14 pages, 11 figures, accepted by MNRAS

arXiv:2307.13887 [pdf, other]

doi 10.3847/1538-4357/ace9b8

A Tale of Two Disks: Mapping the Milky Way with the Final Data Release of APOGEE

Authors: Julie Imig, Cathryn Price, Jon A. Holtzman, Alexander Stone-Martinez, Steven R. Majewski, David H. Weinberg, Jennifer A. Johnson, Carlos Allende Prieto, Rachael L. Beaton, Timothy C. Beers, Dmitry Bizyaev, Michael R. Blanton, Joel R. Brownstein, Katia Cunha, José G. Fernández-Trincado, Diane K. Feuillet, Sten Hasselquist, Christian R. Hayes, Henrik Jönsson, Richard R. Lane, Jianhui Lian, Szabolcs Mészáros, David L. Nidever, Annie C. Robin, Matthew Shetrone , et al. (2 additional authors not shown)

Abstract: We present new maps of the Milky Way disk showing the distribution of metallicity ([Fe/H]), $α$-element abundances ([Mg/Fe]), and stellar age, using a sample of 66,496 red giant stars from the final data release (DR17) of the Apache Point Observatory Galactic Evolution Experiment (APOGEE) survey. We measure radial and vertical gradients, quantify the distribution functions for age and metallicity,… ▽ More We present new maps of the Milky Way disk showing the distribution of metallicity ([Fe/H]), $α$-element abundances ([Mg/Fe]), and stellar age, using a sample of 66,496 red giant stars from the final data release (DR17) of the Apache Point Observatory Galactic Evolution Experiment (APOGEE) survey. We measure radial and vertical gradients, quantify the distribution functions for age and metallicity, and explore chemical clock relations across the Milky Way for the low-$α$ disk, high-$α$ disk, and total population independently. The low-$α$ disk exhibits a negative radial metallicity gradient of $-0.06 \pm 0.001$ dex kpc$^{-1}$, which flattens with distance from the midplane. The high-$α$ disk shows a flat radial gradient in metallicity and age across nearly all locations of the disk. The age and metallicity distribution functions shift from negatively skewed in the inner Galaxy to positively skewed at large radius. Significant bimodality in the [Mg/Fe]-[Fe/H] plane and in the [Mg/Fe]-age relation persist across the entire disk. The age estimates have typical uncertainties of $\sim0.15$ in $\log$(age) and may be subject to additional systematic errors, which impose limitations on conclusions drawn from this sample. Nevertheless, these results act as critical constraints on galactic evolution models, constraining which physical processes played a dominant role in the formation of the Milky Way disk. We discuss how radial migration predicts many of the observed trends near the solar neighborhood and in the outer disk, but an additional more dramatic evolution history, such as the multi-infall model or a merger event, is needed to explain the chemical and age bimodality elsewhere in the Galaxy. △ Less

Submitted 25 July, 2023; originally announced July 2023.

Comments: 41 pages, 32 figures, accepted to ApJ

Journal ref: ApJ 954 124 (2023)

arXiv:2307.12582 [pdf, ps, other]

Faster Algorithms for Bounded Knapsack and Bounded Subset Sum Via Fine-Grained Proximity Results

Authors: Lin Chen, Jiayi Lian, Yuchen Mao, Guochuan Zhang

Abstract: We investigate pseudopolynomial-time algorithms for Bounded Knapsack and Bounded Subset Sum. Recent years have seen a growing interest in settling their fine-grained complexity with respect to various parameters. For Bounded Knapsack, the number of items $n$ and the maximum item weight $w_{\max}$ are two of the most natural parameters that have been studied extensively in the literature. The previ… ▽ More We investigate pseudopolynomial-time algorithms for Bounded Knapsack and Bounded Subset Sum. Recent years have seen a growing interest in settling their fine-grained complexity with respect to various parameters. For Bounded Knapsack, the number of items $n$ and the maximum item weight $w_{\max}$ are two of the most natural parameters that have been studied extensively in the literature. The previous best running time in terms of $n$ and $w_{\max}$ is $O(n + w^3_{\max})$ [Polak, Rohwedder, Wegrzycki '21]. There is a conditional lower bound of $O((n + w_{\max})^{2-o(1)})$ based on $(\min,+)$-convolution hypothesis [Cygan, Mucha, Wegrzycki, Wlodarczyk '17]. We narrow the gap significantly by proposing a $\tilde{O}(n + w^{12/5}_{\max})$-time algorithm. Note that in the regime where $w_{\max} \approx n$, our algorithm runs in $\tilde{O}(n^{12/5})$ time, while all the previous algorithms require $Ω(n^3)$ time in the worst case. For Bounded Subset Sum, we give two algorithms running in $\tilde{O}(nw_{\max})$ and $\tilde{O}(n + w^{3/2}_{\max})$ time, respectively. These results match the currently best running time for 0-1 Subset Sum. Prior to our work, the best running times (in terms of $n$ and $w_{\max}$) for Bounded Subset Sum is $\tilde{O}(n + w^{5/3}_{\max})$ [Polak, Rohwedder, Wegrzycki '21] and $\tilde{O}(n + μ_{\max}^{1/2}w_{\max}^{3/2})$ [implied by Bringmann '19 and Bringmann, Wellnitz '21], where $μ_{\max}$ refers to the maximum multiplicity of item weights. △ Less

Submitted 4 December, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

Comments: To appear in SODA2024

Showing 1–50 of 198 results for author: Lian, J