subscribe to arXiv mailings

Anomalous Enhancement of the Electrocatalytic Hydrogen Evolution Reaction in AuPt Nanoclusters

Authors: Jiahui Kang, Jan Kloppenburg, Jiali Sheng, Zhenyu Xu, Kristoffer Meinander, Hua Jiang, Zhong-Peng Lv, Esko I. Kauppinen, Qiang Zhang, Xi Chen, Olli Ikkala, Miguel A. Caro, Bo Peng

Abstract: Energy- and resource-efficient electrocatalytic water splitting is of paramount importance to enable sustainable hydrogen production. The best bulk catalyst for the hydrogen evolution reaction (HER), i.e., platinum, is one of the scarcest elements on Earth. The use of raw material for HER can be dramatically reduced by utilizing nanoclusters. In addition, nanoalloying can further improve the perfo… ▽ More Energy- and resource-efficient electrocatalytic water splitting is of paramount importance to enable sustainable hydrogen production. The best bulk catalyst for the hydrogen evolution reaction (HER), i.e., platinum, is one of the scarcest elements on Earth. The use of raw material for HER can be dramatically reduced by utilizing nanoclusters. In addition, nanoalloying can further improve the performance of these nanoclusters. In this paper, we present results for HER on nanometer-sized ligand-free AuPt nanoclusters grafted on carbon nanotubes. These results demonstrate excellent monodispersity and a significant reduction of the overpotential for the electrocatalytic HER. We utilize atomistic machine learning techniques to elucidate the atomic-scale origin of the synergistic effect between Pt and Au. We show that the presence of surface Au atoms, known to be poor HER catalysts, in a Pt(core)/AuPt(shell) nanocluster structure, drives an anomalous enhancement of the inherently high catalytic activity of Pt atoms. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.07050 [pdf, other]

DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification

Authors: Jiamu Sheng, Jingyi Zhou, Jiong Wang, Peng Ye, Jiayuan Fan

Abstract: The effectiveness and efficiency of modeling complex spectral-spatial relations are both crucial for Hyperspectral image (HSI) classification. Most existing methods based on CNNs and transformers still suffer from heavy computational burdens and have room for improvement in capturing the global-local spectral-spatial feature representation. To this end, we propose a novel lightweight parallel desi… ▽ More The effectiveness and efficiency of modeling complex spectral-spatial relations are both crucial for Hyperspectral image (HSI) classification. Most existing methods based on CNNs and transformers still suffer from heavy computational burdens and have room for improvement in capturing the global-local spectral-spatial feature representation. To this end, we propose a novel lightweight parallel design called lightweight dual-stream Mamba-convolution network (DualMamba) for HSI classification. Specifically, a parallel lightweight Mamba and CNN block are first developed to extract global and local spectral-spatial features. First, the cross-attention spectral-spatial Mamba module is proposed to leverage the global modeling of Mamba at linear complexity. Within this module, dynamic positional embedding is designed to enhance the spatial location information of visual sequences. The lightweight spectral/spatial Mamba blocks comprise an efficient scanning strategy and a lightweight Mamba design to efficiently extract global spectral-spatial features. And the cross-attention spectral-spatial fusion is designed to learn cross-correlation and fuse spectral-spatial features. Second, the lightweight spectral-spatial residual convolution module is proposed with lightweight spectral and spatial branches to extract local spectral-spatial features through residual learning. Finally, the adaptive global-local fusion is proposed to dynamically combine global Mamba features and local convolution features for a global-local spectral-spatial representation. Compared with state-of-the-art HSI classification methods, experimental results demonstrate that DualMamba achieves significant classification accuracy on three public HSI datasets and a superior reduction in model parameters and floating point operations (FLOPs). △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.01934 [pdf, other]

Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking

Authors: Zefeng Zhang, Jiawei Sheng, Chuang Zhang, Yunzhi Liang, Wenyuan Zhang, Siqi Wang, Tingwen Liu

Abstract: Multimodal Entity Linking (MEL) aims to link ambiguous mentions in multimodal contexts to entities in a multimodal knowledge graph. A pivotal challenge is to fully leverage multi-element correlations between mentions and entities to bridge modality gap and enable fine-grained semantic matching. Existing methods attempt several local correlative mechanisms, relying heavily on the automatically lear… ▽ More Multimodal Entity Linking (MEL) aims to link ambiguous mentions in multimodal contexts to entities in a multimodal knowledge graph. A pivotal challenge is to fully leverage multi-element correlations between mentions and entities to bridge modality gap and enable fine-grained semantic matching. Existing methods attempt several local correlative mechanisms, relying heavily on the automatically learned attention weights, which may over-concentrate on partial correlations. To mitigate this issue, we formulate the correlation assignment problem as an optimal transport (OT) problem, and propose a novel MEL framework, namely OT-MEL, with OT-guided correlation assignment. Thereby, we exploit the correlation between multimodal features to enhance multimodal fusion, and the correlation between mentions and entities to enhance fine-grained matching. To accelerate model prediction, we further leverage knowledge distillation to transfer OT assignment knowledge to attention mechanism. Experimental results show that our model significantly outperforms previous state-of-the-art baselines and confirm the effectiveness of the OT-guided correlation assignment. △ Less

Submitted 5 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: Findings of ACL 2024

arXiv:2405.16222 [pdf, other]

Proposal for a Quantum Mechanical Test of Gravity at Millimeter Scale

Authors: Yu Cheng, Jiadu Lin, Jie Sheng, Tsutomu T. Yanagida

Abstract: The experimental verification of the Newton law of gravity at small scales has been a longstanding challenge. Recently, torsion balance experiments have successfully measured gravitational force at the millimeter scale. However, testing gravity force on quantum mechanical wave function at small scales remains difficult. In this paper, we propose a novel experiment that utilizes the Josephson effec… ▽ More The experimental verification of the Newton law of gravity at small scales has been a longstanding challenge. Recently, torsion balance experiments have successfully measured gravitational force at the millimeter scale. However, testing gravity force on quantum mechanical wave function at small scales remains difficult. In this paper, we propose a novel experiment that utilizes the Josephson effect to detect the different evolution of quantum phase induced from the potential difference caused by gravity. We demonstrate that this experiment can test gravity quantum mechanically at the millimeter scale, and also has a potential to investigate the parity invariance of gravity at small scales. △ Less

Submitted 25 May, 2024; originally announced May 2024.

Comments: 6 pages, 3 figures

arXiv:2404.09445 [pdf, other]

Exploring Text-to-Motion Generation with Human Preference

Authors: Jenny Sheng, Matthieu Lin, Andrew Zhao, Kevin Pruvost, Yu-Hui Wen, Yangguang Li, Gao Huang, Yong-Jin Liu

Abstract: This paper presents an exploration of preference learning in text-to-motion generation. We find that current improvements in text-to-motion generation still rely on datasets requiring expert labelers with motion capture systems. Instead, learning from human preference data does not require motion capture systems; a labeler with no expertise simply compares two generated motions. This is particular… ▽ More This paper presents an exploration of preference learning in text-to-motion generation. We find that current improvements in text-to-motion generation still rely on datasets requiring expert labelers with motion capture systems. Instead, learning from human preference data does not require motion capture systems; a labeler with no expertise simply compares two generated motions. This is particularly efficient because evaluating the model's output is easier than gathering the motion that performs a desired task (e.g. backflip). To pioneer the exploration of this paradigm, we annotate 3,528 preference pairs generated by MotionGPT, marking the first effort to investigate various algorithms for learning from preference data. In particular, our exploration highlights important design choices when using preference data. Additionally, our experimental results show that preference learning has the potential to greatly improve current text-to-motion generative models. Our code and dataset are publicly available at https://github.com/THU-LYJ-Lab/InstructMotion}{https://github.com/THU-LYJ-Lab/InstructMotion to further facilitate research in this area. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: Accepted to CVPR 2024 HuMoGen Workshop

arXiv:2404.08715 [pdf, other]

Differentially Private Log-Location-Scale Regression Using Functional Mechanism

Authors: Jiewen Sheng, Xiaolei Fang

Abstract: This article introduces differentially private log-location-scale (DP-LLS) regression models, which incorporate differential privacy into LLS regression through the functional mechanism. The proposed models are established by injecting noise into the log-likelihood function of LLS regression for perturbed parameter estimation. We will derive the sensitivities utilized to determine the magnitude of… ▽ More This article introduces differentially private log-location-scale (DP-LLS) regression models, which incorporate differential privacy into LLS regression through the functional mechanism. The proposed models are established by injecting noise into the log-likelihood function of LLS regression for perturbed parameter estimation. We will derive the sensitivities utilized to determine the magnitude of the injected noise and prove that the proposed DP-LLS models satisfy $ε$-differential privacy. In addition, we will conduct simulations and case studies to evaluate the performance of the proposed models. The findings suggest that predictor dimension, training sample size, and privacy budget are three key factors impacting the performance of the proposed DP-LLS regression models. Moreover, the results indicate that a sufficiently large training dataset is needed to simultaneously ensure decent performance of the proposed models and achieve a satisfactory level of privacy protection. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2403.05833 [pdf]

Room temperature single-photon terahertz detection with thermal Rydberg atoms

Authors: Danyang Li, Zhengyang Bai, Xiaoliang Zuo, Yuelong Wu, Jiteng Sheng, Haibin Wu

Abstract: Single-photon terahertz (THz) detection is one of the most demanding technology for a variety of fields and could lead to many breakthroughs. Although its significant progress has been made in the last two decades, operating it at room temperature still remains a great challenge. Here, we demonstrate, for the first time, the room temperature THz detector at single-photon levels based on nonlinear… ▽ More Single-photon terahertz (THz) detection is one of the most demanding technology for a variety of fields and could lead to many breakthroughs. Although its significant progress has been made in the last two decades, operating it at room temperature still remains a great challenge. Here, we demonstrate, for the first time, the room temperature THz detector at single-photon levels based on nonlinear wave mixing in thermal Rydberg atomic vapor. The low-energy THz photons are coherently upconverted to the high-energy optical photons via a nondegenerate Rydberg state involved six-wave-mixing process, and therefore, the single-photon THz detection is achieved by a conventional optical single-photon counting module. The noise equivalent power of such a detector is reached to be 9.5*10^-19 W/Hz^1/2, which is more than four orders of magnitude lower than the state-of-the-art room temperature THz detectors. The optimum quantum efficiency of the whole wave-mixing process is about 4.3% with 40.6 dB dynamic range, and the maximum conversion bandwidth is 172 MHz, which is all-optically controllable. The developed fast and continuous-wave single-photon THz detector at room temperature operation has a great potential to be portable and chip-scale, and could be revolutionary for a wide range of applications in remote sensing, wireless communication, biomedical diagnostics, and quantum optics. △ Less

Submitted 9 March, 2024; originally announced March 2024.

arXiv:2403.04521 [pdf, other]

Uncertainty-Aware Relational Graph Neural Network for Few-Shot Knowledge Graph Completion

Authors: Qian Li, Shu Guo, Yinjia Chen, Cheng Ji, Jiawei Sheng, Jianxin Li

Abstract: Few-shot knowledge graph completion (FKGC) aims to query the unseen facts of a relation given its few-shot reference entity pairs. The side effect of noises due to the uncertainty of entities and triples may limit the few-shot learning, but existing FKGC works neglect such uncertainty, which leads them more susceptible to limited reference samples with noises. In this paper, we propose a novel unc… ▽ More Few-shot knowledge graph completion (FKGC) aims to query the unseen facts of a relation given its few-shot reference entity pairs. The side effect of noises due to the uncertainty of entities and triples may limit the few-shot learning, but existing FKGC works neglect such uncertainty, which leads them more susceptible to limited reference samples with noises. In this paper, we propose a novel uncertainty-aware few-shot KG completion framework (UFKGC) to model uncertainty for a better understanding of the limited data by learning representations under Gaussian distribution. Uncertainty representation is first designed for estimating the uncertainty scope of the entity pairs after transferring feature representations into a Gaussian distribution. Further, to better integrate the neighbors with uncertainty characteristics for entity features, we design an uncertainty-aware relational graph neural network (UR-GNN) to conduct convolution operations between the Gaussian distributions. Then, multiple random samplings are conducted for reference triples within the Gaussian distribution to generate smooth reference representations during the optimization. The final completion score for each query instance is measured by the designed uncertainty optimization to make our approach more robust to the noises in few-shot scenarios. Experimental results show that our approach achieves excellent performance on two benchmark datasets compared to its competitors. △ Less

Submitted 21 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

arXiv:2403.01962 [pdf, other]

An Efficient Model-Based Approach on Learning Agile Motor Skills without Reinforcement

Authors: Haojie Shi, Tingguang Li, Qingxu Zhu, Jiapeng Sheng, Lei Han, Max Q. -H. Meng

Abstract: Learning-based methods have improved locomotion skills of quadruped robots through deep reinforcement learning. However, the sim-to-real gap and low sample efficiency still limit the skill transfer. To address this issue, we propose an efficient model-based learning framework that combines a world model with a policy network. We train a differentiable world model to predict future states and use i… ▽ More Learning-based methods have improved locomotion skills of quadruped robots through deep reinforcement learning. However, the sim-to-real gap and low sample efficiency still limit the skill transfer. To address this issue, we propose an efficient model-based learning framework that combines a world model with a policy network. We train a differentiable world model to predict future states and use it to directly supervise a Variational Autoencoder (VAE)-based policy network to imitate real animal behaviors. This significantly reduces the need for real interaction data and allows for rapid policy updates. We also develop a high-level network to track diverse commands and trajectories. Our simulated results show a tenfold sample efficiency increase compared to reinforcement learning methods such as PPO. In real-world testing, our policy achieves proficient command-following performance with only a two-minute data collection period and generalizes well to new speeds and paths. △ Less

Submitted 18 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Comments: Accepted by ICRA2024

arXiv:2403.01422 [pdf, other]

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies

Authors: Zhende Song, Chenchen Wang, Jiamu Sheng, Chi Zhang, Gang Yu, Jiayuan Fan, Tao Chen

Abstract: Development of multimodal models has marked a significant step forward in how machines understand videos. These models have shown promise in analyzing short video clips. However, when it comes to longer formats like movies, they often fall short. The main hurdles are the lack of high-quality, diverse video data and the intensive work required to collect or annotate such data. In face of these chal… ▽ More Development of multimodal models has marked a significant step forward in how machines understand videos. These models have shown promise in analyzing short video clips. However, when it comes to longer formats like movies, they often fall short. The main hurdles are the lack of high-quality, diverse video data and the intensive work required to collect or annotate such data. In face of these challenges, we propose MovieLLM, a novel framework designed to synthesize consistent and high-quality video data for instruction tuning. The pipeline is carefully designed to control the style of videos by improving textual inversion technique with powerful text generation capability of GPT-4. As the first framework to do such thing, our approach stands out for its flexibility and scalability, empowering users to create customized movies with only one description. This makes it a superior alternative to traditional data collection methods. Our extensive experiments validate that the data produced by MovieLLM significantly improves the performance of multimodal models in understanding complex video narratives, overcoming the limitations of existing datasets regarding scarcity and bias. △ Less

Submitted 24 June, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

arXiv:2402.19149 [pdf, other]

Experimental Test of Quantum Nonlocality from Contextuality

Authors: Jianqi Sheng, Dongkai Zhang, Lixiang Chen

Abstract: There are two powerful arguments against the possibility of extending quantum mechanics, the violation of Bell inequalities and the Kochen-Specker theorem, but the connection between the two remains confused. Following the distinctive strategy proposed by Cabello [Phys. Rev. Lett. 127, 070401 (2021)], Bell inequalities can be violated by state-independent contextuality sets. However, the experimen… ▽ More There are two powerful arguments against the possibility of extending quantum mechanics, the violation of Bell inequalities and the Kochen-Specker theorem, but the connection between the two remains confused. Following the distinctive strategy proposed by Cabello [Phys. Rev. Lett. 127, 070401 (2021)], Bell inequalities can be violated by state-independent contextuality sets. However, the experimental realization of such ideas is challenging as it requires high-dimensional entanglement. Orbital angular momentum provides an unlimited state space and the number of effective dimensions can be readily tailored as required. We performed an experimental test of non-locality based on Bell inequalities from contextuality, using orbital angular momentum entanglement in a bipartite photonic system. Our experiment not only shows a new way to produce non-locality but also connects contextuality and non-locality, two fundamental quantum resources that are critical for quantum computation and secure communication tasks. △ Less

Submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.14514 [pdf, other]

Detecting the Féeton Fifth Force by Superconducting Josephson Junctions

Authors: Yu Cheng, Jie Sheng, Tsutomu T. Yanagida

Abstract: The intriguing $U(1)_{B-L}$ extension of the standard model predicts a fifth force between particles carrying $B-L$ charges. The mediator is the $B-L$ gauge boson called Féeton. In this letter, we propose a novel experimental design to detect the quantum phase difference caused by this fifth force using a superconducting Josephson junction. We find that the experiment has the best sensitivity to t… ▽ More The intriguing $U(1)_{B-L}$ extension of the standard model predicts a fifth force between particles carrying $B-L$ charges. The mediator is the $B-L$ gauge boson called Féeton. In this letter, we propose a novel experimental design to detect the quantum phase difference caused by this fifth force using a superconducting Josephson junction. We find that the experiment has the best sensitivity to the gauge coupling when the gauge boson is within the mass range of $10^{-2}\,$eV to $100\,$eV. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 6 pages, 3 figures

arXiv:2402.13473 [pdf, other]

Learning Highly Dynamic Behaviors for Quadrupedal Robots

Authors: Chong Zhang, Jiapeng Sheng, Tingguang Li, He Zhang, Cheng Zhou, Qingxu Zhu, Rui Zhao, Yizheng Zhang, Lei Han

Abstract: Learning highly dynamic behaviors for robots has been a longstanding challenge. Traditional approaches have demonstrated robust locomotion, but the exhibited behaviors lack diversity and agility. They employ approximate models, which lead to compromises in performance. Data-driven approaches have been shown to reproduce agile behaviors of animals, but typically have not been able to learn highly d… ▽ More Learning highly dynamic behaviors for robots has been a longstanding challenge. Traditional approaches have demonstrated robust locomotion, but the exhibited behaviors lack diversity and agility. They employ approximate models, which lead to compromises in performance. Data-driven approaches have been shown to reproduce agile behaviors of animals, but typically have not been able to learn highly dynamic behaviors. In this paper, we propose a learning-based approach to enable robots to learn highly dynamic behaviors from animal motion data. The learned controller is deployed on a quadrupedal robot and the results show that the controller is able to reproduce highly dynamic behaviors including sprinting, jumping and sharp turning. Various behaviors can be activated through human interaction using a stick with markers attached to it. Based on the motion pattern of the stick, the robot exhibits walking, running, sitting and jumping, much like the way humans interact with a pet. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.07730 [pdf, other]

Continuum of spin excitations in an ordered magnet

Authors: Jieming Sheng, Le Wang, Wenrui Jiang, Han Ge, Nan Zhao, Tiantian Li, Maiko Kofu, Dehong Yu, Wei Zhu, Jia-Wei Mei, Zhentao Wang, Liusuo Wu

Abstract: Continuum of spin excitations observed in inelastic neutron scattering experiments are often considered as a strong evidence of quantum spin liquid formation. When quantum spin liquid is indeed the ground state of a disorder-free magnetic compound, the elementary excitation is no longer the conventional spin waves (magnons). Instead, the magnons fractionalize into spinons, leaving only a two-spino… ▽ More Continuum of spin excitations observed in inelastic neutron scattering experiments are often considered as a strong evidence of quantum spin liquid formation. When quantum spin liquid is indeed the ground state of a disorder-free magnetic compound, the elementary excitation is no longer the conventional spin waves (magnons). Instead, the magnons fractionalize into spinons, leaving only a two-spinon continuum detectable in inelastic neutron scattering experiments. For a clean ordered antiferromagnet, it was unclear if we can observe a continuous spectrum similar to the ones in a quantum spin liquid state. Here we show that the magnetically ordered state in Na$_2$BaCo(PO$_4$)$_2$ is able to host a spin excitation continuum induced by strong quantum fluctuations. Thus, a second thought is necessary when concluding such continuum as signature of quantum spin liquid in new material explorations. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 22 pages,9 figures

arXiv:2401.13318 [pdf, other]

Magnetic structure and Ising-like antiferromagnetism in the bilayer triangular lattice compound NdZnPO

Authors: Han Ge, Tiantian Li, S. E. Nikitin, Nan Zhao, Fangli Li, Huanpeng Bu, Jiayue Yuan, Jian Chen, Ying Fu, Jiong Yang, Le Wang, Ping Miao, Qiang Zhang, Ines Puente-Orench, Andrey Podlesnyak, Jieming Sheng, Liusuo Wu

Abstract: The complex interplay of spin frustration and quantum fluctuations in low-dimensional quantum materials leads to a variety of intriguing phenomena. This research focuses on a detailed analysis of the magnetic behavior exhibited by NdZnPO, a bilayer spin-1/2 triangular lattice antiferromagnet. The investigation employs magnetization, specific heat, and powder neutron scattering measurements. At zer… ▽ More The complex interplay of spin frustration and quantum fluctuations in low-dimensional quantum materials leads to a variety of intriguing phenomena. This research focuses on a detailed analysis of the magnetic behavior exhibited by NdZnPO, a bilayer spin-1/2 triangular lattice antiferromagnet. The investigation employs magnetization, specific heat, and powder neutron scattering measurements. At zero field, a long-range magnetic order is observed at $T_{\rm N}=1.64~\rm K$. Powder neutron diffraction experiments show the Ising-like magnetic moments along the $c$-axis, revealing a stripe-like magnetic structure with three equivalent magnetic propagation vectors. Application of a magnetic field along the $c$-axis suppresses the antiferromagnetic order, leading to a fully polarized ferromagnetic state above $B_{\rm c}=4.5~\rm T$. This transition is accompanied by notable enhancements in the nuclear Schottky contribution. Moreover, the absence of spin frustration and expected field-induced plateau-like phases are remarkable observations. Detailed calculations of magnetic dipolar interactions revealed complex couplings reminiscent of a honeycomb lattice, suggesting the potential emergence of Kitaev-like physics within this system. This comprehensive study of the magnetic properties of NdZnPO highlights unresolved intricacies, underscoring the imperative for further exploration to unveil the underlying governing mechanisms. △ Less

Submitted 24 January, 2024; originally announced January 2024.

Comments: 11 pages, 6 figures

arXiv:2401.12732 [pdf, other]

doi 10.1145/3616855.3635794

CDRNP: Cross-Domain Recommendation to Cold-Start Users via Neural Process

Authors: Xiaodong Li, Jiawei Sheng, Jiangxia Cao, Wenyuan Zhang, Quangang Li, Tingwen Liu

Abstract: Cross-domain recommendation (CDR) has been proven as a promising way to tackle the user cold-start problem, which aims to make recommendations for users in the target domain by transferring the user preference derived from the source domain. Traditional CDR studies follow the embedding and mapping (EMCDR) paradigm, which transfers user representations from the source to target domain by learning a… ▽ More Cross-domain recommendation (CDR) has been proven as a promising way to tackle the user cold-start problem, which aims to make recommendations for users in the target domain by transferring the user preference derived from the source domain. Traditional CDR studies follow the embedding and mapping (EMCDR) paradigm, which transfers user representations from the source to target domain by learning a user-shared mapping function, neglecting the user-specific preference. Recent CDR studies attempt to learn user-specific mapping functions in meta-learning paradigm, which regards each user's CDR as an individual task, but neglects the preference correlations among users, limiting the beneficial information for user representations. Moreover, both of the paradigms neglect the explicit user-item interactions from both domains during the mapping process. To address the above issues, this paper proposes a novel CDR framework with neural process (NP), termed as CDRNP. Particularly, it develops the meta-learning paradigm to leverage user-specific preference, and further introduces a stochastic process by NP to capture the preference correlations among the overlapping and cold-start users, thus generating more powerful mapping functions by mapping the user-specific preference and common preference correlations to a predictive probability distribution. In addition, we also introduce a preference remainer to enhance the common preference from the overlapping users, and finally devises an adaptive conditional decoder with preference modulation to make prediction for cold-start users with items in the target domain. Experimental results demonstrate that CDRNP outperforms previous SOTA methods in three real-world CDR scenarios. △ Less

Submitted 12 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

Comments: WSDM'2024 Oral

arXiv:2401.11091 [pdf]

A family of rare-earth Quasi-One-Dimensional spin-chain compounds K2RENb5O15 (RE=Ce,Pr,Nd,Sm,Gd-Ho) with large interchain distance

Authors: Qingyuan Zeng, Han Ge, Maofeng Wu, Shaoheng Ruan, Tiantian Li, Zhaosheng Wang, Jingxin Li, Langsheng Ling, Wei Tong, Shuai Huang, Andi Liu, Jin Zhou, Zhengcai Xia, Jieming Sheng, Liusuo Wu, Zhaoming Tian

Abstract: One-dimensional spin chain systems have received special attention to discover the novel magnetic ground states and emergent phenomena, while the magnetic studies on rare-earth (RE)-based 1D spin chain materials are still rare. Here, we report the synthesis, structure and magnetic behaviors on a family of tetragonal tungsten-bronze structure K2RENb5O15 (RE = Ce, Pr, Nd, Sm, Gd-Ho) compounds, which… ▽ More One-dimensional spin chain systems have received special attention to discover the novel magnetic ground states and emergent phenomena, while the magnetic studies on rare-earth (RE)-based 1D spin chain materials are still rare. Here, we report the synthesis, structure and magnetic behaviors on a family of tetragonal tungsten-bronze structure K2RENb5O15 (RE = Ce, Pr, Nd, Sm, Gd-Ho) compounds, which consist of 1D linear spin-chain structure built by RE3+ ions along the c-axis and well spatially separated by the nonmagnetic K/Nb-O polyhedrons with large interchain distances of ~ 8.80-8.88 Å in the ab-plane. The low temperature magnetic measurements reveal the absence of long-range magnetic order down to 1.8 K for all serial K2RENb5O15 compounds and the dominant ferromagnetic interactions for RE=Ce,Dy and antiferromagnetic interactions for other members. Among them, K2GdNb5O15 with spin only magnetic moment S=7/2, exhibits a long-range magnetic order with TN~0.31 K and strong spin fluctuations at low temperatures due to its low-dimension characteristics. Moreover, a large magnetocaloric effect under low field change of 0-2 T is realized at temperatures below 1 K for K2GdNb5O15, letting it as an ideal candidate for adiabatic magnetic refrigeration applications at sub-kelvin temperatures. The K2RENb5O15 become a rare family of insulting RE-based magnets to explore the novel 1D spin chain physics beyond the 3d TM-based counterparts, in terms of its combination of low dimension, strong spin-orbital coupling and the rich diversity of RE ions. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 27 pages, 11 figures

arXiv:2401.07033 [pdf, other]

Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning

Authors: Lu Wang, Mayukh Das, Fangkai Yang, Junjie Sheng, Bo Qiao, Hang Dong, Si Qin, Victor Rühle, Chetan Bansal, Eli Cortez, Íñigo Goiri, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

Abstract: Oversubscription is a prevalent practice in cloud services where the system offers more virtual resources, such as virtual cores in virtual machines, to users or applications than its available physical capacity for reducing revenue loss due to unused/redundant capacity. While oversubscription can potentially lead to significant enhancement in efficient resource utilization, the caveat is that it… ▽ More Oversubscription is a prevalent practice in cloud services where the system offers more virtual resources, such as virtual cores in virtual machines, to users or applications than its available physical capacity for reducing revenue loss due to unused/redundant capacity. While oversubscription can potentially lead to significant enhancement in efficient resource utilization, the caveat is that it comes with the risks of overloading and introducing jitter at the level of physical nodes if all the co-located virtual machines have high utilization. Thus suitable oversubscription policies which maximize utilization while mitigating risks are paramount for cost-effective seamless cloud experiences. Most cloud platforms presently rely on static heuristics-driven decisions about oversubscription activation and limits, which either leads to overloading or stranded resources. Designing an intelligent oversubscription policy that can adapt to resource utilization patterns and jointly optimizes benefits and risks is, largely, an unsolved problem. We address this challenge with our proposed novel HuMan-in-the-loop Protoypical Imitation Learning (ProtoHAIL) framework that exploits approximate symmetries in utilization patterns to learn suitable policies. Also, our human-in-the-loop (knowledge-infused) training allows for learning safer policies that are robust to noise and sparsity. Our empirical investigations on real data show orders of magnitude reduction in risk and significant increase in benefits (saving stranded cores) in Microsoft cloud platform for 1st party (internal services). △ Less

Submitted 13 January, 2024; originally announced January 2024.

Comments: 9 pages, 3 figures

arXiv:2312.15637 [pdf, other]

doi 10.1016/j.physletb.2024.138735

Thermal Relic Right-Handed Neutrino Dark Matter

Authors: Yu Cheng, Jie Sheng, Tsutomu T. Yanagida

Abstract: It is known that two heavy Majorana right-handed neutrinos are sufficient to generate the baryon asymmetry in the present universe. Thus, it is interesting to identify the third right-handed neutrino $N$ with the dark matter. We impose a new discrete symmetry $Z_2$ on this dark matter neutrino to stabilize it. However, the $U(1)_{B-L}$ gauge boson $A'$ couples to the right-handed neutrino $N$. If… ▽ More It is known that two heavy Majorana right-handed neutrinos are sufficient to generate the baryon asymmetry in the present universe. Thus, it is interesting to identify the third right-handed neutrino $N$ with the dark matter. We impose a new discrete symmetry $Z_2$ on this dark matter neutrino to stabilize it. However, the $U(1)_{B-L}$ gauge boson $A'$ couples to the right-handed neutrino $N$. If the $B-L$ breaking scale $V_{B-L}$ is sufficiently low, the dark matter neutrino $N$ can be in the thermal bath. We find that the thermal relic $N$ can explain the dark matter abundance for the $B-L$ breaking scale $ V_{B-L} \sim O(10)\,$TeV. After considering all the constraints from the existing experiments, a narrow mass region of the thermal produced right-handed neutrino dark matter $N$ is still surviving. △ Less

Submitted 21 May, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

Comments: 6 pages, 2 figures

arXiv:2312.13758 [pdf, other]

Ultraheavy Atomic Dark Matter Freeze-Out through Rearrangement

Authors: Yu-Cheng Qiu, Jie Sheng, Liang Tan, Chuan-Yang Xing

Abstract: Atomic dark matter is usually considered to be produced asymmetrically in the early Universe. In this work, we first propose that the symmetric atomic dark matter can be thermally produced through the freeze-out mechanism. The dominant atom anti-atom annihilation channel is the atomic rearrangement. It has a geometrical cross section much larger than that of elementary fermions. After the atomic f… ▽ More Atomic dark matter is usually considered to be produced asymmetrically in the early Universe. In this work, we first propose that the symmetric atomic dark matter can be thermally produced through the freeze-out mechanism. The dominant atom anti-atom annihilation channel is the atomic rearrangement. It has a geometrical cross section much larger than that of elementary fermions. After the atomic formation, this annihilation process further depletes dark matter particles and finally freezes out. To give the observed dark matter relic, the dark atoms are naturally ultraheavy, ranging from $10^6$ to $10^{10} \,\mathrm{GeV}$. △ Less

Submitted 8 May, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: 8 pages, 4 figures, version to appear in Phys. Rev. D

arXiv:2312.11774 [pdf, other]

Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation

Authors: Yuze He, Yushi Bai, Matthieu Lin, Jenny Sheng, Yubin Hu, Qi Wang, Yu-Hui Wen, Yong-Jin Liu

Abstract: By lifting the pre-trained 2D diffusion models into Neural Radiance Fields (NeRFs), text-to-3D generation methods have made great progress. Many state-of-the-art approaches usually apply score distillation sampling (SDS) to optimize the NeRF representations, which supervises the NeRF optimization with pre-trained text-conditioned 2D diffusion models such as Imagen. However, the supervision signal… ▽ More By lifting the pre-trained 2D diffusion models into Neural Radiance Fields (NeRFs), text-to-3D generation methods have made great progress. Many state-of-the-art approaches usually apply score distillation sampling (SDS) to optimize the NeRF representations, which supervises the NeRF optimization with pre-trained text-conditioned 2D diffusion models such as Imagen. However, the supervision signal provided by such pre-trained diffusion models only depends on text prompts and does not constrain the multi-view consistency. To inject the cross-view consistency into diffusion priors, some recent works finetune the 2D diffusion model with multi-view data, but still lack fine-grained view coherence. To tackle this challenge, we incorporate multi-view image conditions into the supervision signal of NeRF optimization, which explicitly enforces fine-grained view consistency. With such stronger supervision, our proposed text-to-3D method effectively mitigates the generation of floaters (due to excessive densities) and completely empty spaces (due to insufficient densities). Our quantitative evaluations on the T$^3$Bench dataset demonstrate that our method achieves state-of-the-art performance over existing text-to-3D methods. We will make the code publicly available. △ Less

Submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.03290 [pdf, other]

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

Authors: Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang

Abstract: The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can language agents be alternatives to PPO agents in traditional sequential decision-making tasks? To investigate this, we first take environments collected in OpenAI Gym as our testbeds and ground them to textual environments that construct the TextGym simulator. This allo… ▽ More The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can language agents be alternatives to PPO agents in traditional sequential decision-making tasks? To investigate this, we first take environments collected in OpenAI Gym as our testbeds and ground them to textual environments that construct the TextGym simulator. This allows for straightforward and efficient comparisons between PPO agents and language agents, given the widespread adoption of OpenAI Gym. To ensure a fair and effective benchmarking, we introduce $5$ levels of scenario for accurate domain-knowledge controlling and a unified RL-inspired framework for language agents. Additionally, we propose an innovative explore-exploit-guided language (EXE) agent to solve tasks within TextGym. Through numerical experiments and ablation studies, we extract valuable insights into the decision-making capabilities of language agents and make a preliminary evaluation of their potential to be alternatives to PPO in classical sequential decision-making problems. This paper sheds light on the performance of language agents and paves the way for future research in this exciting domain. Our code is publicly available at~\url{https://github.com/mail-ecnu/Text-Gym-Agents}. △ Less

Submitted 5 December, 2023; originally announced December 2023.

arXiv:2311.03892 [pdf]

Magnetic-field tuned anisotropic quantum phase transition in the distorted kagome antiferromagnet Nd3BWO9

Authors: Fangyuan song, Han Ge, Andi Liu, Yuqi Qin, Yuyan Han, Langsheng Ling, Songliu Yuan, Zhongwen Ouyang, Jieming Sheng, Liusuo Wu, Zhaoming Tian

Abstract: Rare-earth (RE) kagome-lattice magnets offer an excellent platform to discover the novel magnetic phase as well as quantum phase transition tuned by non-thermal control parameters, while the experimental realizations remain largely unexplored. Here, we report the discovery of magnetic-field (B)-induced anisotropic quantum phase transition in a distorted kagome antiferromagnet Nd3BWO9 with TN~0.32… ▽ More Rare-earth (RE) kagome-lattice magnets offer an excellent platform to discover the novel magnetic phase as well as quantum phase transition tuned by non-thermal control parameters, while the experimental realizations remain largely unexplored. Here, we report the discovery of magnetic-field (B)-induced anisotropic quantum phase transition in a distorted kagome antiferromagnet Nd3BWO9 with TN~0.32 K. The isothermal magnetizations at 0.05 K exhibit the spin-flop like metamagnetic crossover behaviors with different fractional magnetization anomalies for B perpendicular (B // c-axis) and parallel (B // a*-axis) to the kagome plane, respectively. In combination with the thermodynamic measurements, the field-temperature (B-T) phase diagrams for both field directions are constructed and that reveal the existence of several field-induced magnetic states. Along the c-axis, a proximate quantum bicritical point is observed near the metamagnetic crossover, which separates the low-field antiferromagnetic (AFM) phase and the intermediate AFM phase. While, for B // a*, another intermediate magnetic phase (IAFM2) appears between the low-field AFM phase and intermediate AFM (IAFM1) phase, giving rise to a tetracritical point. These results support the anisotropic field-induced metamagnetic quantum criticalities in Nd3BWO9, making it as a rare kagome antiferromagnet to investigate the quantum multi-criticality driven by spin frustration. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: 22 pages, 7 figures

MSC Class: 81-05 ACM Class: E.1

arXiv:2311.03162 [pdf, other]

Phase-field simulations of the effect of temperature and interface for zirconium $δ\mbox{-}$hydrides

Authors: Zi-Hang Chen, Jie Sheng, Yu Liu, Xiao-Ming Shi, Houbing Huang, Ke Xu, Yue-Chao Wang, Shuai Wu, Bo Sun, Hai-Feng Liu, Hai-Feng Song

Abstract: Hydride precipitation in zirconium cladding materials can damage their integrity and durability.Service temperature and material defects have a significant effect on the dynamic growth of hydrides. In this study, we have developed a phase field model based on the assumption of elastic behaviour within a specific temperature range (613-653K). This model allows us to study the influence of temperatu… ▽ More Hydride precipitation in zirconium cladding materials can damage their integrity and durability.Service temperature and material defects have a significant effect on the dynamic growth of hydrides. In this study, we have developed a phase field model based on the assumption of elastic behaviour within a specific temperature range (613-653K). This model allows us to study the influence of temperature and interfacial effects on the morphology, stress, and average growth rate of zirconium hydride. The results suggest that changes in temperature and interfacial energy influence the aspect ratio and average growth rate of the hydride morphology. The ultimate determinant of hydride orientation is the loss of interfacial coherence, primarily induced by interfacial dislocation defects and quantifiable by the mismatch degree $q$. An escalation in interfacial coherence loss leads to a transition of hydride growth from horizontal to vertical, accompanied by the onset of redirection behaviour. Interestingly, redirection occurs at a critical mismatch level, denoted $q_c$, and remains unaffected by variations in temperature and interfacial energy. However, this redirection leads to an increase in the maximum stress, which may influence the direction of hydride crack propagation. This research highlights the importance of interfacial coherence and provides valuable insights into the morphology and growth kinetics of hydrides in zirconium alloys. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 13 pages (text), 8 figures (text), 1 table (text), 6 pages (SI), 2 figures (SI)

arXiv:2310.05420 [pdf, other]

Féeton ($B-L$ Gauge Boson) Dark Matter for the 511-keV Gamma-Ray Excess and the Prediction of Low-energy Neutrino Flux

Authors: Yu Cheng, Weikang Lin, Jie Sheng, Tsutomu T. Yanagida

Abstract: The féeton is the gauge boson of the $U(1)_{B-L}$ gauge theory. If the gauge coupling constant is extremely small, it becomes a candidate for dark matter. We show that its decay to a pair of electron and positron explains the observed Galactic 511-keV gamma-ray excess in a consistent manner. This féeton dark matter decays mainly into pairs of neutrino and anti-neutrino. Future low-energy experimen… ▽ More The féeton is the gauge boson of the $U(1)_{B-L}$ gauge theory. If the gauge coupling constant is extremely small, it becomes a candidate for dark matter. We show that its decay to a pair of electron and positron explains the observed Galactic 511-keV gamma-ray excess in a consistent manner. This féeton dark matter decays mainly into pairs of neutrino and anti-neutrino. Future low-energy experiments with improved directional capability make it possible to capture those neutrino signals. The seesaw-motivated parameter space predicts a relatively short féeton lifetime comparable to the current cosmological constraint. △ Less

Submitted 9 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

Comments: 7 pages, 4 figures. Published in Chinese Physics C

arXiv:2310.02977 [pdf, other]

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation

Authors: Yuze He, Yushi Bai, Matthieu Lin, Wang Zhao, Yubin Hu, Jenny Sheng, Ran Yi, Juanzi Li, Yong-Jin Liu

Abstract: Recent methods in text-to-3D leverage powerful pretrained diffusion models to optimize NeRF. Notably, these methods are able to produce high-quality 3D scenes without training on 3D data. Due to the open-ended nature of the task, most studies evaluate their results with subjective case studies and user experiments, thereby presenting a challenge in quantitatively addressing the question: How has c… ▽ More Recent methods in text-to-3D leverage powerful pretrained diffusion models to optimize NeRF. Notably, these methods are able to produce high-quality 3D scenes without training on 3D data. Due to the open-ended nature of the task, most studies evaluate their results with subjective case studies and user experiments, thereby presenting a challenge in quantitatively addressing the question: How has current progress in Text-to-3D gone so far? In this paper, we introduce T$^3$Bench, the first comprehensive text-to-3D benchmark containing diverse text prompts of three increasing complexity levels that are specially designed for 3D generation. To assess both the subjective quality and the text alignment, we propose two automatic metrics based on multi-view images produced by the 3D contents. The quality metric combines multi-view text-image scores and regional convolution to detect quality and view inconsistency. The alignment metric uses multi-view captioning and GPT-4 evaluation to measure text-3D consistency. Both metrics closely correlate with different dimensions of human judgments, providing a paradigm for efficiently evaluating text-to-3D models. The benchmarking results, shown in Fig. 1, reveal performance differences among an extensive 10 prevalent text-to-3D methods. Our analysis further highlights the common struggles for current methods on generating surroundings and multi-object scenes, as well as the bottleneck of leveraging 2D guidance for 3D generation. Our project page is available at: https://t3bench.com. △ Less

Submitted 17 April, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: Under review

arXiv:2310.00434 [pdf, other]

DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models

Authors: Zhiyao Sun, Tian Lv, Sheng Ye, Matthieu Lin, Jenny Sheng, Yu-Hui Wen, Minjing Yu, Yong-Jin Liu

Abstract: The generation of stylistic 3D facial animations driven by speech presents a significant challenge as it requires learning a many-to-many mapping between speech, style, and the corresponding natural facial motion. However, existing methods either employ a deterministic model for speech-to-motion mapping or encode the style using a one-hot encoding scheme. Notably, the one-hot encoding approach fai… ▽ More The generation of stylistic 3D facial animations driven by speech presents a significant challenge as it requires learning a many-to-many mapping between speech, style, and the corresponding natural facial motion. However, existing methods either employ a deterministic model for speech-to-motion mapping or encode the style using a one-hot encoding scheme. Notably, the one-hot encoding approach fails to capture the complexity of the style and thus limits generalization ability. In this paper, we propose DiffPoseTalk, a generative framework based on the diffusion model combined with a style encoder that extracts style embeddings from short reference videos. During inference, we employ classifier-free guidance to guide the generation process based on the speech and style. In particular, our style includes the generation of head poses, thereby enhancing user perception. Additionally, we address the shortage of scanned 3D talking face data by training our model on reconstructed 3DMM parameters from a high-quality, in-the-wild audio-visual dataset. Extensive experiments and user study demonstrate that our approach outperforms state-of-the-art methods. The code and dataset are at https://diffposetalk.github.io . △ Less

Submitted 14 May, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

Comments: SIGGRAPH 2024 (Journal Track). Project page: https://diffposetalk.github.io/

arXiv:2309.12043 [pdf, other]

Dark Matter Annihilation via Breit-Wigner Enhancement with Heavier Mediator

Authors: Yu Cheng, Shao-Feng Ge, Jie Sheng, Tsutomu T. Yanagida

Abstract: We propose a new scenario that both the dark matter freeze-out in the early Universe and its possible annihilation for indirect detection around a supermassive black hole are enhanced by a Breit-Wigner resonance. With the mediator mass larger than the total initial dark matter mass, this annihilation is almost forbidden at late times. Thus, the stringent cosmic microwave background and indirect de… ▽ More We propose a new scenario that both the dark matter freeze-out in the early Universe and its possible annihilation for indirect detection around a supermassive black hole are enhanced by a Breit-Wigner resonance. With the mediator mass larger than the total initial dark matter mass, this annihilation is almost forbidden at late times. Thus, the stringent cosmic microwave background and indirect detection constraints do not apply. However, a supermassive black hole can accelerate the dark matter particles to reactivate this resonant annihilation whose subsequent decay to photons leaves a unique signal. The running Fermi-LAT and the future COSI satellites can test this scenario. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 6 pages, 3 figures

arXiv:2309.01974 [pdf]

Anomalous Thermodynamic Cost of Clock Synchronization

Authors: Cheng Yang, Jiteng Sheng, Haibin Wu

Abstract: Clock synchronization is critically important in positioning, navigation and timing systems. While its performance has been intensively studied in a wide range of disciplines, much less is known for the fundamental thermodynamics of clock synchronization, what limits the precision and how to optimize the energy cost for clock synchronization. Here, we report the first experimental investigation of… ▽ More Clock synchronization is critically important in positioning, navigation and timing systems. While its performance has been intensively studied in a wide range of disciplines, much less is known for the fundamental thermodynamics of clock synchronization, what limits the precision and how to optimize the energy cost for clock synchronization. Here, we report the first experimental investigation of two stochastic clocks synchronization, unveiling the thermodynamic relation between the entropy cost and clock synchronization in an open cavity optomechanical system. Two autonomous clocks are synchronized spontaneously by engineering the controllable photon-mediated dissipative optomechanical coupling and the disparate decay rates of hybrid modes. The measured dependence of the degree of synchronization on entropy cost exhibits an unexpected non-monotonic characteristic, indicating that the perfect clock synchronization does not cost the maximum entropy and there exists an optimum. The investigation of transient dynamics of clock synchronization exposes a trade-off between energy and time consumption. Our results reveal the fundamental relation between clock synchronization and thermodynamics, and have a great potential for precision measurements, distributed quantum networks, and biological science. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2308.15143 [pdf, other]

doi 10.1038/s42256-024-00861-3

Lifelike Agility and Play in Quadrupedal Robots using Reinforcement Learning and Generative Pre-trained Models

Authors: Lei Han, Qingxu Zhu, Jiapeng Sheng, Chong Zhang, Tingguang Li, Yizheng Zhang, He Zhang, Yuzhen Liu, Cheng Zhou, Rui Zhao, Jie Li, Yufeng Zhang, Rui Wang, Wanchao Chi, Xiong Li, Yonghui Zhu, Lingzhu Xiang, Xiao Teng, Zhengyou Zhang

Abstract: Knowledge from animals and humans inspires robotic innovations. Numerous efforts have been made to achieve agile locomotion in quadrupedal robots through classical controllers or reinforcement learning approaches. These methods usually rely on physical models or handcrafted rewards to accurately describe the specific system, rather than on a generalized understanding like animals do. Here we propo… ▽ More Knowledge from animals and humans inspires robotic innovations. Numerous efforts have been made to achieve agile locomotion in quadrupedal robots through classical controllers or reinforcement learning approaches. These methods usually rely on physical models or handcrafted rewards to accurately describe the specific system, rather than on a generalized understanding like animals do. Here we propose a hierarchical framework to construct primitive-, environmental- and strategic-level knowledge that are all pre-trainable, reusable and enrichable for legged robots. The primitive module summarizes knowledge from animal motion data, where, inspired by large pre-trained models in language and image understanding, we introduce deep generative models to produce motor control signals stimulating legged robots to act like real animals. Then, we shape various traversing capabilities at a higher level to align with the environment by reusing the primitive module. Finally, a strategic module is trained focusing on complex downstream tasks by reusing the knowledge from previous levels. We apply the trained hierarchical controllers to the MAX robot, a quadrupedal robot developed in-house, to mimic animals, traverse complex obstacles and play in a designed challenging multi-agent chase tag game, where lifelike agility and strategy emerge in the robots. △ Less

Submitted 6 July, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

Comments: Published in Nature Machine Intelligence, Vol. 7, 2024

Journal ref: Nature Machine Intelligence, Vol. 7, 2024

arXiv:2308.03273 [pdf, other]

Learning Terrain-Adaptive Locomotion with Agile Behaviors by Imitating Animals

Authors: Tingguang Li, Yizheng Zhang, Chong Zhang, Qingxu Zhu, Jiapeng sheng, Wanchao Chi, Cheng Zhou, Lei Han

Abstract: In this paper, we present a general learning framework for controlling a quadruped robot that can mimic the behavior of real animals and traverse challenging terrains. Our method consists of two steps: an imitation learning step to learn from motions of real animals, and a terrain adaptation step to enable generalization to unseen terrains. We capture motions from a Labrador on various terrains to… ▽ More In this paper, we present a general learning framework for controlling a quadruped robot that can mimic the behavior of real animals and traverse challenging terrains. Our method consists of two steps: an imitation learning step to learn from motions of real animals, and a terrain adaptation step to enable generalization to unseen terrains. We capture motions from a Labrador on various terrains to facilitate terrain adaptive locomotion. Our experiments demonstrate that our policy can traverse various terrains and produce a natural-looking behavior. We deployed our method on the real quadruped robot Max via zero-shot simulation-to-reality transfer, achieving a speed of 1.1 m/s on stairs climbing. △ Less

Submitted 6 August, 2023; originally announced August 2023.

Comments: 7 pages, 5 figures. To be published in IROS 2023

arXiv:2308.02103 [pdf, other]

Prompt2Gaussia: Uncertain Prompt-learning for Script Event Prediction

Authors: Shiyao Cui, Xin Cong, Jiawei Sheng, Xuebin Wang, Tingwen Liu, Jinqiao Shi

Abstract: Script Event Prediction (SEP) aims to predict the subsequent event for a given event chain from a candidate list. Prior research has achieved great success by integrating external knowledge to enhance the semantics, but it is laborious to acquisite the appropriate knowledge resources and retrieve the script-related knowledge. In this paper, we regard public pre-trained language models as knowledge… ▽ More Script Event Prediction (SEP) aims to predict the subsequent event for a given event chain from a candidate list. Prior research has achieved great success by integrating external knowledge to enhance the semantics, but it is laborious to acquisite the appropriate knowledge resources and retrieve the script-related knowledge. In this paper, we regard public pre-trained language models as knowledge bases and automatically mine the script-related knowledge via prompt-learning. Still, the scenario-diversity and label-ambiguity in scripts make it uncertain to construct the most functional prompt and label token in prompt learning, i.e., prompt-uncertainty and verbalizer-uncertainty. Considering the innate ability of Gaussian distribution to express uncertainty, we deploy the prompt tokens and label tokens as random variables following Gaussian distributions, where a prompt estimator and a verbalizer estimator are proposed to estimate their probabilistic representations instead of deterministic representations. We take the lead to explore prompt-learning in SEP and provide a fresh perspective to enrich the script semantics. Our method is evaluated on the most widely used benchmark and a newly proposed large-scale one. Experiments show that our method, which benefits from knowledge evoked from pre-trained language models, outperforms prior baselines by 1.46\% and 1.05\% on two benchmarks, respectively. △ Less

Submitted 3 August, 2023; originally announced August 2023.

Comments: 16 pages

arXiv:2306.16920 [pdf, other]

A multiphase-field model for simulating the hydrogen-induced multi-spot corrosion on the surface of polycrystalline metals: Application to uranium metal

Authors: Jie Sheng, Yu Liu, Xiao-Ming Shi, Yue-Chao Wang, Zi-Hang Chen, Ke Xu, Shuai Wu, Hou-Bing Huang, Bo Sun, Hai-Feng Liu, Hai-Feng Song

Abstract: Hydrogen-induced multi-spot corrosion on the surface of polycrystalline rare metals is a complex process, which involves the interactions between phases (metal, hydride and oxide), grain orientations, grain boundaries, and corrosion spots. To accurately simulate this process and comprehend the underlying physics, a theoretical method is required that includes the following mechanisms: i) hydrogen… ▽ More Hydrogen-induced multi-spot corrosion on the surface of polycrystalline rare metals is a complex process, which involves the interactions between phases (metal, hydride and oxide), grain orientations, grain boundaries, and corrosion spots. To accurately simulate this process and comprehend the underlying physics, a theoretical method is required that includes the following mechanisms: i) hydrogen diffusion, ii) phase transformation, iii) elastic interactions between phases, especially, the interactions between the oxide film and the hydride, iv) elastic interactions between grains, and v) interactions between hydrogen solutes and grain boundaries. In this study, we report a multiphase-field model that incorporates all these requirements, and conduct a comprehensive study of hydrogen-induced spot corrosion on the uranium metal surface, including the investigation of the oxide film, multi-spot corrosion, grain orientation, and grain boundary in the monocrystal, bicrystal, and polycrystal systems. The results indicate that the oxide film can inhibit the growth of hydrides and plays a crucial role in determining the correct morphology of the hydride at the triple junction of phases. The elastic interaction between multiple corrosion spots causes the merging of corrosion spots and promotes the growth of hydrides. The introduction of grain orientations and grain boundaries results in a variety of intriguing intracrystalline and intergranular hydride morphologies. The model presented here is generally applicable to the hydrogen-induced multi-spot corrosion on any rare metal surface. △ Less

Submitted 30 June, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

Comments: 22 pages (text), 16 figures (text), 2 tables (text), 9 pages (SI), 13 figures (SI)

arXiv:2306.09695 [pdf, other]

Bose-Einstein condensation of a two-magnon bound state in a spin-one triangular lattice

Authors: Jieming Sheng, Jia-Wei Mei, Le Wang, Wenrui Jiang, Lei Xu, Han Ge, Nan Zhao, Tiantian Li, Andrea Candini, Bin Xi, Jize Zhao, Ying Fu, Jiong Yang, Yuanzhu Zhang, Giorgio Biasiol, Shanmin Wang, Jinlong Zhu, Ping Miao, Xin Tong, Dapeng Yu, Richard Mole, Long Ma, Zhitao Zhang, Zhongwen Ouyang, Wei Tong , et al. (6 additional authors not shown)

Abstract: Interactions of collective excitations often lead to rich emergent phenomena in many-particle quantum systems. In ordered magnets, the elementary excitations are spin waves (magnons), which obey Bose-Einstein statistics. Similar to the Cooper pairs in superconductors, magnons can be paired into bound states under attractive interactions. Even more interestingly, the Zeeman coupling to a magnetic f… ▽ More Interactions of collective excitations often lead to rich emergent phenomena in many-particle quantum systems. In ordered magnets, the elementary excitations are spin waves (magnons), which obey Bose-Einstein statistics. Similar to the Cooper pairs in superconductors, magnons can be paired into bound states under attractive interactions. Even more interestingly, the Zeeman coupling to a magnetic field acts as a chemical potential that can tune the particle density through a quantum critical point (QCP), beyond which a ``hidden order'' is predicted to exist. However, experimental confirmation of this QCP and the associated new state of matter remain elusive. Here we report direct observation of the Bose-Einstein condensation (BEC) of the two-magnon bound state in Na$_2$BaNi(PO$_4$)$_2$. Comprehensive thermodynamic measurements confirmed the existence of a two-dimensional BEC-QCP at the saturation field. Inelastic neutron scattering experiments were performed to accurately establish the magnetic exchange model. An exact solution of the model found stable 2-magnon bound states that were further confirmed by an electron spin resonance (ESR) experiment, demonstrating that the QCP is due to the pair condensation and the phase below saturation field is the long-sought-after spin nematic (SN) phase. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Comments: 53 pages, 28 figures

arXiv:2306.08964 [pdf, other]

Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification

Authors: Jingyi Zhou, Jiamu Sheng, Jiayuan Fan, Peng Ye, Tong He, Bin Wang, Tao Chen

Abstract: The effectiveness of spectral-spatial feature learning is crucial for the hyperspectral image (HSI) classification task. Diffusion models, as a new class of groundbreaking generative models, have the ability to learn both contextual semantics and textual details from the distinct timestep dimension, enabling the modeling of complex spectral-spatial relations in HSIs. However, existing diffusion-ba… ▽ More The effectiveness of spectral-spatial feature learning is crucial for the hyperspectral image (HSI) classification task. Diffusion models, as a new class of groundbreaking generative models, have the ability to learn both contextual semantics and textual details from the distinct timestep dimension, enabling the modeling of complex spectral-spatial relations in HSIs. However, existing diffusion-based HSI classification methods only utilize manually selected single-timestep single-stage features, limiting the full exploration and exploitation of rich contextual semantics and textual information hidden in the diffusion model. To address this issue, we propose a novel diffusion-based feature learning framework that explores Multi-Timestep Multi-Stage Diffusion features for HSI classification for the first time, called MTMSD. Specifically, the diffusion model is first pretrained with unlabeled HSI patches to mine the connotation of unlabeled data, and then is used to extract the multi-timestep multi-stage diffusion features. To effectively and efficiently leverage multi-timestep multi-stage features,two strategies are further developed. One strategy is class & timestep-oriented multi-stage feature purification module with the inter-class and inter-timestep prior for reducing the redundancy of multi-stage features and alleviating memory constraints. The other one is selective timestep feature fusion module with the guidance of global features to adaptively select different timestep features for integrating texture and semantics. Both strategies facilitate the generality and adaptability of the MTMSD framework for diverse patterns of different HSI data. Extensive experiments are conducted on four public HSI datasets, and the results demonstrate that our method outperforms state-of-the-art methods for HSI classification, especially on the challenging Houston 2018 dataset. △ Less

Submitted 3 June, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

arXiv:2306.05353 [pdf, other]

Negotiated Reasoning: On Provably Addressing Relative Over-Generalization

Authors: Junjie Sheng, Wenhao Li, Bo Jin, Hongyuan Zha, Jun Wang, Xiangfeng Wang

Abstract: Over-generalization is a thorny issue in cognitive science, where people may become overly cautious due to past experiences. Agents in multi-agent reinforcement learning (MARL) also have been found to suffer relative over-generalization (RO) as people do and stuck to sub-optimal cooperation. Recent methods have shown that assigning reasoning ability to agents can mitigate RO algorithmically and em… ▽ More Over-generalization is a thorny issue in cognitive science, where people may become overly cautious due to past experiences. Agents in multi-agent reinforcement learning (MARL) also have been found to suffer relative over-generalization (RO) as people do and stuck to sub-optimal cooperation. Recent methods have shown that assigning reasoning ability to agents can mitigate RO algorithmically and empirically, but there has been a lack of theoretical understanding of RO, let alone designing provably RO-free methods. This paper first proves that RO can be avoided when the MARL method satisfies a consistent reasoning requirement under certain conditions. Then we introduce a novel reasoning framework, called negotiated reasoning, that first builds the connection between reasoning and RO with theoretical justifications. After that, we propose an instantiated algorithm, Stein variational negotiated reasoning (SVNR), which uses Stein variational gradient descent to derive a negotiation policy that provably avoids RO in MARL under maximum entropy policy iteration. The method is further parameterized with neural networks for amortized learning, making computation efficient. Numerical experiments on many RO-challenged environments demonstrate the superiority and efficiency of SVNR compared to state-of-the-art methods in addressing RO. △ Less

Submitted 8 June, 2023; originally announced June 2023.

Comments: 21 pages

arXiv:2306.00657 [pdf, other]

Associated Production of Neutrino and Dark Fermion at Future Lepton Colliders

Authors: Shao-Feng Ge, Kai Ma, Xiao-Dong Ma, Jie Sheng

Abstract: Fermionic dark matter can be pairly produced and hence searched with missing energy at colliders. We extend such probe to the associated production of a neutrino and a dark sector fermion at the future $e^+ e^-$ colliders such as CEPC, FCC-ee, ILC, and CLIC. Two typical processes, the mono-photon and electron-positron pair productions associated with missing energy, can serve the purpose. While th… ▽ More Fermionic dark matter can be pairly produced and hence searched with missing energy at colliders. We extend such probe to the associated production of a neutrino and a dark sector fermion at the future $e^+ e^-$ colliders such as CEPC, FCC-ee, ILC, and CLIC. Two typical processes, the mono-photon and electron-positron pair productions associated with missing energy, can serve the purpose. While the mono-photon search prevails at CEPC, FCC-ee, and ILC, the $e^+ e^- \met$ channel has more significant contributions at CLIC with much higher collision energy $\sqrt s$. The beam polarizations can help further suppressing the SM backgrounds to enhance the signal significance while differential cross sections can distinguish the Lorentz structure of various effective operators. The combined sensitivity can reach well above $1\tev$ at CEPC/FCC-ee and ILC while it further touches 30\,TeV at CLIC. Comparing with the updated results from the direct detection experiments (XENON1T, PandaX-II, PandaX-4T, LZ, and XENONnT), astrophysical $X/γ$-ray observations, and cosmological constraints for the sub-MeV absorption dark matter, the collider searches are actually more sensitive and hence can provide a complementary approach to addressing the dark fermions. △ Less

Submitted 21 September, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

Comments: 34pages, 17 captioned figures; v2: Title and text are improved

arXiv:2304.10045 [pdf, other]

ID-MixGCL: Identity Mixup for Graph Contrastive Learning

Authors: Gehang Zhang, Bowen Yu, Jiangxia Cao, Xinghua Zhang, Jiawei Sheng, Chuan Zhou, Tingwen Liu

Abstract: Graph contrastive learning (GCL) has recently achieved substantial advancements. Existing GCL approaches compare two different ``views'' of the same graph in order to learn node/graph representations. The underlying assumption of these studies is that the graph augmentation strategy is capable of generating several different graph views such that the graph views are structurally different but sema… ▽ More Graph contrastive learning (GCL) has recently achieved substantial advancements. Existing GCL approaches compare two different ``views'' of the same graph in order to learn node/graph representations. The underlying assumption of these studies is that the graph augmentation strategy is capable of generating several different graph views such that the graph views are structurally different but semantically similar to the original graphs, and thus the ground-truth labels of the original and augmented graph/nodes can be regarded identical in contrastive learning. However, we observe that this assumption does not always hold. For instance, the deletion of a super-node within a social network can exert a substantial influence on the partitioning of communities for other nodes. Similarly, any perturbation to nodes or edges in a molecular graph will change the labels of the graph. Therefore, we believe that augmenting the graph, accompanied by an adaptation of the labels used for the contrastive loss, will facilitate the encoder to learn a better representation. Based on this idea, we propose ID-MixGCL, which allows the simultaneous interpolation of input nodes and corresponding identity labels to obtain soft-confidence samples, with a controllable degree of change, leading to the capture of fine-grained representations from self-supervised training on unlabeled graphs. Experimental results demonstrate that ID-MixGCL improves performance on graph classification and node classification tasks, as demonstrated by significant improvements on the Cora, IMDB-B, IMDB-M, and PROTEINS datasets compared to state-of-the-art techniques, by 3-29% absolute points. △ Less

Submitted 17 January, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

Comments: 10 pages, 7 figures, accepted by IEEE BigData 2023

arXiv:2304.03891 [pdf, other]

doi 10.1145/3511808.3557262

Contrastive Cross-Domain Sequential Recommendation

Authors: Jiangxia Cao, Xin Cong, Jiawei Sheng, Tingwen Liu, Bin Wang

Abstract: Cross-Domain Sequential Recommendation (CDSR) aims to predict future interactions based on user's historical sequential interactions from multiple domains. Generally, a key challenge of CDSR is how to mine precise cross-domain user preference based on the intra-sequence and inter-sequence item interactions. Existing works first learn single-domain user preference only with intra-sequence item inte… ▽ More Cross-Domain Sequential Recommendation (CDSR) aims to predict future interactions based on user's historical sequential interactions from multiple domains. Generally, a key challenge of CDSR is how to mine precise cross-domain user preference based on the intra-sequence and inter-sequence item interactions. Existing works first learn single-domain user preference only with intra-sequence item interactions, and then build a transferring module to obtain cross-domain user preference. However, such a pipeline and implicit solution can be severely limited by the bottleneck of the designed transferring module, and ignores to consider inter-sequence item relationships. In this paper, we propose C^2DSR to tackle the above problems to capture precise user preferences. The main idea is to simultaneously leverage the intra- and inter- sequence item relationships, and jointly learn the single- and cross- domain user preferences. Specifically, we first utilize a graph neural network to mine inter-sequence item collaborative relationship, and then exploit sequential attentive encoder to capture intra-sequence item sequential relationship. Based on them, we devise two different sequential training objectives to obtain user single-domain and cross-domain representations. Furthermore, we present a novel contrastive cross-domain infomax objective to enhance the correlation between single- and cross- domain user representations by maximizing their mutual information. To validate the effectiveness of C^2DSR, we first re-split four e-comerce datasets, and then conduct extensive experiments to demonstrate the effectiveness of our approach C^2DSR. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: This paper has been accepted by CIKM 2022

arXiv:2304.02997 [pdf, other]

doi 10.1103/PhysRevD.107.123013

Right-Handed Neutrino Dark Matter with Forbidden Annihilation

Authors: Yu Cheng, Shao-Feng Ge, Jie Sheng, Tsutomu T. Yanagida

Abstract: The seesaw mechanism with three right-handed neutrinos has one as a well-motivated dark matter candidate if stable and the other two can explain baryon asymmetry via the thermal leptogenesis scenario. We explore the possibility of introducing additional particles to make the right-handed neutrino dark matter in thermal equilibrium and freeze out through a forbidden annihilation channel. Nowadays i… ▽ More The seesaw mechanism with three right-handed neutrinos has one as a well-motivated dark matter candidate if stable and the other two can explain baryon asymmetry via the thermal leptogenesis scenario. We explore the possibility of introducing additional particles to make the right-handed neutrino dark matter in thermal equilibrium and freeze out through a forbidden annihilation channel. Nowadays in the Universe, this forbidden channel can be reactivated by a strong gravitational potential such as the supermassive black hole in our galaxy center. The Fermi-LAT gamma ray data and dark matter relic density require this right-handed neutrino dark matter to have mass below $100\,$GeV and the existence of an additional boson $φ$ that can be tested at future lepton colliders. △ Less

Submitted 8 November, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

Comments: 7 pages, 1 figures

arXiv:2304.02328 [pdf, other]

doi 10.1109/TASLP.2023.3345146

Enhancing Multimodal Entity and Relation Extraction with Variational Information Bottleneck

Authors: Shiyao Cui, Jiangxia Cao, Xin Cong, Jiawei Sheng, Quangang Li, Tingwen Liu, Jinqiao Shi

Abstract: This paper studies the multimodal named entity recognition (MNER) and multimodal relation extraction (MRE), which are important for multimedia social platform analysis. The core of MNER and MRE lies in incorporating evident visual information to enhance textual semantics, where two issues inherently demand investigations. The first issue is modality-noise, where the task-irrelevant information in… ▽ More This paper studies the multimodal named entity recognition (MNER) and multimodal relation extraction (MRE), which are important for multimedia social platform analysis. The core of MNER and MRE lies in incorporating evident visual information to enhance textual semantics, where two issues inherently demand investigations. The first issue is modality-noise, where the task-irrelevant information in each modality may be noises misleading the task prediction. The second issue is modality-gap, where representations from different modalities are inconsistent, preventing from building the semantic alignment between the text and image. To address these issues, we propose a novel method for MNER and MRE by Multi-Modal representation learning with Information Bottleneck (MMIB). For the first issue, a refinement-regularizer probes the information-bottleneck principle to balance the predictive evidence and noisy information, yielding expressive representations for prediction. For the second issue, an alignment-regularizer is proposed, where a mutual information-based item works in a contrastive manner to regularize the consistent text-image representations. To our best knowledge, we are the first to explore variational IB estimation for MNER and MRE. Experiments show that MMIB achieves the state-of-the-art performances on three public benchmarks. △ Less

Submitted 5 April, 2023; originally announced April 2023.

Journal ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2024

arXiv:2304.01563 [pdf, other]

Attribute-Consistent Knowledge Graph Representation Learning for Multi-Modal Entity Alignment

Authors: Qian Li, Shu Guo, Yangyifei Luo, Cheng Ji, Lihong Wang, Jiawei Sheng, Jianxin Li

Abstract: The multi-modal entity alignment (MMEA) aims to find all equivalent entity pairs between multi-modal knowledge graphs (MMKGs). Rich attributes and neighboring entities are valuable for the alignment task, but existing works ignore contextual gap problems that the aligned entities have different numbers of attributes on specific modality when learning entity representations. In this paper, we propo… ▽ More The multi-modal entity alignment (MMEA) aims to find all equivalent entity pairs between multi-modal knowledge graphs (MMKGs). Rich attributes and neighboring entities are valuable for the alignment task, but existing works ignore contextual gap problems that the aligned entities have different numbers of attributes on specific modality when learning entity representations. In this paper, we propose a novel attribute-consistent knowledge graph representation learning framework for MMEA (ACK-MMEA) to compensate the contextual gaps through incorporating consistent alignment knowledge. Attribute-consistent KGs (ACKGs) are first constructed via multi-modal attribute uniformization with merge and generate operators so that each entity has one and only one uniform feature in each modality. The ACKGs are then fed into a relation-aware graph neural network with random dropouts, to obtain aggregated relation representations and robust entity representations. In order to evaluate the ACK-MMEA facilitated for entity alignment, we specially design a joint alignment loss for both entity and attribute evaluation. Extensive experiments conducted on two benchmark datasets show that our approach achieves excellent performance compared to its competitors. △ Less

Submitted 4 April, 2023; originally announced April 2023.

arXiv:2303.10584 [pdf, other]

doi 10.1103/PhysRevB.107.125126

Magnetic phase diagrams and large magnetocaloric effects of the two-dimensional antiferromagnetic triangular lattice of Gd$^{3+}$ ions in KBaGd(BO$_3$)$_2$

Authors: Z. M. Song, N. Zhao, H. Ge, T. T. Li, J. Yang, L. Wang, Y. Fu, Y. Z. Zhang, S. M. Wang, J. W. Mei, H. He, S. Guo, L. S. Wu, J. M. Sheng

Abstract: We report a detailed study of the magnetic properties of KBaGd(BO$_3$)$_2$, in which magnetic Gd$^{3+}$ ($S=7/2$) ions form into two-dimensional triangular layers. Magnetization, specific heat and magnetocaloric effect (MCE) measurements have been performed on KBaGd(BO$_3$)$_2$ single crystals. The results show that a long-range antiferromagnetic state is established below $T_{\rm N}=0.24$ K. In z… ▽ More We report a detailed study of the magnetic properties of KBaGd(BO$_3$)$_2$, in which magnetic Gd$^{3+}$ ($S=7/2$) ions form into two-dimensional triangular layers. Magnetization, specific heat and magnetocaloric effect (MCE) measurements have been performed on KBaGd(BO$_3$)$_2$ single crystals. The results show that a long-range antiferromagnetic state is established below $T_{\rm N}=0.24$ K. In zero fields, only about half of the full entropy is released at $T_{\rm N}$, indicating that not all the magnetic moments are frozen below the ordering temperature, as expected from the geometrical frustration of the triangular spin lattice. Further studies under external fields were performed down to 50 mK, and the magnetic phase diagrams are established with magnetic fields applied both within and perpendicular to the triangular plane. KBaGd(BO$_3$)$_2$ serves as an example of a two-dimensional triangular lattice with large spin values ($S=7/2$) and can be directly compared with the iso-structure KBaR(BO$_3$)$_2$ (R = Dy-Yb) family of doublet ground states, which exhibit effective spins of $S=1/2$. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: 8 pages, 5 figures

Journal ref: Physical Review B 107, 125126 (2023)

arXiv:2303.10578 [pdf, other]

doi 10.1103/PhysRevMaterials.7.034401

Quasi One-Dimensional Ising-like Antiferromagnetism in the Rare-earth Perovskite Oxide TbScO$_3$

Authors: Nan Zhao, Jieming Sheng, Jinchen Wang, Han Ge, Tiantian Li, Jiong Yang, Shanmin Wang, Ping Miao, Hua He, Xin Tong, Wei Bao, Er-Jia Guo, Richard Mole, Dehong Yu, Andrey A. Podlesnyak, Liusuo Wu

Abstract: The rare-earth perovskite TbScO$_3$ has been widely used as a substrate for the growth of epitaxial ferroelectric and multiferroic thin films, while its detailed low-temperature magnetic properties were rarely reported. In this paper, we performed detailed magnetization, specific heat and single crystal neutron scattering measurements, along with the crystalline electric field calculations to stud… ▽ More The rare-earth perovskite TbScO$_3$ has been widely used as a substrate for the growth of epitaxial ferroelectric and multiferroic thin films, while its detailed low-temperature magnetic properties were rarely reported. In this paper, we performed detailed magnetization, specific heat and single crystal neutron scattering measurements, along with the crystalline electric field calculations to study the low-temperature magnetic properties of TbScO$_3$. All our results suggest the magnetic Tb$^{3+}$ has an Ising-like pseudo-doublet ground state at low temperatures. Due to the constrain of local point symmetry, these Tb$^{3+}$ Ising moments are confined in the $ab$ plane with a tilt angle of $\varphi = \pm48^{\mathrm{o}}$ to the $a$ axis. In zero field, the system undergoes an antiferromagnetic phase transition at $T_{\mathrm{N}}=2.53$ K, and forms a $G_xA_y$ noncollinear magnetic structure below $T_{\mathrm{N}}$. We find the dipole-dipole interactions play an important role to determine the magnetic ground state, which are also responsible for the quasi-one-dimensional magnetism in TbScO$_3$. The significant anisotropic diffuse scatterings further confirm the quasi-one-dimensional magnetism along the $c$ axis. The magnetic phase diagram with the field along the easy $b$ axis is well established. In addition to the $G_xA_y$ antiferromagnetic state, there is an exotic field-induced phase emerged near the critical field $B_{\mathrm{c}}\simeq0.7$ T, where three-dimensional magnetic order is suppressed but strong one-dimensional correlations may still exist. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: 10 pages, 5 figures

Journal ref: Physical Review MATERIALS 7, 034401 (2023)

arXiv:2303.10405 [pdf, other]

doi 10.1103/PhysRevB.105.014441

Antiferromagnetism and Ising Ground States in the Rare-earth Garnet Nd$_3$Ga$_5$O$_{12}$

Authors: N. Zhao, H. Ge, L. Zhou, Z. M. Song, J. Yang, T. T. Li, L. Wang, Y. Fu, Y. F. Zhang, J. B. Xu, S. M. Wang, J. W. Mei, X. Tong, L. S. Wu, J. M. Sheng

Abstract: In this paper, we investigate the low temperature magnetic properties of the rare-earth garnet compound Nd$_3$Ga$_5$O$_{12}$ in detail by means of magnetization, specific heat and magnetocaloric effect measurements. The magnetic thermal properties along with the crystal field calculations reveal that the Nd$^{3+}$ ions form into a frustrated hyper-kagome lattice with connected triangles have an Is… ▽ More In this paper, we investigate the low temperature magnetic properties of the rare-earth garnet compound Nd$_3$Ga$_5$O$_{12}$ in detail by means of magnetization, specific heat and magnetocaloric effect measurements. The magnetic thermal properties along with the crystal field calculations reveal that the Nd$^{3+}$ ions form into a frustrated hyper-kagome lattice with connected triangles have an Ising-like ground state with the easy axis along the local [100], [010] and [001] directions. Instead of a quantum spin liquid ground state, an antiferromagnetically ordered state is found below $T_{\mathrm{N}}=0.52~\rm K$. With applying field in the [111] direction, the antiferromagnetic order is suppressed at the critical field of $B_{\mathrm{c}}=0.75~\rm T$, and enhancement of the critical fluctuations with linear crossover behaviors is observed near the critical point. △ Less

Submitted 18 March, 2023; originally announced March 2023.

Comments: 7 pages, 6 figures

Journal ref: Physical Review B 105, 014441 (2022)

arXiv:2303.08673 [pdf, other]

doi 10.1103/PhysRevMaterials.6.085001

Successive magnetic orderings in the Ising spin chain magnet DyNi$_5$Ge$_3$

Authors: H. Ge, L. Zhang, N. Zhao, J. Yang, L. Wang, L. Zhou, Y. Fu, T. T. Li, Z. M. Song, F. Ding, J. B. Xu, Y. F. Zhang, S. M. Wang, J. W. Mei, X. Tong, P. Miao, H. He, Q. Zhanghang, L. S. Wu, J. M. Sheng

Abstract: In this report, we investigated a new rare earth based one-dimensional Ising spin chain magnet~\DNG~by means of magnetization, specific heat and powder neutron diffraction measurements. Due to the crystalline electrical field splitting, the magnetic Dy ions share an Ising like ground doublet state. Owning to the local point symmetry, these Ising moments form into two canted magnetic sublattices, w… ▽ More In this report, we investigated a new rare earth based one-dimensional Ising spin chain magnet~\DNG~by means of magnetization, specific heat and powder neutron diffraction measurements. Due to the crystalline electrical field splitting, the magnetic Dy ions share an Ising like ground doublet state. Owning to the local point symmetry, these Ising moments form into two canted magnetic sublattices, which were further confirmed by the angle-dependent magnetization measurement. In zero fields, two successive antiferromagnetic phase transitions were found at temperatures $T_{\mathrm{N1}}=6~\rm K$ and $T_{\mathrm{N2}}=5~\rm K$, respectively. Only part of the moments are statically ordered in this intermediate state between $T_{\mathrm{N1}}$ and $T_{\mathrm{N2}}$. Powder neutron diffraction experiments at different temperatures were performed as well. An incommensurate magnetic propagation vector of $\mathbf{k_{\rm m}}=(0.5,0.4,0.5)$ was identified. The refined spin configurations through the irreducible representation analysis confirmed that these Ising spins are canted in the crystal $ab$~plane. △ Less

Submitted 15 March, 2023; originally announced March 2023.

Comments: 9 pages, 7 figures

Journal ref: Physical Review MATERIALS 6, 085001 (2022)

arXiv:2303.08661 [pdf, other]

doi 10.1103/PhysRevMaterials.7.024423

Magnetic phase diagram and multiple field-induced states in the intermetallic triangular-lattice antiferromagnet NdAuAl$_4$Ge$_2$ with Ising-like spins

Authors: Mengru Cong, Han Ge, Lei Zhang, Weijun Ren, Nan Zhao, Tiantian Li, Shanmin Wang, Jinlong Zhu, Jiawei Mei, Qiang Zhang, Jieming Sheng, Fei Gao, Bing Li, Zhidong Zhang, Liusuo Wu

Abstract: Geometrical frustration and the enhancement of strong quantum fluctuations in two-dimensional triangular antiferromagnets can lead to various intriguing phenomena. Here, we studied the spin-1/2 triangular lattice antiferromagnet NdAuAl$_4$Ge$_2$. Thermodynamic and transport properties, such as magnetization and specific heat together with the resistivity measurements were performed. In zero field,… ▽ More Geometrical frustration and the enhancement of strong quantum fluctuations in two-dimensional triangular antiferromagnets can lead to various intriguing phenomena. Here, we studied the spin-1/2 triangular lattice antiferromagnet NdAuAl$_4$Ge$_2$. Thermodynamic and transport properties, such as magnetization and specific heat together with the resistivity measurements were performed. In zero field, two successive phase transitions were observed at $T_{\rm N1}=1.75\pm 0.02$ K and $T_{\rm N2}=0.49\pm 0.02$ K, respectively. Under magnetic field, $\rm XXZ$-type anisotropy was revealed, with the moments pointing along the easy $c$ axis. For $B\parallel c$, multiple field-induced states were observed, and the magnetic phase diagram was established based on the specific heat and magnetization data. The temperature-dependent resistivity measurements indicate that NdAuAl$_4$Ge$_2$ is a good metal. It is very likely that both the long-range RKKY interactions and the geometrical frustration play an important roles in this case. △ Less

Submitted 15 March, 2023; originally announced March 2023.

Comments: 9 pages, 7 figures

Journal ref: Physical Review MATERIALS 7, 024423 (2023)

arXiv:2302.09838 [pdf, other]

JNDMix: JND-Based Data Augmentation for No-reference Image Quality Assessment

Authors: Jiamu Sheng, Jiayuan Fan, Peng Ye, Jianjian Cao

Abstract: Despite substantial progress in no-reference image quality assessment (NR-IQA), previous training models often suffer from over-fitting due to the limited scale of used datasets, resulting in model performance bottlenecks. To tackle this challenge, we explore the potential of leveraging data augmentation to improve data efficiency and enhance model robustness. However, most existing data augmentat… ▽ More Despite substantial progress in no-reference image quality assessment (NR-IQA), previous training models often suffer from over-fitting due to the limited scale of used datasets, resulting in model performance bottlenecks. To tackle this challenge, we explore the potential of leveraging data augmentation to improve data efficiency and enhance model robustness. However, most existing data augmentation methods incur a serious issue, namely that it alters the image quality and leads to training images mismatching with their original labels. Additionally, although only a few data augmentation methods are available for NR-IQA task, their ability to enrich dataset diversity is still insufficient. To address these issues, we propose a effective and general data augmentation based on just noticeable difference (JND) noise mixing for NR-IQA task, named JNDMix. In detail, we randomly inject the JND noise, imperceptible to the human visual system (HVS), into the training image without any adjustment to its label. Extensive experiments demonstrate that JNDMix significantly improves the performance and data efficiency of various state-of-the-art NR-IQA models and the commonly used baseline models, as well as the generalization ability. More importantly, JNDMix facilitates MANIQA to achieve the state-of-the-art performance on LIVEC and KonIQ-10k. △ Less

Submitted 20 February, 2023; originally announced February 2023.

Comments: Accepted by 48th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

arXiv:2301.12201 [pdf]

doi 10.1088/0256-307X/40/12/126101

Chiral Dirac fermion in a collinear antiferromagnet

Authors: Ao Zhang, Ke Deng, Jieming Sheng, Pengfei Liu, Shiv Kumar, Kenya Shimada, Zhicheng Jiang, Zhengtai Liu, Dawei Shen, Jiayu Li, Jun Ren, Le Wang, Liang Zhou, Yoshihisa Ishikawa, Qiang Zhang, Garry McIntyre, Dehong Yu, Enke Liu, Liusuo Wu, Chaoyu Chen, Qihang Liu

Abstract: In a Dirac semimetal, the massless Dirac fermion has zero chirality, leading to surface states connected adiabatically to a topologically trivial surface state as well as vanishing anomalous Hall effect (AHE). Recently, it is predicted that in the nonrelativistic limit of certain collinear antiferromagnets, there exists a type of chiral Dirac-like fermion, whose dispersion manifests four-fold dege… ▽ More In a Dirac semimetal, the massless Dirac fermion has zero chirality, leading to surface states connected adiabatically to a topologically trivial surface state as well as vanishing anomalous Hall effect (AHE). Recently, it is predicted that in the nonrelativistic limit of certain collinear antiferromagnets, there exists a type of chiral Dirac-like fermion, whose dispersion manifests four-fold degenerate crossing points formed by spin-degenerate linear bands, with topologically protected Fermi arcs. Such unconventional chiral fermion, protected by a hidden SU(2) symmetry in the hierarchy of an enhanced crystallographic group, namely spin space group, is not experimentally verified yet. Here, by angle-resolved photoemission spectroscopy measurements, we reveal the surface origin of the electron pocket at the Fermi surface in collinear antiferromagnet CoNb3S6. Combining with neutron diffraction and first-principles calculations, we suggest a multidomain collinear AFM configuration, rendering the the existence of the Fermi-arc surface states induced by chiral Dirac-like fermions. Our work provides spectral evidence of the chiral Dirac-like fermion caused by particular spin symmetry in CoNb3S6, paving an avenue for exploring new emergent phenomena in antiferromagnets with unconventional quasiparticle excitations. △ Less

Submitted 17 December, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

Comments: 19 pages, 4 figures

Journal ref: Chinese Physics Letters 40, 126101 (2023)

arXiv:2301.11621 [pdf, other]

Event Causality Extraction with Event Argument Correlations

Authors: Shiyao Cui, Jiawei Sheng, Xin Cong, QuanGang Li, Tingwen Liu, Jinqiao Shi

Abstract: Event Causality Identification (ECI), which aims to detect whether a causality relation exists between two given textual events, is an important task for event causality understanding. However, the ECI task ignores crucial event structure and cause-effect causality component information, making it struggle for downstream applications. In this paper, we explore a novel task, namely Event Causality… ▽ More Event Causality Identification (ECI), which aims to detect whether a causality relation exists between two given textual events, is an important task for event causality understanding. However, the ECI task ignores crucial event structure and cause-effect causality component information, making it struggle for downstream applications. In this paper, we explore a novel task, namely Event Causality Extraction (ECE), aiming to extract the cause-effect event causality pairs with their structured event information from plain texts. The ECE task is more challenging since each event can contain multiple event arguments, posing fine-grained correlations between events to decide the causeeffect event pair. Hence, we propose a method with a dual grid tagging scheme to capture the intra- and inter-event argument correlations for ECE. Further, we devise a event type-enhanced model architecture to realize the dual grid tagging scheme. Experiments demonstrate the effectiveness of our method, and extensive analyses point out several future directions for ECE. △ Less

Submitted 27 January, 2023; originally announced January 2023.

Comments: Accepted to COLING2022

Showing 1–50 of 144 results for author: Sheng, J