subscribe to arXiv mailings

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

Authors: Honghao Chen, Yurong Zhang, Xiaokun Feng, Xiangxiang Chu, Kaiqi Huang

Abstract: Robustness is a vital aspect to consider when deploying deep learning models into the wild. Numerous studies have been dedicated to the study of the robustness of vision transformers (ViTs), which have dominated as the mainstream backbone choice for vision tasks since the dawn of 2020s. Recently, some large kernel convnets make a comeback with impressive performance and efficiency. However, it sti… ▽ More Robustness is a vital aspect to consider when deploying deep learning models into the wild. Numerous studies have been dedicated to the study of the robustness of vision transformers (ViTs), which have dominated as the mainstream backbone choice for vision tasks since the dawn of 2020s. Recently, some large kernel convnets make a comeback with impressive performance and efficiency. However, it still remains unclear whether large kernel networks are robust and the attribution of their robustness. In this paper, we first conduct a comprehensive evaluation of large kernel convnets' robustness and their differences from typical small kernel counterparts and ViTs on six diverse robustness benchmark datasets. Then to analyze the underlying factors behind their strong robustness, we design experiments from both quantitative and qualitative perspectives to reveal large kernel convnets' intriguing properties that are completely different from typical convnets. Our experiments demonstrate for the first time that pure CNNs can achieve exceptional robustness comparable or even superior to that of ViTs. Our analysis on occlusion invariance, kernel attention patterns and frequency characteristics provide novel insights into the source of robustness. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08398 [pdf, other]

Delocalization of skin steady states

Authors: Xu Feng, Shu Chen

Abstract: The skin effect, characterized by the tendency of particles to accumulate at the boundaries, has been extensively studied in non-Hermitian systems. In this work, we propose an intuitive Lindbladian composed of two chains with reversed skin localization. The skin steady state is gradually delocalized as the interchain coupling increases. In the single-body scenario, it corresponds to a shift in the… ▽ More The skin effect, characterized by the tendency of particles to accumulate at the boundaries, has been extensively studied in non-Hermitian systems. In this work, we propose an intuitive Lindbladian composed of two chains with reversed skin localization. The skin steady state is gradually delocalized as the interchain coupling increases. In the single-body scenario, it corresponds to a shift in the scaling of the Liouvillian gap $Δ$ from $Δ\propto N^0$ to $Δ\propto N^{-2}$. Notably, exact diagonalization results reveal a system-size sensitivity of the single-particle Liouvillian spectrum, inherited from the non-Hermitian effective Hamiltonian's system-size sensitivity. We predict that even an arbitrarily small coupling will induce dramatic changes in the Liouvillian spectrum and steady state in the thermodynamic limit, a phenomenon we term the critical Liouvillian skin effect. Additionally, in the many-body scenario, by employing the stochastic Schrödinger equation to unravel the Lindblad master equation, it is revealed that the scaling behavior of steady-state entanglement changes from the area law to the logarithmic law. This work demonstrates the delocalization of both single-body and many-body skin steady states, introducing a novel mechanism for inducing entanglement transitions beyond the quantum Zeno effect. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 10 pages,7 figures

arXiv:2407.06510 [pdf, other]

On the Northward Shift of the Heliospheric Current Sheet at the End of Solar Cycle 24

Authors: Huichao Li, Xueshang Feng

Abstract: Since solar cycle 16, the { heliospheric} current sheet (HCS) has been found to be shifted southward during the late declining to minimum phase. However, this trend is broken at the end of solar cycle 24. In this paper, we analyze the shift of the HCS by using information obtained from coronal model and insitu data provide by the near-Earth OMNI database and the Parker Solar Probe (PSP). Coronal p… ▽ More Since solar cycle 16, the { heliospheric} current sheet (HCS) has been found to be shifted southward during the late declining to minimum phase. However, this trend is broken at the end of solar cycle 24. In this paper, we analyze the shift of the HCS by using information obtained from coronal model and insitu data provide by the near-Earth OMNI database and the Parker Solar Probe (PSP). Coronal potential field source surface (PFSS) modeling results show that the northward shift is established at the beginning of 2018 and remains stable for about two years. Interplanetary magnetic field data obtained from and within 1 au also support the northward shift, as the southern polarity T appears more frequently than the northern polarity A between 2018-2020. Both model results and insitu observation obtained by PSP imply that the HCS shift is established in the corona, and then propagates into the heliosphere. The quadrupole term still has a significant influence on the formation of the HCS shift. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: Accepted by MNRAS

arXiv:2407.04713 [pdf]

16-channel Photonic Solver for Optimization Problems on a Silicon Chip

Authors: Jiayi Ouyang, Shengping Liu, Ziyue Yang, Wei Wang, Xue Feng, Yongzhuo Li, Yidong Huang

Abstract: In this article, we proposed a programmable 16-channel photonic solver for quadratic unconstrained binary optimization (QUBO) problems. The solver is based on a hybrid optoelectronic scheme including a photonic chip and the corresponding electronic driving circuit. The photonic chip is fabricated on silicon on insulator (SOI) substrate and integrates high-speed electro-optic modulators, thermo-opt… ▽ More In this article, we proposed a programmable 16-channel photonic solver for quadratic unconstrained binary optimization (QUBO) problems. The solver is based on a hybrid optoelectronic scheme including a photonic chip and the corresponding electronic driving circuit. The photonic chip is fabricated on silicon on insulator (SOI) substrate and integrates high-speed electro-optic modulators, thermo-optic phase shifters and photodetectors to conduct the 16-dimensional optical vector-matrix multiplication (OVMM). Due to the parallel and low latency propagation of lightwave, the calculation of the QUBO cost function can be accelerated. Besides, the electronic processor is employed to run the heuristic algorithm to search the optimal solution. In the experiment, two 16-dimensional randomly generated QUBO problems are solved with high successful probabilities. To our knowledge, it is the largest scale of programmable and on-chip photonic solver ever reported. Moreover, the computing speed of the OVMM on photonic chip is ~2 TFLOP/s. It shows the potential of fast solving such optimization problems with integrated photonic systems. △ Less

Submitted 5 June, 2024; originally announced July 2024.

arXiv:2407.04251 [pdf, other]

Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding

Authors: Xincan Feng, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Abstract: Knowledge Graphs (KGs) are fundamental resources in knowledge-intensive tasks in NLP. Due to the limitation of manually creating KGs, KG Completion (KGC) has an important role in automatically completing KGs by scoring their links with KG Embedding (KGE). To handle many entities in training, KGE relies on Negative Sampling (NS) loss that can reduce the computational cost by sampling. Since the app… ▽ More Knowledge Graphs (KGs) are fundamental resources in knowledge-intensive tasks in NLP. Due to the limitation of manually creating KGs, KG Completion (KGC) has an important role in automatically completing KGs by scoring their links with KG Embedding (KGE). To handle many entities in training, KGE relies on Negative Sampling (NS) loss that can reduce the computational cost by sampling. Since the appearance frequencies for each link are at most one in KGs, sparsity is an essential and inevitable problem. The NS loss is no exception. As a solution, the NS loss in KGE relies on smoothing methods like Self-Adversarial Negative Sampling (SANS) and subsampling. However, it is uncertain what kind of smoothing method is suitable for this purpose due to the lack of theoretical understanding. This paper provides theoretical interpretations of the smoothing methods for the NS loss in KGE and induces a new NS loss, Triplet Adaptive Negative Sampling (TANS), that can cover the characteristics of the conventional smoothing methods. Experimental results of TransE, DistMult, ComplEx, RotatE, HAKE, and HousE on FB15k-237, WN18RR, and YAGO3-10 datasets and their sparser subsets show the soundness of our interpretation and performance improvement by our TANS. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 9 pages, 4 figures, 2 tables; accepted to workshop RepL4NLP held in conjunction with ACL 2024

arXiv:2407.04222 [pdf, ps, other]

Regge trajectories for the hidden bottom tetraquarks in the diquark picture

Authors: Jia-Qi Xie, He Song, Xia Feng, Jiao-Kai Chen

Abstract: We apply the newly proposed Regge trajectory relation for the heavy-light diquarks alongside the Regge trajectory formula for the heavy-heavy systems to investigate the hidden bottom tetraquarks $(bq)(\bar{b}\bar{q}')$ $(q,\,q'=u,\,d,\,s)$. The Regge trajectory formulas for the hidden bottom tetraquarks are obtained. The masses of the $λ$-mode excited states and the $ρ$-mode excited states of the… ▽ More We apply the newly proposed Regge trajectory relation for the heavy-light diquarks alongside the Regge trajectory formula for the heavy-heavy systems to investigate the hidden bottom tetraquarks $(bq)(\bar{b}\bar{q}')$ $(q,\,q'=u,\,d,\,s)$. The Regge trajectory formulas for the hidden bottom tetraquarks are obtained. The masses of the $λ$-mode excited states and the $ρ$-mode excited states of the hidden bottom tetraquarks are estimated. Additionally, the $λ$-trajectories and the $ρ$-trajectories of the tetraquarks with hidden bottom are discussed. We find that the behavior of the $ρ$-trajectories is different from that of the $λ$-trajectories. The $ρ$-trajectories behave as $M{\sim}x_ρ^{1/2}$ $(x_ρ=n_r,\,l)$ because the $ρ$-mode excitations are in diquark and antidiquark, with both the diquark $(bq)$ and antidiquark $(\bar{b}\bar{q}')$ being the heavy-light systems. Conversely, the $λ$-trajectories behave as $M{\sim}x_λ^{2/3}$ $(x_λ=N_r,\,L)$ because the $λ$-mode excitations occure between diquark $(bq)$ and antidiquark $(\bar{b}\bar{q}')$, with the the diquark $(bq)$ and antidiquark $(\bar{b}\bar{q}')$ forming a heavy-heavy system $(bq)(\bar{b}\bar{q}')$. Moreover, not only the $λ$-trajectories but also the $ρ$-trajectories for the hidden bottom tetraquarks are concave downwards in the $(M^2,\,x)$ plane. △ Less

Submitted 4 July, 2024; originally announced July 2024.

Comments: 12 pages,5 figures,10 tables

arXiv:2407.00728 [pdf, other]

Magnetic Excitations in Ferromagnetically Coupled Spin-1 Nanographenes

Authors: Elia Turco, Fupeng Wu, Gonçalo Catarina, Nils Krane, Ji Ma, Roman Fasel, Xinliang Feng, Pascal Ruffieux

Abstract: In the quest for high-spin building blocks to form covalently bonded 1D or 2D materials with controlled magnetic interactions, $π$-electron magnetism provides an ideal framework to engineer large ferromagnetic interactions between nanographenes. As a first step in this direction, we investigate the spin properties of ferromagnetically coupled triangulenes, triangular nanographenes with spin… ▽ More In the quest for high-spin building blocks to form covalently bonded 1D or 2D materials with controlled magnetic interactions, $π$-electron magnetism provides an ideal framework to engineer large ferromagnetic interactions between nanographenes. As a first step in this direction, we investigate the spin properties of ferromagnetically coupled triangulenes, triangular nanographenes with spin $S = 1$. Combining in-solution synthesis of rationally designed molecular precursors and on-surface synthesis, we achieve covalently bonded $S = 2$ triangulene dimers and $S = 3$ trimers on Au(111). Starting from the triangulene dimer, we thoroughly characterize its low-energy magnetic excitations using inelastic electron tunneling spectroscopy (IETS). IETS reveals conductance steps identified as a quintet to triplet excitation, and a zero-bias peak stemming from higher-order spin-spin scattering of the 5-fold degenerate ferromagnetic ground state. The Heisenberg picture captures the relevant parameters of inter-triangulene ferromagnetic exchange, and its successful extension to the larger $S = 3$ system confirms the model's accuracy. We expect that the addition of ferromagnetically coupled building blocks to the toolbox of magnetic nanographenes opens new opportunities to design carbon materials with complex magnetic ground states. △ Less

Submitted 30 June, 2024; originally announced July 2024.

Comments: 38 pages, 5 Figures

arXiv:2407.00569 [pdf, other]

Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models

Authors: Weihong Zhong, Xiaocheng Feng, Liang Zhao, Qiming Li, Lei Huang, Yuxuan Gu, Weitao Ma, Yuan Xu, Bing Qin

Abstract: Though advanced in understanding visual information with human languages, Large Vision-Language Models (LVLMs) still suffer from multimodal hallucinations. A natural concern is that during multimodal interaction, the generated hallucinations could influence the LVLMs' subsequent generation. Thus, we raise a question: When presented with a query relevant to the previously generated hallucination, w… ▽ More Though advanced in understanding visual information with human languages, Large Vision-Language Models (LVLMs) still suffer from multimodal hallucinations. A natural concern is that during multimodal interaction, the generated hallucinations could influence the LVLMs' subsequent generation. Thus, we raise a question: When presented with a query relevant to the previously generated hallucination, will LVLMs be misled and respond incorrectly, even though the ground visual information exists? To answer this, we propose a framework called MMHalSnowball to evaluate LVLMs' behaviors when encountering generated hallucinations, where LVLMs are required to answer specific visual questions within a curated hallucinatory conversation. Crucially, our experiment shows that the performance of open-source LVLMs drops by at least $31\%$, indicating that LVLMs are prone to accept the generated hallucinations and make false claims that they would not have supported without distractions. We term this phenomenon Multimodal Hallucination Snowballing. To mitigate this, we further propose a training-free method called Residual Visual Decoding, where we revise the output distribution of LVLMs with the one derived from the residual visual input, providing models with direct access to the visual information. Experiments show that our method can mitigate more than $24\%$ of the snowballed multimodal hallucination while maintaining capabilities. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: Accepted to ACL 2024 Main Conference. 21 pages, 20 figures

arXiv:2406.19659 [pdf]

Object Space is Embodied

Authors: Shan Xu, Xinran Feng, Yuannan Li, Jia Liu

Abstract: The perceived similarity between objects has often been attributed to their physical and conceptual features, such as appearance and animacy, and the theoretical framework of object space is accordingly conceived. Here, we extend this framework by proposing that object space may also be defined by embodied features, specifically action possibilities that objects afford to an agent (i.e., affordanc… ▽ More The perceived similarity between objects has often been attributed to their physical and conceptual features, such as appearance and animacy, and the theoretical framework of object space is accordingly conceived. Here, we extend this framework by proposing that object space may also be defined by embodied features, specifically action possibilities that objects afford to an agent (i.e., affordance) and their spatial relation with the agent (i.e., situatedness). To test this proposal, we quantified the embodied features with a set of action atoms. We found that embodied features explained the subjective similarity among familiar objects along with the objects' visual features. This observation was further replicated with novel objects. Our study demonstrates that embodied features, which place objects within an ecological context, are essential in constructing object space in the human visual system, emphasizing the importance of incorporating embodiment as a fundamental dimension in our understanding of the visual world. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2406.16522 [pdf, other]

Why are non-radial solar eruptions less frequent than radial ones?

Authors: Qingjun Liu, Chaowei Jiang, Xuesheng Feng, Pingbing Zuo, Yi Wang

Abstract: Coronal mass ejections from the Sun are not always initiated along a radial trajectory; such non-radial eruptions are well known to be caused by the asymmetry of the pre-eruption magnetic configuration, which is primarily determined by the uneven distribution of magnetic flux at the photosphere. Therefore, it is naturally expected that the non-radial eruptions should be rather common, at least as… ▽ More Coronal mass ejections from the Sun are not always initiated along a radial trajectory; such non-radial eruptions are well known to be caused by the asymmetry of the pre-eruption magnetic configuration, which is primarily determined by the uneven distribution of magnetic flux at the photosphere. Therefore, it is naturally expected that the non-radial eruptions should be rather common, at least as frequent as radial ones, given the typically asymmetrical nature of photospheric magnetic flux. However, statistical studies have shown that only a small fraction of eruptions display non-radial behavior. Here we aim to shed light on this counterintuitive fact, based on a series of numerical simulations of eruption initiation in bipolar fields with different asymmetric flux distributions. As the asymmetry of the flux distribution increases, the eruption direction tends to deviate further away from the radial path, accompanied by a decrease in eruption intensity. In case of too strong asymmetry, no eruption is triggered, indicating that excessively inclined eruptions cannot occur. Therefore, our simulations suggest that asymmetry plays a negative role in producing eruption, potentially explaining the lesser frequency of non-radial solar eruptions compared to radial ones. With increasing asymmetry, the degree of non-potentiality the field can attain is reduced. Consequently, the intensity of the pre-eruption current sheet decreases, and reconnection becomes less efficient, resulting in weaker eruptions. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 7 pages, 5 figures, accept by MNRAS Letters

arXiv:2406.16005 [pdf, other]

A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

Authors: Lei Chen, Shi Liu, Chenxi Wang, Haoran Ma, Yifan Qiao, Zhe Wang, Chenggang Wu, Youyou Lu, Xiaobing Feng, Huimin Cui, Shan Lu, Harry Xu

Abstract: With rapid advances in network hardware, far memory has gained a great deal of traction due to its ability to break the memory capacity wall. Existing far memory systems fall into one of two data paths: one that uses the kernel's paging system to transparently access far memory at the page granularity, and a second that bypasses the kernel, fetching data at the object granularity. While it is gene… ▽ More With rapid advances in network hardware, far memory has gained a great deal of traction due to its ability to break the memory capacity wall. Existing far memory systems fall into one of two data paths: one that uses the kernel's paging system to transparently access far memory at the page granularity, and a second that bypasses the kernel, fetching data at the object granularity. While it is generally believed that object fetching outperforms paging due to its fine-grained access, it requires significantly more compute resources to run object-level LRU and eviction. We built Atlas, a hybrid data plane enabled by a runtime-kernel co-design that simultaneously enables accesses via these two data paths to provide high efficiency for real-world applications. Atlas uses always-on profiling to continuously measure page locality. For workloads already with good locality, paging is used to fetch data, whereas for those without, object fetching is employed. Object fetching moves objects that are accessed close in time to contiguous local space, dynamically improving locality and making the execution increasingly amenable to paging, which is much more resource-efficient. Our evaluation shows that Atlas improves the throughput (e.g., by 1.5x and 3.2x) and reduces the tail latency (e.g., by one and two orders of magnitude) when using remote memory, compared with AIFM and Fastswap, the state-of-the-art techniques respectively in the two categories. △ Less

Submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.15807 [pdf]

Low-Voltage Electron Emission by Graphene-hBN-graphene Heterostructure

Authors: Zhexuan Wang, Fang Liu, Kaiyu Cui, Xue Feng, Wei Zhang, Yidong Huang

Abstract: Scanning Electron Microscopes (SEM) with low energy electron sources (accelerating voltage of less than 1000V) have important application requirements in many application scenarios. Tunneling junction can potentially achieve low-voltage and planar-type electron sources with good emission current density. However, further lower the extracting voltage while ensure the emission current density remain… ▽ More Scanning Electron Microscopes (SEM) with low energy electron sources (accelerating voltage of less than 1000V) have important application requirements in many application scenarios. Tunneling junction can potentially achieve low-voltage and planar-type electron sources with good emission current density. However, further lower the extracting voltage while ensure the emission current density remains challenging. In this paper, we report a low-voltage planar-type electron source based on graphene-hBN-graphene heterostructures (GBGH) under a really low out-plane extracting voltage. The external electric field strength applied to the electron sources is only 4 times 10^4V/m and the accelerating voltage as low as 20V is realized. Steady electron emission of over 1nA and operating duration of several hours is observed from the GBGH with size of 59.29um^2 in our experiments, and thus the maximum emission current density reaches 7mA/cm^2. Great electrical contacts, extremely low thickness, and excellent layer properties of two-dimensional (2D) materials lead to easy-fabrication and miniature on-chip electron sources, which would significantly contribute to the development of next-generation free electron devices. △ Less

Submitted 22 June, 2024; originally announced June 2024.

arXiv:2406.15796 [pdf, other]

Rethinking Entity-level Unlearning for Large Language Models

Authors: Weitao Ma, Xiaocheng Feng, Weihong Zhong, Lei Huang, Yangfan Ye, Bing Qin

Abstract: Large language model unlearning has gained increasing attention due to its potential to mitigate security and privacy concerns. Current research predominantly focuses on Instance-level unlearning, specifically aiming at forgetting predefined instances of sensitive content. However, a notable gap still exists in exploring the deletion of complete entity-related information, which is crucial in many… ▽ More Large language model unlearning has gained increasing attention due to its potential to mitigate security and privacy concerns. Current research predominantly focuses on Instance-level unlearning, specifically aiming at forgetting predefined instances of sensitive content. However, a notable gap still exists in exploring the deletion of complete entity-related information, which is crucial in many real-world scenarios, such as copyright protection. To this end, we propose a novel task of Entity-level unlearning, where the entity-related knowledge within the target model is supposed to be entirely erased. Given the challenge of practically accessing all entity-related knowledge within a model, we begin by simulating entity-level unlearning scenarios through fine-tuning models to introduce pseudo entities. Following this, we develop baseline methods inspired by trending unlearning techniques and conduct a detailed comparison of their effectiveness in this task. Extensive experiments reveal that current unlearning algorithms struggle to achieve effective entity-level unlearning. Additionally, our analyses further indicate that entity-related knowledge injected through fine-tuning is more susceptible than original entities from pre-training during unlearning, highlighting the necessity for more thorough pseudo-entity injection methods to make them closer to pre-trained knowledge. △ Less

Submitted 22 June, 2024; originally announced June 2024.

Comments: Work in progress

arXiv:2406.14457 [pdf, other]

Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue

Authors: Huifang Du, Shuqin Li, Minghao Wu, Xuejing Feng, Yuan-Fang Li, Haofen Wang

Abstract: Reinforcement learning (RL) is a powerful approach to enhance task-oriented dialogue (TOD) systems. However, existing RL methods tend to mainly focus on generation tasks, such as dialogue policy learning (DPL) or response generation (RG), while neglecting dialogue state tracking (DST) for understanding. This narrow focus limits the systems to achieve globally optimal performance by overlooking the… ▽ More Reinforcement learning (RL) is a powerful approach to enhance task-oriented dialogue (TOD) systems. However, existing RL methods tend to mainly focus on generation tasks, such as dialogue policy learning (DPL) or response generation (RG), while neglecting dialogue state tracking (DST) for understanding. This narrow focus limits the systems to achieve globally optimal performance by overlooking the interdependence between understanding and generation. Additionally, RL methods face challenges with sparse and delayed rewards, which complicates training and optimization. To address these issues, we extend RL into both understanding and generation tasks by introducing step-by-step rewards throughout the token generation. The understanding reward increases as more slots are correctly filled in DST, while the generation reward grows with the accurate inclusion of user requests. Our approach provides a balanced optimization aligned with task completion. Experimental results demonstrate that our approach effectively enhances the performance of TOD systems and achieves new state-of-the-art results on three widely used datasets, including MultiWOZ2.0, MultiWOZ2.1, and In-Car. Our approach also shows superior few-shot ability in low-resource settings compared to current models. △ Less

Submitted 20 June, 2024; originally announced June 2024.

arXiv:2406.13285 [pdf, ps, other]

The extremal problem for weighted combined energy and $ρ-$Nitsche type inequality

Authors: Ting Peng, Chaochuan Wang, Xiaogao Feng

Abstract: Let $A_1$ and $A_2$ be two circular annuli and let $ρ$ be a radial metric defined in the annuli $A_2$. We study the existence and uniqueness of the extremal problem for weighted combined energy between $A_1$ and $A_2$, and obtain that the extremal mapping is a certain radial mapping. In fact, this extremal mapping generalizes the $ρ-$harmonic mapping and satisfies equation (2.7) obtained by mean o… ▽ More Let $A_1$ and $A_2$ be two circular annuli and let $ρ$ be a radial metric defined in the annuli $A_2$. We study the existence and uniqueness of the extremal problem for weighted combined energy between $A_1$ and $A_2$, and obtain that the extremal mapping is a certain radial mapping. In fact, this extremal mapping generalizes the $ρ-$harmonic mapping and satisfies equation (2.7) obtained by mean of variation for weighted combined energy. Meanwhile, we get a $ρ-$Nitsche type inequality. This extends the results of Kalaj (J. Differential Equations, 268(2020)) and YTF (Arch. Math., 122(2024)), where they considered the case $ρ=1$ and $ρ=\frac{1}{|h|^{2}}$, respectively. Moreover, in the course of proving the extremal problem for weighted combined energy we also investigate the extremal problem for the weighted combined distortion (see Theorem 4.1). This extends the result obtained by Kalaj (J. London Math. Soc., 93(2016)). △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 14 pages

MSC Class: 30C70

arXiv:2406.11788 [pdf, other]

Holographic Classical Shadow Tomography

Authors: Shuhan Zhang, Xiaozhou Feng, Matteo Ippoliti, Yi-Zhuang You

Abstract: We introduce "holographic shadows", a new class of randomized measurement schemes for classical shadow tomography that achieves the optimal scaling of sample complexity for learning geometrically local Pauli operators at any length scale, without the need for fine-tuning protocol parameters such as circuit depth or measurement rate. Our approach utilizes hierarchical quantum circuits, such as tree… ▽ More We introduce "holographic shadows", a new class of randomized measurement schemes for classical shadow tomography that achieves the optimal scaling of sample complexity for learning geometrically local Pauli operators at any length scale, without the need for fine-tuning protocol parameters such as circuit depth or measurement rate. Our approach utilizes hierarchical quantum circuits, such as tree quantum circuits or holographic random tensor networks. Measurements within the holographic bulk correspond to measurements at different scales on the boundary (i.e. the physical system of interests), facilitating efficient quantum state estimation across observable at all scales. Considering the task of estimating string-like Pauli observables supported on contiguous intervals of $k$ sites in a 1D system, our method achieves an optimal sample complexity scaling of $\sim d^k\mathrm{poly}(k)$, with $d$ the local Hilbert space dimension. We present a holographic minimal cut framework to demonstrate the universality of this sample complexity scaling and validate it with numerical simulations, illustrating the efficacy of holographic shadows in enhancing quantum state learning capabilities. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 20 pages, 9 figures

arXiv:2406.11253 [pdf, other]

Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space

Authors: Yuan Wang, Zhao Wang, Junhao Gong, Di Huang, Tong He, Wanli Ouyang, Jile Jiao, Xuetao Feng, Qi Dou, Shixiang Tang, Dan Xu

Abstract: In this paper, we introduce a novel path to $\textit{general}$ human motion generation by focusing on 2D space. Traditional methods have primarily generated human motions in 3D, which, while detailed and realistic, are often limited by the scope of available 3D motion data in terms of both the size and the diversity. To address these limitations, we exploit extensive availability of 2D motion data… ▽ More In this paper, we introduce a novel path to $\textit{general}$ human motion generation by focusing on 2D space. Traditional methods have primarily generated human motions in 3D, which, while detailed and realistic, are often limited by the scope of available 3D motion data in terms of both the size and the diversity. To address these limitations, we exploit extensive availability of 2D motion data. We present $\textbf{Holistic-Motion2D}$, the first comprehensive and large-scale benchmark for 2D whole-body motion generation, which includes over 1M in-the-wild motion sequences, each paired with high-quality whole-body/partial pose annotations and textual descriptions. Notably, Holistic-Motion2D is ten times larger than the previously largest 3D motion dataset. We also introduce a baseline method, featuring innovative $\textit{whole-body part-aware attention}$ and $\textit{confidence-aware modeling}$ techniques, tailored for 2D $\underline{\text T}$ext-driv$\underline{\text{EN}}$ whole-bo$\underline{\text D}$y motion gen$\underline{\text{ER}}$ation, namely $\textbf{Tender}$. Extensive experiments demonstrate the effectiveness of $\textbf{Holistic-Motion2D}$ and $\textbf{Tender}$ in generating expressive, diverse, and realistic human motions. We also highlight the utility of 2D motion for various downstream applications and its potential for lifting to 3D motion. The page link is: https://holistic-motion2d.github.io. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 22 pages, 11figures, 17 tables

arXiv:2406.11211 [pdf, other]

Quantized Andreev conductance in semiconductor nanowires

Authors: Yichun Gao, Wenyu Song, Yuhao Wang, Zuhan Geng, Zhan Cao, Zehao Yu, Shuai Yang, Jiaye Xu, Fangting Chen, Zonglin Li, Ruidong Li, Lining Yang, Zhaoyu Wang, Shan Zhang, Xiao Feng, Tiantian Wang, Yunyi Zang, Lin Li, Dong E. Liu, Runan Shang, Qi-Kun Xue, Ke He, Hao Zhang

Abstract: Clean one-dimensional electron systems can exhibit quantized conductance. The plateau conductance doubles if the transport is dominated by Andreev reflection. Here, we report quantized conductance observed in both Andreev and normal-state transports in PbTe-Pb and PbTe-In hybrid nanowires. The Andreev plateau is observed at $4e^2/h$, twice of the normal plateau value of $2e^2/h$. In comparison, An… ▽ More Clean one-dimensional electron systems can exhibit quantized conductance. The plateau conductance doubles if the transport is dominated by Andreev reflection. Here, we report quantized conductance observed in both Andreev and normal-state transports in PbTe-Pb and PbTe-In hybrid nanowires. The Andreev plateau is observed at $4e^2/h$, twice of the normal plateau value of $2e^2/h$. In comparison, Andreev conductance in the best-optimized III-V nanowires is non-quantized due to mode-mixing induced dips (a disorder effect), despite the quantization of normal-state transport. The negligible mode mixing in PbTe hybrids indicates an unprecedented low-disorder transport regime for nanowire devices, beneficial for Majorana researches. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.10090 [pdf, other]

Over-parameterization and Adversarial Robustness in Neural Networks: An Overview and Empirical Analysis

Authors: Zhang Chen, Luca Demetrio, Srishti Gupta, Xiaoyi Feng, Zhaoqiang Xia, Antonio Emanuele Cinà, Maura Pintor, Luca Oneto, Ambra Demontis, Battista Biggio, Fabio Roli

Abstract: Thanks to their extensive capacity, over-parameterized neural networks exhibit superior predictive capabilities and generalization. However, having a large parameter space is considered one of the main suspects of the neural networks' vulnerability to adversarial example -- input samples crafted ad-hoc to induce a desired misclassification. Relevant literature has claimed contradictory remarks in… ▽ More Thanks to their extensive capacity, over-parameterized neural networks exhibit superior predictive capabilities and generalization. However, having a large parameter space is considered one of the main suspects of the neural networks' vulnerability to adversarial example -- input samples crafted ad-hoc to induce a desired misclassification. Relevant literature has claimed contradictory remarks in support of and against the robustness of over-parameterized networks. These contradictory findings might be due to the failure of the attack employed to evaluate the networks' robustness. Previous research has demonstrated that depending on the considered model, the algorithm employed to generate adversarial examples may not function properly, leading to overestimating the model's robustness. In this work, we empirically study the robustness of over-parameterized networks against adversarial examples. However, unlike the previous works, we also evaluate the considered attack's reliability to support the results' veracity. Our results show that over-parameterized networks are robust against adversarial attacks as opposed to their under-parameterized counterparts. △ Less

Submitted 14 June, 2024; originally announced June 2024.

MSC Class: 68T10 ACM Class: I.5

arXiv:2406.08698 [pdf, other]

Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes of astrophysical $γ$-ray background while large amount of dark matter. By analyzing more than 700 days observational data at LHAASO, no significant dark matter signal from 1 TeV to 1 EeV is detected. Accordingly we derive the most stringent constraints on the ultra-heavy dark matter annihilation cross-section up to EeV. The constraints on the lifetime of dark matter in decay mode are also derived. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 17 pages, 12 figures, accepted by PRL

arXiv:2406.08100 [pdf, other]

Multimodal Table Understanding

Authors: Mingyu Zheng, Xinwei Feng, Qingyi Si, Qiaoqiao She, Zheng Lin, Wenbin Jiang, Weiping Wang

Abstract: Although great progress has been made by previous table understanding methods including recent approaches based on large language models (LLMs), they rely heavily on the premise that given tables must be converted into a certain text sequence (such as Markdown or HTML) to serve as model input. However, it is difficult to access such high-quality textual table representations in some real-world sce… ▽ More Although great progress has been made by previous table understanding methods including recent approaches based on large language models (LLMs), they rely heavily on the premise that given tables must be converted into a certain text sequence (such as Markdown or HTML) to serve as model input. However, it is difficult to access such high-quality textual table representations in some real-world scenarios, and table images are much more accessible. Therefore, how to directly understand tables using intuitive visual information is a crucial and urgent challenge for developing more practical applications. In this paper, we propose a new problem, multimodal table understanding, where the model needs to generate correct responses to various table-related requests based on the given table image. To facilitate both the model training and evaluation, we construct a large-scale dataset named MMTab, which covers a wide spectrum of table images, instructions and tasks. On this basis, we develop Table-LLaVA, a generalist tabular multimodal large language model (MLLM), which significantly outperforms recent open-source MLLM baselines on 23 benchmarks under held-in and held-out settings. The code and data is available at this https://github.com/SpursGoZmy/Table-LLaVA △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 23 pages, 16 figures, ACL 2024 main conference, camera-ready version

arXiv:2406.08002 [pdf, other]

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning

Authors: Yizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang, Song-Chun Zhu, Xue Feng

Abstract: Despite the recent successes of multi-agent reinforcement learning (MARL) algorithms, efficiently adapting to co-players in mixed-motive environments remains a significant challenge. One feasible approach is to hierarchically model co-players' behavior based on inferring their characteristics. However, these methods often encounter difficulties in efficient reasoning and utilization of inferred in… ▽ More Despite the recent successes of multi-agent reinforcement learning (MARL) algorithms, efficiently adapting to co-players in mixed-motive environments remains a significant challenge. One feasible approach is to hierarchically model co-players' behavior based on inferring their characteristics. However, these methods often encounter difficulties in efficient reasoning and utilization of inferred information. To address these issues, we propose Hierarchical Opponent modeling and Planning (HOP), a novel multi-agent decision-making algorithm that enables few-shot adaptation to unseen policies in mixed-motive environments. HOP is hierarchically composed of two modules: an opponent modeling module that infers others' goals and learns corresponding goal-conditioned policies, and a planning module that employs Monte Carlo Tree Search (MCTS) to identify the best response. Our approach improves efficiency by updating beliefs about others' goals both across and within episodes and by using information from the opponent modeling module to guide planning. Experimental results demonstrate that in mixed-motive environments, HOP exhibits superior few-shot adaptation capabilities when interacting with various unseen agents, and excels in self-play scenarios. Furthermore, the emergence of social intelligence during our experiments underscores the potential of our approach in complex multi-agent environments. △ Less

Submitted 12 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

Comments: Accepted at ICML 2024

arXiv:2406.07300 [pdf, other]

Shadows, rings and optical appearance of a magnetically charged regular black hole illuminated by various accretion disks

Authors: Soroush Zare, Luis M. Nieto, Xing-Hui Feng, Shi-Hai Dong, Hassan Hassanabadi

Abstract: The Event Horizon Telescope (EHT) imaging of the supermassive black holes at the centers of Messier 87 galaxy and the Milky Way galaxy marks a significant step in observing the photon rings and central brightness depression that define the optical appearance of black holes with an accretion disk scenario. Inspired by this, we take into account a static and spherically symmetric magnetically charge… ▽ More The Event Horizon Telescope (EHT) imaging of the supermassive black holes at the centers of Messier 87 galaxy and the Milky Way galaxy marks a significant step in observing the photon rings and central brightness depression that define the optical appearance of black holes with an accretion disk scenario. Inspired by this, we take into account a static and spherically symmetric magnetically charged regular black hole (MCRBH) metric characterized by its mass and an additional parameter q, which arises from the coupling of Einstein gravity and nonlinear electrodynamics (NLED) in the weak field approximation. This parameterized model offers a robust foundation for testing the coupling of Einstein gravity and NLED in the weak-field approximation, using the EHT observational results. In this study, we investigate the geodesic motion of particles around the solution, followed by a discussion of its fundamental geometrical characteristics such as scalar invariants. Using null geodesics, we examine how the model parameter influences the behavior of the photon sphere radius and the associated shadow silhouette. We seek constraints on q by applying the EHT results for supermassive black holes M87* and Sgr A*. Furthermore, it is observed that the geodesics of time-like particles are susceptible to variations in q, which can have an impact on the traits of the innermost stable circular orbit and the marginally bounded orbit. Our primary objective is to probe how the free parameter q affects various aspects of the accretion disk surrounding the MCRBH using the thin-disk approximation. Next, we discuss the physical characteristics of the thin accretion disk as well as the observed shadows and rings of the MCRBH, along with its luminosity, across various accretion models. Ultimately, variations in accretion models and the parameter q yield distinct shadow images and optical appearances of the MCRBH. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 31 pages, 2 tables, 16 figures

arXiv:2406.05862 [pdf, other]

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

Authors: Ziqiang Liu, Feiteng Fang, Xi Feng, Xinrun Du, Chenhao Zhang, Zekun Wang, Yuelin Bai, Qixuan Zhao, Liyang Fan, Chengguang Gan, Hongquan Lin, Jiaming Li, Yuansheng Ni, Haihong Wu, Yaswanth Narsupalli, Zhigang Zheng, Chengming Li, Xiping Hu, Ruifeng Xu, Xiaojun Chen, Min Yang, Jiaheng Liu, Ruibo Liu, Wenhao Huang, Ge Zhang , et al. (1 additional authors not shown)

Abstract: The rapid advancements in the development of multimodal large language models (MLLMs) have consistently led to new breakthroughs on various benchmarks. In response, numerous challenging and comprehensive benchmarks have been proposed to more accurately assess the capabilities of MLLMs. However, there is a dearth of exploration of the higher-order perceptual capabilities of MLLMs. To fill this gap,… ▽ More The rapid advancements in the development of multimodal large language models (MLLMs) have consistently led to new breakthroughs on various benchmarks. In response, numerous challenging and comprehensive benchmarks have been proposed to more accurately assess the capabilities of MLLMs. However, there is a dearth of exploration of the higher-order perceptual capabilities of MLLMs. To fill this gap, we propose the Image Implication understanding Benchmark, II-Bench, which aims to evaluate the model's higher-order perception of images. Through extensive experiments on II-Bench across multiple MLLMs, we have made significant findings. Initially, a substantial gap is observed between the performance of MLLMs and humans on II-Bench. The pinnacle accuracy of MLLMs attains 74.8%, whereas human accuracy averages 90%, peaking at an impressive 98%. Subsequently, MLLMs perform worse on abstract and complex images, suggesting limitations in their ability to understand high-level semantics and capture image details. Finally, it is observed that most models exhibit enhanced accuracy when image sentiment polarity hints are incorporated into the prompts. This observation underscores a notable deficiency in their inherent understanding of image sentiment. We believe that II-Bench will inspire the community to develop the next generation of MLLMs, advancing the journey towards expert artificial general intelligence (AGI). II-Bench is publicly available at https://huggingface.co/datasets/m-a-p/II-Bench. △ Less

Submitted 11 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

Comments: 100 pages, 82 figures, add citations

arXiv:2406.05676 [pdf]

Chern insulator phase realized in dual-gate-tuned MnBi2Te4 thin films grown by molecular beam epitaxy

Authors: Yunhe Bai, Yuanzhao Li, Ruixuan Liu, Jianli Luan, Yang Chen, Wenyu Song, Peng-Fei Ji, Cui Ding, Zongwei Gao, Qinghua Zhang, Fanqi Meng, Bingbing Tong, Lin Li, Tianchen Zhu, Lin Gu, Lili Wang, Jinsong Zhang, Yayu Wang, Qi-Kun Xue, Ke He, Yang Feng, Xiao Feng

Abstract: The intrinsic magnetic order, large topological-magnetic gap and rich topological phases make MnBi2Te4 a wonderful platform to study exotic topological quantum states such as axion insulator and Chern insulator. To realize and manipulate these topological phases in a MnBi2Te4 thin film, precise manipulation of the electric field across the film is essential, which requires a dual-gate structure. I… ▽ More The intrinsic magnetic order, large topological-magnetic gap and rich topological phases make MnBi2Te4 a wonderful platform to study exotic topological quantum states such as axion insulator and Chern insulator. To realize and manipulate these topological phases in a MnBi2Te4 thin film, precise manipulation of the electric field across the film is essential, which requires a dual-gate structure. In this work, we achieve dual-gate tuning of MnBi2Te4 thin films grown with molecular beam epitaxy on SrTiO3(111) substrates by applying the substrate and an AlOx layer as the gate dielectrics of bottom and top gates, respectively. Under magnetic field of 9T and temperature of 20 mK, the Hall and longitudinal resistivities of the films show inversed gate-voltage dependence, for both top- and bottom-gates, signifying the existence of the dissipationless edge state contributed by Chern insulator phase in the ferromagnetic configuration. The maximum of the Hall resistivity only reaches 0.8 h/e2, even with dual-gate tuning, probably due to the high density of bulk carriers introduced by secondary phases. In the antiferromagnetic state under zero magnetic field, the films show normal insulator behavior. The dual-gated MnBi2Te4 thin films lay the foundation for developing devices based on electrically tunable topological quantum states. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 24 pages, 4 figures

arXiv:2406.03511 [pdf, other]

MagiNet: Mask-Aware Graph Imputation Network for Incomplete Traffic Data

Authors: Jianping Zhou, Bin Lu, Zhanyu Liu, Siyu Pan, Xuejun Feng, Hua Wei, Guanjie Zheng, Xinbing Wang, Chenghu Zhou

Abstract: Due to detector malfunctions and communication failures, missing data is ubiquitous during the collection of traffic data. Therefore, it is of vital importance to impute the missing values to facilitate data analysis and decision-making for Intelligent Transportation System (ITS). However, existing imputation methods generally perform zero pre-filling techniques to initialize missing values, intro… ▽ More Due to detector malfunctions and communication failures, missing data is ubiquitous during the collection of traffic data. Therefore, it is of vital importance to impute the missing values to facilitate data analysis and decision-making for Intelligent Transportation System (ITS). However, existing imputation methods generally perform zero pre-filling techniques to initialize missing values, introducing inevitable noises. Moreover, we observe prevalent over-smoothing interpolations, falling short in revealing the intrinsic spatio-temporal correlations of incomplete traffic data. To this end, we propose Mask-Aware Graph imputation Network: MagiNet. Our method designs an adaptive mask spatio-temporal encoder to learn the latent representations of incomplete data, eliminating the reliance on pre-filling missing values. Furthermore, we devise a spatio-temporal decoder that stacks multiple blocks to capture the inherent spatial and temporal dependencies within incomplete traffic data, alleviating over-smoothing imputation. Extensive experiments demonstrate that our method outperforms state-of-the-art imputation methods on five real-world traffic datasets, yielding an average improvement of 4.31% in RMSE and 3.72% in MAPE. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 19 pages, 7 figures

arXiv:2406.02101 [pdf, other]

Universal gravitational self-force for a point mass orbiting around a compact star

Authors: Xuefeng Feng, Huan Yang

Abstract: In this work, we study the gravitational back-reaction (i.e., the "self-force") of a point mass moving around a non-rotating, compact star on a circular orbit. We find that the additional self-force, comparing with the case with a point mass orbiting around a Schwarzschild black hole, can be well characterized by an universal frequency-dependent function multiplied by the (dynamical) tidal deforma… ▽ More In this work, we study the gravitational back-reaction (i.e., the "self-force") of a point mass moving around a non-rotating, compact star on a circular orbit. We find that the additional self-force, comparing with the case with a point mass orbiting around a Schwarzschild black hole, can be well characterized by an universal frequency-dependent function multiplied by the (dynamical) tidal deformability of the compact star. This finding provides the foundation for building the waveform model for an extreme mass-ratio inspiral system around a star-like black hole mimicker, which is relevant for testing General Relativity and exotic compact objects with space-borne gravitational wave detectors. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 18 pages, 13 figures, 1 table, Comments are welcome

arXiv:2406.01549 [pdf, other]

An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation

Authors: Kun Zhu, Xiaocheng Feng, Xiyuan Du, Yuxuan Gu, Weijiang Yu, Haotian Wang, Qianglong Chen, Zheng Chu, Jingchang Chen, Bing Qin

Abstract: Retrieval-augmented generation integrates the capabilities of large language models with relevant information retrieved from an extensive corpus, yet encounters challenges when confronted with real-world noisy data. One recent solution is to train a filter module to find relevant content but only achieve suboptimal noise compression. In this paper, we propose to introduce the information bottlenec… ▽ More Retrieval-augmented generation integrates the capabilities of large language models with relevant information retrieved from an extensive corpus, yet encounters challenges when confronted with real-world noisy data. One recent solution is to train a filter module to find relevant content but only achieve suboptimal noise compression. In this paper, we propose to introduce the information bottleneck theory into retrieval-augmented generation. Our approach involves the filtration of noise by simultaneously maximizing the mutual information between compression and ground output, while minimizing the mutual information between compression and retrieved passage. In addition, we derive the formula of information bottleneck to facilitate its application in novel comprehensive evaluations, the selection of supervised fine-tuning data, and the construction of reinforcement learning rewards. Experimental results demonstrate that our approach achieves significant improvements across various question answering datasets, not only in terms of the correctness of answer generation but also in the conciseness with $2.5\%$ compression rate. △ Less

Submitted 4 July, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: Accepted to ACL 2024

arXiv:2405.18966 [pdf, other]

svds-C: A Multi-Thread C Code for Computing Truncated Singular Value Decomposition

Authors: Xu Feng, Wenjian Yu, Yuyang Xie

Abstract: This article presents svds-C, an open-source and high-performance C program for accurately and robustly computing truncated SVD, e.g. computing several largest singular values and corresponding singular vectors. We have re-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS and multi-thread computing to obtain the parallel program named svds-C. svds-C running on shared-memory… ▽ More This article presents svds-C, an open-source and high-performance C program for accurately and robustly computing truncated SVD, e.g. computing several largest singular values and corresponding singular vectors. We have re-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS and multi-thread computing to obtain the parallel program named svds-C. svds-C running on shared-memory computer consumes less time and memory than svds thanks to careful implementation of multi-thread parallelization and memory management. Numerical experiments on different test cases which are synthetically generated or directly from real world datasets show that, svds-C runs remarkably faster than svds with averagely 4.7X and at most 12X speedup for 16-thread parallel computing on a computer with Intel CPU, while preserving same accuracy and consuming about half memory space. Experimental results also demonstrate that svds-C has similar advantages over svds on the computer with AMD CPU, and outperforms other state-of-the-art algorithms for truncated SVD on computing time and robustness. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 20 pages, accepted by SoftwareX

arXiv:2405.17987 [pdf, other]

BlueSWAT: A Lightweight State-Aware Security Framework for Bluetooth Low Energy

Authors: Xijia Che, Yi He, Xuewei Feng, Kun Sun, Ke Xu, Qi Li

Abstract: Bluetooth Low Energy (BLE) is a short-range wireless communication technology for resource-constrained IoT devices. Unfortunately, BLE is vulnerable to session-based attacks, where previous packets construct exploitable conditions for subsequent packets to compromise connections. Defending against session-based attacks is challenging because each step in the attack sequence is legitimate when insp… ▽ More Bluetooth Low Energy (BLE) is a short-range wireless communication technology for resource-constrained IoT devices. Unfortunately, BLE is vulnerable to session-based attacks, where previous packets construct exploitable conditions for subsequent packets to compromise connections. Defending against session-based attacks is challenging because each step in the attack sequence is legitimate when inspected individually. In this paper, we present BlueSWAT, a lightweight state-aware security framework for protecting BLE devices. To perform inspection on the session level rather than individual packets, BlueSWAT leverages a finite state machine (FSM) to monitor sequential actions of connections at runtime. Patterns of session-based attacks are modeled as malicious transition paths in the FSM. To overcome the heterogeneous IoT environment, we develop a lightweight eBPF framework to facilitate universal patch distribution across different BLE architectures and stacks, without requiring device reboot. We implement BlueSWAT on 5 real-world devices with different chips and stacks to demonstrate its cross-device adaptability. On our dataset with 101 real-world BLE vulnerabilities, BlueSWAT can mitigate 76.1% of session-based attacks, outperforming other defense frameworks. In our end-to-end application evaluation, BlueSWAT patches introduce an average of 0.073% memory overhead and negligible latency. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2405.17899 [pdf]

doi 10.1002/anie.202300186

Near IR bandgap semiconductive 2D conjugated metal-organic framework with rhombic lattice and high mobility

Authors: Lukas Sporrer, Guojun Zhou, Mingchao Wang, Vasileios Balos, Sergio Revuelta, Kamil Jastrzembski, Markus Loeffler, Petko Petkov, Thomas Heine, Angieszka Kuc, Enrique Canovas, Zhehao Huang, Xinliang Feng, Renhao Dong

Abstract: Two-dimensional conjugated metal-organic frameworks (2D c-MOFs) are emerging as a unique class of 2D electronic materials. However, intrinsically semiconducting 2D c-MOFs with gaps in the Vis-NIR and high charge carrier mobility have been rare. Most of the reported semiconducting 2D c-MOFs are metallic (i.e. gapless), which limits their use in applications where larger band gaps are needed for log… ▽ More Two-dimensional conjugated metal-organic frameworks (2D c-MOFs) are emerging as a unique class of 2D electronic materials. However, intrinsically semiconducting 2D c-MOFs with gaps in the Vis-NIR and high charge carrier mobility have been rare. Most of the reported semiconducting 2D c-MOFs are metallic (i.e. gapless), which limits their use in applications where larger band gaps are needed for logic devices. Herein, we design a new D2h-geometric ligand, 2,3,6,7,11,12,15,16-octahydroxyphenanthro(9,10b)triphenylene (OHPTP), and synthesize the first example of a 2D c-MOF single crystal (OHPTP-Cu) with a rhombohedral pore geometry after coordination with copper. The continuous rotation electron diffraction (cRED) analysis unveils the orthorhombic crystal structure at the atomic level with a unique AB layer stacking. The resultant Cu2(OHPTP) is a p-type semiconductor with an indirect band gap of about 0.50 eV and exhibits high electrical conductivity of 0.10 S cm-1 and high charge carrier mobility of 10.0 cm2V-1s-1. Density-functional theory calculations underline the predominant role of the out-of-plane charge transport in this semiquinone-based 2D c-MOFs. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 11 pages 5 figures

Journal ref: Angew. Chem. Int. Ed. 2023, 62, e202300186

arXiv:2405.16042 [pdf, other]

Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention

Authors: Andrew Li, Xianle Feng, Siddhant Narang, Austin Peng, Tianle Cai, Raj Sanjay Shah, Sashank Varma

Abstract: When reading temporarily ambiguous garden-path sentences, misinterpretations sometimes linger past the point of disambiguation. This phenomenon has traditionally been studied in psycholinguistic experiments using online measures such as reading times and offline measures such as comprehension questions. Here, we investigate the processing of garden-path sentences and the fate of lingering misinter… ▽ More When reading temporarily ambiguous garden-path sentences, misinterpretations sometimes linger past the point of disambiguation. This phenomenon has traditionally been studied in psycholinguistic experiments using online measures such as reading times and offline measures such as comprehension questions. Here, we investigate the processing of garden-path sentences and the fate of lingering misinterpretations using four large language models (LLMs): GPT-2, LLaMA-2, Flan-T5, and RoBERTa. The overall goal is to evaluate whether humans and LLMs are aligned in their processing of garden-path sentences and in the lingering misinterpretations past the point of disambiguation, especially when extra-syntactic information (e.g., a comma delimiting a clause boundary) is present to guide processing. We address this goal using 24 garden-path sentences that have optional transitive and reflexive verbs leading to temporary ambiguities. For each sentence, there are a pair of comprehension questions corresponding to the misinterpretation and the correct interpretation. In three experiments, we (1) measure the dynamic semantic interpretations of LLMs using the question-answering task; (2) track whether these models shift their implicit parse tree at the point of disambiguation (or by the end of the sentence); and (3) visualize the model components that attend to disambiguating information when processing the question probes. These experiments show promising alignment between humans and LLMs in the processing of garden-path sentences, especially when extra-syntactic information is available to guide processing. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: Accepted by CogSci-24

arXiv:2405.15677 [pdf, other]

SMART: Scalable Multi-agent Real-time Simulation via Next-token Prediction

Authors: Wei Wu, Xiaoxin Feng, Ziyan Gao, Yuheng Kan

Abstract: Data-driven autonomous driving motion generation tasks are frequently impacted by the limitations of dataset size and the domain gap between datasets, which precludes their extensive application in real-world scenarios. To address this issue, we introduce SMART, a novel autonomous driving motion generation paradigm that models vectorized map and agent trajectory data into discrete sequence tokens.… ▽ More Data-driven autonomous driving motion generation tasks are frequently impacted by the limitations of dataset size and the domain gap between datasets, which precludes their extensive application in real-world scenarios. To address this issue, we introduce SMART, a novel autonomous driving motion generation paradigm that models vectorized map and agent trajectory data into discrete sequence tokens. These tokens are then processed through a decoder-only transformer architecture to train for the next token prediction task across spatial-temporal series. This GPT-style method allows the model to learn the motion distribution in real driving scenarios. SMART achieves state-of-the-art performance across most of the metrics on the generative Sim Agents challenge, ranking 1st on the leaderboards of Waymo Open Motion Dataset (WOMD), demonstrating remarkable inference speed. Moreover, SMART represents the generative model in the autonomous driving motion domain, exhibiting zero-shot generalization capabilities: Using only the NuPlan dataset for training and WOMD for validation, SMART achieved a competitive score of 0.71 on the Sim Agents challenge. Lastly, we have collected over 1 billion motion tokens from multiple datasets, validating the model's scalability. These results suggest that SMART has initially emulated two important properties: scalability and zero-shot generalization, and preliminarily meets the needs of large-scale real-time simulation applications. We have released all the code to promote the exploration of models for motion generation in the autonomous driving field. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.15056 [pdf, other]

ElastoGen: 4D Generative Elastodynamics

Authors: Yutao Feng, Yintong Shang, Xiang Feng, Lei Lan, Shandian Zhe, Tianjia Shao, Hongzhi Wu, Kun Zhou, Hao Su, Chenfanfu Jiang, Yin Yang

Abstract: We present ElastoGen, a knowledge-driven model that generates physically accurate and coherent 4D elastodynamics. Instead of relying on petabyte-scale data-driven learning, ElastoGen leverages the principles of physics-in-the-loop and learns from established physical knowledge, such as partial differential equations and their numerical solutions. The core idea of ElastoGen is converting the global… ▽ More We present ElastoGen, a knowledge-driven model that generates physically accurate and coherent 4D elastodynamics. Instead of relying on petabyte-scale data-driven learning, ElastoGen leverages the principles of physics-in-the-loop and learns from established physical knowledge, such as partial differential equations and their numerical solutions. The core idea of ElastoGen is converting the global differential operator, corresponding to the nonlinear elastodynamic equations, into iterative local convolution-like operations, which naturally fit modern neural networks. Each network module is specifically designed to support this goal rather than functioning as a black box. As a result, ElastoGen is exceptionally lightweight in terms of both training requirements and network scale. Additionally, due to its alignment with physical procedures, ElastoGen efficiently generates accurate dynamics for a wide range of hyperelastic materials and can be easily integrated with upstream and downstream deep modules to enable end-to-end 4D generation. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.13894 [pdf, other]

Charge and Spin Sharpening Transitions on Dynamical Quantum Trees

Authors: Xiaozhou Feng, Nadezhda Fishchenko, Sarang Gopalakrishnan, Matteo Ippoliti

Abstract: The dynamics of monitored systems can exhibit a measurement-induced phase transition (MIPT) between entangling and disentangling phases, tuned by the measurement rate. When the dynamics obeys a continuous symmetry, the entangling phase further splits into a fuzzy phase and a sharp phase based on the scaling of fluctuations of the symmetry charge. While the sharpening transition for Abelian symmetr… ▽ More The dynamics of monitored systems can exhibit a measurement-induced phase transition (MIPT) between entangling and disentangling phases, tuned by the measurement rate. When the dynamics obeys a continuous symmetry, the entangling phase further splits into a fuzzy phase and a sharp phase based on the scaling of fluctuations of the symmetry charge. While the sharpening transition for Abelian symmetries is well understood analytically, no such understanding exists for the non- Abelian case. In this work, building on a recent analytical solution of the MIPT on tree-like circuit architectures (where qubits are repatedly added or removed from the system in a recursive pattern), we study entanglement and sharpening transitions in monitored dynamical quantum trees obeying U (1) and SU (2) symmetries. The recursive structure of tree tensor networks enables powerful analytical and numerical methods to determine the phase diagrams in both cases. In the U (1) case, we analytically derive a Fisher-KPP-like differential equation that allows us to locate the critical point and identify its properties. We find that the entanglement/purification and sharpening transitions generically occur at distinct measurement rates. In the SU (2) case, we find that the fuzzy phase is generic, and a sharp phase is possible only in the limit of maximal measurement rate. In this limit, we analytically solve the boundaries separating the fuzzy and sharp phases, and find them to be in agreement with exact numerical simulations. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.12819 [pdf, other]

Large Language Models Meet NLP: A Survey

Authors: Libo Qin, Qiguang Chen, Xiachong Feng, Yang Wu, Yongheng Zhang, Yinghui Li, Min Li, Wanxiang Che, Philip S. Yu

Abstract: While large language models (LLMs) like ChatGPT have shown impressive capabilities in Natural Language Processing (NLP) tasks, a systematic investigation of their potential in this field remains largely unexplored. This study aims to address this gap by exploring the following questions: (1) How are LLMs currently applied to NLP tasks in the literature? (2) Have traditional NLP tasks already been… ▽ More While large language models (LLMs) like ChatGPT have shown impressive capabilities in Natural Language Processing (NLP) tasks, a systematic investigation of their potential in this field remains largely unexplored. This study aims to address this gap by exploring the following questions: (1) How are LLMs currently applied to NLP tasks in the literature? (2) Have traditional NLP tasks already been solved with LLMs? (3) What is the future of the LLMs for NLP? To answer these questions, we take the first step to provide a comprehensive overview of LLMs in NLP. Specifically, we first introduce a unified taxonomy including (1) parameter-frozen application and (2) parameter-tuning application to offer a unified perspective for understanding the current progress of LLMs in NLP. Furthermore, we summarize the new frontiers and the associated challenges, aiming to inspire further groundbreaking advancements. We hope this work offers valuable insights into the {potential and limitations} of LLMs in NLP, while also serving as a practical guide for building effective LLMs in NLP. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2405.12592 [pdf]

Spin-polarized p-wave superconductivity in the kagome material RbV$_3$Sb$_5$

Authors: Shuo Wang, Xilin Feng, Jing-Zhi Fang, Jia-Peng Peng, Zi-Ting Sun, Jia-Jie Yang, Jingchao Liu, Jia-Ji Zhao, Jian-Kun Wang, Xin-Jie Liu, Ze-Nan Wu, Shengbiao Sun, Ning Kang, Xiao-Song Wu, Zhensheng Zhang, Xuewen Fu, Kam Tuen Law, Ben-Chuan Lin, Dapeng Yu

Abstract: The study of kagome materials has attracted much attention in the past few years due to the presence of many electron-electron interaction-driven phases in a single material.In this work, we report the discovery of intrinsic spin-polarized p-wave superconductivity in the thin-flake kagome material RbV$_3$Sb$_5$. Firstly, when an in-plane magnetic field is swept in opposite directions, we observe a… ▽ More The study of kagome materials has attracted much attention in the past few years due to the presence of many electron-electron interaction-driven phases in a single material.In this work, we report the discovery of intrinsic spin-polarized p-wave superconductivity in the thin-flake kagome material RbV$_3$Sb$_5$. Firstly, when an in-plane magnetic field is swept in opposite directions, we observe a unique form of hysteresis in magnetoresistance which is different from the hysteresis induced by extrinsic mechanisms such as flux-trapping or superheating and supercooling effects. The unconventional hysteresis indicates the emergence of an intrinsic time-reversal symmetry-breaking superconducting phase. Strikingly, at a fixed magnetic field, the finite-resistance state can be quenched to the zero-resistance state by applying a large current. Secondly, at temperatures around 400 mK, the re-entrance of superconductivity occurs during an in-plane field-sweeping process with a fixed sweeping direction. This kind of re-entrance is asymmetric about the zero field axis and observed in all field directions for a fixed current direction, which is different from the re-entrance observed in conventional superconductors. Moreover, the angle-dependent in-plane critical field measurements reveal a two-fold symmetry that deviates from the original, centrosymmetric D$_{6h}$ point group symmetry of the crystal. These findings put very strong constraints on the possible superconducting pairing symmetry of RbV$_3$Sb$_5$. We point out that the pairing symmetry, which is consistent with the crystal symmetry and all the observed novel properties, is a time-reversal symmetry-breaking, p-wave pairing with net spin polarization. Importantly, this p-wave pairing gives rise to a nodal topological superconducting state with Majorana flat bands on the sample edges. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 21 pages, 4 figures

arXiv:2405.12434 [pdf, other]

Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference

Authors: Yonghao Liu, Mengyu Li, Di Liang, Ximing Li, Fausto Giunchiglia, Lan Huang, Xiaoyue Feng, Renchu Guan

Abstract: Natural Language Inference (NLI) is a crucial task in natural language processing that involves determining the relationship between two sentences, typically referred to as the premise and the hypothesis. However, traditional NLI models solely rely on the semantic information inherent in independent sentences and lack relevant situational visual information, which can hinder a complete understandi… ▽ More Natural Language Inference (NLI) is a crucial task in natural language processing that involves determining the relationship between two sentences, typically referred to as the premise and the hypothesis. However, traditional NLI models solely rely on the semantic information inherent in independent sentences and lack relevant situational visual information, which can hinder a complete understanding of the intended meaning of the sentences due to the ambiguity and vagueness of language. To address this challenge, we propose an innovative ScenaFuse adapter that simultaneously integrates large-scale pre-trained linguistic knowledge and relevant visual information for NLI tasks. Specifically, we first design an image-sentence interaction module to incorporate visuals into the attention mechanism of the pre-trained model, allowing the two modalities to interact comprehensively. Furthermore, we introduce an image-sentence fusion module that can adaptively integrate visual information from images and semantic information from sentences. By incorporating relevant visual information and leveraging linguistic knowledge, our approach bridges the gap between language and vision, leading to improved understanding and inference capabilities in NLI tasks. Extensive benchmark experiments demonstrate that our proposed ScenaFuse, a scenario-guided approach, consistently boosts NLI performance. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: IJCAI24

arXiv:2405.12139 [pdf, other]

DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM

Authors: Xuchen Li, Xiaokun Feng, Shiyu Hu, Meiqi Wu, Dailing Zhang, Jing Zhang, Kaiqi Huang

Abstract: Visual Language Tracking (VLT) enhances single object tracking (SOT) by integrating natural language descriptions from a video, for the precise tracking of a specified object. By leveraging high-level semantic information, VLT guides object tracking, alleviating the constraints associated with relying on a visual modality. Nevertheless, most VLT benchmarks are annotated in a single granularity and… ▽ More Visual Language Tracking (VLT) enhances single object tracking (SOT) by integrating natural language descriptions from a video, for the precise tracking of a specified object. By leveraging high-level semantic information, VLT guides object tracking, alleviating the constraints associated with relying on a visual modality. Nevertheless, most VLT benchmarks are annotated in a single granularity and lack a coherent semantic framework to provide scientific guidance. Moreover, coordinating human annotators for high-quality annotations is laborious and time-consuming. To address these challenges, we introduce DTLLM-VLT, which automatically generates extensive and multi-granularity text to enhance environmental diversity. (1) DTLLM-VLT generates scientific and multi-granularity text descriptions using a cohesive prompt framework. Its succinct and highly adaptable design allows seamless integration into various visual tracking benchmarks. (2) We select three prominent benchmarks to deploy our approach: short-term tracking, long-term tracking, and global instance tracking. We offer four granularity combinations for these benchmarks, considering the extent and density of semantic information, thereby showcasing the practicality and versatility of DTLLM-VLT. (3) We conduct comparative experiments on VLT benchmarks with different text granularities, evaluating and analyzing the impact of diverse text on tracking performance. Conclusionally, this work leverages LLM to provide multi-granularity semantic information for VLT task from efficient and diverse perspectives, enabling fine-grained evaluation of multi-modal trackers. In the future, we believe this work can be extended to more datasets to support vision datasets understanding. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: Accepted by CVPR Workshop 2024, Oral Presentation

arXiv:2405.12000 [pdf, other]

Lifetime Characterization of Extreme Wave Localizations in Crossing Seas

Authors: Yuchen He, Jinghua Wang, Jingsong He, Ye Li, Xingya Feng, Amin Chabchoub

Abstract: Rogue waves (RWs) can form on the ocean surface due to quasi-four wave resonant interaction or superposition principle. Both mechanisms have been acutely studied. The first of the two is known as the nonlinear focusing mechanism and leads to an increased probability of rogue waves when wave conditions are favourable, i.e., when unidirectionality and high narrowband energy of the wave field are sat… ▽ More Rogue waves (RWs) can form on the ocean surface due to quasi-four wave resonant interaction or superposition principle. Both mechanisms have been acutely studied. The first of the two is known as the nonlinear focusing mechanism and leads to an increased probability of rogue waves when wave conditions are favourable, i.e., when unidirectionality and high narrowband energy of the wave field are satisfied. This work delves into the dynamics of extreme wave focusing in crossing seas, revealing a distinct type of nonlinear RWs, characterized by a decisive longevity compared to those generated by the dispersive focusing mechanism. In fact, through fully nonlinear hydrodynamic numerical simulations, we show that the interactions between two crossing unidirectional wave beams can trigger fully localized and robust development of RWs. These coherent structures, characterized by a typical spectral broadening then spreading in the form of dual bimodality and recurrent wave group focusing, not only defy the weakening expectation of quasi-four wave resonant interaction in directionally spread wave fields, but also differ from classical focusing mechanisms already mentioned. This has been determined following a rigorous lifespan-based statistical analysis of extreme wave events in our fully nonlinear simulations. Utilizing the coupled nonlinear Schrödinger framework, we also show that such intrinsic focusing dynamics can also be captured by weakly nonlinear wave evolution equations. This opens new research avenues for further explorations of these complex and intriguing wave phenomena in hydrodynamics as well as other nonlinear and dispersive multi-wave systems. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.11826 [pdf, other]

Data quality control system and long-term performance monitor of the LHAASO-KM2A

Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To ensure the reliability of the LHAASO-KM2A data, a three-level quality control system has been established. It is used to monitor the status of detector units, stability of reconstructed parameters and the performance of the array based on observations of the Crab Nebula and Moon shadow. This paper will introduce the control system and its application on the LHAASO-KM2A data collected from August 2021 to July 2023. During this period, the pointing and angular resolution of the array were stable. From the observations of the Moon shadow and Crab Nebula, the results achieved using the two methods are consistent with each other. According to the observation of the Crab Nebula at energies from 25 TeV to 100 TeV, the time averaged pointing errors are estimated to be $-0.003^{\circ} \pm 0.005^{\circ}$ and $0.001^{\circ} \pm 0.006^{\circ}$ in the R.A. and Dec directions, respectively. △ Less

Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 15 pages, 9 figures

arXiv:2405.11524 [pdf, other]

Simple-Sampling and Hard-Mixup with Prototypes to Rebalance Contrastive Learning for Text Classification

Authors: Mengyu Li, Yonghao Liu, Fausto Giunchiglia, Xiaoyue Feng, Renchu Guan

Abstract: Text classification is a crucial and fundamental task in natural language processing. Compared with the previous learning paradigm of pre-training and fine-tuning by cross entropy loss, the recently proposed supervised contrastive learning approach has received tremendous attention due to its powerful feature learning capability and robustness. Although several studies have incorporated this techn… ▽ More Text classification is a crucial and fundamental task in natural language processing. Compared with the previous learning paradigm of pre-training and fine-tuning by cross entropy loss, the recently proposed supervised contrastive learning approach has received tremendous attention due to its powerful feature learning capability and robustness. Although several studies have incorporated this technique for text classification, some limitations remain. First, many text datasets are imbalanced, and the learning mechanism of supervised contrastive learning is sensitive to data imbalance, which may harm the model performance. Moreover, these models leverage separate classification branch with cross entropy and supervised contrastive learning branch without explicit mutual guidance. To this end, we propose a novel model named SharpReCL for imbalanced text classification tasks. First, we obtain the prototype vector of each class in the balanced classification branch to act as a representation of each class. Then, by further explicitly leveraging the prototype vectors, we construct a proper and sufficient target sample set with the same size for each class to perform the supervised contrastive learning procedure. The empirical results show the effectiveness of our model, which even outperforms popular large language models across several datasets. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 12 pages, 9 figures

arXiv:2405.09844 [pdf]

Electrically switchable $2^N$-channel wave-front control with N cascaded polarization-dependent metasurfaces

Authors: Zhiyao Ma, Tian Tian, Yuxuan Liao, Xue Feng, Yongzhuo Li, Kaiyu Cui, Fang Liu, Hao Sun, Wei Zhang, Yidong Huang

Abstract: Metasurfaces with tunable functionalities are greatly desired for modern optical system and various applications. To increase the operating channels of polarization-multiplexed metasurfaces, we proposed a structure of N cascaded dual-channel metasurfaces to achieve 2^N electrically switchable functional channels without intrinsic noise or cross-talk. As proof of principles, we have implemented a 3… ▽ More Metasurfaces with tunable functionalities are greatly desired for modern optical system and various applications. To increase the operating channels of polarization-multiplexed metasurfaces, we proposed a structure of N cascaded dual-channel metasurfaces to achieve 2^N electrically switchable functional channels without intrinsic noise or cross-talk. As proof of principles, we have implemented a 3-layer setup to achieve 8 channels. In success, we have demonstrated two typical functionalities of vortex beam generation with switchable topological charge of l=-3 ~ +4 or l=-1~ -8, and beam steering with the deflecting direction switchable in an 8*1 line or a 4*2 grid. We believe that our proposal would provide a practical way to significantly increase the scalability and extend the functionality of polarization-multiplexed metasurfaces, which are potential for the applications of LiDAR, glasses-free 3D display, OAM (de)multiplexing, and varifocal meta-lens. △ Less

Submitted 27 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.08326 [pdf, other]

doi 10.1093/mnras/stae1092

Periodic Activities of Fast Radio Burst Repeaters from Precessing Magnetars with Evolving Obliquity

Authors: Xin-Ming Feng, Yuan-Pei Yang, Qiao-Chu Li

Abstract: Fast radio bursts (FRBs) are cosmological radio transients with millisecond durations and extremely high brightness temperatures. One FRB repeater, FRB 180916.J0158+65 (FRB 180916B), was confirmed to appear 16.35-day periodic activities with 5-day activity window. Another FRB repeater, FRB 121102, and two soft gamma-ray repeaters (SGRs), SGR 1935+2154 and SGR 1806-20, also show possible periodic a… ▽ More Fast radio bursts (FRBs) are cosmological radio transients with millisecond durations and extremely high brightness temperatures. One FRB repeater, FRB 180916.J0158+65 (FRB 180916B), was confirmed to appear 16.35-day periodic activities with 5-day activity window. Another FRB repeater, FRB 121102, and two soft gamma-ray repeaters (SGRs), SGR 1935+2154 and SGR 1806-20, also show possible periodic activities. These periodicities might originate from the precession process of young magnetars due to the anisotropic pressure from the inner magnetic fields as proposed in the literature. In this work, we analyze a self-consistent model for the rotation evolution of magnetars and obtain the evolutions of magnetar precession and obliquity. We find that if the FRB repeaters and the SGRs with (possible) periodic activities originate from the magnetar precession, their ages would be constrained to be hundreds to tens of thousands of years, which is consistent with the typical ages of magnetars. Assuming that the FRB emission is beaming in the magnetosphere as proposed in the literature, we calculate the evolution of the observable probability and the duty cycle of the active window period. We find that for a given magnetar the observable probability increases with the magnetar age in the early stage and decreases with the magnetar age in the later stage, meanwhile, there are one or two active windows in one precession period if the emission is not perfectly axisymmetric with respect to the deformation axis of a magnetar, which could be tested by the future observation for repeating FRB sources. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 10 pages, 9 figures. Accepted for publication in MNRAS. Comments welcome!

arXiv:2405.07691 [pdf, other]

Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) is compatible with NGC 4278 within $\sim0.03$ degree. Variation analysis shows an indication of the variability at a few months level in the TeV band, which is consistent with low frequency observations. Based on these observations, we report the detection of TeV $γ$-ray emissions from this low-luminosity AGN NGC 4278. The observations by LHAASO-WCDA during active period has a significance level of 8.8\,$σ$ with best-fit photon spectral index $\varGamma=2.56\pm0.14$ and a flux $f_{1-10\,\rm{TeV}}=(7.0\pm1.1_{\rm{sta}}\pm0.35_{\rm{syst}})\times10^{-13}\,\rm{photons\,cm^{-2}\,s^{-1}}$, or approximately $5\%$ of the Crab Nebula. The discovery of VHE from NGC 4278 indicates that the compact, weak radio jet can efficiently accelerate particles and emit TeV photons. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 11 pages, 5 figures

arXiv:2405.04090 [pdf, ps, other]

Protecting quantum gates from arbitrary single- and two-qubit errors

Authors: Chunfeng Wu, Gangcheng Wang, Xun-Li Feng

Abstract: We explore the protection of quantum gates from arbitrary single- and two-qubit noises with properly designed dynamical decoupling pulses. The proposed dynamical decoupling method is a concatenation of a sequence of pulses formed by $σ_x$, $σ_xσ_x$ with another sequence constructed by $σ_z$, $σ_zσ_z$. The concatenation of the two sequences results in desired pulses to fight agianst any single- and… ▽ More We explore the protection of quantum gates from arbitrary single- and two-qubit noises with properly designed dynamical decoupling pulses. The proposed dynamical decoupling method is a concatenation of a sequence of pulses formed by $σ_x$, $σ_xσ_x$ with another sequence constructed by $σ_z$, $σ_zσ_z$. The concatenation of the two sequences results in desired pulses to fight agianst any single- and two-qubit errors. The success of our method relies on the ability to adjust system parameters or interaction terms, which can be achieved in different physical systems, including trapped ions and superconducting qubits. We finally explore the performance of our method numerically with the above-mentioned errors that are changing at any moment and show the preferred protection offered by the method. Therefore, our method is a timely step forward in preserving quantum gates at the level of physical qubits. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: 5 pages

arXiv:2405.02933 [pdf, other]

Relay Decoding: Concatenating Large Language Models for Machine Translation

Authors: Chengpeng Fu, Xiaocheng Feng, Yichong Huang, Wenshuai Huo, Baohang Li, Hui Wang, Bin Qin, Ting Liu

Abstract: Leveraging large language models for machine translation has demonstrated promising results. However, it does require the large language models to possess the capability of handling both the source and target languages in machine translation. When it is challenging to find large models that support the desired languages, resorting to continuous learning methods becomes a costly endeavor. To mitiga… ▽ More Leveraging large language models for machine translation has demonstrated promising results. However, it does require the large language models to possess the capability of handling both the source and target languages in machine translation. When it is challenging to find large models that support the desired languages, resorting to continuous learning methods becomes a costly endeavor. To mitigate these expenses, we propose an innovative approach called RD (Relay Decoding), which entails concatenating two distinct large models that individually support the source and target languages. By incorporating a simple mapping layer to facilitate the connection between these two models and utilizing a limited amount of parallel data for training, we successfully achieve superior results in the machine translation task. Experimental results conducted on the Multi30k and WikiMatrix datasets validate the effectiveness of our proposed method. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: Work in progress

arXiv:2405.02356 [pdf, other]

Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator

Authors: Xincheng Feng, Guodong Shen, Jianhao Hu, Meng Li, Ngai Wong

Abstract: Nonlinearities are crucial for capturing complex input-output relationships especially in deep neural networks. However, nonlinear functions often incur various hardware and compute overheads. Meanwhile, stochastic computing (SC) has emerged as a promising approach to tackle this challenge by trading output precision for hardware simplicity. To this end, this paper proposes a first-of-its-kind sto… ▽ More Nonlinearities are crucial for capturing complex input-output relationships especially in deep neural networks. However, nonlinear functions often incur various hardware and compute overheads. Meanwhile, stochastic computing (SC) has emerged as a promising approach to tackle this challenge by trading output precision for hardware simplicity. To this end, this paper proposes a first-of-its-kind stochastic multivariate universal-radix finite-state machine (SMURF) that harnesses SC for hardware-simplistic multivariate nonlinear function generation at high accuracy. We present the finite-state machine (FSM) architecture for SMURF, as well as analytical derivations of sampling gate coefficients for accurately approximating generic nonlinear functions. Experiments demonstrate the superiority of SMURF, requiring only 16.07% area and 14.45% power consumption of Taylor-series approximation, and merely 2.22% area of look-up table (LUT) schemes. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2405.01259 [pdf, other]

Identification of Entailment and Contradiction Relations between Natural Language Sentences: A Neurosymbolic Approach

Authors: Xuyao Feng, Anthony Hunter

Abstract: Natural language inference (NLI), also known as Recognizing Textual Entailment (RTE), is an important aspect of natural language understanding. Most research now uses machine learning and deep learning to perform this task on specific datasets, meaning their solution is not explainable nor explicit. To address the need for an explainable approach to RTE, we propose a novel pipeline that is based o… ▽ More Natural language inference (NLI), also known as Recognizing Textual Entailment (RTE), is an important aspect of natural language understanding. Most research now uses machine learning and deep learning to perform this task on specific datasets, meaning their solution is not explainable nor explicit. To address the need for an explainable approach to RTE, we propose a novel pipeline that is based on translating text into an Abstract Meaning Representation (AMR) graph. For this we use a pre-trained AMR parser. We then translate the AMR graph into propositional logic and use a SAT solver for automated reasoning. In text, often commonsense suggests that an entailment (or contradiction) relationship holds between a premise and a claim, but because different wordings are used, this is not identified from their logical representations. To address this, we introduce relaxation methods to allow replacement or forgetting of some propositions. Our experimental results show this pipeline performs well on four RTE datasets. △ Less

Submitted 2 May, 2024; originally announced May 2024.

ACM Class: I.2

arXiv:2404.19389 [pdf]

Electronic decoupling and hole-doping of graphene nanoribbons on metal substrates by chloride intercalation

Authors: Amogh Kinikar, Thorsten G. Englmann, Marco Di Giovannantonio, Nicolò Bassi, Feifei Xiang, Samuel Stolz, Roland Widmer, Gabriela Borin Barin, Elia Turco, Néstor Merino Díez, Kristjan Eimre, Andres Ortega Guerrero, Xinliang Feng, Oliver Gröning, Carlo A. Pignedoli, Roman Fasel, Pascal Ruffieux

Abstract: Atomically precise graphene nanoribbons (GNRs) have a wide range of electronic properties that depend sensitively on their chemical structure. Several types of GNRs have been synthesized on metal surfaces through selective surface-catalyzed reactions. The resulting GNRs are adsorbed on the metal surface, which may lead to hybridization between the GNR orbitals and those of the substrate. This make… ▽ More Atomically precise graphene nanoribbons (GNRs) have a wide range of electronic properties that depend sensitively on their chemical structure. Several types of GNRs have been synthesized on metal surfaces through selective surface-catalyzed reactions. The resulting GNRs are adsorbed on the metal surface, which may lead to hybridization between the GNR orbitals and those of the substrate. This makes investigation of the intrinsic electronic properties of GNRs more difficult, and also rules out capacitive gating. Here we demonstrate the formation of a dielectric gold chloride adlayer that can intercalate underneath GNRs on the Au(111) surface. The intercalated gold chloride adlayer electronically decouples the GNRs from the metal and leads to a substantial hole doping of the GNRs. Our results introduce an easily accessible tool in the in situ characterization of GNRs grown on Au(111) that allows for exploration of their electronic properties in a heavily hole-doped regime. △ Less

Submitted 30 April, 2024; originally announced April 2024.

Comments: Supporting information follows the main text in the same file

Showing 1–50 of 1,018 results for author: Feng, X