Skip to main content

Showing 1–50 of 142 results for author: Duan, S

  1. arXiv:2407.08924  [pdf, other

    cs.CR

    Disassembling Obfuscated Executables with LLM

    Authors: Huanyao Rong, Yue Duan, Hang Zhang, XiaoFeng Wang, Hongbo Chen, Shengchen Duan, Shen Wang

    Abstract: Disassembly is a challenging task, particularly for obfuscated executables containing junk bytes, which is designed to induce disassembly errors. Existing solutions rely on heuristics or leverage machine learning techniques, but only achieve limited successes. Fundamentally, such obfuscation cannot be defeated without in-depth understanding of the binary executable's semantics, which is made possi… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.08353  [pdf

    cond-mat.mtrl-sci

    One-dimensional flat bands in phosphorene nanoribbons with pentagonal nature

    Authors: Shuo Sun, Jing-Yang You, Zhihao Cai, Jie Su, Tong Yang, Xinnan Peng, Yihe Wang, Daiyu Geng, Jian Gou, Yuli Huang, Sisheng Duan, Lan Chen, Kehui Wu, Andrew T. S. Wee, Yuan Ping Feng, Jia Lin Zhang, Jiong Lu, Baojie Feng, Wei Chen

    Abstract: Materials with topological flat bands can serve as a promising platform to investigate strongly interacting phenomena. However, experimental realization of ideal flat bands is mostly limited to artificial lattices or moiré systems. Here we report a general way to construct one-dimensional (1D) flat bands in phosphorene nanoribbons (PNRs) with pentagonal nature: penta-hexa-PNRs and penta-dodeca-PNR… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures

  3. arXiv:2407.01183  [pdf, other

    cs.DB

    TCSR-SQL: Towards Table Content-aware Text-to-SQL with Self-retrieval

    Authors: Wenbo Xu, Liang Yan, Peiyi Han, Haifeng Zhu, Chuanyi Liu, Shaoming Duan, Cuiyun Gao, Yingwei Liang

    Abstract: Large Language Model-based (LLM-based) Text-to-SQL methods have achieved important progress in generating SQL queries for real-world applications. When confronted with table content-aware questions in real-world scenarios, ambiguous data content keywords and non-existent database schema column names within the question leads to the poor performance of existing methods. To solve this problem, we pr… ▽ More

    Submitted 12 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2406.14549  [pdf, other

    cs.CV cs.LG q-bio.NC

    Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models

    Authors: Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete

    Abstract: The proliferation of large language models has revolutionized natural language processing tasks, yet it raises profound concerns regarding data privacy and security. Language models are trained on extensive corpora including potentially sensitive or proprietary information, and the risk of data leakage -- where the model response reveals pieces of such information -- remains inadequately understoo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.07436  [pdf, other

    cs.PL

    McEval: Massively Multilingual Code Evaluation

    Authors: Linzheng Chai, Shukai Liu, Jian Yang, Yuwei Yin, Ke Jin, Jiaheng Liu, Tao Sun, Ge Zhang, Changyu Ren, Hongcheng Guo, Zekun Wang, Boyang Wang, Xianjie Wu, Bing Wang, Tongliang Li, Liqun Yang, Sufeng Duan, Zhoujun Li

    Abstract: Code large language models (LLMs) have shown remarkable advances in code understanding, completion, and generation tasks. Programming benchmarks, comprised of a selection of code challenges and corresponding test cases, serve as a standard to evaluate the capability of different LLMs in such tasks. However, most existing benchmarks primarily focus on Python and are still restricted to a limited nu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 22 pages

  7. arXiv:2406.07032  [pdf, other

    cs.CV

    RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks

    Authors: Zhechao Wang, Peirui Cheng, Pengju Tian, Yuchao Wang, Mingxin Chen, Shujing Duan, Zhirui Wang, Xinming Li, Xian Sun

    Abstract: Remote sensing lightweight foundation models have achieved notable success in online perception within remote sensing. However, their capabilities are restricted to performing online inference solely based on their own observations and models, thus lacking a comprehensive understanding of large-scale remote sensing scenarios. To overcome this limitation, we propose a Remote Sensing Distributed Fou… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  8. arXiv:2406.06305  [pdf, other

    cs.CV cs.AI

    NeuroMoCo: A Neuromorphic Momentum Contrast Learning Method for Spiking Neural Networks

    Authors: Yuqi Ma, Huamin Wang, Hangchi Shen, Xuemei Chen, Shukai Duan, Shiping Wen

    Abstract: Recently, brain-inspired spiking neural networks (SNNs) have attracted great research attention owing to their inherent bio-interpretability, event-triggered properties and powerful perception of spatiotemporal information, which is beneficial to handling event-based neuromorphic datasets. In contrast to conventional static image datasets, event-based neuromorphic datasets present heightened compl… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 32 pages,4 figures,4 tables

  9. arXiv:2406.04422  [pdf, ps, other

    math.AP

    Collapsing-ring blowup solutions for the nonlinear heat equation

    Authors: Senhao Duan, Nejla Nouaili, Hatem Zaag

    Abstract: In this paper, we construct a singular standing ring solution of the nonlinear heat in the radial case. We give rigorous proof for the existence of a ring blow-up solution in finite time. This result was predicted formally by Baruch, Fibich and Gavish \cite{BFGpd10}. We also prove the stability of these dynamics among radially symmetric solutions.

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 25 pages

    MSC Class: 35B40; 35B44

  10. arXiv:2406.02629  [pdf, other

    cs.CR cs.LG

    SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud

    Authors: Shijin Duan, Chenghong Wang, Hongwu Peng, Yukui Luo, Wujie Wen, Caiwen Ding, Xiaolin Xu

    Abstract: As privacy-preserving becomes a pivotal aspect of deep learning (DL) development, multi-party computation (MPC) has gained prominence for its efficiency and strong security. However, the practice of current MPC frameworks is limited, especially when dealing with large neural networks, exemplified by the prolonged execution time of 25.8 seconds for secure inference on ResNet-152. The primary challe… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 16 pages, 9 figures

  11. arXiv:2405.14185  [pdf, other

    cs.LG cs.PF

    A structure-aware framework for learning device placements on computation graphs

    Authors: Shukai Duan, Heng Ping, Nikos Kanakaris, Xiongye Xiao, Peiyu Zhang, Panagiotis Kyriakis, Nesreen K. Ahmed, Guixiang Ma, Mihai Capota, Shahin Nazarian, Theodore L. Willke, Paul Bogdan

    Abstract: Existing approaches for device placement ignore the topological features of computation graphs and rely mostly on heuristic methods for graph partitioning. At the same time, they either follow a grouper-placer or an encoder-placer architecture, which requires understanding the interaction structure between code operations. To bridge the gap between encoder-placer and grouper-placer techniques, we… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.05542  [pdf, other

    cs.RO cs.MA

    Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning

    Authors: Yuchen Shi, Shihong Duan, Cheng Xu, Ran Wang, Fangwen Ye, Chau Yuen

    Abstract: This work introduces a novel value decomposition algorithm, termed \textit{Dynamic Deep Factor Graphs} (DDFG). Unlike traditional coordination graphs, DDFG leverages factor graphs to articulate the decomposition of value functions, offering enhanced flexibility and adaptability to complex value function structures. Central to DDFG is a graph structure generation policy that innovatively generates… ▽ More

    Submitted 7 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE TPAMI

  13. arXiv:2404.04265  [pdf, other

    cs.IR cs.LG

    Accelerating Matrix Factorization by Dynamic Pruning for Fast Recommendation

    Authors: Yining Wu, Shengyu Duan, Gaole Sai, Chenhong Cao, Guobing Zou

    Abstract: Matrix factorization (MF) is a widely used collaborative filtering (CF) algorithm for recommendation systems (RSs), due to its high prediction accuracy, great flexibility and high efficiency in big data processing. However, with the dramatically increased number of users/items in current RSs, the computational complexity for training a MF model largely increases. Many existing works have accelerat… ▽ More

    Submitted 18 March, 2024; originally announced April 2024.

  14. arXiv:2403.16228  [pdf, other

    q-fin.MF q-fin.PM

    Rank-Dependent Predictable Forward Performance Processes

    Authors: Bahman Angoshtari, Shida Duan

    Abstract: Predictable forward performance processes (PFPPs) are stochastic optimal control frameworks for an agent who controls a randomly evolving system but can only prescribe the system dynamics for a short period ahead. This is a common scenario in which a controlling agent frequently re-calibrates her model. We introduce a new class of PFPPs based on rank-dependent utility, generalizing existing models… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 43 pages, 3 figures

    MSC Class: 91G10; 91G80; 60H30

  15. arXiv:2403.13844  [pdf, other

    cs.LG cs.AI

    Scheduled Knowledge Acquisition on Lightweight Vector Symbolic Architectures for Brain-Computer Interfaces

    Authors: Yejia Liu, Shijin Duan, Xiaolin Xu, Shaolei Ren

    Abstract: Brain-Computer interfaces (BCIs) are typically designed to be lightweight and responsive in real-time to provide users timely feedback. Classical feature engineering is computationally efficient but has low accuracy, whereas the recent neural networks (DNNs) improve accuracy but are computationally expensive and incur high latency. As a promising alternative, the low-dimensional computing (LDC) cl… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted as a full paper by the tinyML Research Symposium 2024

  16. arXiv:2403.11518  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Optical manipulation of the topological phase in ZrTe5 revealed by time- and angle-resolved photoemission

    Authors: Chaozhi Huang, Chengyang Xu, Fengfeng Zhu, Shaofeng Duan, Jianzhe Liu, Lingxiao Gu, Shichong Wang, Haoran Liu, Dong Qian, Weidong Luo, Wentao Zhang

    Abstract: High-resolution time- and angle-resolved photoemission measurements were conducted on the topological insulator ZrTe5. With strong femtosecond photoexcitation, a possible ultrafast phase transition from a weak to a strong topological insulating phase was experimentally realized by recovering the energy gap inversion in a time scale that was shorter than 0.15 ps. This photoinduced transient strong… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Journal ref: Chinese Physics B 33, 017901 (2024)

  17. arXiv:2403.06682  [pdf, other

    cs.CL cs.CV cs.CY

    Restoring Ancient Ideograph: A Multimodal Multitask Neural Network Approach

    Authors: Siyu Duan, Jun Wang, Qi Su

    Abstract: Cultural heritage serves as the enduring record of human thought and history. Despite significant efforts dedicated to the preservation of cultural relics, many ancient artefacts have been ravaged irreversibly by natural deterioration and human actions. Deep learning technology has emerged as a valuable tool for restoring various kinds of cultural heritages, including ancient text restoration. Pre… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Accept by Lrec-Coling 2024

  18. arXiv:2403.04204  [pdf, other

    cs.AI cs.CL

    On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

    Authors: Xinpeng Wang, Shitong Duan, Xiaoyuan Yi, Jing Yao, Shanlin Zhou, Zhihua Wei, Peng Zhang, Dongkuan Xu, Maosong Sun, Xing Xie

    Abstract: Big models have achieved revolutionary breakthroughs in the field of AI, but they might also pose potential concerns. Addressing such concerns, alignment technologies were introduced to make these models conform to human preferences and values. Despite considerable advancements in the past year, various challenges lie in establishing the optimal alignment strategy, such as data cost and scalable o… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 23 pages, 7 figures

  19. arXiv:2403.03609  [pdf, ps, other

    math.AC

    Powers of edge ideals of edge-weighted trees

    Authors: Jiaxin Li, Guangjun Zhu, Shiya Duan

    Abstract: This paper gives exact formulas for the regularity of edge ideals of edge-weighted integrally closed trees. In addition, we provide some linear upper bounds on the regularity of powers of such ideals.

    Submitted 6 March, 2024; originally announced March 2024.

    MSC Class: Primary 13A15; 13D02; Secondary 05E40

  20. arXiv:2403.03419  [pdf, other

    cs.CL cs.AI

    Negating Negatives: Alignment without Human Positive Samples via Distributional Dispreference Optimization

    Authors: Shitong Duan, Xiaoyuan Yi, Peng Zhang, Tun Lu, Xing Xie, Ning Gu

    Abstract: Large language models (LLMs) have revolutionized the role of AI, yet also pose potential risks of propagating unethical content. Alignment technologies have been introduced to steer LLMs towards human preference, gaining increasing attention. Despite notable breakthroughs in this direction, existing methods heavily rely on high-quality positive-negative training pairs, suffering from noisy labels… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  21. arXiv:2402.09725  [pdf, other

    cs.CL cs.AI

    Improving Non-autoregressive Machine Translation with Error Exposure and Consistency Regularization

    Authors: Xinran Chen, Sufeng Duan, Gongshen Liu

    Abstract: Being one of the IR-NAT (Iterative-refinemennt-based NAT) frameworks, the Conditional Masked Language Model (CMLM) adopts the mask-predict paradigm to re-predict the masked low-confidence tokens. However, CMLM suffers from the data distribution discrepancy between training and inference, where the observed tokens are generated differently in the two cases. In this paper, we address this problem wi… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  22. arXiv:2401.02111  [pdf, ps, other

    math.AC

    Edge ideals of some edge-weighted graphs

    Authors: Guangjun Zhu, Shiya Duan, Yijun Cui, Jiaxin Li

    Abstract: This paper presents exact formulas for the regularity and depth of powers of edge ideals of an edge-weighted star graph. Additionally, we provide exact formulas for the regularity of powers of the edge ideal of an edge-weighted integrally closed path, as well as lower bounds on the depth of powers of such an edge ideal.

    Submitted 4 January, 2024; originally announced January 2024.

    MSC Class: Primary 13F20; 13C15; 05C22; Secondary 05E40

  23. arXiv:2401.00225  [pdf

    eess.AS cs.AI eess.SP

    Enhancing dysarthria speech feature representation with empirical mode decomposition and Walsh-Hadamard transform

    Authors: Ting Zhu, Shufei Duan, Camille Dingam, Huizhi Liang, Wei Zhang

    Abstract: Dysarthria speech contains the pathological characteristics of vocal tract and vocal fold, but so far, they have not yet been included in traditional acoustic feature sets. Moreover, the nonlinearity and non-stationarity of speech have been ignored. In this paper, we propose a feature enhancement algorithm for dysarthria speech called WHFEMD. It combines empirical mode decomposition (EMD) and fast… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  24. arXiv:2312.08998  [pdf

    eess.AS cs.AI cs.SD eess.SP

    Design, construction and evaluation of emotional multimodal pathological speech database

    Authors: Ting Zhu, Shufei Duan, Huizhi Liang, Wei Zhang

    Abstract: The lack of an available emotion pathology database is one of the key obstacles in studying the emotion expression status of patients with dysarthria. The first Chinese multimodal emotional pathological speech database containing multi-perspective information is constructed in this paper. It includes 29 controls and 39 patients with different degrees of motor dysarthria, expressing happy, sad, ang… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  25. arXiv:2312.05657  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Leveraging Reinforcement Learning and Large Language Models for Code Optimization

    Authors: Shukai Duan, Nikos Kanakaris, Xiongye Xiao, Heng Ping, Chenyu Zhou, Nesreen K. Ahmed, Guixiang Ma, Mihai Capota, Theodore L. Willke, Shahin Nazarian, Paul Bogdan

    Abstract: Code optimization is a daunting task that requires a significant level of expertise from experienced programmers. This level of expertise is not sufficient when compared to the rapid development of new hardware architectures. Towards advancing the whole code optimization process, recent approaches rely on machine learning and artificial intelligence techniques. This paper introduces a new framewor… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  26. arXiv:2312.00856  [pdf, other

    cs.CV

    QAFE-Net: Quality Assessment of Facial Expressions with Landmark Heatmaps

    Authors: Shuchao Duan, Amirhossein Dadashzadeh, Alan Whone, Majid Mirmehdi

    Abstract: Facial expression recognition (FER) methods have made great inroads in categorising moods and feelings in humans. Beyond FER, pain estimation methods assess levels of intensity in pain expressions, however assessing the quality of all facial expressions is of critical value in health-related applications. In this work, we address the quality of five different facial expressions in patients affecte… ▽ More

    Submitted 12 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted to ELFA workshop at WACV 2024

  27. arXiv:2311.17819  [pdf, ps, other

    astro-ph.SR physics.space-ph

    Weak Solar Radio Bursts from the Solar Wind Acceleration Region Observed by Parker Solar Probe and Its Probable Emission Mechanism

    Authors: Ling Chen, Bing Ma, Dejin Wu, Xiaowei Zhou, Marc Pulupa, PeiJin Zhang, Pietro Zucca, Stuart D. Bale, Justin C. Kasper, SuPing Duan

    Abstract: The Parker Solar Probe (PSP) provides us the unprecedentedly close approach observation to the Sun, and hence the possibility of directly understanding the "elementary process" which occurs in the kinetic scale of particles collective interactioin in solar coronal plasmas. We reported a kind of weak solar radio bursts (SRBs), which are detected by PSP when it passed a low-density magnetic channel… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  28. arXiv:2311.15179  [pdf, other

    cs.SE

    Estimation of the User Contribution Rate by Leveraging Time Sequence in Pairwise Matching function-point between Users Feedback and App Updating Log

    Authors: Shiqi Duan, Jianxun Liu, Yong Xiao, Xiangping Zhang

    Abstract: Mobile applications have become an inseparable part of people's daily life. Nonetheless, the market competition is extremely fierce, and apps lacking recognition among most users are susceptible to market elimination. To this end, developers must swiftly and accurately apprehend the requirements of the wider user base to effectively strategize and promote their apps' orderly and healthy evolution.… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  29. arXiv:2311.09489  [pdf, other

    cs.CR

    MirrorNet: A TEE-Friendly Framework for Secure On-device DNN Inference

    Authors: Ziyu Liu, Yukui Luo, Shijin Duan, Tong Zhou, Xiaolin Xu

    Abstract: Deep neural network (DNN) models have become prevalent in edge devices for real-time inference. However, they are vulnerable to model extraction attacks and require protection. Existing defense approaches either fail to fully safeguard model confidentiality or result in significant latency issues. To overcome these challenges, this paper presents MirrorNet, which leverages Trusted Execution Enviro… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted by ICCAD 2023

  30. arXiv:2311.07619  [pdf, other

    cs.IR cs.AI

    Modeling User Viewing Flow Using Large Language Models for Article Recommendation

    Authors: Zhenghao Liu, Zulong Chen, Moufeng Zhang, Shaoyang Duan, Hong Wen, Liangyue Li, Nan Li, Yu Gu, Ge Yu

    Abstract: This paper proposes the User Viewing Flow Modeling (SINGLE) method for the article recommendation task, which models the user constant preference and instant interest from user-clicked articles. Specifically, we first employ a user constant viewing flow modeling method to summarize the user's general interest to recommend articles. In this case, we utilize Large Language Models (LLMs) to capture c… ▽ More

    Submitted 7 March, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted by WebConf 2024

  31. arXiv:2311.07603  [pdf, other

    cs.CV

    PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment

    Authors: Amirhossein Dadashzadeh, Shuchao Duan, Alan Whone, Majid Mirmehdi

    Abstract: The limited availability of labelled data in Action Quality Assessment (AQA), has forced previous works to fine-tune their models pretrained on large-scale domain-general datasets. This common approach results in weak generalisation, particularly when there is a significant domain shift. We propose a novel, parameter efficient, continual pretraining framework, PECoP, to reduce such domain shift vi… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted to WACV 2024 (preprint)

  32. arXiv:2311.05608  [pdf, other

    cs.CR cs.AI cs.CL

    FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts

    Authors: Yichen Gong, Delong Ran, Jinyuan Liu, Conglei Wang, Tianshuo Cong, Anyu Wang, Sisi Duan, Xiaoyun Wang

    Abstract: Ensuring the safety of artificial intelligence-generated content (AIGC) is a longstanding topic in the artificial intelligence (AI) community, and the safety concerns associated with Large Language Models (LLMs) have been widely investigated. Recently, large vision-language models (VLMs) represent an unprecedented revolution, as they are built upon LLMs but can incorporate additional modalities (e… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Technical Report

  33. arXiv:2310.20290  [pdf, other

    math.NA

    On Rayleigh Quotient Iteration for Dual Quaternion Hermitian Eigenvalue Problem

    Authors: Shan-Qi Duan, Qing-Wen Wang, Xue-Feng Duan

    Abstract: The application of eigenvalue theory to dual quaternion Hermitian matrices holds significance in the realm of multi-agent formation control. In this paper, we study the Rayleigh quotient iteration (RQI) for solving the right eigenpairs of dual quaternion Hermitian matrices. Combined with dual representation, the RQI algorithm can effectively compute the extreme eigenvalue along with the associated… ▽ More

    Submitted 6 March, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2111.12211 by other authors

  34. Multi-grained Evidence Inference for Multi-choice Reading Comprehension

    Authors: Yilin Zhao, Hai Zhao, Sufeng Duan

    Abstract: Multi-choice Machine Reading Comprehension (MRC) is a major and challenging task for machines to answer questions according to provided options. Answers in multi-choice MRC cannot be directly extracted in the given passages, and essentially require machines capable of reasoning from accurate extracted evidence. However, the critical evidence may be as simple as just one word or phrase, while it is… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted by TASLP 2023, vol. 31, pp. 3896-3907

    Journal ref: in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 3896-3907, 2023

  35. arXiv:2310.11984  [pdf, other

    cs.LG cs.CL

    From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers

    Authors: Shaoxiong Duan, Yining Shi, Wei Xu

    Abstract: In this paper, we investigate the inherent capabilities of transformer models in learning arithmetic algorithms, such as addition and parity. Through experiments and attention analysis, we identify a number of crucial factors for achieving optimal length generalization. We show that transformer models are able to generalize to long lengths with the help of targeted attention biasing. In particular… ▽ More

    Submitted 10 May, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  36. arXiv:2310.11053  [pdf, other

    cs.CL cs.AI cs.CY

    Denevil: Towards Deciphering and Navigating the Ethical Values of Large Language Models via Instruction Learning

    Authors: Shitong Duan, Xiaoyuan Yi, Peng Zhang, Tun Lu, Xing Xie, Ning Gu

    Abstract: Large Language Models (LLMs) have made unprecedented breakthroughs, yet their increasing integration into everyday life might raise societal risks due to generated unethical content. Despite extensive study on specific issues like bias, the intrinsic values of LLMs remain largely unexplored from a moral philosophy perspective. This work delves into ethical values utilizing Moral Foundation Theory.… ▽ More

    Submitted 4 March, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  37. arXiv:2310.07548  [pdf, other

    cs.CV

    Attribute Localization and Revision Network for Zero-Shot Learning

    Authors: Junzhe Xu, Suling Duan, Chenwei Tang, Zhenan He, Jiancheng Lv

    Abstract: Zero-shot learning enables the model to recognize unseen categories with the aid of auxiliary semantic information such as attributes. Current works proposed to detect attributes from local image regions and align extracted features with class-level semantics. In this paper, we find that the choice between local and global features is not a zero-sum game, global features can also contribute to the… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  38. arXiv:2309.13568  [pdf, ps, other

    math.AC

    The $\circ$ operation and $*$ operation of Cohen-Macaulay bipartite graphs

    Authors: Yulong Yang, Guangjun Zhu, Yijun Cui, Shiya Duan

    Abstract: Let $G$ be a finite simple graph with the vertex set $V$ and let $I_G$ be its edge ideal in the polynomial ring $S=\mathbb{K}[x_V]$. In this paper, we compute the depth and the Castelnuovo--Mumford regularity of $S/I_G$ when $G=G_1\circ G_2$ or $G=G_1* G_2$ is a graph obtained from Cohen-Macaulay bipartite graphs $G_1$, $G_2$ by $\circ$ operation or $*$ operation, respectively.

    Submitted 27 September, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2308.06010

    MSC Class: Primary 13C15; 13A15; 13D02; Secondary 05E40

  39. arXiv:2309.02230  [pdf, other

    cs.CV cs.AI

    DCP-Net: A Distributed Collaborative Perception Network for Remote Sensing Semantic Segmentation

    Authors: Zhechao Wang, Peirui Cheng, Shujing Duan, Kaiqiang Chen, Zhirui Wang, Xinming Li, Xian Sun

    Abstract: Onboard intelligent processing is widely applied in emergency tasks in the field of remote sensing. However, it is predominantly confined to an individual platform with a limited observation range as well as susceptibility to interference, resulting in limited accuracy. Considering the current state of multi-platform collaborative observation, this article innovatively presents a distributed colla… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  40. arXiv:2308.07492  [pdf, ps, other

    math.CA

    generalized Radon transforms on fractal measures

    Authors: Shengze Duan

    Abstract: In the setting of a general Borel measure $μ$ on $R^d$ with the natural ball size condition $$μ[B(x,r)]\leq Cr^s,$$ we establish the $L^p(μ)$-$L^q(μ)$-estimate for the generalized Radon transform $$(Af)(x):=\int_{Φ(x,y)=0}(fμ)(y)ψ(x,y)dσ_x(y),$$ where $Φ$ is a smooth function away from the diagonal. Among other reasonable assumptions, an $L^2$-Sobolev bound on $A$ on $R^d$ is imposed. This bound… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  41. arXiv:2308.07139  [pdf, other

    physics.optics

    Extremely thin perfect absorber by generalized multipole bianisotropic effect

    Authors: Hao Ma, Andrey B. Evlyukhin, Andrey E. Miroshnichenko, Fengjie Zhu, Siyu Duan, Jingbo Wu, Caihong Zhang, Jian Chen, Biao-Bing Jin, Willie J. Padilla, Kebin Fan

    Abstract: Symmetry breaking plays a crucial role in understanding the fundamental physics underlying numerous physical phenomena, including the electromagnetic response in resonators, giving rise to intriguing effects such as directional light scattering, supercavity lasing, and topologically protected states. In this work, we demonstrate that adding a small fraction of lossy metal (as low as… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  42. arXiv:2308.06016  [pdf, ps, other

    math.AC

    Integral closure and normality of edge ideals of some edge-weighted graphs

    Authors: Shiya Duan, Guangjun Zhu, Yijun Cui, Jiaxin Li

    Abstract: Let $G_ω$ be an edge-weighted simple graph. In this paper, we give a complete characterization of the graph $G_ω$ whose edge ideal $I(G_ω)$ is integrally closed. We also show that if $G_ω$ is an edge-weighted star graph, a path or a cycle, and $I(G_ω)$ is integrally closed, then $I(G_ω)$ is normal.

    Submitted 11 August, 2023; originally announced August 2023.

    MSC Class: Primary 13B22; 13F20; Secondary 05C99; 05E4

  43. arXiv:2308.01469  [pdf, other

    cs.LG cs.AI cs.CR

    VertexSerum: Poisoning Graph Neural Networks for Link Inference

    Authors: Ruyi Ding, Shijin Duan, Xiaolin Xu, Yunsi Fei

    Abstract: Graph neural networks (GNNs) have brought superb performance to various applications utilizing graph structural data, such as social analysis and fraud detection. The graph links, e.g., social relationships and transaction history, are sensitive and valuable information, which raises privacy concerns when using GNNs. To exploit these vulnerabilities, we propose VertexSerum, a novel graph poisoning… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  44. arXiv:2307.02751  [pdf, ps, other

    cs.SD cs.CR eess.AS

    DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition

    Authors: Zhifeng Wang, Chunyan Zeng, Surong Duan, Hongjie Ouyang, Hongmin Xu

    Abstract: Speaker recognition is a biometric modality that utilizes the speaker's speech segments to recognize the identity, determining whether the test speaker belongs to one of the enrolled speakers. In order to improve the robustness of the i-vector framework on cross-channel conditions and explore the nova method for applying deep learning to speaker recognition, the Stacked Auto-encoders are used to g… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 12 pages, 3 figures

  45. arXiv:2306.15513  [pdf, other

    cs.CR

    PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment

    Authors: Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding

    Abstract: Two-party computation (2PC) is promising to enable privacy-preserving deep learning (DL). However, the 2PC-based privacy-preserving DL implementation comes with high comparison protocol overhead from the non-linear operators. This work presents PASNet, a novel systematic framework that enables low latency, high energy efficiency & accuracy, and security-guaranteed 2PC-DL by integrating the hardwar… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: DAC 2023 accepeted publication, short version was published on AAAI 2023 workshop on DL-Hardware Co-Design for AI Acceleration: RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference

    ACM Class: E.3; I.2; B.0

    Journal ref: DAC 2023

  46. arXiv:2306.00311  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Ultrafast Switching from the Charge Density Wave Phase to a Metastable Metallic State in 1T-TiSe$_2$

    Authors: Shaofeng Duan, Wei Xia, Chaozhi Huang, Shichong Wang, Lingxiao Gu, Haoran Liu, Dao Xiang, Dong Qian, Yanfeng Guo, Wentao Zhang

    Abstract: The ultrafast electronic structures of the charge density wave material 1T-TiSe$_2$ were investigated by high-resolution time- and angle-resolved photoemission spectroscopy. We found that the quasiparticle populations drove ultrafast electronic phase transitions in 1T-TiSe$_2$ within 100 fs after photoexcitation, and a metastable metallic state, which was significantly different from the equilibri… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 13 Pages, 10 figures

    Journal ref: Phys. Rev. Lett. 130, 226501 (2023)

  47. arXiv:2304.08020  [pdf, other

    stat.ME

    Sparse Positive-Definite Estimation for Covariance Matrices with Repeated Measurements

    Authors: Sunpeng Duan, Guo Yu, Juntao Duan, Yuedong Wang

    Abstract: Repeated measurements are common in many fields, where random variables are observed repeatedly across different subjects. Such data have an underlying hierarchical structure, and it is of interest to learn covariance/correlation at different levels. Most existing methods for sparse covariance/correlation matrix estimation assume independent samples. Ignoring the underlying hierarchical structure… ▽ More

    Submitted 10 June, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  48. arXiv:2302.12506  [pdf

    cs.IT

    Exploring the Enablers of Digital Transformation in Small and Medium-Sized Enterprise

    Authors: Sachithra Lokuge, Sophia Duan

    Abstract: Recently, digital transformation has caught much attention of both academics and practitioners. With the advent of digital technologies, small-and-medium-sized enterprises (SMEs) have obtained the capacity to initiate digital transformation initiatives in a similar fashion to large-sized organizations. The innate characteristics of digital technologies also favor SMEs in promoting initiation of di… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  49. arXiv:2302.12347  [pdf, other

    cs.LG

    MetaLDC: Meta Learning of Low-Dimensional Computing Classifiers for Fast On-Device Adaption

    Authors: Yejia Liu, Shijin Duan, Xiaolin Xu, Shaolei Ren

    Abstract: Fast model updates for unseen tasks on intelligent edge devices are crucial but also challenging due to the limited computational power. In this paper,we propose MetaLDC, which meta-trains braininspired ultra-efficient low-dimensional computing classifiers to enable fast adaptation on tiny devices with minimal computational costs. Concretely, during the meta-training stage, MetaLDC meta trains a r… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted as a full paper by the TinyML Research Symposium 2023; 8 pages, 5 figures

  50. arXiv:2302.02292  [pdf, other

    cs.CR cs.LG

    RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference

    Authors: Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Shaoyi Huang, Xi Xie, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding

    Abstract: The proliferation of deep learning (DL) has led to the emergence of privacy and security concerns. To address these issues, secure Two-party computation (2PC) has been proposed as a means of enabling privacy-preserving DL computation. However, in practice, 2PC methods often incur high computation and communication overhead, which can impede their use in large-scale systems. To address this challen… ▽ More

    Submitted 22 February, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: This is work is a updated version of arXiv:2209.09424, the original version has been withdrawn

    ACM Class: I.2