Skip to main content

Showing 1–50 of 369 results for author: Liang, L

  1. arXiv:2407.08903  [pdf, other

    cs.CR cs.AI cs.AR

    TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing

    Authors: Husheng Han, Xinyao Zheng, Yuanbo Wen, Yifan Hao, Erhu Feng, Ling Liang, Jianan Mu, Xiaqing Li, Tianyun Ma, Pengwei Jin, Xinkai Song, Zidong Du, Qi Guo, Xing Hu

    Abstract: Heterogeneous collaborative computing with NPU and CPU has received widespread attention due to its substantial performance benefits. To ensure data confidentiality and integrity during computing, Trusted Execution Environments (TEE) is considered a promising solution because of its comparatively lower overhead. However, existing heterogeneous TEE designs are inefficient for collaborative computin… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by ASPLOS 2024

  2. arXiv:2407.06042  [pdf, ps, other

    eess.SP cs.IT

    Near-Optimal MIMO Detection Using Gradient-Based MCMC in Discrete Spaces

    Authors: Xingyu Zhou, Le Liang, Jing Zhang, Chao-Kai Wen, Shi Jin

    Abstract: The discrete nature of transmitted symbols poses challenges for achieving optimal detection in multiple-input multiple-output (MIMO) systems associated with a large number of antennas. Recently, the combination of two powerful machine learning methods, Markov chain Monte Carlo (MCMC) sampling and gradient descent, has emerged as a highly efficient solution to address this issue. However, existing… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2407.03294  [pdf, ps, other

    math.OC cs.LG

    Vertex Exchange Method for a Class of Quadratic Programming Problems

    Authors: Ling Liang, Kim-Chuan Toh, Haizhao Yang

    Abstract: A vertex exchange method is proposed for solving the strongly convex quadratic program subject to the generalized simplex constraint. We conduct rigorous convergence analysis for the proposed algorithm and demonstrate its essential roles in solving some important classes of constrained convex optimization. To get a feasible initial point to execute the algorithm, we also present and analyze a high… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 32 pages, 5 tables

    MSC Class: 90C06; 90C22; 90C25

  4. arXiv:2407.03272  [pdf, other

    math.OC math.NA

    Nesterov's Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems

    Authors: Ling Liang, Qiyuan Pang, Kim-Chuan Toh, Haizhao Yang

    Abstract: Solving symmetric positive semidefinite linear systems is an essential task in many scientific computing problems. While Jacobi-type methods, including the classical Jacobi method and the weighted Jacobi method, exhibit simplicity in their forms and friendliness to parallelization, they are not attractive either because of the potential convergence failure or their slow convergence rate. This pape… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 20 pages

    MSC Class: 90C06; 90C22; 90C25

  5. arXiv:2407.02779  [pdf, other

    cs.AI cs.LG

    Croppable Knowledge Graph Embedding

    Authors: Yushan Zhu, Wen Zhang, Zhiqiang Liu, Mingyang Chen, Lei Liang, Huajun Chen

    Abstract: Knowledge Graph Embedding (KGE) is a common method for Knowledge Graphs (KGs) to serve various artificial intelligence tasks. The suitable dimensions of the embeddings depend on the storage and computing conditions of the specific application scenarios. Once a new dimension is required, a new KGE model needs to be trained from scratch, which greatly increases the training cost and limits the effic… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  6. arXiv:2407.02674  [pdf, other

    astro-ph.CO astro-ph.GA

    Elevated UV luminosity density at Cosmic Dawn explained by non-evolving, weakly-mass dependent star formation efficiency

    Authors: Robert Feldmann, Michael Boylan-Kolchin, James S. Bullock, Onur Çatmabacak, Claude-André Faucher-Giguère, Christopher C. Hayward, Dušan Kereš, Alexandres Lazar, Lichen Liang, Jorge Moreno, Pascal A. Oesch, Eliot Quataert, Xuejian Shen, Guochao Sun

    Abstract: Recent observations with the James Webb Space Telescope (JWST) have uncovered unexpectedly high cosmic star formation activity in the early Universe, mere hundreds of millions of years after the Big Bang. These observations are often understood to reflect an evolutionary shift in star formation efficiency (SFE) caused by changing galactic conditions during these early epochs. We present FIREbox-HR… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 26 pages, 14 figures, 5 tables, submitted to MNRAS, comments welcome

  7. arXiv:2407.01425  [pdf, other

    cs.CV

    FORA: Fast-Forward Caching in Diffusion Transformer Acceleration

    Authors: Pratheba Selvaraju, Tianyu Ding, Tianyi Chen, Ilya Zharkov, Luming Liang

    Abstract: Diffusion transformers (DiT) have become the de facto choice for generating high-quality images and videos, largely due to their scalability, which enables the construction of larger models for enhanced performance. However, the increased size of these models leads to higher inference costs, making them less attractive for real-time applications. We present Fast-FORward CAching (FORA), a simple ye… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  8. arXiv:2406.18916  [pdf, other

    cs.CL cs.AI

    TrustUQA: A Trustful Framework for Unified Structured Data Question Answering

    Authors: Wen Zhang, Long Jin, Yushan Zhu, Jiaoyan Chen, Zhiwei Huang, Junjie Wang, Yin Hua, Lei Liang, Huajun Chen

    Abstract: Natural language question answering (QA) over structured data sources such as tables and knowledge graphs (KGs) have been widely investigated, for example with Large Language Models (LLMs). The main solutions include question to formal query parsing and retrieval-based answer generation. However, current methods of the former often suffer from weak generalization, failing to dealing with multiple… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  9. arXiv:2406.18345  [pdf, other

    cs.LG eess.SP

    EmT: A Novel Transformer for Generalized Cross-subject EEG Emotion Recognition

    Authors: Yi Ding, Chengxuan Tong, Shuailei Zhang, Muyun Jiang, Yong Li, Kevin Lim Jun Liang, Cuntai Guan

    Abstract: Integrating prior knowledge of neurophysiology into neural network architecture enhances the performance of emotion decoding. While numerous techniques emphasize learning spatial and short-term temporal patterns, there has been limited emphasis on capturing the vital long-term contextual information associated with emotional cognitive processes. In order to address this discrepancy, we introduce a… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  10. arXiv:2406.18050  [pdf, other

    cs.CV

    A Multi-Stage Goal-Driven Network for Pedestrian Trajectory Prediction

    Authors: Xiuen Wu, Tao Wang, Yuanzheng Cai, Lingyu Liang, George Papageorgiou

    Abstract: Pedestrian trajectory prediction plays a pivotal role in ensuring the safety and efficiency of various applications, including autonomous vehicles and traffic management systems. This paper proposes a novel method for pedestrian trajectory prediction, called multi-stage goal-driven network (MGNet). Diverging from prior approaches relying on stepwise recursive prediction and the singular forecastin… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Paper accepted by 5th International Conference on Computer Vision, Image and Deep Learning (CVIDL 2024)

  11. arXiv:2406.13927  [pdf, ps, other

    math.AP

    Fully Nonlinear Elliptic Equations With Periodic Data

    Authors: Dongsheng Li, Lichun Liang

    Abstract: In this paper, we study solutions $u$ of fully nonlinear elliptic equations of the form $F(D^2u)=f$ in $\mathbb{R}^n$, where $f$ is periodic. We establish the existence and Liouville type results for entire quadratic polynomial growth solutions, that is, the solution is a quadratic polynomial plus a periodic function. As a consequence, we consider applications to $k$-Hessian equations.

    Submitted 19 June, 2024; originally announced June 2024.

  12. arXiv:2406.11589  [pdf, other

    cs.SE cs.AI cs.IR

    CoSQA+: Enhancing Code Search Dataset with Matching Code

    Authors: Jing Gong, Yanghui Wu, Linxi Liang, Zibin Zheng, Yanlin Wang

    Abstract: Semantic code search, retrieving code that matches a given natural language query, is an important task to improve productivity in software engineering. Existing code search datasets are problematic: either using unrealistic queries, or with mismatched codes, and typically using one-to-one query-code pairing, which fails to reflect the reality that a query might have multiple valid code matches. T… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 11 pages, 4 figures, conference

    ACM Class: I.2.7; D.2.3

  13. arXiv:2406.10208  [pdf, other

    cs.CV

    Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering

    Authors: Zeyu Liu, Weicong Liang, Yiming Zhao, Bohan Chen, Lin Liang, Lijuan Wang, Ji Li, Yuhui Yuan

    Abstract: Recently, Glyph-ByT5 has achieved highly accurate visual text rendering performance in graphic design images. However, it still focuses solely on English and performs relatively poorly in terms of visual appeal. In this work, we address these two fundamental limitations by presenting Glyph-ByT5-v2 and Glyph-SDXL-v2, which not only support accurate visual text rendering for 10 different languages b… ▽ More

    Submitted 12 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Project page: https://glyph-byt5-v2.github.io/

  14. arXiv:2406.08806  [pdf, ps, other

    eess.SY

    Adaptive Cooperative Streaming of Holographic Video Over Wireless Networks: A Proximal Policy Optimization Solution

    Authors: Wanli Wen, Jiping Yan, Yulu Zhang, Zhen Huang, Liang Liang, Yunjian Jia

    Abstract: Adapting holographic video streaming to fluctuating wireless channels is essential to maintain consistent and satisfactory Quality of Experience (QoE) for users, which, however, is a challenging task due to the dynamic and uncertain characteristics of wireless networks. To address this issue, we propose a holographic video cooperative streaming framework designed for a generic wireless network in… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for publication in IEEE Wireless Communications Letters

  15. arXiv:2406.05846  [pdf, other

    math.OC cs.RO

    Fast and Certifiable Trajectory Optimization

    Authors: Shucheng Kang, Xiaoyang Xu, Jay Sarva, Ling Liang, Heng Yang

    Abstract: We propose semidefinite trajectory optimization (STROM), a framework that computes fast and certifiably optimal solutions for nonconvex trajectory optimization problems defined by polynomial objectives and constraints. STROM employs sparse second-order Lasserre's hierarchy to generate semidefinite program (SDP) relaxations of trajectory optimization. Different from existing tools (e.g., YALMIP and… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  16. arXiv:2406.05799  [pdf, ps, other

    eess.SP

    Double-RIS-Assisted Orbital Angular Momentum Near-Field Secure Communications

    Authors: Liping Liang, Minmin Wang, Wenchi Cheng, Wei Zhang

    Abstract: To satisfy the various demands of growing devices and services, emerging high-frequency-based technologies promote near-field wireless communications. Therefore, near-field physical layer security has attracted much attention to facilitate the wireless information security against illegitimate eavesdropping. However, highly correlated channels between legitimate transceivers and eavesdroppers of e… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  17. arXiv:2406.05790  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication for Anti-Jamming with OAM

    Authors: Liping Liang, Wenchi Cheng, Wei Zhang, Zhuohui Yao

    Abstract: The spectrum share and open nature of wireless channels enable integrated sensing and communication (ISAC) susceptible to hostile jamming attacks. Due to the intrinsic orthogonality and rich azimuth angle information of orbital angular momentum (OAM), vortex electromagnetic waves with helical phase fronts have shown great potential to achieve high-resolution imaging and strong anti-jamming capabil… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  18. arXiv:2405.20652  [pdf, other

    cs.LG

    Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

    Authors: Langzhang Liang, Sunwoo Kim, Kijung Shin, Zenglin Xu, Shirui Pan, Yuan Qi

    Abstract: Graph Neural Networks (GNNs) have gained significant attention as a powerful modeling and inference method, especially for homophilic graph-structured data. To empower GNNs in heterophilic graphs, where adjacent nodes exhibit dissimilar labels or features, Signed Message Passing (SMP) has been widely adopted. However, there is a lack of theoretical and empirical analysis regarding the limitations… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Published as a conference paper at ICML 2024

  19. arXiv:2405.19949  [pdf, other

    cs.CV

    Hyper-Transformer for Amodal Completion

    Authors: Jianxiong Gao, Xuelin Qian, Longfei Liang, Junwei Han, Yanwei Fu

    Abstract: Amodal object completion is a complex task that involves predicting the invisible parts of an object based on visible segments and background information. Learning shape priors is crucial for effective amodal completion, but traditional methods often rely on two-stage processes or additional information, leading to inefficiencies and potential error accumulation. To address these shortcomings, we… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  20. arXiv:2405.19893  [pdf, other

    cs.LG cs.AI cs.CL

    Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

    Authors: Chunjing Gan, Dan Yang, Binbin Hu, Hanxiao Zhang, Siyuan Li, Ziqi Liu, Yue Shen, Lin Ju, Zhiqiang Zhang, Jinjie Gu, Lei Liang, Jun Zhou

    Abstract: In recent years, large language models (LLMs) have made remarkable achievements in various domains. However, the untimeliness and cost of knowledge updates coupled with hallucination issues of LLMs have curtailed their applications in knowledge intensive tasks, where retrieval augmented generation (RAG) can be of help. Nevertheless, existing retrieval augmented models typically use similarity as a… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 12 pages

  21. arXiv:2405.14878  [pdf, other

    eess.IV cs.CV cs.LG stat.AP

    Improving and Evaluating Machine Learning Methods for Forensic Shoeprint Matching

    Authors: Divij Jain, Saatvik Kher, Lena Liang, Yufeng Wu, Ashley Zheng, Xizhen Cai, Anna Plantinga, Elizabeth Upton

    Abstract: We propose a machine learning pipeline for forensic shoeprint pattern matching that improves on the accuracy and generalisability of existing methods. We extract 2D coordinates from shoeprint scans using edge detection and align the two shoeprints with iterative closest point (ICP). We then extract similarity metrics to quantify how well the two prints match and use these metrics to train a random… ▽ More

    Submitted 2 April, 2024; originally announced May 2024.

  22. arXiv:2405.13085  [pdf, other

    cs.CL cs.AI

    Multi-domain Knowledge Graph Collaborative Pre-training and Prompt Tuning for Diverse Downstream Tasks

    Authors: Yichi Zhang, Binbin Hu, Zhuo Chen, Lingbing Guo, Ziqi Liu, Zhiqiang Zhang, Lei Liang, Huajun Chen, Wen Zhang

    Abstract: Knowledge graphs (KGs) provide reliable external knowledge for a wide variety of AI tasks in the form of structured triples. Knowledge graph pre-training (KGP) aims to pre-train neural networks on large-scale KGs and provide unified interfaces to enhance different downstream tasks, which is a key direction for KG management, maintenance, and applications. Existing works often focus on purely resea… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Work in progress. Code and data will be open-sourced at https://github.com/zjukg/MuDoK

  23. arXiv:2405.09507  [pdf, other

    cs.CL cs.AI

    QueryNER: Segmentation of E-commerce Queries

    Authors: Chester Palen-Michel, Lizzie Liang, Zhe Wu, Constantine Lignos

    Abstract: We present QueryNER, a manually-annotated dataset and accompanying model for e-commerce query segmentation. Prior work in sequence labeling for e-commerce has largely addressed aspect-value extraction which focuses on extracting portions of a product title or query for narrowly defined aspects. Our work instead focuses on the goal of dividing a query into meaningful chunks with broadly applicable… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted to LREC-COLING 2024

  24. arXiv:2405.00974  [pdf, other

    math.ST

    On Ridge Estimation in High-dimensional Rotationally Sparse Linear Regression

    Authors: Libin Liang, Zhiqiang Tan

    Abstract: Recently, deep neural networks have been found to nearly interpolate training data but still generalize well in various applications. To help understand such a phenomenon, it has been of interest to analyze the ridge estimator and its interpolation limit in high-dimensional regression models. For this motivation, we study the ridge estimator in a rotationally sparse setting of high-dimensional lin… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  25. arXiv:2405.00312  [pdf, ps, other

    math.CT math.RT

    Compatible weak factorization systems and model structures

    Authors: Zhenxing Di, Liping Li, Li Liang

    Abstract: In this paper the concept of compatible weak factorization systems in general categories is introduced as a counterpart of compatible complete cotorsion pairs in abelian categories. We describe a method to construct model structures on general categories via two compatible weak factorization systems satisfying certain conditions, and hence generalize a very useful result by Gillespie for abelian m… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 18 pages

  26. arXiv:2405.00033  [pdf

    physics.app-ph physics.optics

    One-way Valley-locked waveguide with large channel achieved by all-dielectric Photonic Crystals

    Authors: Li Liang, Xiao Zhang, Chuan Wang, Jie Liu, Longzhen Fan, Chengpeng Liang, Liang Liang, Feifei Li, Qi Wu, Yin Poo

    Abstract: Nonreciprocity, which denotes the asymmetric or even unidirectional transmission of light, constitutes the cornerstone of modern photonic circuits. In the realm of photonic devices, it has been widely utilized in isolators, circulators and so on. Recent topology in artificial materials, an unprecedented degree of freedom, has been proposed to solve the effect of impurities on nonreciprocal transmi… ▽ More

    Submitted 7 March, 2024; originally announced May 2024.

    Comments: 16 pages and 5 figures

  27. arXiv:2404.19711  [pdf, other

    physics.acc-ph

    Elevating electron energy gain and betatron X-ray emission in proton-driven wakefield acceleration

    Authors: Hossein Saberi, Guoxing Xia, Linbo Liang, John Patrick Farmer, Alexander Pukhov

    Abstract: The long proton beams present at CERN have the potential to evolve into a train of microbunches through the self-modulation instability process. The resonant wakefield generated by a periodic train of proton microbunches can establish a high acceleration field within the plasma, facilitating electron acceleration. This paper investigates the impact of plasma density on resonant wakefield excitatio… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  28. arXiv:2404.15125  [pdf, ps, other

    math.RT math.RA

    Representations of $\mathbb{N}^{\infty}$-type combinatorial categories

    Authors: Zhenxing Di, Liping Li, Li Liang

    Abstract: In this paper we consider representations of certain combinatorial categories, including the poset $\D$ of positive integers and division, the Young lattice $\mathscr{Y}$ of partitions of finite sets, the opposite category of the orbit category $\mathscr{Z}$ of $(\mathbb{Z}, +)$ with respect to nontrivial subgroups, and the category $\mathscr{CI}$ of finite cyclic groups and injective homomorphism… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  29. arXiv:2404.12903  [pdf, other

    cs.MM

    ConCLVD: Controllable Chinese Landscape Video Generation via Diffusion Model

    Authors: Dingming Liu, Shaowei Li, Ruoyan Zhou, Lili Liang, Yongguan Hong, Fei Chao, Rongrong Ji

    Abstract: Chinese landscape painting is a gem of Chinese cultural and artistic heritage that showcases the splendor of nature through the deep observations and imaginations of its painters. Limited by traditional techniques, these artworks were confined to static imagery in ancient times, leaving the dynamism of landscapes and the subtleties of artistic sentiment to the viewer's imagination. Recently, emerg… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  30. arXiv:2404.09686  [pdf, other

    cs.LG cs.DC

    AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster

    Authors: Siyuan Li, Youshao Xiao, Fanzhuang Meng, Lin Ju, Lei Liang, Lin Wang, Jun Zhou

    Abstract: Offline batch inference is a common task in the industry for deep learning applications, but it can be challenging to ensure stability and performance when dealing with large amounts of data and complicated inference pipelines. This paper demonstrated AntBatchInfer, an elastic batch inference framework, which is specially optimized for the non-dedicated cluster. AntBatchInfer addresses these chall… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  31. arXiv:2404.09679  [pdf, other

    cs.DC cs.LG

    AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

    Authors: Youshao Xiao, Lin Ju, Zhenglei Zhou, Siyuan Li, Zhaoxin Huan, Dalong Zhang, Rujie Jiang, Lin Wang, Xiaolu Zhang, Lei Liang, Jun Zhou

    Abstract: Many distributed training techniques like Parameter Server and AllReduce have been proposed to take advantage of the increasingly large data and rich features. However, stragglers frequently occur in distributed training due to resource contention and hardware heterogeneity, which significantly hampers the training efficiency. Previous works only address part of the stragglers and could not adapti… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  32. arXiv:2404.08292  [pdf, other

    cs.CV cs.GR

    AdaContour: Adaptive Contour Descriptor with Hierarchical Representation

    Authors: Tianyu Ding, Jinxin Zhou, Tianyi Chen, Zhihui Zhu, Ilya Zharkov, Luming Liang

    Abstract: Existing angle-based contour descriptors suffer from lossy representation for non-starconvex shapes. By and large, this is the result of the shape being registered with a single global inner center and a set of radii corresponding to a polar coordinate parameterization. In this paper, we propose AdaContour, an adaptive contour descriptor that uses multiple local representations to desirably charac… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  33. arXiv:2404.08111  [pdf, other

    cs.CV cs.AI cs.CL

    S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing

    Authors: Guangzhi Wang, Tianyi Chen, Kamran Ghasedi, HsiangTao Wu, Tianyu Ding, Chris Nuesmeyer, Ilya Zharkov, Mohan Kankanhalli, Luming Liang

    Abstract: Face attribute editing plays a pivotal role in various applications. However, existing methods encounter challenges in achieving high-quality results while preserving identity, editing faithfulness, and temporal consistency. These challenges are rooted in issues related to the training pipeline, including limited supervision, architecture design, and optimization strategy. In this work, we introdu… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  34. arXiv:2404.04007  [pdf, other

    cs.CV

    Neural-Symbolic VideoQA: Learning Compositional Spatio-Temporal Reasoning for Real-world Video Question Answering

    Authors: Lili Liang, Guanglu Sun, Jin Qiu, Lizhong Zhang

    Abstract: Compositional spatio-temporal reasoning poses a significant challenge in the field of video question answering (VideoQA). Existing approaches struggle to establish effective symbolic reasoning structures, which are crucial for answering compositional spatio-temporal questions. To address this challenge, we propose a neural-symbolic framework called Neural-Symbolic VideoQA (NS-VideoQA), specificall… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  35. arXiv:2404.00231  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Attention-based Shape-Deformation Networks for Artifact-Free Geometry Reconstruction of Lumbar Spine from MR Images

    Authors: Linchen Qian, Jiasong Chen, Linhai Ma, Timur Urakov, Weiyong Gu, Liang Liang

    Abstract: Lumbar disc degeneration, a progressive structural wear and tear of lumbar intervertebral disc, is regarded as an essential role on low back pain, a significant global health concern. Automated lumbar spine geometry reconstruction from MR images will enable fast measurement of medical parameters to evaluate the lumbar status, in order to determine a suitable treatment. Existing image segmentation-… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

  36. arXiv:2403.19591  [pdf, other

    cs.LG cs.AR cs.NE

    Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers

    Authors: Pingcheng Dong, Yonghao Tan, Dong Zhang, Tianwei Ni, Xuejiao Liu, Yu Liu, Peng Luo, Luhong Liang, Shih-Yang Liu, Xijie Huang, Huaiyu Zhu, Yun Pan, Fengwei An, Kwang-Ting Cheng

    Abstract: Non-linear functions are prevalent in Transformers and their lightweight variants, incurring substantial and frequently underestimated hardware costs. Previous state-of-the-art works optimize these operations by piece-wise linear approximation and store the parameters in look-up tables (LUT), but most of them require unfriendly high-precision arithmetics such as FP/INT 32 and lack consideration of… ▽ More

    Submitted 29 March, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 61st ACM/IEEE Design Automation Conference (DAC) 2024

  37. arXiv:2403.14346  [pdf, other

    cs.CV

    Towards Efficient Information Fusion: Concentric Dual Fusion Attention Based Multiple Instance Learning for Whole Slide Images

    Authors: Yujian Liu, Ruoxuan Wu, Xinjie Shen, Zihuang Lu, Lingyu Liang, Haiyu Zhou, Shipu Xu, Shaoai Cai, Shidang Xu

    Abstract: In the realm of digital pathology, multi-magnification Multiple Instance Learning (multi-mag MIL) has proven effective in leveraging the hierarchical structure of Whole Slide Images (WSIs) to reduce information loss and redundant data. However, current methods fall short in bridging the domain gap between pretrained models and medical imaging, and often fail to account for spatial relationships ac… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 14 pages, 7 figures

  38. arXiv:2403.12649  [pdf, other

    cs.IR cs.AI

    InBox: Recommendation with Knowledge Graph using Interest Box Embedding

    Authors: Zezhong Xu, Yincen Qu, Wen Zhang, Lei Liang, Huajun Chen

    Abstract: Knowledge graphs (KGs) have become vitally important in modern recommender systems, effectively improving performance and interpretability. Fundamentally, recommender systems aim to identify user interests based on historical interactions and recommend suitable items. However, existing works overlook two key challenges: (1) an interest corresponds to a potentially large set of related items, and (… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: VLDB 2024 under submission

  39. arXiv:2403.12646  [pdf, other

    cs.LG

    Prompt-fused framework for Inductive Logical Query Answering

    Authors: Zezhong Xu, Peng Ye, Lei Liang, Huajun Chen, Wen Zhang

    Abstract: Answering logical queries on knowledge graphs (KG) poses a significant challenge for machine reasoning. The primary obstacle in this task stems from the inherent incompleteness of KGs. Existing research has predominantly focused on addressing the issue of missing edges in KGs, thereby neglecting another aspect of incompleteness: the emergence of new entities. Furthermore, most of the existing meth… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by COLING 2024

  40. arXiv:2403.07284  [pdf, other

    cs.CV

    SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection

    Authors: Hongcheng Zhang, Liu Liang, Pengxin Zeng, Xiao Song, Zhe Wang

    Abstract: Sparse 3D detectors have received significant attention since the query-based paradigm embraces low latency without explicit dense BEV feature construction. However, these detectors achieve worse performance than their dense counterparts. In this paper, we find the key to bridging the performance gap is to enhance the awareness of rich representations in two modalities. Here, we present a high-per… ▽ More

    Submitted 10 July, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: The 18th European Conference on Computer Vision ECCV 2024

  41. arXiv:2403.06259  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    Editing Conceptual Knowledge for Large Language Models

    Authors: Xiaohan Wang, Shengyu Mao, Ningyu Zhang, Shumin Deng, Yunzhi Yao, Yue Shen, Lei Liang, Jinjie Gu, Huajun Chen

    Abstract: Recently, there has been a growing interest in knowledge editing for Large Language Models (LLMs). Current approaches and evaluations merely explore the instance-level editing, while whether LLMs possess the capability to modify concepts remains unclear. This paper pioneers the investigation of editing conceptual knowledge for LLMs, by constructing a novel benchmark dataset ConceptEdit and establi… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Work in progress. Code: https://github.com/zjunlp/EasyEdit Dataset: https://huggingface.co/datasets/zjunlp/ConceptEdit

  42. arXiv:2403.03101  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.MA

    KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents

    Authors: Yuqi Zhu, Shuofei Qiao, Yixin Ou, Shumin Deng, Ningyu Zhang, Shiwei Lyu, Yue Shen, Lei Liang, Jinjie Gu, Huajun Chen

    Abstract: Large Language Models (LLMs) have demonstrated great potential in complex reasoning tasks, yet they fall short when tackling more sophisticated challenges, especially when interacting with environments through generating executable actions. This inadequacy primarily stems from the lack of built-in action knowledge in language agents, which fails to effectively guide the planning trajectories durin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Work in progress. Project page: https://zjunlp.github.io/project/KnowAgent/ Code: https://github.com/zjunlp/KnowAgent

  43. arXiv:2403.02449  [pdf, other

    cs.CV

    Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

    Authors: Mahmoud Afifi, Zhenhua Hu, Liang Liang

    Abstract: High dynamic range (HDR) imaging involves capturing a series of frames of the same scene, each with different exposure settings, to broaden the dynamic range of light. This can be achieved through burst capturing or using staggered HDR sensors that capture long and short exposures simultaneously in the camera image signal processor (ISP). Within camera ISP pipeline, illuminant estimation is a cruc… ▽ More

    Submitted 6 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  44. arXiv:2402.16280  [pdf, other

    cs.CV

    Few-Shot Learning for Annotation-Efficient Nucleus Instance Segmentation

    Authors: Yu Ming, Zihao Wu, Jie Yang, Danyi Li, Yuan Gao, Changxin Gao, Gui-Song Xia, Yuanqing Li, Li Liang, Jin-Gang Yu

    Abstract: Nucleus instance segmentation from histopathology images suffers from the extremely laborious and expert-dependent annotation of nucleus instances. As a promising solution to this task, annotation-efficient deep learning paradigms have recently attracted much research interest, such as weakly-/semi-supervised learning, generative adversarial learning, etc. In this paper, we propose to formulate an… ▽ More

    Submitted 27 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

  45. arXiv:2402.15444  [pdf, other

    cs.AI cs.CL cs.LG cs.MM

    Unleashing the Power of Imbalanced Modality Information for Multi-modal Knowledge Graph Completion

    Authors: Yichi Zhang, Zhuo Chen, Lei Liang, Huajun Chen, Wen Zhang

    Abstract: Multi-modal knowledge graph completion (MMKGC) aims to predict the missing triples in the multi-modal knowledge graphs by incorporating structural, visual, and textual information of entities into the discriminant models. The information from different modalities will work together to measure the triple plausibility. Existing MMKGC methods overlook the imbalance problem of modality information amo… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024

  46. arXiv:2402.14710  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus

    Authors: Honghao Gui, Lin Yuan, Hongbin Ye, Ningyu Zhang, Mengshu Sun, Lei Liang, Huajun Chen

    Abstract: Large Language Models (LLMs) demonstrate remarkable potential across various domains; however, they exhibit a significant performance gap in Information Extraction (IE). Note that high-quality instruction data is the vital key for enhancing the specific capabilities of LLMs, while current IE datasets tend to be small in scale, fragmented, and lack standardized schema. To this end, we introduce IEP… ▽ More

    Submitted 26 May, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ACL 2024 (short); 21 pages; Github: https://github.com/zjunlp/IEPile

  47. arXiv:2402.06033  [pdf, ps, other

    math.OC cs.LG

    An Inexact Halpern Iteration with Application to Distributionally Robust Optimization

    Authors: Ling Liang, Kim-Chuan Toh, Jia-Jie Zhu

    Abstract: The Halpern iteration for solving monotone inclusion problems has gained increasing interests in recent years due to its simple form and appealing convergence properties. In this paper, we investigate the inexact variants of the scheme in both deterministic and stochastic settings. We conduct extensive convergence analysis and show that by choosing the inexactness tolerances appropriately, the ine… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Correct a typo in the title and update authors' information

  48. arXiv:2402.03190  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.MM

    Unified Hallucination Detection for Multimodal Large Language Models

    Authors: Xiang Chen, Chenxi Wang, Yida Xue, Ningyu Zhang, Xiaoyan Yang, Qiang Li, Yue Shen, Lei Liang, Jinjie Gu, Huajun Chen

    Abstract: Despite significant strides in multimodal tasks, Multimodal Large Language Models (MLLMs) are plagued by the critical issue of hallucination. The reliable detection of such hallucinations in MLLMs has, therefore, become a vital aspect of model evaluation and the safeguarding of practical application deployment. Prior research in this domain has been constrained by a narrow focus on singular tasks,… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 (main conference)

  49. arXiv:2402.00332  [pdf, other

    cs.LG stat.ML

    Comparing Spectral Bias and Robustness For Two-Layer Neural Networks: SGD vs Adaptive Random Fourier Features

    Authors: Aku Kammonen, Lisi Liang, Anamika Pandey, Raúl Tempone

    Abstract: We present experimental results highlighting two key differences resulting from the choice of training algorithm for two-layer neural networks. The spectral bias of neural networks is well known, while the spectral bias dependence on the choice of training algorithm is less studied. Our experiments demonstrate that an adaptive random Fourier features algorithm (ARFF) can yield a spectral bias clos… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 6 Pages, 4 Figures; Accepted in the International Conference on Scientific Computing and Machine Learning

  50. arXiv:2401.17574  [pdf, other

    cs.CL cs.LG

    Scavenging Hyena: Distilling Transformers into Long Convolution Models

    Authors: Tokiniaina Raharison Ralambomihanta, Shahrad Mohammadzadeh, Mohammad Sami Nur Islam, Wassim Jabbour, Laurence Liang

    Abstract: The rapid evolution of Large Language Models (LLMs), epitomized by architectures like GPT-4, has reshaped the landscape of natural language processing. This paper introduces a pioneering approach to address the efficiency concerns associated with LLM pre-training, proposing the use of knowledge distillation for cross-architecture transfer. Leveraging insights from the efficient Hyena mechanism, ou… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures