Skip to main content

Showing 1–50 of 148 results for author: Cheng, Q

  1. arXiv:2407.13757  [pdf, other

    cs.CL cs.AI cs.CR

    Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models

    Authors: Zhuo Chen, Jiawei Liu, Haotan Liu, Qikai Cheng, Fan Zhang, Wei Lu, Xiaozhong Liu

    Abstract: Retrieval-Augmented Generation (RAG) is applied to solve hallucination problems and real-time constraints of large language models, but it also induces vulnerabilities against retrieval corruption attacks. Existing research mainly explores the unreliability of RAG in white-box and closed-domain QA tasks. In this paper, we aim to reveal the vulnerabilities of Retrieval-Enhanced Generative (RAG) mod… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 10 pages, 3 figures, under review

  2. arXiv:2407.12504  [pdf, other

    cs.CL

    Case2Code: Learning Inductive Reasoning with Synthetic Data

    Authors: Yunfan Shao, Linyang Li, Yichuan Ma, Peiji Li, Demin Song, Qinyuan Cheng, Shimin Li, Xiaonan Li, Pengyu Wang, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin

    Abstract: Complex reasoning is an impressive ability shown by large language models (LLMs). Most LLMs are skilled in deductive reasoning, such as chain-of-thought prompting or iterative tool-using to solve challenging tasks step-by-step. In this paper, we hope to focus on evaluating and teaching LLMs to conduct inductive reasoning, that is, LLMs are supposed to infer underlying rules by observing examples o… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  3. arXiv:2407.12105  [pdf, other

    cs.RO cs.HC

    AeroHaptix: A Wearable Vibrotactile Feedback System for Enhancing Collision Avoidance in UAV Teleoperation

    Authors: Bingjian Huang, Zhecheng Wang, Qilong Cheng, Siyi Ren, Hanfeng Cai, Antonio Alvarez Valdivia, Karthik Mahadevan, Daniel Wigdor

    Abstract: Haptic feedback enhances collision avoidance by providing directional obstacle information to operators in unmanned aerial vehicle (UAV) teleoperation. However, such feedback is often rendered via haptic joysticks, which are unfamiliar to UAV operators and limited to single-directional force feedback. Additionally, the direct coupling of the input device and the feedback method diminishes the oper… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  4. arXiv:2407.00600  [pdf, other

    cs.CV cs.AI

    GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing

    Authors: Yisong Xiao, Aishan Liu, QianJia Cheng, Zhenfei Yin, Siyuan Liang, Jiapeng Li, Jing Shao, Xianglong Liu, Dacheng Tao

    Abstract: Large Vision-Language Models (LVLMs) have been widely adopted in various applications; however, they exhibit significant gender biases. Existing benchmarks primarily evaluate gender bias at the demographic group level, neglecting individual fairness, which emphasizes equal treatment of similar individuals. This research gap limits the detection of discriminatory behaviors, as individual fairness o… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures

  5. arXiv:2406.15720  [pdf, other

    cs.CL

    Scaling Laws for Fact Memorization of Large Language Models

    Authors: Xingyu Lu, Xiaonan Li, Qinyuan Cheng, Kai Ding, Xuanjing Huang, Xipeng Qiu

    Abstract: Fact knowledge memorization is crucial for Large Language Models (LLM) to generate factual and reliable responses. However, the behaviors of LLM fact memorization remain under-explored. In this paper, we analyze the scaling laws for LLM's fact knowledge and LLMs' behaviors of memorizing different types of facts. We find that LLMs' fact knowledge capacity has a linear and negative exponential law r… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  6. arXiv:2406.15279  [pdf, other

    cs.AI cs.CL

    Cross-Modality Safety Alignment

    Authors: Siyin Wang, Xingsong Ye, Qinyuan Cheng, Junwen Duan, Shimin Li, Jinlan Fu, Xipeng Qiu, Xuanjing Huang

    Abstract: As Artificial General Intelligence (AGI) becomes increasingly integrated into various facets of human life, ensuring the safety and ethical alignment of such systems is paramount. Previous studies primarily focus on single-modality threats, which may not suffice given the integrated and complex nature of cross-modality interactions. We introduce a novel safety alignment challenge called Safe Input… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  7. arXiv:2406.13990  [pdf, other

    cs.CL

    Inference-Time Decontamination: Reusing Leaked Benchmarks for Large Language Model Evaluation

    Authors: Qin Zhu, Qingyuan Cheng, Runyu Peng, Xiaonan Li, Tengxiao Liu, Ru Peng, Xipeng Qiu, Xuanjing Huang

    Abstract: The training process of large language models (LLMs) often involves varying degrees of test data contamination. Although current LLMs are achieving increasingly better performance on various benchmarks, their performance in practical applications does not always match their benchmark results. Leakage of benchmarks can prevent the accurate assessment of LLMs' true performance. However, constructing… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  8. arXiv:2406.13007  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Night Photography Rendering

    Authors: Egor Ershov, Artyom Panshin, Oleg Karasev, Sergey Korchagin, Shepelev Lev, Alexandr Startsev, Daniil Vladimirov, Ekaterina Zaychenkova, Nikola Banić, Dmitrii Iarchuk, Maria Efimova, Radu Timofte, Arseniy Terekhin, Shuwei Yue, Yuyang Liu, Minchen Wei, Lu Xu, Chao Zhang, Yasi Wang, Furkan Kınlı, Doğa Yılmaz, Barış Özcan, Furkan Kıraç, Shuai Liu, Jingyuan Xiao , et al. (25 additional authors not shown)

    Abstract: This paper presents a review of the NTIRE 2024 challenge on night photography rendering. The goal of the challenge was to find solutions that process raw camera images taken in nighttime conditions, and thereby produce a photo-quality output images in the standard RGB (sRGB) space. Unlike the previous year's competition, the challenge images were collected with a mobile phone and the speed of algo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 10 figures

  9. arXiv:2406.12847  [pdf, other

    cs.CV

    ChangeViT: Unleashing Plain Vision Transformers for Change Detection

    Authors: Duowang Zhu, Xiaohu Huang, Haiyan Huang, Zhenfeng Shao, Qimin Cheng

    Abstract: Change detection in remote sensing images is essential for tracking environmental changes on the Earth's surface. Despite the success of vision transformers (ViTs) as backbones in numerous computer vision applications, they remain underutilized in change detection, where convolutional neural networks (CNNs) continue to dominate due to their powerful feature extraction capabilities. In this paper,… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  10. arXiv:2406.12534  [pdf, other

    cs.CL

    Unified Active Retrieval for Retrieval Augmented Generation

    Authors: Qinyuan Cheng, Xiaonan Li, Shimin Li, Qin Zhu, Zhangyue Yin, Yunfan Shao, Linyang Li, Tianxiang Sun, Hang Yan, Xipeng Qiu

    Abstract: In Retrieval-Augmented Generation (RAG), retrieval is not always helpful and applying it to every instruction is sub-optimal. Therefore, determining whether to retrieve is crucial for RAG, which is usually referred to as Active Retrieval. However, existing active retrieval methods face two challenges: 1. They usually rely on a single criterion, which struggles with handling various types of instru… ▽ More

    Submitted 27 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  11. arXiv:2406.04419  [pdf, other

    cs.LG

    TSCMamba: Mamba Meets Multi-View Learning for Time Series Classification

    Authors: Md Atik Ahamed, Qiang Cheng

    Abstract: Time series classification (TSC) on multivariate time series is a critical problem. We propose a novel multi-view approach integrating frequency-domain and time-domain features to provide complementary contexts for TSC. Our method fuses continuous wavelet transform spectral features with temporal convolutional or multilayer perceptron features. We leverage the Mamba state space model for efficient… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  12. arXiv:2406.04145  [pdf, other

    cs.CL cs.AI

    Every Answer Matters: Evaluating Commonsense with Probabilistic Measures

    Authors: Qi Cheng, Michael Boratko, Pranay Kumar Yelugam, Tim O'Gorman, Nalini Singh, Andrew McCallum, Xiang Lorraine Li

    Abstract: Large language models have demonstrated impressive performance on commonsense tasks; however, these tasks are often posed as multiple-choice questions, allowing models to exploit systematic biases. Commonsense is also inherently probabilistic with multiple correct answers. The purpose of "boiling water" could be making tea and cooking, but it also could be killing germs. Existing tasks do not capt… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Camera Ready

  13. arXiv:2405.18458  [pdf

    cs.LG physics.optics

    Asymmetrical estimator for training grey-box deep photonic neural networks

    Authors: Yizhi Wang, Minjia Chen, Chunhui Yao, Jie Ma, Ting Yan, Richard Penty, Qixiang Cheng

    Abstract: Physical neural networks (PNNs) are emerging paradigms for neural network acceleration due to their high-bandwidth, in-propagation analogue processing. Despite the advantages of PNN for inference, training remains a challenge. The imperfect information of the physical transformation means the failure of conventional gradient-based updates from backpropagation (BP). Here, we present the asymmetrica… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 17 pages, 5 figures

    MSC Class: 78-05

  14. arXiv:2405.13336  [pdf, other

    cs.HC

    SIGGesture: Generalized Co-Speech Gesture Synthesis via Semantic Injection with Large-Scale Pre-Training Diffusion Models

    Authors: Qingrong Cheng, Xu Li, Xinghui Fu

    Abstract: The automated synthesis of high-quality 3D gestures from speech is of significant value in virtual humans and gaming. Previous methods focus on synthesizing gestures that are synchronized with speech rhythm, yet they frequently overlook the inclusion of semantic gestures. These are sparse and follow a long-tailed distribution across the gesture sequence, making them difficult to learn in an end-to… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 18 pages, 5 figures

    ACM Class: I.2.6

  15. arXiv:2405.12939  [pdf, other

    cs.CL

    Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

    Authors: Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Xiaonan Li, Tianxiang Sun, Cheng Chang, Qinyuan Cheng, Ding Wang, Xiaofeng Mou, Xipeng Qiu, XuanJing Huang

    Abstract: Recent advancements in Chain-of-Thought prompting have facilitated significant breakthroughs for Large Language Models (LLMs) in complex reasoning tasks. Current research enhances the reasoning performance of LLMs by sampling multiple reasoning chains and ensembling based on the answer frequency. However, this approach fails in scenarios where the correct answers are in the minority. We identify t… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 17 pages, 14 figures, accepted by LREC-COLING 2024

  16. arXiv:2404.19534  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu Jin, Guanqun Liu, Chen Change Loy, Lize Zhang, Shuai Liu, Chaoyu Feng, Luyang Wang, Shuan Chen, Guangqi Shao, Xiaotao Wang, Lei Lei, Qirui Yang, Qihua Cheng, Zhiqiang Xu, Yihao Liu, Huanjing Yue, Jingyu Yang , et al. (38 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  17. arXiv:2404.07108  [pdf, other

    cs.CL cs.IR

    From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications

    Authors: Yongqiang Ma, Lizhi Qing, Jiawei Liu, Yangyang Kang, Yue Zhang, Wei Lu, Xiaozhong Liu, Qikai Cheng

    Abstract: Evaluating large language models (LLMs) is fundamental, particularly in the context of practical applications. Conventional evaluation methods, typically designed primarily for LLM development, yield numerical scores that ignore the user experience. Therefore, our study shifts the focus from model-centered to human-centered evaluation in the context of AI-powered writing assistance applications. O… ▽ More

    Submitted 10 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 9 pages, 2 figures, under review

  18. arXiv:2404.01624  [pdf

    cs.CE q-fin.CP

    Intelligent Optimization of Mine Environmental Damage Assessment and Repair Strategies Based on Deep Learning

    Authors: Qishuo Cheng

    Abstract: In recent decades, financial quantification has emerged and matured rapidly. For financial institutions such as funds, investment institutions are increasingly dissatisfied with the situation of passively constructing investment portfolios with average market returns, and are paying more and more attention to active quantitative strategy investment portfolios. This requires the introduction of act… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  19. arXiv:2403.09898  [pdf, other

    cs.LG

    TimeMachine: A Time Series is Worth 4 Mambas for Long-term Forecasting

    Authors: Md Atik Ahamed, Qiang Cheng

    Abstract: Long-term time-series forecasting remains challenging due to the difficulty in capturing long-term dependencies, achieving linear scalability, and maintaining computational efficiency. We introduce TimeMachine, an innovative model that leverages Mamba, a state-space model, to capture long-term dependencies in multivariate time series data while maintaining linear scalability and small memory footp… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  20. arXiv:2403.02866  [pdf

    physics.optics cs.ET

    Unlocking Electro-optic Resonant Phase Shifting for Multi-dimensional, Ultra-dynamic Photonic Switches

    Authors: Lingzhi Luo, Rui Ma, Richard V. Penty, Qixiang Cheng

    Abstract: Optical circuit switching is connection-oriented, being deterministic through the reservation of a complete wavelength channel or spatial path for a certain period. However, this comes at a trade-off against link dynamics, and overall capacity can thus be constrained by the time slot reservations, especially for switches with microsecond- to millisecond-scale reconfiguration times. For data-intens… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 10 pages

  21. arXiv:2403.02757  [pdf, other

    cs.CL

    In-Memory Learning: A Declarative Learning Framework for Large Language Models

    Authors: Bo Wang, Tianxiang Sun, Hang Yan, Siyin Wang, Qingyuan Cheng, Xipeng Qiu

    Abstract: The exploration of whether agents can align with their environment without relying on human-labeled data presents an intriguing research topic. Drawing inspiration from the alignment process observed in intelligent organisms, where declarative memory plays a pivotal role in summarizing past experiences, we propose a novel learning framework. The agents adeptly distill insights from past experience… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  22. arXiv:2403.02232  [pdf

    cs.CR cs.AI cs.LG

    Comprehensive evaluation of Mal-API-2019 dataset by machine learning in malware detection

    Authors: Zhenglin Li, Haibei Zhu, Houze Liu, Jintong Song, Qishuo Cheng

    Abstract: This study conducts a thorough examination of malware detection using machine learning techniques, focusing on the evaluation of various classification models using the Mal-API-2019 dataset. The aim is to advance cybersecurity capabilities by identifying and mitigating threats more effectively. Both ensemble and non-ensemble machine learning methods, such as Random Forest, XGBoost, K Nearest Neigh… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Journal ref: International Journal of Computer Science and Information Technology, 2024, 2(1), 1-9

  23. arXiv:2403.01209  [pdf, other

    cs.CV

    Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning

    Authors: Shuo Yang, Zirui Shang, Yongqi Wang, Derong Deng, Hongwei Chen, Qiyuan Cheng, Xinxiao Wu

    Abstract: This paper proposes a novel framework for multi-label image recognition without any training data, called data-free framework, which uses knowledge of pre-trained Large Language Model (LLM) to learn prompts to adapt pretrained Vision-Language Model (VLM) like CLIP to multilabel classification. Through asking LLM by well-designed questions, we acquire comprehensive knowledge about characteristics a… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  24. arXiv:2402.17194  [pdf

    q-fin.TR cs.CE q-fin.PM

    The Random Forest Model for Analyzing and Forecasting the US Stock Market in the Context of Smart Finance

    Authors: Jiajian Zheng, Duan Xin, Qishuo Cheng, Miao Tian, Le Yang

    Abstract: The stock market is a crucial component of the financial market, playing a vital role in wealth accumulation for investors, financing costs for listed companies, and the stable development of the national macroeconomy. Significant fluctuations in the stock market can damage the interests of stock investors and cause an imbalance in the industrial structure, which can interfere with the macro level… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 10 pages, 8 figures

  25. arXiv:2402.17191  [pdf

    cs.CR cs.AI cs.LG

    AI-Driven Anonymization: Protecting Personal Data Privacy While Leveraging Machine Learning

    Authors: Le Yang, Miao Tian, Duan Xin, Qishuo Cheng, Jiajian Zheng

    Abstract: The development of artificial intelligence has significantly transformed people's lives. However, it has also posed a significant threat to privacy and security, with numerous instances of personal information being exposed online and reports of criminal attacks and theft. Consequently, the need to achieve intelligent protection of personal information through machine learning algorithms has becom… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 9 pages, 6 figures

  26. arXiv:2402.15994  [pdf

    q-fin.CP cs.CE cs.LG

    Optimizing Portfolio Management and Risk Assessment in Digital Assets Using Deep Learning for Predictive Analysis

    Authors: Qishuo Cheng, Le Yang, Jiajian Zheng, Miao Tian, Duan Xin

    Abstract: Portfolio management issues have been extensively studied in the field of artificial intelligence in recent years, but existing deep learning-based quantitative trading methods have some areas where they could be improved. First of all, the prediction mode of stocks is singular; often, only one trading expert is trained by a model, and the trading decision is solely based on the prediction results… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

  27. arXiv:2402.13008  [pdf, other

    cs.DS cs.DC

    Efficient Enumeration of Large Maximal k-Plexes

    Authors: Qihao Cheng, Da Yan, Tianhao Wu, Lyuheng Yuan, Ji Cheng, Zhongyi Huang, Yang Zhou

    Abstract: Finding cohesive subgraphs in a large graph has many important applications, such as community detection and biological network analysis. Clique is often a too strict cohesive structure since communities or biological modules rarely form as cliques for various reasons such as data noise. Therefore, $k$-plex is introduced as a popular clique relaxation, which is a graph where every vertex is adjace… ▽ More

    Submitted 10 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted by EDBT2025. Camera-ready version

  28. arXiv:2402.12201  [pdf, other

    cs.LG

    Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT

    Authors: Zhengfu He, Xuyang Ge, Qiong Tang, Tianxiang Sun, Qinyuan Cheng, Xipeng Qiu

    Abstract: Sparse dictionary learning has been a rapidly growing technique in mechanistic interpretability to attack superposition and extract more human-understandable features from model activations. We ask a further question based on the extracted more monosemantic features: How do we recognize circuits connecting the enormous amount of dictionary features? We propose a circuit discovery framework alterna… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 24 pages, 13 figures. Not final version. Better dictionary training in progress

  29. arXiv:2402.11251  [pdf, other

    cs.CL

    LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

    Authors: Siyin Wang, Shimin Li, Tianxiang Sun, Jinlan Fu, Qinyuan Cheng, Jiasheng Ye, Junjie Ye, Xipeng Qiu, Xuanjing Huang

    Abstract: In the realm of Large Language Models (LLMs), users commonly employ diverse decoding strategies and adjust hyperparameters to control the generated text. However, a critical question emerges: Are LLMs conscious of the existence of these decoding strategies and capable of regulating themselves? The current decoding generation process often relies on empirical and heuristic manual adjustments to hyp… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  30. arXiv:2402.10738  [pdf, other

    cs.CL

    Let's Learn Step by Step: Enhancing In-Context Learning Ability with Curriculum Learning

    Authors: Yinpeng Liu, Jiawei Liu, Xiang Shi, Qikai Cheng, Yong Huang, Wei Lu

    Abstract: Demonstration ordering, which is an important strategy for in-context learning (ICL), can significantly affects the performance of large language models (LLMs). However, most of the current approaches of ordering require high computational costs to introduce the priori knowledge. In this paper, inspired by the human learning process, we propose a simple but effective demonstration ordering method… ▽ More

    Submitted 16 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  31. arXiv:2402.07234  [pdf, other

    cs.AI

    CPSDBench: A Large Language Model Evaluation Benchmark and Baseline for Chinese Public Security Domain

    Authors: Xin Tong, Bo Jin, Zhi Lin, Binjun Wang, Ting Yu, Qiang Cheng

    Abstract: Large Language Models (LLMs) have demonstrated significant potential and effectiveness across multiple application domains. To assess the performance of mainstream LLMs in public security tasks, this study aims to construct a specialized evaluation benchmark tailored to the Chinese public security domain--CPSDbench. CPSDbench integrates datasets related to public security collected from real-world… ▽ More

    Submitted 21 March, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  32. arXiv:2402.01144  [pdf, ps, other

    cs.IT cs.CR

    A Construction of Evolving $k$-threshold Secret Sharing Scheme over A Polynomial Ring

    Authors: Qi Cheng, Hongru Cao, Sian-Jheng Lin, Nenghai Yu

    Abstract: The threshold secret sharing scheme allows the dealer to distribute the share to every participant such that the secret is correctly recovered from a certain amount of shares. The traditional $(k, n)$-threshold secret sharing scheme requests that the number of participants $n$ is known in advance. In contrast, the evolving secret sharing scheme allows that $n$ can be uncertain and even ever-growin… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  33. arXiv:2401.13275  [pdf, other

    cs.CL cs.AI

    Can AI Assistants Know What They Don't Know?

    Authors: Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, Xipeng Qiu

    Abstract: Recently, AI assistants based on large language models (LLMs) show surprising performance in many tasks, such as dialogue, solving math problems, writing code, and using tools. Although LLMs possess intensive world knowledge, they still make factual errors when facing some knowledge intensive tasks, like open-domain question answering. These untruthful responses from the AI assistant may cause sig… ▽ More

    Submitted 28 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Work in progress

  34. arXiv:2401.08867  [pdf, other

    cs.LG

    MambaTab: A Plug-and-Play Model for Learning Tabular Data

    Authors: Md Atik Ahamed, Qiang Cheng

    Abstract: Despite the prevalence of images and texts in machine learning, tabular data remains widely used across various domains. Existing deep learning models, such as convolutional neural networks and transformers, perform well however demand extensive preprocessing and tuning limiting accessibility and scalability. This work introduces an innovative approach based on a structured state-space model (SSM)… ▽ More

    Submitted 24 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE 7th International Conference on Multimedia Information Processing and Retrieval (MIPR), 2024

  35. arXiv:2401.04620  [pdf, other

    cs.CL cs.AI

    Agent Alignment in Evolving Social Norms

    Authors: Shimin Li, Tianxiang Sun, Qinyuan Cheng, Xipeng Qiu

    Abstract: Agents based on Large Language Models (LLMs) are increasingly permeating various domains of human production and life, highlighting the importance of aligning them with human values. The current alignment of AI systems primarily focuses on passively aligning LLMs through human intervention. However, agents possess characteristics like receiving environmental feedback and self-evolution, rendering… ▽ More

    Submitted 19 February, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Work in progress

  36. arXiv:2312.10074  [pdf

    cs.HC

    STAGER checklist: Standardized Testing and Assessment Guidelines for Evaluating Generative AI Reliability

    Authors: Jinghong Chen, Lingxuan Zhu, Weiming Mou, Zaoqu Liu, Quan Cheng, Anqi Lin, Jian Zhang, Peng Luo

    Abstract: Generative Artificial Intelligence (AI) holds immense potential in medical applications. Numerous studies have explored the efficacy of various generative AI models within healthcare contexts, but there is a lack of a comprehensive and systematic evaluation framework. Given that some studies evaluating the ability of generative AI for medical applications have deficiencies in their methodological… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 11 pages, 0 figure, 2 tables

  37. arXiv:2312.06723  [pdf, other

    cs.CV

    Learning to See Low-Light Images via Feature Domain Adaptation

    Authors: Qirui Yang, Qihua Cheng, Huanjing Yue, Le Zhang, Yihao Liu, Jingyu Yang

    Abstract: Raw low light image enhancement (LLIE) has achieved much better performance than the sRGB domain enhancement methods due to the merits of raw data. However, the ambiguity between noisy to clean and raw to sRGB mappings may mislead the single-stage enhancement networks. The two-stage networks avoid ambiguity by decoupling the two mappings but usually have large computing complexity. To solve this p… ▽ More

    Submitted 19 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

  38. arXiv:2311.18254  [pdf, other

    cs.CV cs.AI

    Sketch Input Method Editor: A Comprehensive Dataset and Methodology for Systematic Input Recognition

    Authors: Guangming Zhu, Siyuan Wang, Qing Cheng, Kelong Wu, Hao Li, Liang Zhang

    Abstract: With the recent surge in the use of touchscreen devices, free-hand sketching has emerged as a promising modality for human-computer interaction. While previous research has focused on tasks such as recognition, retrieval, and generation of familiar everyday objects, this study aims to create a Sketch Input Method Editor (SketchIME) specifically designed for a professional C4I system. Within this s… ▽ More

    Submitted 31 March, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: The paper has been accepted by ACM Multimedia 2023

  39. arXiv:2311.16603  [pdf, other

    cs.DS cs.IR

    l2Match: Optimization Techniques on Subgraph Matching Algorithm using Label Pair, Neighboring Label Index, and Jump-Redo method

    Authors: C. Q. Cheng, K. S. Wong, L. K. Soon

    Abstract: Graph database is designed to store bidirectional relationships between objects and facilitate the traversal process to extract a subgraph. However, the subgraph matching process is an NP-Complete problem. Existing solutions to this problem usually employ a filter-and-verification framework and a divide-and-conquer method. The filter-and-verification framework minimizes the number of inputs to the… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: This short version of this article (6 pages) is accepted by ICEIC 2024

    MSC Class: 05C60 (Primary); 05C30 (Secondary); 68R10 ACM Class: G.4.1; H.3.3

  40. arXiv:2311.15643  [pdf, other

    cs.RO

    A Survey on Monocular Re-Localization: From the Perspective of Scene Map Representation

    Authors: Jinyu Miao, Kun Jiang, Tuopu Wen, Yunlong Wang, Peijing Jia, Xuhe Zhao, Qian Cheng, Zhongyang Xiao, Jin Huang, Zhihua Zhong, Diange Yang

    Abstract: Monocular Re-Localization (MRL) is a critical component in autonomous applications, estimating 6 degree-of-freedom ego poses w.r.t. the scene map based on monocular images. In recent decades, significant progress has been made in the development of MRL techniques. Numerous algorithms have accomplished extraordinary success in terms of localization accuracy and robustness. In MRL, scene maps are re… ▽ More

    Submitted 12 January, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 33 pages, 10 tables, 16 figures, under review

  41. arXiv:2311.14379  [pdf

    cs.RO

    Robot Learning in the Era of Foundation Models: A Survey

    Authors: Xuan Xiao, Jiahang Liu, Zhipeng Wang, Yanmin Zhou, Yong Qi, Qian Cheng, Bin He, Shuo Jiang

    Abstract: The proliferation of Large Language Models (LLMs) has s fueled a shift in robot learning from automation towards general embodied Artificial Intelligence (AI). Adopting foundation models together with traditional learning methods to robot learning has increasingly gained recent interest research community and showed potential for real-life application. However, there are few literatures comprehens… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  42. arXiv:2310.12443  [pdf, other

    cs.IR cs.AI cs.CL

    Know Where to Go: Make LLM a Relevant, Responsible, and Trustworthy Searcher

    Authors: Xiang Shi, Jiawei Liu, Yinpeng Liu, Qikai Cheng, Wei Lu

    Abstract: The advent of Large Language Models (LLMs) has shown the potential to improve relevance and provide direct answers in web searches. However, challenges arise in validating the reliability of generated results and the credibility of contributing sources, due to the limitations of traditional information retrieval algorithms and the LLM hallucination problem. Aiming to create a "PageRank" for the LL… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 14 pages, 4 figures, under peer review

  43. arXiv:2310.11702  [pdf, other

    cs.CV

    DPF-Nutrition: Food Nutrition Estimation via Depth Prediction and Fusion

    Authors: Yuzhe Han, Qimin Cheng, Wenjin Wu, Ziyang Huang

    Abstract: A reasonable and balanced diet is essential for maintaining good health. With the advancements in deep learning, automated nutrition estimation method based on food images offers a promising solution for monitoring daily nutritional intake and promoting dietary health. While monocular image-based nutrition estimation is convenient, efficient, and economical, the challenge of limited accuracy remai… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  44. arXiv:2310.04787  [pdf, other

    cs.RO cs.CV

    HI-SLAM: Monocular Real-time Dense Mapping with Hybrid Implicit Fields

    Authors: Wei Zhang, Tiecheng Sun, Sen Wang, Qing Cheng, Norbert Haala

    Abstract: In this letter, we present a neural field-based real-time monocular mapping framework for accurate and dense Simultaneous Localization and Mapping (SLAM). Recent neural mapping frameworks show promising results, but rely on RGB-D or pose inputs, or cannot run in real-time. To address these limitations, our approach integrates dense-SLAM with neural implicit fields. Specifically, our dense SLAM app… ▽ More

    Submitted 15 December, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE Robotics and Automation Letters

  45. arXiv:2310.03368  [pdf, other

    cs.CL

    Evaluating Hallucinations in Chinese Large Language Models

    Authors: Qinyuan Cheng, Tianxiang Sun, Wenwei Zhang, Siyin Wang, Xiangyang Liu, Mozhi Zhang, Junliang He, Mianqiu Huang, Zhangyue Yin, Kai Chen, Xipeng Qiu

    Abstract: In this paper, we establish a benchmark named HalluQA (Chinese Hallucination Question-Answering) to measure the hallucination phenomenon in Chinese large language models. HalluQA contains 450 meticulously designed adversarial questions, spanning multiple domains, and takes into account Chinese historical culture, customs, and social phenomena. During the construction of HalluQA, we consider two ty… ▽ More

    Submitted 25 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Work in progress

  46. arXiv:2309.08799  [pdf, other

    cs.LG cs.AI

    SHAPNN: Shapley Value Regularized Tabular Neural Network

    Authors: Qisen Cheng, Shuhui Qu, Janghwan Lee

    Abstract: We present SHAPNN, a novel deep tabular data modeling architecture designed for supervised learning. Our approach leverages Shapley values, a well-established technique for explaining black-box models. Our neural network is trained using standard backward propagation optimization methods, and is regularized with realtime estimated Shapley values. Our method offers several advantages, including the… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 9 pages, 8 figures

  47. arXiv:2307.06527  [pdf, other

    cs.CV

    Free-Form Composition Networks for Egocentric Action Recognition

    Authors: Haoran Wang, Qinghua Cheng, Baosheng Yu, Yibing Zhan, Dapeng Tao, Liang Ding, Haibin Ling

    Abstract: Egocentric action recognition is gaining significant attention in the field of human action recognition. In this paper, we address data scarcity issue in egocentric action recognition from a compositional generalization perspective. To tackle this problem, we propose a free-form composition network (FFCN) that can simultaneously learn disentangled verb, preposition, and noun representations, and t… ▽ More

    Submitted 14 October, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  48. arXiv:2306.00412  [pdf, ps, other

    cs.IT eess.SP

    Beamforming Design for IRS-and-UAV-Aided Two-Way Amplify-and-Forward Relay Networks in Maritime IoT

    Authors: Xuehui Wang, Feng Shu, Yuanyuan Wu, Weiping Shi, Shihao Yan, Yifan Zhao, Qiankun Cheng, Zhongwen Sun, Jiangzhou Wang

    Abstract: In this paper, an intelligent reflecting surface (IRS)-and-unmanned aerial vehicle (UAV)-assisted two-way amplify-and-forward (AF) relay network in maritime Internet of Things (IoT) is proposed, where ship1 ($\text{S}_1$) and ship2 ($\text{S}_2$) can be viewed as data collecting centers. To enhance the message exchange rate between $\text{S}_1$ and $\text{S}_2$, a problem of maximizing minimum rat… ▽ More

    Submitted 24 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  49. arXiv:2305.18548  [pdf

    cs.ET physics.optics

    I/O-efficient iterative matrix inversion with photonic integrated circuits

    Authors: Minjia Chen, Yizhi Wang, Chunhui Yao, Adrian Wonfor, Shuai Yang, Richard Penty, Qixiang Cheng

    Abstract: Photonic integrated circuits have been extensively explored for optical processing with the aim of breaking the speed bottleneck of digital electronics. However, the input/output (IO) bottleneck remains one of the key barriers. Here we report a novel photonic iterative processor (PIP) for matrix-inversion-intensive applications. The direct reuse of inputted data in the optical domain unlocks the p… ▽ More

    Submitted 22 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  50. arXiv:2305.07835  [pdf, other

    cs.IT

    Multi-Scenario Broadband Channel Measurement and Modeling for Sub-6 GHz RIS-Assisted Wireless Communication Systems

    Authors: Jian Sang, Mingyong Zhou, Jifeng Lan, Boning Gao, Wankai Tang, Xiao Li, Shi Jin, Ertugrul Basar, Cen Li, Qiang Cheng, Tie Jun Cui

    Abstract: Reconfigurable intelligent surface (RIS)-empowered communication, has been considered widely as one of the revolutionary technologies for next generation networks. However, due to the novel propagation characteristics of RISs, underlying RIS channel modeling and measurement research is still in its infancy and not fully investigated. In this paper, we conduct multi-scenario broadband channel measu… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.