Skip to main content

Showing 1–50 of 897 results for author: Lin, M

  1. arXiv:2407.09466  [pdf, other

    cs.RO cs.GR

    TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety

    Authors: Sandeep Thalapanane, Sandip Sharan Senthil Kumar, Guru Nandhan Appiya Dilipkumar Peethambari, Sourang SriHari, Laura Zheng, Julio Poveda, Ming C. Lin

    Abstract: Data for training learning-enabled self-driving cars in the physical world are typically collected in a safe, normal environment. Such data distribution often engenders a strong bias towards safe driving, making self-driving cars unprepared when encountering adversarial scenarios like unexpected accidents. Due to a dearth of such adverse data that is unrealistic for drivers to collect, autonomous… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.09404  [pdf, other

    math.OC eess.SY physics.soc-ph

    CAACS: A Carbon Aware Ant Colony System

    Authors: Marina Lin, Laura P. Schaposnik

    Abstract: In an era where sustainability is becoming increasingly crucial, we introduce a new Carbon-Aware Ant Colony System (CAACS) Algorithm that addresses the Generalized Traveling Salesman Problem (GTSP) while minimizing carbon emissions. This novel approach leverages the natural efficiency of ant colony pheromone trails to find optimal routes, balancing both environmental and economic objectives. By in… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 31 figures, 23 pages

  3. arXiv:2407.09089  [pdf

    q-bio.MN

    Lomics: Generation of Pathways and Gene Sets using Large Language Models for Transcriptomic Analysis

    Authors: Chun-Ka Wong, Ali Choo, Eugene C. C. Cheng, Wing-Chun San, Kelvin Chak-Kong Cheng, Yee-Man Lau, Minqing Lin, Fei Li, Wei-Hao Liang, Song-Yan Liao, Kwong-Man Ng, Ivan Fan-Ngai Hung, Hung-Fat Tse, Jason Wing-Hon Wong

    Abstract: Interrogation of biological pathways is an integral part of omics data analysis. Large language models (LLMs) enable the generation of custom pathways and gene sets tailored to specific scientific questions. These targeted sets are significantly smaller than traditional pathway enrichment analysis libraries, reducing multiple hypothesis testing and potentially enhancing statistical power. Lomics (… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  4. arXiv:2407.09037  [pdf

    physics.optics

    Photonic quasicrystal of spin angular momentum

    Authors: Min Lin, Xinxin Gou, Zhenwei Xie, Aiping Yang, Luping Du, Xiaocong Yuan

    Abstract: Quasicrystals,characterized by long-range order without translational symmetry,have catalyzed transformative advances in various fields,including optics in terms of field quasicrystals.Here,we present the first demonstration of photonic quasicrystals formed by spin angular momentum, unveiling novel spin-orbit coupling effects absent in traditional field quasicrystals.A de Bruijn tiling like theore… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  5. arXiv:2407.08032  [pdf, other

    astro-ph.EP

    Rossby Wave Instability and Substructure Formation in 3D Non-Ideal MHD Wind-Launching Disks

    Authors: Chun-Yen Hsu, Zhi-Yun Li, Yisheng Tu, Xiao Hu, Min-Kai Lin

    Abstract: Rings and gaps are routinely observed in the dust continuum emission of protoplanetary discs (PPDs). How they form and evolve remains debated. Previous studies have demonstrated the possibility of spontaneous gas rings and gaps formation in wind-launching disks. Here, we show that such gas substructures are unstable to the Rossby Wave Instability (RWI) through numerical simulations. Specifically,… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  6. arXiv:2407.07330  [pdf

    cs.CL cs.AI

    Interpretable Differential Diagnosis with Dual-Inference Large Language Models

    Authors: Shuang Zhou, Sirui Ding, Jiashuo Wang, Mingquan Lin, Genevieve B. Melton, Rui Zhang

    Abstract: Methodological advancements to automate the generation of differential diagnosis (DDx) to predict a list of potential diseases as differentials given patients' symptom descriptions are critical to clinical reasoning and applications such as decision support. However, providing reasoning or interpretation for these differential diagnoses is more meaningful. Fortunately, large language models (LLMs)… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 15 pages

  7. arXiv:2407.06184  [pdf, ps, other

    math.AG

    Integral aspects of Fourier duality for abelian varieties

    Authors: Junaid Hasan, Hazem Hassan, Milton Lin, Marcella Manivel, Lily McBeath, Ben Moonen

    Abstract: We prove several results about integral versions of Fourier duality for abelian schemes, making use of Pappas's work on integral Grothendieck-Riemann-Roch. If $S$ is smooth quasi-projective of dimension $d$ over a field and $π\colon X\to S$ is a $g$-dimensional abelian scheme, we prove, under very mild assumptions on $X/S$, that all classical results about Fourier duality, including the existence… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 22 pages

    MSC Class: 14C15; 14K05

  8. arXiv:2407.06027  [pdf, other

    cs.CL

    PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

    Authors: Miao Zheng, Hao Liang, Fan Yang, Haoze Sun, Tianpeng Li, Lingchu Xiong, Yan Zhang, Youzhen Wu, Kun Li, Yanjun Shen, Mingan Lin, Tao Zhang, Guosheng Dong, Yujing Qiao, Kun Fang, Weipeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou

    Abstract: In recent years, the rise of Large Language Models (LLMs) has spurred a growing demand for plug-and-play AI systems. Among the various AI techniques, prompt engineering stands out as particularly significant. However, users often face challenges in writing prompts due to the steep learning curve and significant time investment, and existing automatic prompt engineering (APE) models can be difficul… ▽ More

    Submitted 12 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  9. arXiv:2407.04241  [pdf, other

    cs.CV cs.AI

    AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource

    Authors: Wengyi Zhan, Mingbao Lin, Chia-Wen Lin, Rongrong Ji

    Abstract: In an effort to improve the efficiency and scalability of single-image super-resolution (SISR) applications, we introduce AnySR, to rebuild existing arbitrary-scale SR methods into any-scale, any-resource implementation. As a contrast to off-the-shelf methods that solve SR tasks across various scales with the same computing costs, our AnySR innovates in: 1) building arbitrary-scale tasks as any-re… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  10. arXiv:2407.02764  [pdf, other

    cs.OS

    Data-driven Software-based Power Estimation for Embedded Devices

    Authors: Haoyu Wang, Xinyi Li, Ti Zhou, Man Lin

    Abstract: Energy measurement of computer devices, which are widely used in the Internet of Things (IoT), is an important yet challenging task. Most of these IoT devices lack ready-to-use hardware or software for power measurement. A cost-effective solution is to use low-end consumer-grade power meters. However, these low-end power meters cannot provide accurate instantaneous power measurements. In this pape… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  11. arXiv:2407.02103  [pdf, ps, other

    astro-ph.EP

    Rossby wave instability in weakly ionized protoplanetary disks. I. azimuthal or vertical B-fields

    Authors: Can Cui, Ashutosh Tripathi, Cong Yu, Min-Kai Lin, Andrew Youdin

    Abstract: Rossby wave instability (RWI) is considered the underlying mechanism to crescent-shaped azimuthal asymmetries, discovered in (sub-)millimeter dust continuum of many protoplanetary disks. Previous works on linear theory were conducted in the hydrodynamic limit. Nevertheless, protoplanetary disks are likely magnetized and weakly ionized. We examine the influence of magnetic fields and non-ideal magn… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures, submitted to MNRAS

  12. arXiv:2407.01492  [pdf, other

    cs.CL cs.AI

    RegMix: Data Mixture as Regression for Language Model Pre-training

    Authors: Qian Liu, Xiaosen Zheng, Niklas Muennighoff, Guangtao Zeng, Longxu Dou, Tianyu Pang, Jing Jiang, Min Lin

    Abstract: The data mixture for large language model pre-training significantly impacts performance, yet how to determine an effective mixture remains unclear. We propose RegMix to automatically identify a high-performing data mixture by formulating it as a regression task. RegMix involves training a set of small models with diverse data mixtures and fitting a regression model to predict their performance gi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  13. arXiv:2407.00631  [pdf, other

    cs.LG cs.AI

    TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets

    Authors: Jintai Chen, Yaojun Hu, Yue Wang, Yingzhou Lu, Xu Cao, Miao Lin, Hongxia Xu, Jian Wu, Cao Xiao, Jimeng Sun, Lucas Glass, Kexin Huang, Marinka Zitnik, Tianfan Fu

    Abstract: Clinical trials are pivotal for developing new medical treatments, yet they typically pose some risks such as patient mortality, adverse events, and enrollment failure that waste immense efforts spanning over a decade. Applying artificial intelligence (AI) to forecast or simulate key events in clinical trials holds great potential for providing insights to guide trial designs. However, complex dat… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  14. arXiv:2407.00497  [pdf, other

    cs.CL

    LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement

    Authors: Jiahao Ying, Mingbao Lin, Yixin Cao, Wei Tang, Bo Wang, Qianru Sun, Xuanjing Huang, Shuicheng Yan

    Abstract: This paper introduces the innovative "LLMs-as-Instructors" framework, which leverages the advanced Large Language Models (LLMs) to autonomously enhance the training of smaller target models. Inspired by the theory of "Learning from Errors", this framework employs an instructor LLM to meticulously analyze the specific errors within a target model, facilitating targeted and efficient training cycles… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  15. arXiv:2407.00474  [pdf, other

    cs.LG cs.AI

    MH-pFLGB: Model Heterogeneous personalized Federated Learning via Global Bypass for Medical Image Analysis

    Authors: Luyuan Xie, Manqing Lin, ChenMing Xu, Tianyu Luan, Zhipeng Zeng, Wenjun Qian, Cong Li, Yuejian Fang, Qingni Shen, Zhonghai Wu

    Abstract: In the evolving application of medical artificial intelligence, federated learning is notable for its ability to protect training data privacy. Federated learning facilitates collaborative model development without the need to share local data from healthcare institutions. Yet, the statistical and system heterogeneity among these institutions poses substantial challenges, which affects the effecti… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.06822

  16. arXiv:2407.00462  [pdf, other

    cs.CV cs.AI

    pFLFE: Cross-silo Personalized Federated Learning via Feature Enhancement on Medical Image Segmentation

    Authors: Luyuan Xie, Manqing Lin, Siyuan Liu, ChenMing Xu, Tianyu Luan, Cong Li, Yuejian Fang, Qingni Shen, Zhonghai Wu

    Abstract: In medical image segmentation, personalized cross-silo federated learning (FL) is becoming popular for utilizing varied data across healthcare settings to overcome data scarcity and privacy concerns. However, existing methods often suffer from client drift, leading to inconsistent performance and delayed training. We propose a new framework, Personalized Federated Learning via Feature Enhancement… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  17. arXiv:2406.19438  [pdf, other

    astro-ph.EP

    Shoulder of Dust Rings Formed by Planet-disk Interactions

    Authors: Jiaqing Bi, Min-Kai Lin

    Abstract: Recent analyses of mm-wavelength protoplanetary disk observations have revealed several emission excesses on the previously identified dust rings, referred to as dust shoulders. The prevalence of dust shoulders suggests that they trace a common but unclear mechanism. In this work, we combine 3D, multifluid hydrodynamic simulations with radiative transfer calculations to explain the formation of du… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: accepted to ApJ

  18. arXiv:2406.18173  [pdf, other

    cs.CL

    UIO-LLMs: Unbiased Incremental Optimization for Long-Context LLMs

    Authors: Wenhao Li, Mingbao Lin, Yunshan Zhong, Shuicheng Yan, Rongrong Ji

    Abstract: Managing long texts is challenging for large language models (LLMs) due to limited context window sizes. This study introduces UIO-LLMs, an unbiased incremental optimization approach for memory-enhanced transformers under long-context settings. We initially conceptualize the process as a streamlined encoder-decoder framework where the weights-shared encoder and decoder respectively encapsulate a c… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  19. arXiv:2406.17628  [pdf, other

    cs.CV cs.CR

    Video Inpainting Localization with Contrastive Learning

    Authors: Zijie Lou, Gang Cao, Man Lin

    Abstract: Deep video inpainting is typically used as malicious manipulation to remove important objects for creating fake videos. It is significant to identify the inpainted regions blindly. This letter proposes a simple yet effective forensic scheme for Video Inpainting LOcalization with ContrAstive Learning (ViLocal). Specifically, a 3D Uniformer encoder is applied to the video noise residual for learning… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2406.13576

  20. arXiv:2406.17252  [pdf, other

    quant-ph

    Resource-Optimized Grouping Shadow for Efficient Energy Estimation

    Authors: Min Li, Mao Lin, Matthew J. S. Beach

    Abstract: The accurate and efficient energy estimation of quantum Hamiltonians consisting of Pauli observables is an essential task in modern quantum computing. We introduce a Resource-Optimized Grouping Shadow (ROGS) algorithm, which optimally allocates measurement resources by minimizing the estimation error bound through a novel overlapped grouping strategy and convex optimization. Our numerical experime… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 22 pages, 5 figures

  21. arXiv:2406.14162  [pdf, other

    cs.IR cs.AI cs.CL

    DIRAS: Efficient LLM-Assisted Annotation of Document Relevance in Retrieval Augmented Generation

    Authors: Jingwei Ni, Tobias Schimanski, Meihong Lin, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: Retrieval Augmented Generation (RAG) is widely employed to ground responses to queries on domain-specific documents. But do RAG implementations leave out important information or excessively include irrelevant information? To allay these concerns, it is necessary to annotate domain-specific benchmarks to evaluate information retrieval (IR) performance, as relevance definitions vary across queries… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  22. arXiv:2406.13576  [pdf, other

    cs.CV cs.CR

    Trusted Video Inpainting Localization via Deep Attentive Noise Learning

    Authors: Zijie Lou, Gang Cao, Man Lin

    Abstract: Digital video inpainting techniques have been substantially improved with deep learning in recent years. Although inpainting is originally designed to repair damaged areas, it can also be used as malicious manipulation to remove important objects for creating false scenes and facts. As such it is significant to identify inpainted regions blindly. In this paper, we present a Trusted Video Inpaintin… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  23. arXiv:2406.13083  [pdf, other

    physics.ins-det

    Design and Performance of a Magnetic Bottle Electron Spectrometer for High-Energy Photoelectron Spectroscopy

    Authors: Kurtis Borne, Jordan T ONeal, Jun Wang, Erk Isele, Razib Obaid, Nora Berrah, Xinxin Cheng, Philip H Bucksbaum, Justin James, Andri Kamalov, Kirk A Larsen, Xiang Li, Ming-Fu Lin, Yusong Liu, Agostino Marinelli, Adam Summers, Emily Thierstein, Thomas Wolf, Daniel Rolles, Peter Walter, James P Cryan, Taran Driver

    Abstract: We describe the design and performance of a magnetic bottle electron spectrometer~(MBES) for high-energy electron spectroscopy. Our design features a ${\sim2}$~m long electron drift tube and electrostatic retardation lens, achieving sub-electronvolt (eV) electron kinetic energy resolution for high energy (several hundred eV) electrons with close to 4$π$ collection efficiency. A segmented anode… ▽ More

    Submitted 4 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  24. arXiv:2406.10740  [pdf, other

    cs.CV

    FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

    Authors: Zhikai Zhang, Yitang Li, Haofeng Huang, Mingxian Lin, Li Yi

    Abstract: Human motion synthesis is a fundamental task in computer animation. Despite recent progress in this field utilizing deep learning and motion capture data, existing methods are always limited to specific motion categories, environments, and styles. This poor generalizability can be partially attributed to the difficulty and expense of collecting large-scale and high-quality motion data. At the same… ▽ More

    Submitted 21 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  25. arXiv:2406.09836  [pdf, other

    cs.LG cs.CR

    Robustness-Inspired Defense Against Backdoor Attacks on Graph Neural Networks

    Authors: Zhiwei Zhang, Minhua Lin, Junjie Xu, Zongyu Wu, Enyan Dai, Suhang Wang

    Abstract: Graph Neural Networks (GNNs) have achieved promising results in tasks such as node classification and graph classification. However, recent studies reveal that GNNs are vulnerable to backdoor attacks, posing a significant threat to their real-world adoption. Despite initial efforts to defend against specific graph backdoor attacks, there is no work on defending against various types of backdoor at… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  26. arXiv:2406.09760  [pdf, other

    cs.CL cs.LG

    Bootstrapping Language Models with DPO Implicit Rewards

    Authors: Changyu Chen, Zichen Liu, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha, Pradeep Varakantham, Min Lin

    Abstract: Human alignment in large language models (LLMs) is an active area of research. A recent groundbreaking work, direct preference optimization (DPO), has greatly simplified the process from past work in reinforcement learning from human feedback (RLHF) by bypassing the reward learning stage in RLHF. DPO, after training, provides an implicit reward model. In this work, we make a novel observation that… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  27. arXiv:2406.09648  [pdf, other

    cs.LG cs.CV

    An Intrinsic Vector Heat Network

    Authors: Alexander Gao, Maurice Chu, Mubbasir Kapadia, Ming C. Lin, Hsueh-Ti Derek Liu

    Abstract: Vector fields are widely used to represent and model flows for many science and engineering applications. This paper introduces a novel neural network architecture for learning tangent vector fields that are intrinsically defined on manifold surfaces embedded in 3D. Previous approaches to learning vector fields on surfaces treat vectors as multi-dimensional scalar fields, using traditional scalar-… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  28. arXiv:2406.09136  [pdf, other

    cs.CL cs.LG

    Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

    Authors: Xuan Zhang, Chao Du, Tianyu Pang, Qian Liu, Wei Gao, Min Lin

    Abstract: The recent development of chain-of-thought (CoT) decoding has enabled large language models (LLMs) to generate explicit logical reasoning paths for complex problem-solving. However, research indicates that these paths are not always deliberate and optimal. The tree-of-thought (ToT) method employs tree-searching to extensively explore the reasoning space and find better reasoning paths that CoT dec… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  29. arXiv:2406.06971  [pdf, ps, other

    astro-ph.EP

    Polar alignment of a dusty circumbinary disc -- I. Dust ring formation

    Authors: Jeremy L. Smallwood, Min-Kai Lin, Hossam Aly, Rebecca Nealon, Cristiano Longarini

    Abstract: We investigate the formation of dust traffic jams in polar-aligning circumbinary discs. We use 3D smoothed particle hydrodynamical simulations of both gas and dust to model an initially highly misaligned circumbinary disc around an eccentric binary. As the circumbinary disc evolves to a polar configuration (perpendicular to the binary orbital plane), the difference in the precession between the ga… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in MNRAS, 19 pages, 17 figures

  30. arXiv:2406.06038  [pdf, other

    cs.RO

    Navigation and 3D Surface Reconstruction from Passive Whisker Sensing

    Authors: Michael A. Lin, Hao Li, Chengyi Xing, Mark R. Cutkosky

    Abstract: Whiskers provide a way to sense surfaces in the immediate environment without disturbing it. In this paper we present a method for using highly flexible, curved, passive whiskers mounted along a robot arm to gather sensory data as they brush past objects during normal robot motion. The information is useful both for guiding the robot in cluttered spaces and for reconstructing the exposed faces of… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2210.12387

  31. arXiv:2406.04313  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.CY

    Improving Alignment and Robustness with Circuit Breakers

    Authors: Andy Zou, Long Phan, Justin Wang, Derek Duenas, Maxwell Lin, Maksym Andriushchenko, Rowan Wang, Zico Kolter, Matt Fredrikson, Dan Hendrycks

    Abstract: AI systems can take harmful actions and are highly vulnerable to adversarial attacks. We present an approach, inspired by recent advances in representation engineering, that interrupts the models as they respond with harmful outputs with "circuit breakers." Existing techniques aimed at improving alignment, such as refusal training, are often bypassed. Techniques such as adversarial training try to… ▽ More

    Submitted 12 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Code and models are available at https://github.com/GraySwanAI/circuit-breakers

  32. arXiv:2406.02859   

    eess.AS cs.SD

    ConPCO: Preserving Phoneme Characteristics for Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization

    Authors: Bi-Cheng Yan, Wei-Cheng Chao, Jiun-Ting Li, Yi-Cheng Wang, Hsin-Wei Wang, Meng-Shin Lin, Berlin Chen

    Abstract: Automatic pronunciation assessment (APA) manages to evaluate the pronunciation proficiency of a second language (L2) learner in a target language. Existing efforts typically draw on regression models for proficiency score prediction, where the models are trained to estimate target values without explicitly accounting for phoneme-awareness in the feature space. In this paper, we propose a contrasti… ▽ More

    Submitted 8 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: This paper has been withdrawn because the authors aim to achieve better organization in writing and more detailed experimental analysis

  33. arXiv:2406.01602  [pdf, other

    physics.data-an hep-ex nucl-ex

    Effectiveness of denoising diffusion probabilistic models for fast and high-fidelity whole-event simulation in high-energy heavy-ion experiments

    Authors: Yeonju Go, Dmitrii Torbunov, Timothy Rinn, Yi Huang, Haiwang Yu, Brett Viren, Meifeng Lin, Yihui Ren, Jin Huang

    Abstract: Artificial intelligence (AI) generative models, such as generative adversarial networks (GANs), variational auto-encoders, and normalizing flows, have been widely used and studied as efficient alternatives for traditional scientific simulations. However, they have several drawbacks, including training instability and inability to cover the entire data distribution, especially for regions where dat… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures

  34. arXiv:2406.01431  [pdf, other

    cs.RO

    Deep Stochastic Kinematic Models for Probabilistic Motion Forecasting in Traffic

    Authors: Laura Zheng, Sanghyun Son, Jing Liang, Xijun Wang, Brian Clipp, Ming C. Lin

    Abstract: Kinematic priors have shown to be helpful in boosting generalization and performance in prior work on trajectory forecasting. Specifically, kinematic priors have been applied such that models predict a set of actions instead of future output trajectories. By unrolling predicted trajectories via time integration and models of kinematic dynamics, predicted trajectories are not only kinematically fea… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 8 pages

  35. arXiv:2406.01425  [pdf, other

    cs.CV

    Sensitivity-Informed Augmentation for Robust Segmentation

    Authors: Laura Zheng, Wenjie Wei, Tony Wu, Jacob Clements, Shreelekha Revankar, Andre Harrison, Yu Shen, Ming C. Lin

    Abstract: Segmentation is an integral module in many visual computing applications such as virtual try-on, medical imaging, autonomous driving, and agricultural automation. These applications often involve either widespread consumer use or highly variable environments, both of which can degrade the quality of visual sensor data, whether from a common mobile phone or an expensive satellite imaging camera. In… ▽ More

    Submitted 16 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages

  36. arXiv:2406.01288  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

    Authors: Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin

    Abstract: Recently, Anil et al. (2024) show that many-shot (up to hundreds of) demonstrations can jailbreak state-of-the-art LLMs by exploiting their long-context capability. Nevertheless, is it possible to use few-shot demonstrations to efficiently jailbreak LLMs within limited context sizes? While the vanilla few-shot jailbreaking may be inefficient, we propose improved techniques such as injecting specia… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  37. arXiv:2406.01032  [pdf, other

    cs.LG cs.AI

    LLM and GNN are Complementary: Distilling LLM for Multimodal Graph Learning

    Authors: Junjie Xu, Zongyu Wu, Minhua Lin, Xiang Zhang, Suhang Wang

    Abstract: Recent progress in Graph Neural Networks (GNNs) has greatly enhanced the ability to model complex molecular structures for predicting properties. Nevertheless, molecular data encompasses more than just graph structures, including textual and visual information that GNNs do not handle well. To bridge this gap, we present an innovative framework that utilizes multimodal molecular data to extract ins… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  38. arXiv:2405.21018  [pdf, other

    cs.LG cs.CL cs.CR

    Improved Techniques for Optimization-Based Jailbreaking on Large Language Models

    Authors: Xiaojun Jia, Tianyu Pang, Chao Du, Yihao Huang, Jindong Gu, Yang Liu, Xiaochun Cao, Min Lin

    Abstract: Large language models (LLMs) are being rapidly developed, and a key component of their widespread deployment is their safety-related alignment. Many red-teaming efforts aim to jailbreak LLMs, where among these efforts, the Greedy Coordinate Gradient (GCG) attack's success has led to a growing interest in the study of optimization-based jailbreaking techniques. Although GCG is a significant milesto… ▽ More

    Submitted 5 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  39. arXiv:2405.19026  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

    Authors: Andrew Zhao, Quentin Xu, Matthieu Lin, Shenzhi Wang, Yong-jin Liu, Zilong Zheng, Gao Huang

    Abstract: Recent advances in large language models (LLMs) have made them indispensable, raising significant concerns over managing their safety. Automated red teaming offers a promising alternative to the labor-intensive and error-prone manual probing for vulnerabilities, providing more consistent and scalable safety evaluations. However, existing approaches often compromise diversity by focusing on maximiz… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  40. arXiv:2405.18810  [pdf, other

    cs.CV cs.AI

    UniPTS: A Unified Framework for Proficient Post-Training Sparsity

    Authors: Jingjing Xie, Yuxin Zhang, Mingbao Lin, Zhihang Lin, Liujuan Cao, Rongrong Ji

    Abstract: Post-training Sparsity (PTS) is a recently emerged avenue that chases efficient network sparsity with limited data in need. Existing PTS methods, however, undergo significant performance degradation compared with traditional methods that retrain the sparse networks via the whole dataset, especially at high sparsity ratios. In this paper, we attempt to reconcile this disparity by transposing three… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR2024

  41. arXiv:2405.18256  [pdf

    cond-mat.mtrl-sci

    Electrical Control Grain Dimensionality with Multilevel Magnetic Anisotropy

    Authors: Shengyao Li, Sabpreet Bhatti, Siew Lang Teo, Ming Lin, Xinyue Pan, Zherui Yang, Peng Song, Wanghao Tian, Xinyu He, Jianwei Chai, Xian Jun Loh, Qiang Zhu, S. N. Piramanayagam, Xiao Renshaw Wang

    Abstract: In alignment with the increasing demand for larger storage capacity and longer data retention, electrical control of magnetic anisotropy has been a research focus in the realm of spintronics. Typically, magnetic anisotropy is determined by grain dimensionality, which is set during the fabrication of magnetic thin films. Despite the intrinsic correlation between magnetic anisotropy and grain dimens… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  42. arXiv:2405.16279  [pdf, other

    physics.ins-det cs.AI

    AI-Assisted Detector Design for the EIC (AID(2)E)

    Authors: M. Diefenthaler, C. Fanelli, L. O. Gerlach, W. Guan, T. Horn, A. Jentsch, M. Lin, K. Nagai, H. Nayak, C. Pecar, K. Suresh, A. Vossen, T. Wang, T. Wenaus

    Abstract: Artificial Intelligence is poised to transform the design of complex, large-scale detectors like the ePIC at the future Electron Ion Collider. Featuring a central detector with additional detecting systems in the far forward and far backward regions, the ePIC experiment incorporates numerous design parameters and objectives, including performance, physics reach, and cost, constrained by mechanical… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: 11 pages, 4 figures, AI4EIC 2023 proceeding

  43. arXiv:2405.15362  [pdf, other

    cs.LG cs.CL cs.DC

    Pipeline Parallelism with Controllable Memory

    Authors: Penghui Qi, Xinyi Wan, Nyamdavaa Amar, Min Lin

    Abstract: Pipeline parallelism has been widely explored, but most existing schedules lack a systematic methodology. In this paper, we propose a framework to decompose pipeline schedules as repeating a building block and we show that the lifespan of the building block decides the peak activation memory of the pipeline schedule. Guided by the observations, we find that almost all existing pipeline schedules,… ▽ More

    Submitted 10 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  44. arXiv:2405.14341  [pdf, other

    cs.HC

    How do Observable Users Decompose D3 Code? An Exploratory Study

    Authors: Melissa Lin, Heer Patel, Medina Lamkin, Tukey Tu, Hannah Bako, Soham Raut, Leilani Battle

    Abstract: Users often struggle to program visualizations using complex toolkits like D3. Before we can design effective code assistants to support them, we must first understand how D3 users reason about their code. In this work, we explore users' understanding of D3 using an important gauge of code comprehension in CS education: code decomposition. We qualitatively analyze 560 D3 programs published on Obse… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  45. arXiv:2405.13685  [pdf, other

    cs.CV

    Prompt Mixing in Diffusion Models using the Black Scholes Algorithm

    Authors: Divya Kothandaraman, Ming Lin, Dinesh Manocha

    Abstract: We introduce a novel approach for prompt mixing, aiming to generate images at the intersection of multiple text prompts using pre-trained text-to-image diffusion models. At each time step during diffusion denoising, our algorithm forecasts predictions w.r.t. the generated image and makes informed text conditioning decisions. To do so, we leverage the connection between diffusion models (rooted in… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  46. Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective

    Authors: Zhiwei Zhang, Minhua Lin, Enyan Dai, Suhang Wang

    Abstract: Graph Neural Networks (GNNs) have shown remarkable performance in various tasks. However, recent works reveal that GNNs are vulnerable to backdoor attacks. Generally, backdoor attack poisons the graph by attaching backdoor triggers and the target class label to a set of nodes in the training graph. A GNN trained on the poisoned graph will then be misled to predict test nodes attached with trigger… ▽ More

    Submitted 11 July, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024

  47. arXiv:2405.08786  [pdf, other

    cs.CV

    Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring

    Authors: Tiantian Zhang, Manxi Lin, Hongda Guo, Xiaofan Zhang, Ka Fung Peter Chiu, Aasa Feragen, Qi Dou

    Abstract: The Prostate Imaging Reporting and Data System (PI-RADS) is pivotal in the diagnosis of clinically significant prostate cancer through MRI imaging. Current deep learning-based PI-RADS scoring methods often lack the incorporation of common PI-RADS clinical guideline~(PICG) utilized by radiologists, potentially compromising scoring accuracy. This paper introduces a novel approach that adapts a multi… ▽ More

    Submitted 10 July, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  48. arXiv:2405.08780  [pdf

    cs.CV cs.AI

    Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling

    Authors: Gregory Holste, Mingquan Lin, Ruiwen Zhou, Fei Wang, Lei Liu, Qi Yan, Sarah H. Van Tassel, Kyle Kovacs, Emily Y. Chew, Zhiyong Lu, Zhangyang Wang, Yifan Peng

    Abstract: Deep learning has enabled breakthroughs in automated diagnosis from medical imaging, with many successful applications in ophthalmology. However, standard medical image classification approaches only assess disease presence at the time of acquisition, neglecting the common clinical setting of longitudinal imaging. For slow, progressive eye diseases like age-related macular degeneration (AMD) and p… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  49. arXiv:2405.06944  [pdf, other

    cs.CV

    Learning Monocular Depth from Focus with Event Focal Stack

    Authors: Chenxu Jiang, Mingyuan Lin, Chi Zhang, Zhenghai Wang, Lei Yu

    Abstract: Depth from Focus estimates depth by determining the moment of maximum focus from multiple shots at different focal distances, i.e. the Focal Stack. However, the limited sampling rate of conventional optical cameras makes it difficult to obtain sufficient focus cues during the focal sweep. Inspired by biological vision, the event camera records intensity changes over time in extremely low latency,… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  50. arXiv:2405.06918  [pdf, other

    cs.CV

    Super-Resolving Blurry Images with Events

    Authors: Chi Zhang, Mingyuan Lin, Xiang Zhang, Chenxu Jiang, Lei Yu

    Abstract: Super-resolution from motion-blurred images poses a significant challenge due to the combined effects of motion blur and low spatial resolution. To address this challenge, this paper introduces an Event-based Blurry Super Resolution Network (EBSR-Net), which leverages the high temporal resolution of events to mitigate motion blur and improve high-resolution image prediction. Specifically, we propo… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.