Skip to main content

Showing 1–50 of 742 results for author: Tan, S

  1. arXiv:2407.08975  [pdf, other

    cs.AR cs.ET

    Hybrid Temporal Computing for Lower Power Hardware Accelerators

    Authors: Maliha Tasnim, Sachin Sachdeva, Yibo Liu, Sheldon X. -D. Tan

    Abstract: In this paper, we propose a new hybrid temporal computing (HTC) framework that leverages both pulse rate and temporal data encoding to design ultra-low energy hardware accelerators. Our approach is inspired by the recently proposed temporal computing, or race logic, which encodes data values as single delays, leading to significantly lower energy consumption due to minimized signal switching. Howe… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 7 pages, 8 figures and 3 tables

  2. arXiv:2406.19958  [pdf, other

    stat.ML cs.LG math.ST

    The Computational Curse of Big Data for Bayesian Additive Regression Trees: A Hitting Time Analysis

    Authors: Yan Shuo Tan, Omer Ronen, Theo Saarinen, Bin Yu

    Abstract: Bayesian Additive Regression Trees (BART) is a popular Bayesian non-parametric regression model that is commonly used in causal inference and beyond. Its strong predictive performance is supported by theoretical guarantees that its posterior distribution concentrates around the true regression function at optimal rates under various data generative settings and for appropriate prior choices. In th… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    MSC Class: 62G08; 65C40

  3. arXiv:2406.18897  [pdf, other

    quant-ph

    Resilience of the surface code to error bursts

    Authors: Shi Jie Samuel Tan, Christopher A. Pattison, Matt McEwen, John Preskill

    Abstract: Quantum error correction works effectively only if the error rate of gate operations is sufficiently low. However, some rare physical mechanisms can cause a temporary increase in the error rate that affects many qubits; examples include ionizing radiation in superconducting hardware and large deviations in the global control of atomic systems. We refer to such rare transient spikes in the gate err… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.13124  [pdf, other

    cs.CL

    Learning to Generate Answers with Citations via Factual Consistency Models

    Authors: Rami Aly, Zhiqiang Tang, Samson Tan, George Karypis

    Abstract: Large Language Models (LLMs) frequently hallucinate, impeding their reliability in mission-critical situations. One approach to address this issue is to provide citations to relevant sources alongside generated content, enhancing the verifiability of generations. However, citing passages accurately in answers remains a substantial challenge. This paper proposes a weakly-supervised fine-tuning meth… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024. Code release will follow

  5. arXiv:2406.12800  [pdf, other

    cs.CR

    Supporting Human Raters with the Detection of Harmful Content using Large Language Models

    Authors: Kurt Thomas, Patrick Gage Kelley, David Tao, Sarah Meiklejohn, Owen Vallis, Shunwen Tan, Blaž Bratanič, Felipe Tiengo Ferreira, Vijay Kumar Eranti, Elie Bursztein

    Abstract: In this paper, we explore the feasibility of leveraging large language models (LLMs) to automate or otherwise assist human raters with identifying harmful content including hate speech, harassment, violent extremism, and election misinformation. Using a dataset of 50,000 comments, we demonstrate that LLMs can achieve 90% accuracy when compared to human verdicts. We explore how to best leverage the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.12649  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

    Authors: Hengyi Wang, Shiwei Tan, Hao Wang

    Abstract: Vision transformers (ViTs) have emerged as a significant area of focus, particularly for their capacity to be jointly trained with large language models and to serve as robust vision foundation models. Yet, the development of trustworthy explanation methods for ViTs has lagged, particularly in the context of post-hoc interpretations of ViT predictions. Existing sub-image selection approaches, such… ▽ More

    Submitted 18 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024

  7. arXiv:2406.12313  [pdf

    cs.DB

    A framework for developing a knowledge management platform

    Authors: Marie Lisandra Zepeda Mendoza, Sonali Agarwal, James A. Blackshaw, Vanesa Bol, Audrey Fazzi, Filippo Fiorini, Amy Louise Foreman, Nancy George, Brett R. Johnson, Brian Martin, Dave McComb, Euphemia Mutasa-Gottgens, Helen Parkinson, Martin Romacker, Rolf Russell, Valérien Ségard, Shawn Zheng Kai Tan, Wei Kheng Teh, F. P. Winstanley, Benedict Wong, Adrian M. Smith

    Abstract: Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide gu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 18 pages, 1 figure

  8. arXiv:2406.11230  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

    Authors: Hengyi Wang, Haizhou Shi, Shiwei Tan, Weiyi Qin, Wenyuan Wang, Tunyu Zhang, Akshay Nambi, Tanuja Ganu, Hao Wang

    Abstract: Multimodal Large Language Models (MLLMs) have shown significant promise in various applications, leading to broad interest from researchers and practitioners alike. However, a comprehensive evaluation of their long-context capabilities remains underexplored. To address these gaps, we introduce the MultiModal Needle-in-a-haystack (MMNeedle) benchmark, specifically designed to assess the long-contex… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  9. arXiv:2406.10290  [pdf, other

    cs.CL cs.AI cs.LG

    MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

    Authors: Rithesh Murthy, Liangwei Yang, Juntao Tan, Tulika Manoj Awalgaonkar, Yilun Zhou, Shelby Heinecke, Sachin Desai, Jason Wu, Ran Xu, Sarah Tan, Jianguo Zhang, Zhiwei Liu, Shirley Kokane, Zuxin Liu, Ming Zhu, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: The deployment of Large Language Models (LLMs) and Large Multimodal Models (LMMs) on mobile devices has gained significant attention due to the benefits of enhanced privacy, stability, and personalization. However, the hardware constraints of mobile devices necessitate the use of models with fewer parameters and model compression techniques like quantization. Currently, there is limited understand… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  10. arXiv:2406.07866  [pdf, other

    cs.LG math.OC

    Asymptotically Optimal Regret for Black-Box Predict-then-Optimize

    Authors: Samuel Tan, Peter I. Frazier

    Abstract: We consider the predict-then-optimize paradigm for decision-making in which a practitioner (1) trains a supervised learning model on historical data of decisions, contexts, and rewards, and then (2) uses the resulting model to make future binary decisions for new contexts by finding the decision that maximizes the model's predicted reward. This approach is common in industry. Past analysis assumes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 15 pages, 2 figures, 3 tables

  11. arXiv:2405.16003  [pdf, other

    cs.AI cs.CY cs.LG

    Disentangling Heterogeneous Knowledge Concept Embedding for Cognitive Diagnosis on Untested Knowledge

    Authors: Kui Xiao, Runtian Xing, Miao Zhang, Shunfeng Tan, Ziming Wang, Xiaolian Zhu

    Abstract: Cognitive diagnosis is a fundamental and critical task in learning assessment, which aims to infer students' proficiency on knowledge concepts from their response logs. Current works assume each knowledge concept will certainly be tested and covered by multiple exercises. However, whether online or offline courses, it's hardly feasible to completely cover all knowledge concepts in several exercise… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  12. arXiv:2405.14782  [pdf, other

    cs.CL

    Lessons from the Trenches on Reproducible Evaluation of Language Models

    Authors: Stella Biderman, Hailey Schoelkopf, Lintang Sutawika, Leo Gao, Jonathan Tow, Baber Abbasi, Alham Fikri Aji, Pawan Sasanka Ammanamanchi, Sidney Black, Jordan Clive, Anthony DiPofi, Julen Etxaniz, Benjamin Fattori, Jessica Zosa Forde, Charles Foster, Jeffrey Hsu, Mimansa Jaiswal, Wilson Y. Lee, Haonan Li, Charles Lovering, Niklas Muennighoff, Ellie Pavlick, Jason Phang, Aviya Skowron, Samson Tan , et al. (5 additional authors not shown)

    Abstract: Effective evaluation of language models remains an open challenge in NLP. Researchers and engineers face methodological issues such as the sensitivity of models to evaluation setup, difficulty of proper comparisons across methods, and the lack of reproducibility and transparency. In this paper we draw on three years of experience in evaluating large language models to provide guidance and lessons… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  13. arXiv:2405.12462   

    cs.LG cs.AI

    Boosting X-formers with Structured Matrix for Long Sequence Time Series Forecasting

    Authors: Zhicheng Zhang, Yong Wang, Shaoqi Tan, Bowei Xia, Yujie Luo

    Abstract: Transformer-based models for long sequence time series forecasting (LSTF) problems have gained significant attention due to their exceptional forecasting precision. As the cornerstone of these models, the self-attention mechanism poses a challenge to efficient training and inference due to its quadratic time complexity. In this article, we propose a novel architectural design for Transformer-based… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: We believe this work is premature and requires further study

  14. arXiv:2405.09386  [pdf, ps, other

    math.QA

    Quantum vertex algebra associated to quantum toroidal $\mathfrak{gl}_N$

    Authors: Fulin Chen, Xin Huang, Fei Kong, Shaobin Tan

    Abstract: In this paper, we associate the quantum toroidal algebra $\mathcal{E}_N$ of type $\mathfrak{gl}_N$ with quantum vertex algebra through equivariant $φ$-coordinated quasi modules. More precisely, for every $\ell\in \mathbb{C}$, by deforming the universal affine vertex algebra of $\mathfrak{sl}_\infty$, we construct an $\hbar$-adic quantum $\Z$-vertex algebra… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  15. arXiv:2405.05413  [pdf

    cs.DB

    Digital Evolution: Novo Nordisk's Shift to Ontology-Based Data Management

    Authors: Shawn Zheng Kai Tan, Shounak Baksi, Thomas Gade Bjerregaard, Preethi Elangovan, Thrishna Kuttikattu Gopalakrishnan, Darko Hric, Joffrey Joumaa, Beidi Li, Kashif Rabbani, Santhosh Kannan Venkatesan, Joshua Daniel Valdez, Saritha Vettikunnel Kuriakose

    Abstract: Biomedical data is growing exponentially, and managing it is increasingly challenging. While Findable, Accessible, Interoperable and Reusable (FAIR) data principles provide guidance, their adoption has proven difficult, especially in larger enterprises like pharmaceutical companies. In this manuscript, we describe how we leverage an Ontology-Based Data Management (OBDM) strategy for digital transf… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 14 pages, 2 figures

  16. arXiv:2405.02213  [pdf, other

    cs.SE cs.AI cs.LG

    Automatic Programming: Large Language Models and Beyond

    Authors: Michael R. Lyu, Baishakhi Ray, Abhik Roychoudhury, Shin Hwei Tan, Patanamon Thongtanunam

    Abstract: Automatic programming has seen increasing popularity due to the emergence of tools like GitHub Copilot which rely on Large Language Models (LLMs). At the same time, automatically generated code faces challenges during deployment due to concerns around quality and trust. In this article, we study automated coding in a general sense and study the concerns around code quality, security and related is… ▽ More

    Submitted 15 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  17. arXiv:2405.01548  [pdf

    physics.app-ph physics.optics

    Foundry's perspective on laser and SOA module integration with silicon photonics

    Authors: James Y. S. Tan, Shawn Xie Wu, Salih Yanikgonul, Chao Li, Patrick Guo-Qiang Lo

    Abstract: Silicon photonic integrated circuit (PIC) builds on the demand for a low cost approach from established silicon-based manufacturing infrastructure traditionally built for electronics. Besides its natural abundance, silicon has desirable properties such as optically low loss (at certain critical wavelengths), and small form factor to enable high density scaled-up optical on-chip circuitry. However,… ▽ More

    Submitted 20 February, 2024; originally announced May 2024.

    Comments: 14 pages

    Journal ref: IEEE J Lightwave Technol. vol. 42, no. 3, pp. 1062-1074, 2024

  18. arXiv:2405.01350  [pdf, other

    cs.LG cs.SI

    Community-Invariant Graph Contrastive Learning

    Authors: Shiyin Tan, Dongyuan Li, Renhe Jiang, Ying Zhang, Manabu Okumura

    Abstract: Graph augmentation has received great attention in recent years for graph contrastive learning (GCL) to learn well-generalized node/graph representations. However, mainstream GCL methods often favor randomly disrupting graphs for augmentation, which shows limited generalization and inevitably leads to the corruption of high-level graph information, i.e., the graph community. Moreover, current know… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by ICML-2024

  19. arXiv:2404.19179  [pdf, other

    astro-ph.SR

    On the Determining Physical Factor of Jet-Related Coronal Mass Ejection's Morphology in the High Corona

    Authors: Yadan Duan, Yuandeng Shen, Zehao Tang, Chenrui Zhou, Song Tan

    Abstract: A solar jet can often cause coronal mass ejections (CMEs) with different morphologies in the high corona, for example, jet-like CMEs, bubble-like CMEs, and so-called twin CMEs that include a pair of simultaneous jet-like and bubble-like CMEs. However, what determines the morphology of a jet-related CME is still an open question. Using high spatiotemporal resolution stereoscopic observations taken… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 15 pages, 15 figures, 2 tables, accepted by ApJ

  20. arXiv:2404.18391  [pdf, other

    astro-ph.SR

    Broad and Bi-directional narrow quasi-periodic fast-propagating wave trains associated with a filament-driven halo CME on 2023 April 21

    Authors: Xinping Zhou, Yuandeng Shen, Yihua Yan, Ke Yu, Zhining Qu, Ahmed Ahmed Ibrahim, Zehao Tang, Chengrui Zhou, Song Tan, Ye Qiu, Hongfei Liang

    Abstract: This paper presents three distinct wave trains that occurred on 2023 April 21: a broad quasi-periodic fast-propagating (QFP) wave train and a bi-directional narrow QFP wave train. The broad QFP wave train expands outward in a circular wavefront, while bi-directional narrow QFP wave trains propagate in the northward and southward directions, respectively. The concurrent presence of the wave trains… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 11 pages, 5 figures, accepted by ApJ

  21. arXiv:2404.17126  [pdf, other

    cs.LG cs.AI eess.IV physics.med-ph

    Deep Evidential Learning for Dose Prediction

    Authors: Hai Siong Tan, Kuancheng Wang, Rafe Mcbeth

    Abstract: In this work, we present a novel application of an uncertainty-quantification framework called Deep Evidential Learning in the domain of radiotherapy dose prediction. Using medical images of the Open Knowledge-Based Planning Challenge dataset, we found that this model can be effectively harnessed to yield uncertainty estimates that inherited correlations with prediction errors upon completion of n… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 24 pages, 8 figures

  22. arXiv:2404.15163  [pdf, other

    cs.CV eess.IV

    Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment

    Authors: Tianwei Zhou, Songbai Tan, Wei Zhou, Yu Luo, Yuan-Gen Wang, Guanghui Yue

    Abstract: With the increasing maturity of the text-to-image and image-to-image generative models, AI-generated images (AGIs) have shown great application potential in advertisement, entertainment, education, social media, etc. Although remarkable advancements have been achieved in generative models, very few efforts have been paid to design relevant quality assessment models. In this paper, we propose a nov… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: IEEE Transactions on Broadcasting (TBC)

  23. arXiv:2404.13818  [pdf, other

    q-fin.GN

    Joint Liability Model with Adaptation to Climate Change

    Authors: Jiayue Zhang, Ken Seng Tan, Tony S. Wirjanto, Lysa Porth

    Abstract: This paper extends the application of ESG score assessment methodologies from large corporations to individual farmers' production, within the context of climate change. Our proposal involves the integration of crucial agricultural sustainability variables into conventional personal credit evaluation frameworks, culminating in the formulation of a holistic sustainable credit rating referred to as… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  24. arXiv:2404.11201  [pdf, other

    cs.CL

    Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation

    Authors: Shaomu Tan, Di Wu, Christof Monz

    Abstract: Training a unified multilingual model promotes knowledge transfer but inevitably introduces negative interference. Language-specific modeling methods show promise in reducing interference. However, they often rely on heuristics to distribute capacity and struggle to foster cross-lingual transfer via isolated modules. In this paper, we explore intrinsic task modularity within multilingual networks… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  25. arXiv:2404.08877  [pdf, other

    cs.SE cs.CL cs.LG

    Aligning LLMs for FL-free Program Repair

    Authors: Junjielong Xu, Ying Fu, Shin Hwei Tan, Pinjia He

    Abstract: Large language models (LLMs) have achieved decent results on automated program repair (APR). However, the next token prediction training objective of decoder-only LLMs (e.g., GPT-4) is misaligned with the masked span prediction objective of current infilling-style methods, which impedes LLMs from fully leveraging pre-trained knowledge for program repair. In addition, while some LLMs are capable of… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  26. arXiv:2404.07979  [pdf, other

    cs.CL cs.AI cs.LG

    LLoCO: Learning Long Contexts Offline

    Authors: Sijun Tan, Xiuyu Li, Shishir Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Ada Popa

    Abstract: Processing long contexts remains a challenge for large language models (LLMs) due to the quadratic computational and memory overhead of the self-attention mechanism and the substantial KV cache sizes during generation. We propose a novel approach to address this problem by learning contexts offline through context compression and in-domain parameter-efficient finetuning. Our method enables an LLM… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: The first two authors contributed equally to this work

  27. arXiv:2404.05200  [pdf

    cond-mat.mtrl-sci cond-mat.soft

    Quasicrystal bulk and surface energies from density functional theory

    Authors: Woohyeon Baek, Sambit Das, Shibo Tan, Vikram Gavini, Wenhao Sun

    Abstract: Are quasicrystals stable or metastable? Density functional theory (DFT) is often used to evaluate thermodynamic stability, but quasicrystals are long-range aperiodic and their energies cannot be calculated using conventional ab initio methods. Here, we perform first-principles calculations on quasicrystal nanoparticles of increasing sizes, from which we can directly extrapolate their bulk and surf… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  28. arXiv:2404.01647  [pdf, other

    cs.CV

    EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis

    Authors: Shuai Tan, Bin Ji, Mengxiao Bi, Ye Pan

    Abstract: Achieving disentangled control over multiple facial motions and accommodating diverse input modalities greatly enhances the application and entertainment of the talking head generation. This necessitates a deep exploration of the decoupling space for facial features, ensuring that they a) operate independently without mutual interference and b) can be preserved to share with different modal input,… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 22 pages, 15 figures

  29. arXiv:2403.18927  [pdf, other

    quant-ph math-ph

    Optimal Coherent Quantum Phase Estimation via Tapering

    Authors: Dhrumil Patel, Shi Jie Samuel Tan, Yigit Subasi, Andrew T. Sornborger

    Abstract: Quantum phase estimation is one of the fundamental primitives that underpins many quantum algorithms, including quantum amplitude estimation, the HHL algorithm for solving linear systems of equations, and quantum principal component analysis. Due to its significance as a subroutine, in this work, we study the coherent version of the phase estimation problem, where given an arbitrary input state an… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 23 pages, 6 figures

    Report number: LA-UR-23-30410

  30. arXiv:2403.15132  [pdf, other

    cs.CV eess.IV

    Transfer CLIP for Generalizable Image Denoising

    Authors: Jun Cheng, Dong Liang, Shan Tan

    Abstract: Image denoising is a fundamental task in computer vision. While prevailing deep learning-based supervised and self-supervised methods have excelled in eliminating in-distribution noise, their susceptibility to out-of-distribution (OOD) noise remains a significant challenge. The recent emergence of contrastive language-image pre-training (CLIP) model has showcased exceptional capabilities in open-w… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  31. arXiv:2403.09917  [pdf

    astro-ph.EP astro-ph.IM

    The Equilibrium Vapor Pressures of Ammonia and Oxygen Ices at Outer Solar System Temperatures

    Authors: B. P. Blakley, Will M. Grundy, Jordan K. Steckloff, Sugata P. Tan, Jennifer Hanley, Anna E. Engle, Stephen C. Tegler, Gerrick E. Lindberg, Shae M. Raposa, Kendall J. Koga, Cecilia L. Thieberger

    Abstract: Few laboratory studies have investigated the vapor pressures of the volatiles that may be present as ices in the outer solar system; even fewer studies have investigated these species at the temperatures and pressures suitable to the surfaces of icy bodies in the Saturnian and Uranian systems ($\lt$100 K, $\lt10^{-9}$ bar). This study adds to the work of Grundy et al. (2024) in extending the known… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 29 pages, 9 figures, to be published in Planetary and Space Science

  32. arXiv:2403.08245  [pdf, other

    cs.LG cs.DC

    Scattered Mixture-of-Experts Implementation

    Authors: Shawn Tan, Yikang Shen, Rameswar Panda, Aaron Courville

    Abstract: We present ScatterMoE, an implementation of Sparse Mixture-of-Experts (SMoE) on GPUs. ScatterMoE builds upon existing implementations, and overcoming some of the limitations to improve inference and training speed, and memory footprint. This implementation achieves this by avoiding padding and making excessive copies of the input. We introduce ParallelLinear, the main component we use to build our… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  33. arXiv:2403.06375  [pdf, other

    cs.CV

    FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization

    Authors: Shuai Tan, Bin Ji, Ye Pan

    Abstract: Generating emotional talking faces is a practical yet challenging endeavor. To create a lifelike avatar, we draw upon two critical insights from a human perspective: 1) The connection between audio and the non-deterministic facial dynamics, encompassing expressions, blinks, poses, should exhibit synchronous and one-to-many mapping. 2) Vibrant expressions are often accompanied by emotion-aware high… ▽ More

    Submitted 22 April, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 11 pages, 11 figures, conference

  34. arXiv:2403.06365  [pdf, other

    cs.CV

    Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style

    Authors: Shuai Tan, Bin Ji, Ye Pan

    Abstract: Although automatically animating audio-driven talking heads has recently received growing interest, previous efforts have mainly concentrated on achieving lip synchronization with the audio, neglecting two crucial elements for generating expressive videos: emotion style and art style. In this paper, we present an innovative audio-driven talking face generation method called Style2Talker. It involv… ▽ More

    Submitted 11 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures, conference

  35. arXiv:2403.06363  [pdf, other

    cs.CV

    Say Anything with Any Style

    Authors: Shuai Tan, Bin Ji, Yu Ding, Ye Pan

    Abstract: Generating stylized talking head with diverse head motions is crucial for achieving natural-looking videos but still remains challenging. Previous works either adopt a regressive method to capture the speaking style, resulting in a coarse style that is averaged across all training data, or employ a universal network to synthesize videos with different styles which causes suboptimal performance. To… ▽ More

    Submitted 12 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 9 pages, 5 figures, conference

  36. arXiv:2403.06217  [pdf, ps, other

    math.AG

    Non-existence of Shimura curves of Mumford type generically in the non-hyperelliptic locus

    Authors: Xin Lu, Shengli Tan, Kang Zuo

    Abstract: We show that there does not exist any Shimura curve with strictly maximal Higgs field generically in the Torelli locus of non-hyperelliptic curves of genus $g\geq 4$. In particular, Shimura curves of Mumford type are not generically in the Torelli locus of non-hyperelliptic curves of genus $g\geq 4$.

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Any comment is welcome

    MSC Class: 14J10; 14E30

  37. arXiv:2403.04446  [pdf, other

    hep-th cond-mat.str-el math-ph math.QA quant-ph

    Weak Hopf symmetry and tube algebra of the generalized multifusion string-net model

    Authors: Zhian Jia, Sheng Tan, Dagomir Kaszlikowski

    Abstract: We investigate the multifusion generalization of string-net ground states and lattice Hamiltonians, delving into its associated weak Hopf symmetry. For the multifusion string-net, the gauge symmetry manifests as a general weak Hopf algebra, leading to a reducible vacuum string label; the charge symmetry, serving as a quantum double of gauge symmetry, constitutes a connected weak Hopf algebra. This… ▽ More

    Submitted 14 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: v1: 64 pages

  38. arXiv:2403.04133  [pdf, other

    cs.CV cs.RO

    Towards learning-based planning:The nuPlan benchmark for real-world autonomous driving

    Authors: Napat Karnchanachari, Dimitris Geromichalos, Kok Seang Tan, Nanxiang Li, Christopher Eriksen, Shakiba Yaghoubi, Noushin Mehdipour, Gianmarco Bernasconi, Whye Kit Fong, Yiluan Guo, Holger Caesar

    Abstract: Machine Learning (ML) has replaced traditional handcrafted methods for perception and prediction in autonomous vehicles. Yet for the equally important planning task, the adoption of ML-based techniques is slow. We present nuPlan, the world's first real-world autonomous driving dataset, and benchmark. The benchmark is designed to test the ability of ML-based planners to handle diverse driving situa… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: ICRA 2024 camera ready incl. supplementary material

  39. arXiv:2403.02593  [pdf, ps, other

    math.CO

    The Ramsey numbers for trees of order $n$ with maximum degree at least $n-5$ versus the wheel graph of order nine

    Authors: Zhi Yee Chng, Thomas Britz, Ta Sheng Tan, Kok Bin Wong

    Abstract: The Ramsey numbers $R(T_n,W_8)$ are determined for each tree graph $T_n$ of order $n\geq 7$ and maximum degree $Δ(T_n)$ equal to either $n-4$ or $n-5$. These numbers indicate strong support for the conjecture, due to Chen, Zhang and Zhang and to Hafidh and Baskoro, that $R(T_n,W_m) = 2n-1$ for each tree graph $T_n$ of order $n\geq m-1$ with $Δ(T_n)\leq n-m+2$ when $m\geq 4$ is even.

    Submitted 4 March, 2024; originally announced March 2024.

    MSC Class: 05C55; 05D10

  40. arXiv:2403.01229  [pdf, other

    cs.CV cs.AI cs.LG eess.SP

    REWIND Dataset: Privacy-preserving Speaking Status Segmentation from Multimodal Body Movement Signals in the Wild

    Authors: Jose Vargas Quiros, Chirag Raman, Stephanie Tan, Ekin Gedik, Laura Cabrera-Quiros, Hayley Hung

    Abstract: Recognizing speaking in humans is a central task towards understanding social interactions. Ideally, speaking would be detected from individual voice recordings, as done previously for meeting scenarios. However, individual voice recordings are hard to obtain in the wild, especially in crowded mingling scenarios due to cost, logistics, and privacy concerns. As an alternative, machine learning mode… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  41. arXiv:2402.18600  [pdf

    eess.IV cs.AI q-bio.TO

    Artificial Intelligence and Diabetes Mellitus: An Inside Look Through the Retina

    Authors: Yasin Sadeghi Bazargani, Majid Mirzaei, Navid Sobhi, Mirsaeed Abdollahi, Ali Jafarizadeh, Siamak Pedrammehr, Roohallah Alizadehsani, Ru San Tan, Sheikh Mohammed Shariful Islam, U. Rajendra Acharya

    Abstract: Diabetes mellitus (DM) predisposes patients to vascular complications. Retinal images and vasculature reflect the body's micro- and macrovascular health. They can be used to diagnose DM complications, including diabetic retinopathy (DR), neuropathy, nephropathy, and atherosclerotic cardiovascular disease, as well as forecast the risk of cardiovascular events. Artificial intelligence (AI)-enabled s… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 44 Pages, 6 figures, 1 table, 166 references

    ACM Class: J.3.2; J.3.3

  42. arXiv:2402.18592  [pdf, other

    cs.AR cs.PF

    A$^3$PIM: An Automated, Analytic and Accurate Processing-in-Memory Offloader

    Authors: Qingcai Jiang, Shaojie Tan, Junshi Chen, Hong An

    Abstract: The performance gap between memory and processor has grown rapidly. Consequently, the energy and wall-clock time costs associated with moving data between the CPU and main memory predominate the overall computational cost. The Processing-in-Memory (PIM) paradigm emerges as a promising architecture that mitigates the need for extensive data movements by strategically positioning computing units pro… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures, accepted for presentation at Design, Automation and Test in Europe Conference | The European Event for Electronic System Design & Test (DATE 2024), conference to be held in March 2024

  43. arXiv:2402.17509  [pdf, other

    cs.CL

    Extreme Miscalibration and the Illusion of Adversarial Robustness

    Authors: Vyas Raina, Samson Tan, Volkan Cevher, Aditya Rawal, Sheng Zha, George Karypis

    Abstract: Deep learning-based Natural Language Processing (NLP) models are vulnerable to adversarial attacks, where small perturbations can cause a model to misclassify. Adversarial Training (AT) is often used to increase model robustness. However, we have discovered an intriguing phenomenon: deliberately or accidentally miscalibrating models masks gradients in a way that interferes with adversarial attack… ▽ More

    Submitted 30 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  44. arXiv:2402.14366  [pdf, other

    cs.SE

    Understanding and Detecting Annotation-Induced Faults of Static Analyzers

    Authors: Huaien Zhang, Yu Pei, Shuyun Liang, Shin Hwei Tan

    Abstract: Static analyzers can reason about the properties and behaviors of programs and detect various issues without executing them. Hence, they should extract the necessary information to understand the analyzed program well. Annotation has been a widely used feature for different purposes in Java since the introduction of Java 5. Annotations can change program structures and convey semantics information… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 23 pages, 16 figures

  45. arXiv:2402.10551  [pdf, other

    cs.LG q-bio.QM

    Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Information

    Authors: Aishwarya Jayagopal, Hansheng Xue, Ziyang He, Robert J. Walsh, Krishna Kumar Hariprasannan, David Shao Peng Tan, Tuan Zea Tan, Jason J. Pitt, Anand D. Jeyasekharan, Vaibhav Rajan

    Abstract: Cancer remains a global challenge due to its growing clinical and economic burden. Its uniquely personal manifestation, which makes treatment difficult, has fuelled the quest for personalized treatment strategies. Thus, genomic profiling is increasingly becoming part of clinical diagnostic panels. Effective use of such panels requires accurate drug response prediction (DRP) models, which are chall… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  46. arXiv:2402.04983  [pdf, ps, other

    quant-ph physics.optics

    Broadband squeezed light field by magnetostriction in an opto-magnomechanical

    Authors: Ke Di, Shuai Tan, Anyu Cheng, Yinxue Zhao, Yu Liu, Jiajia Du

    Abstract: We present a novel mechanism for generating a wide bandwidth squeezed optical output field in an opto-magnomechanical system. In this system, the magnon (mechanical) mode in the yttrium-iron-garnet crystal is coupled to the microwave field (optical field) through magnetic dipole (radiation pressure) interaction. The magnetostrictive force induced by the yttrium-iron-garnet crystal causes a mechani… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  47. arXiv:2402.02478  [pdf, other

    cs.LG cs.AI

    Why are hyperbolic neural networks effective? A study on hierarchical representation capability

    Authors: Shicheng Tan, Huanjing Zhao, Shu Zhao, Yanping Zhang

    Abstract: Hyperbolic Neural Networks (HNNs), operating in hyperbolic space, have been widely applied in recent years, motivated by the existence of an optimal embedding in hyperbolic space that can preserve data hierarchical relationships (termed Hierarchical Representation Capability, HRC) more accurately than Euclidean space. However, there is no evidence to suggest that HNNs can achieve this theoretical… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  48. arXiv:2402.02202  [pdf, other

    cond-mat.quant-gas nucl-th physics.atom-ph quant-ph

    Three-body scattering area for particles with infinite or zero scattering length in two dimensions

    Authors: Junjie Liang, Shina Tan

    Abstract: We derive the asymptotic expansions of the wave function of three particles having equal mass with finite-range interactions and infinite or zero two-dimensional scattering length colliding at zero energy and zero orbital angular momentum, from which a three-body parameter $D$ is defined. The dimension of $D$ is length squared, and we call $D$ three-body scattering area. We find that the ground st… ▽ More

    Submitted 28 April, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  49. arXiv:2402.01150  [pdf

    quant-ph

    Entanglement enhancement of two different magnon modes via nonlinear effect in cavity magnomechanics

    Authors: Ke Di, Xi Wang, Shuai Tan, Yinxue Zhao, Yu Liu, Anyu Cheng, Jiajia Du

    Abstract: We present a scheme to enhance two different magnon modes entanglement in cavity magnomechanics via nonlinear effect. The scheme demonstrated that nonlinear effects enhance entanglement of the two magnon modes. Moreover, the entanglement of the two magnon modes is also significantly enhanced by microwave parametric amplification (PA) and magnon self-Kerr nonlinearity. Not only dose nonlinear effec… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:1903.00221 by other authors

  50. arXiv:2401.15234  [pdf, other

    cs.SE

    Moving beyond Deletions: Program Simplification via Diverse Program Transformations

    Authors: Haibo Wang, Zezhong Xing, Zheng Wang, Chengnian Sun, Shin Hwei Tan

    Abstract: To reduce the complexity of software, Developers manually simplify program (known as developer-induced program simplification in this paper) to reduce its code size yet preserving its functionality but manual simplification is time-consuming and error-prone. To reduce manual effort, rule-based approaches (e.g., refactoring) and deletion-based approaches (e.g., delta debugging) can be potentially a… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.