Skip to main content

Showing 1–50 of 1,438 results for author: Guo, Z

  1. arXiv:2407.08739  [pdf, other

    cs.CV

    MAVIS: Mathematical Visual Instruction Tuning

    Authors: Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li

    Abstract: Multi-modal Large Language Models (MLLMs) have recently emerged as a significant focus in academia and industry. Despite their proficiency in general multi-modal scenarios, the mathematical problem-solving capabilities in visual contexts remain insufficiently explored. We identify three key areas within MLLMs that need to be improved: visual encoding of math diagrams, diagram-language alignment, a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Work in progress. Data and Models are released at https://github.com/ZrrSkywalker/MAVIS

  2. arXiv:2407.08680  [pdf, other

    cs.CV

    Generalizable Implicit Motion Modeling for Video Frame Interpolation

    Authors: Zujin Guo, Wei Li, Chen Change Loy

    Abstract: Motion modeling is critical in flow-based Video Frame Interpolation (VFI). Existing paradigms either consider linear combinations of bidirectional flows or directly predict bilateral flows for given timestamps without exploring favorable motion priors, thus lacking the capability of effectively modeling spatiotemporal dynamics in real-world videos. To address this limitation, in this study, we int… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Project Page: https://gseancdat.github.io/projects/GIMMVFI

  3. arXiv:2407.08615  [pdf, other

    math.NA

    MgFNO: Multi-grid Architecture Fourier Neural Operator for Parametric Partial Differential Equations

    Authors: Zi-Hao Guo, Hou-Biao Li

    Abstract: In science and engineering, there is often a need to repeatedly solve large-scale and high-resolution partial differential equations (PDEs). Neural operators are a new type of models that can map between function spaces, allowing trained models to emulate the solution operators of PDEs. This paper introduces a novel Fourier neural operator with a multigrid architecture (MgFNO). The MgFNO combines… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 29 pages, 15 figures, 4 tables

    MSC Class: 65M55; 65M22 ACM Class: G.1.8; G.1.3

  4. arXiv:2407.07670  [pdf, ps, other

    stat.ML cs.LG

    Stochastic Gradient Descent for Two-layer Neural Networks

    Authors: Dinghao Cao, Zheng-Chu Guo, Lei Shi

    Abstract: This paper presents a comprehensive study on the convergence rates of the stochastic gradient descent (SGD) algorithm when applied to overparameterized two-layer neural networks. Our approach combines the Neural Tangent Kernel (NTK) approximation with convergence analysis in the Reproducing Kernel Hilbert Space (RKHS) generated by NTK, aiming to provide a deep understanding of the convergence beha… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  5. arXiv:2407.07299  [pdf, ps, other

    cs.IT cs.DS math.CO

    Random Reed-Solomon Codes Achieve the Half-Singleton Bound for Insertions and Deletions over Linear-Sized Alphabets

    Authors: Roni Con, Zeyu Guo, Ray Li, Zihan Zhang

    Abstract: In this paper, we prove that with high probability, random Reed-Solomon codes approach the half-Singleton bound - the optimal rate versus error tradeoff for linear insdel codes - with linear-sized alphabets. More precisely, we prove that, for any $ε>0$ and positive integers $n$ and $k$, with high probability, random Reed--Solomon codes of length $n$ and dimension $k$ can correct… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  6. arXiv:2407.06815  [pdf, other

    hep-ph astro-ph.HE

    Searching Accretion-Enhanced Dark Matter Annihilation Signals in the Galactic Centre

    Authors: Mei-Wen Yang, Zhi-Qi Guo, Xiao-Yi Luo, Zhao-Qiang Shen, Zi-Qing Xia, Chih-Ting Lu, Yue-Lin Sming Tsai, Yi-Zhong Fan

    Abstract: This study reanalyzes the detection prospects of dark matter (DM) annihilation signals in the Galactic Center, focusing on velocity-dependent dynamics within a spike density near the supermassive black hole (Sgr~A$^{\star}$). We investigate three annihilation processes -- $p$-wave, resonance, and forbidden annihilation -- under semi-relativistic velocities, leveraging gamma-ray data from Fermi and… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  7. arXiv:2407.06754  [pdf, other

    cs.DC cs.AI

    Threats and Defenses in Federated Learning Life Cycle: A Comprehensive Survey and Challenges

    Authors: Yanli Li, Zhongliang Guo, Nan Yang, Huaming Chen, Dong Yuan, Weiping Ding

    Abstract: Federated Learning (FL) offers innovative solutions for privacy-preserving collaborative machine learning (ML). Despite its promising potential, FL is vulnerable to various attacks due to its distributed nature, affecting the entire life cycle of FL services. These threats can harm the model's utility or compromise participants' privacy, either directly or indirectly. In response, numerous defense… ▽ More

    Submitted 11 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  8. arXiv:2407.06297  [pdf, other

    cs.CV

    SGOR: Outlier Removal by Leveraging Semantic and Geometric Information for Robust Point Cloud Registration

    Authors: Guiyu Zhao, Zhentao Guo, Hongbin Ma

    Abstract: In this paper, we introduce a new outlier removal method that fully leverages geometric and semantic information, to achieve robust registration. Current semantic-based registration methods only use semantics for point-to-point or instance semantic correspondence generation, which has two problems. First, these methods are highly dependent on the correctness of semantics. They perform poorly in sc… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted by IROS 2024

  9. arXiv:2407.06115  [pdf, other

    cs.CV cs.AI cs.CL

    Infer Induced Sentiment of Comment Response to Video: A New Task, Dataset and Baseline

    Authors: Qi Jia, Baoyu Fan, Cong Xu, Lu Liu, Liang Jin, Guoguang Du, Zhenhua Guo, Yaqian Zhao, Xuanjing Huang, Rengang Li

    Abstract: Existing video multi-modal sentiment analysis mainly focuses on the sentiment expression of people within the video, yet often neglects the induced sentiment of viewers while watching the videos. Induced sentiment of viewers is essential for inferring the public response to videos, has broad application in analyzing public societal sentiment, effectiveness of advertising and other areas. The micro… ▽ More

    Submitted 15 May, 2024; originally announced July 2024.

  10. arXiv:2407.05374  [pdf, other

    cs.CL cs.CV

    Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition

    Authors: Zirun Guo, Tao Jin, Zhou Zhao

    Abstract: The development of multimodal models has significantly advanced multimodal sentiment analysis and emotion recognition. However, in real-world applications, the presence of various missing modality cases often leads to a degradation in the model's performance. In this work, we propose a novel multimodal Transformer framework using prompt learning to address the issue of missing modalities. Our meth… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2024 Main

  11. arXiv:2407.05283  [pdf, other

    cs.CV

    SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning

    Authors: Yi Feng, Zizhan Guo, Qijun Chen, Rui Fan

    Abstract: Unsupervised monocular depth estimation frameworks have shown promising performance in autonomous driving. However, existing solutions primarily rely on a simple convolutional neural network for ego-motion recovery, which struggles to estimate precise camera poses in dynamic, complicated real-world scenarios. These inaccurately estimated camera poses can inevitably deteriorate the photometric reco… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted by IEEE Transactions on Intelligent Vehicles. Code is available at https://mias.group/SCIPaD

  12. arXiv:2407.05198  [pdf, other

    cs.SI

    Medfluencer: A Network Representation of Medical Influencers' Identities and Discourse on Social Media

    Authors: Zhijin Guo, Edwin Simpson, Roberta Bernardi

    Abstract: In our study, we first constructed a dataset from the tweets of the top 100 medical influencers with the highest Influencer Score during the COVID-19 pandemic. This dataset was then used to construct a socio-semantic network, mapping both their identities and key topics, which are crucial for understanding their impact on public health discourse. To achieve this, we developed a few-shot multi-labe… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: ACM SIGKDD 2024 Workshop epiDAMIK 2024: The 7th International Workshop on Epidemiology meets Data Mining and Knowledge Discovery

  13. arXiv:2407.04347  [pdf, other

    math.AP math.NA

    On a nonlinear nonlocal reaction-diffusion system applied to image restoration

    Authors: Yuhang Li, Zhichang Guo, Jingfeng Shao, Boying Wu

    Abstract: This paper deals with a novel nonlinear coupled nonlocal reaction-diffusion system proposed for image restoration, characterized by the advantages of preserving low gray level features and textures.The gray level indicator in the proposed model is regularized using a new method based on porous media type equations, which is suitable for recovering noisy blurred images. The well-posedness, regulari… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 28 pages,7 figures

  14. arXiv:2407.04284  [pdf, other

    cs.MM

    TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting

    Authors: Zixi Guo, Yun Zhang, Linwei Zhu, Hanli Wang, Gangyi Jiang

    Abstract: Point cloud has been the mainstream representation for advanced 3D applications, such as virtual reality and augmented reality. However, the massive data amounts of point clouds is one of the most challenging issues for transmission and storage. In this paper, we propose an end-to-end voxel Transformer and Sparse Convolution based Point Cloud Attribute Compression (TSC-PCAC) for 3D broadcasting. F… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  15. arXiv:2407.03907  [pdf, other

    gr-qc

    The pseudospectrum and transient of Kaluza-Klein black holes in Einstein-Gauss-Bonnet gravity

    Authors: Jia-Ning Chen, Liang-Bi Wu, Zong-Kuan Guo

    Abstract: The spectrum and dynamical instability, as well as the transient effect of the tensor perturbation for the so-called Maeda-Dadhich black hole, a type of Kaluza-Klein black hole, in Einstein-Gauss-Bonnet gravity have been investigated in framework of pseudospectrum. We cast the problem of solving quasinormal modes (QNMs) in AdS-like spacetime as the linear evolution problem of the non-normal operat… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 22 pages, 14 figures, 1 table

  16. arXiv:2407.02873  [pdf, other

    cs.RO

    Robot Shape and Location Retention in Video Generation Using Diffusion Models

    Authors: Peng Wang, Zhihao Guo, Abdul Latheef Sait, Minh Huy Pham

    Abstract: Diffusion models have marked a significant milestone in the enhancement of image and video generation technologies. However, generating videos that precisely retain the shape and location of moving objects such as robots remains a challenge. This paper presents diffusion models specifically tailored to generate videos that accurately maintain the shape and location of mobile robots. This developme… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 10 figures

  17. arXiv:2407.01306  [pdf, other

    cs.LG cs.CR

    Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability

    Authors: Chenxi Li, Abhinav Kumar, Zhen Guo, Jie Hou, Reza Tourani

    Abstract: The increasing prominence of deep learning applications and reliance on personalized data underscore the urgent need to address privacy vulnerabilities, particularly Membership Inference Attacks (MIAs). Despite numerous MIA studies, significant knowledge gaps persist, particularly regarding the impact of hidden features (in isolation) on attack efficacy and insufficient justification for the root… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 20 pages, 10 figures, 4 tables

  18. arXiv:2407.00468  [pdf, other

    cs.CV cs.AI cs.CL

    MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

    Authors: Jinsheng Huang, Liang Chen, Taian Guo, Fu Zeng, Yusheng Zhao, Bohan Wu, Ye Yuan, Haozhe Zhao, Zhihui Guo, Yichi Zhang, Jingyang Yuan, Wei Ju, Luchen Liu, Tianyu Liu, Baobao Chang, Ming Zhang

    Abstract: Large Multimodal Models (LMMs) exhibit impressive cross-modal understanding and reasoning abilities, often assessed through multiple-choice questions (MCQs) that include an image, a question, and several options. However, many benchmarks used for such evaluations suffer from systematic biases. Remarkably, Large Language Models (LLMs) without any visual perception capabilities achieve non-trivial p… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 21 pages, code released at https://github.com/chenllliang/MMEvalPro, Homepage at https://mmevalpro.github.io/

  19. arXiv:2406.18082  [pdf, other

    cs.CL cs.HC

    Octo-planner: On-device Language Model for Planner-Action Agents

    Authors: Wei Chen, Zhiyuan Li, Zhen Guo, Yikang Shen

    Abstract: AI agents have become increasingly significant in various domains, enabling autonomous decision-making and problem-solving. To function effectively, these agents require a planning process that determines the best course of action and then executes the planned actions. In this paper, we present an efficient on-device Planner-Action framework that separates planning and action execution into two di… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  20. arXiv:2406.16646  [pdf, other

    astro-ph.GA astro-ph.SR

    The VISTA Variables in the Vía Láctea eXtended (VVVX) ESO public survey: Completion of the observations and legacy

    Authors: R. K. Saito, M. Hempel, J. Alonso-García, P. W. Lucas, D. Minniti, S. Alonso, L. Baravalle, J. Borissova, C. Caceres, A. N. Chené, N. J. G. Cross, F. Duplancic, E. R. Garro, M. Gómez, V. D. Ivanov, R. Kurtev, A. Luna, D. Majaess, M. G. Navarro, J. B. Pullen, M. Rejkuba, J. L. Sanders, L. C. Smith, P. H. C. Albino, M. V. Alonso , et al. (121 additional authors not shown)

    Abstract: The ESO public survey VISTA Variables in the Vía Láctea (VVV) surveyed the inner Galactic bulge and the adjacent southern Galactic disk from $2009-2015$. Upon its conclusion, the complementary VVV eXtended (VVVX) survey has expanded both the temporal as well as spatial coverage of the original VVV area, widening it from $562$ to $1700$ sq. deg., as well as providing additional epochs in… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 11 figures (+ appendix). Accepted for publication in Astronomy and Astrophysics in section 14: Catalogs and data

  21. arXiv:2406.16062  [pdf, other

    cs.NE

    Towards Biologically Plausible Computing: A Comprehensive Comparison

    Authors: Changze Lv, Yufei Gu, Zhengkang Guo, Zhibo Xu, Yixin Wu, Feiran Zhang, Tianyuan Shi, Zhenghua Wang, Ruicheng Yin, Yu Shang, Siqi Zhong, Xiaohua Wang, Muling Wu, Wenhao Liu, Tianlong Li, Jianhao Zhu, Cenyuan Zhang, Zixuan Ling, Xiaoqing Zheng

    Abstract: Backpropagation is a cornerstone algorithm in training neural networks for supervised learning, which uses a gradient descent method to update network weights by minimizing the discrepancy between actual and desired outputs. Despite its pivotal role in propelling deep learning advancements, the biological plausibility of backpropagation is questioned due to its requirements for weight symmetry, gl… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  22. arXiv:2406.15678  [pdf, other

    astro-ph.SR

    Oscillation Frequencies of Moderately Rotating Delta Scuti Stars: Asymmetric Mode Splittings Due to Non-spherical Distortion

    Authors: Zhao Guo, Timothy R. Bedding, A. A. Pamyatnykh, Donald W. Kurtz, Gang Li, Anuj Gautam, Simon J. Murphy, Conny Aerts

    Abstract: We find that the observed pressure-mode rotational splittings of slowly/moderately rotating Delta Scuti stars and Beta Cephei stars mostly have a positive asymmetry. That is, the left frequency spacing is larger than the right spacing in the dipole mode splitting triplets and the $l=2$ mode splitting multiplets (considering $m=1, 0, -1$ modes only). This is in agreement with the second-order pertu… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: MNRAS, submitted

  23. arXiv:2406.14653  [pdf, other

    cs.RO cs.AI

    LLM Granularity for On-the-Fly Robot Control

    Authors: Peng Wang, Mattia Robbiani, Zhihao Guo

    Abstract: Assistive robots have attracted significant attention due to their potential to enhance the quality of life for vulnerable individuals like the elderly. The convergence of computer vision, large language models, and robotics has introduced the `visuolinguomotor' mode for assistive robots, where visuals and linguistics are incorporated into assistive robots to enable proactive and interactive assis… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  24. arXiv:2406.13975  [pdf, other

    cs.CL cs.AI

    MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models

    Authors: Zhongshen Zeng, Yinhong Liu, Yingjia Wan, Jingyao Li, Pengguang Chen, Jianbo Dai, Yuxuan Yao, Rongwu Xu, Zehan Qi, Wanru Zhao, Linling Shen, Jianqiao Lu, Haochen Tan, Yukang Chen, Hao Zhang, Zhan Shi, Bailin Wang, Zhijiang Guo, Jiaya Jia

    Abstract: Large language models (LLMs) have shown increasing capability in problem-solving and decision-making, largely based on the step-by-step chain-of-thought reasoning processes. However, it has been increasingly challenging to evaluate the reasoning capability of LLMs. Concretely, existing outcome-based benchmarks begin to saturate and become less sufficient to monitor the progress. To this end, we pr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  25. arXiv:2406.13958  [pdf

    physics.app-ph

    Symmetry engineering in 2D bioelectronics facilitating augmented biosensing interfaces

    Authors: Yizhang Wu, Yihan Liu, Yuan Li, Ziquan Wei, Sicheng Xing, Yunlang Wang, Dashuai Zhu, Ziheng Guo, Anran Zhang, Gongkai Yuan, Zhibo Zhang, Ke Huang, Yong Wang, Guorong Wu, Ke Cheng, Wubin Bai

    Abstract: Symmetry lies at the heart of 2D bioelectronics, determining material properties at the fundamental level. Breaking the symmetry allows emergent functionalities and effects. However, symmetry modulation in 2D bioelectronics and the resultant applications have been largely overlooked. Here we devise an oxidized architectural MXene, referred as OXene, that couples orbit symmetric breaking with inver… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  26. arXiv:2406.12822  [pdf, other

    cs.CL cs.AI

    Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?

    Authors: Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow

    Abstract: Large language models, particularly multilingual ones, are designed, claimed, and expected to cater to native speakers of varied languages. We hypothesise that the current practices of fine-tuning and evaluating these models may not perfectly align with this objective owing to a heavy reliance on translation, which can introduce translation artefacts and defects. It remains unknown whether the nat… ▽ More

    Submitted 11 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  27. arXiv:2406.12335  [pdf, other

    cs.CL cs.LG

    Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters

    Authors: Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Scaling the context size of large language models (LLMs) enables them to perform various new tasks, e.g., book summarization. However, the memory cost of the Key and Value (KV) cache in attention significantly limits the practical applications of LLMs. Recent works have explored token pruning for KV cache reduction in LLMs, relying solely on attention scores as a token importance indicator. Howeve… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  28. arXiv:2406.11933  [pdf, other

    cs.CV

    Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset

    Authors: Fengxiang Wang, Hongzhen Wang, Di Wang, Zonghao Guo, Zhenyu Zhong, Long Lan, Jing Zhang, Zhiyuan Liu, Maosong Sun

    Abstract: Masked Image Modeling (MIM) has emerged as a pivotal approach for developing foundational visual models in the field of remote sensing (RS). However, current RS datasets are limited in volume and diversity, which significantly constrains the capacity of MIM methods to learn generalizable representations. In this study, we introduce \textbf{RS-4M}, a large-scale dataset designed to enable highly ef… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  29. arXiv:2406.11138  [pdf, other

    cs.CV cs.AI

    Diffusion Models in Low-Level Vision: A Survey

    Authors: Chunming He, Yuqi Shen, Chengyu Fang, Fengyang Xiao, Longxiang Tang, Yulun Zhang, Wangmeng Zuo, Zhenhua Guo, Xiu Li

    Abstract: Deep generative models have garnered significant attention in low-level vision tasks due to their generative capabilities. Among them, diffusion model-based solutions, characterized by a forward diffusion process and a reverse denoising process, have emerged as widely acclaimed for their ability to produce samples of superior quality and diversity. This ensures the generation of visually compellin… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 20 pages, 23 figures, 4 tables

  30. HiFGL: A Hierarchical Framework for Cross-silo Cross-device Federated Graph Learning

    Authors: Zhuoning Guo, Duanyi Yao, Qiang Yang, Hao Liu

    Abstract: Federated Graph Learning (FGL) has emerged as a promising way to learn high-quality representations from distributed graph data with privacy preservation. Despite considerable efforts have been made for FGL under either cross-device or cross-silo paradigm, how to effectively capture graph knowledge in a more complicated cross-silo cross-device environment remains an under-explored problem. However… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted by SIGKDD 2024

  31. arXiv:2406.10593  [pdf, other

    cs.AI cs.DB cs.IR

    QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL

    Authors: Yinggang Sun, Ziming Guo, Haining Yu, Chuanyi Liu, Xiang Li, Bingxuan Wang, Xiangzhan Yu, Tiancheng Zhao

    Abstract: Fine-tuning large language models (LLMs) for specific domain tasks has achieved great success in Text-to-SQL tasks. However, these fine-tuned models often face challenges with multi-turn Text-to-SQL tasks caused by ambiguous or unanswerable questions. It is desired to enhance LLMs to handle multiple types of questions in multi-turn Text-to-SQL tasks. To address this, we propose a novel data augmen… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 13 pages, 7 figures

  32. arXiv:2406.10457  [pdf, other

    quant-ph

    Noise-induced quantum synchronization and maximally entangled mixed states in superconducting circuits

    Authors: Ziyu Tao, Finn Schmolke, Chang-Kang Hu, Wenhui Huang, Yuxuan Zhou, Jiawei Zhang, Ji Chu, Libo Zhang, Xuandong Sun, Zecheng Guo, Jingjing Niu, Wenle Weng, Song Liu, Youpeng Zhong, Dian Tan, Dapeng Yu, Eric Lutz

    Abstract: Random fluctuations can lead to cooperative effects in complex systems. We here report the experimental observation of noise-induced quantum synchronization in a chain of superconducting transmon qubits with nearest-neighbor interactions. The application of Gaussian white noise to a single site leads to synchronous oscillations in the entire chain. We show that the two synchronized end qubits are… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  33. arXiv:2406.09842  [pdf, other

    hep-lat hep-ph

    Charmoniumlike Channels $1^{+}$ with Isospin $1$ from Lattice and Effective Field Theory

    Authors: Mitja Sadl, Sara Collins, Zhi-Hui Guo, M. Padmanath, Sasa Prelovsek, Lin-Wan Yan

    Abstract: Many exotic charmoniumlike mesons have already been discovered experimentally, of which the $Z_c$ mesons with isospin 1 are prominent examples. We investigate $J^{PC}=1^{+\pm}$ states with flavor $\bar cc \bar qq$ ($q=u,d$) in isospin 1 using lattice QCD. This is the first study of these mesons employing more than one volume and involving frames with nonzero total momentum. We utilize two… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 23 pages plus appendices, 24 figures

  34. arXiv:2406.08968  [pdf, other

    stat.ME

    Covariate Selection for Optimizing Balance with Covariate-Adjusted Response-Adaptive Randomization

    Authors: Ziqing Guo, Yang Liu, Lucy Xia

    Abstract: Balancing influential covariates is crucial for valid treatment comparisons in clinical studies. While covariate-adaptive randomization is commonly used to achieve balance, its performance can be inadequate when the number of baseline covariates is large. It is therefore essential to identify the influential factors associated with the outcome and ensure balance among these critical covariates. In… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 54 pages, 4 figures

  35. arXiv:2406.08845  [pdf, other

    cs.CV

    Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality

    Authors: Tianle Zhang, Langtian Ma, Yuchen Yan, Yuchen Zhang, Kai Wang, Yue Yang, Ziyao Guo, Wenqi Shao, Yang You, Yu Qiao, Ping Luo, Kaipeng Zhang

    Abstract: Recent text-to-video (T2V) technology advancements, as demonstrated by models such as Gen2, Pika, and Sora, have significantly broadened its applicability and popularity. Despite these strides, evaluating these models poses substantial challenges. Primarily, due to the limitations inherent in automatic metrics, manual evaluation is often considered a superior method for assessing T2V generation. H… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  36. arXiv:2406.08571  [pdf, other

    astro-ph.IM

    The verification of periodicity with the use of recurrent neural networks

    Authors: Niall Miller, Philip Lucas, Yi Sun, Zhen Guo, Calum Morris, William Cooper

    Abstract: The ability to automatically and robustly self-verify periodicity present in time-series astronomical data is becoming more important as data sets rapidly increase in size. The age of large astronomical surveys has rendered manual inspection of time-series data less practical. Previous efforts in generating a false alarm probability to verify the periodicity of stars have been aimed towards the an… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  37. arXiv:2406.08314  [pdf, other

    physics.chem-ph cond-mat.str-el physics.comp-ph quant-ph

    GPU-accelerated Auxiliary-field quantum Monte Carlo with multi-Slater determinant trial states

    Authors: Yifei Huang, Zhen Guo, Hung Q. Pham, Dingshun Lv

    Abstract: The accuracy of phaseless auxiliary-field quantum Monte Carlo (ph-AFQMC) can be systematically improved with better trial states. Using multi-Slater determinant trial states, ph-AFQMC has the potential to faithfully treat strongly correlated systems, while balancing the static and dynamical correlations on an equal footing. This preprint presents an implementation and application of graphics proce… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures

  38. arXiv:2406.07806  [pdf, other

    astro-ph.HE astro-ph.SR

    Probing the Shock Breakout Signal of SN 2024ggi from the Transformation of Early Flash Spectroscopy

    Authors: Jujia Zhang, Luc Dessart, Xiaofeng Wang, Qian Zhai, Yi Yang, Liping Li, Han Lin, Giorgio Valerin, Yongzhi Cai, Zhen Guo, Lingzhi Wang, Zeyi Zhao, Zhenyu Wang, Shengyu Yan

    Abstract: We present early-time, hour-to-day cadence spectroscopy of the nearby type II supernova (SN II) 2024ggi, which was discovered at a phase when the SN shock just emerged from the red-supergiant (RSG) progenitor star. Over the first few days after the first light, SN 2024ggi exhibited prominent narrow emission lines formed through intense and persistent photoionization of the nearby circumstellar mat… ▽ More

    Submitted 29 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 10 pages and 5 figures in the main text (16 pages and 9 figures in total). Accepted for publication in ApJL

  39. arXiv:2406.06852  [pdf, other

    cs.CR cs.AI cs.CL

    A Survey of Backdoor Attacks and Defenses on Large Language Models: Implications for Security Measures

    Authors: Shuai Zhao, Meihuizi Jia, Zhongliang Guo, Leilei Gan, Jie Fu, Yichao Feng, Fengjun Pan, Luu Anh Tuan

    Abstract: The large language models (LLMs), which bridge the gap between human language understanding and complex problem-solving, achieve state-of-the-art performance on several NLP tasks, particularly in few-shot and zero-shot settings. Despite the demonstrable efficacy of LMMs, due to constraints on computational resources, users have to engage with open-source language models or outsource the entire tra… ▽ More

    Submitted 13 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  40. arXiv:2406.06777  [pdf, other

    cs.CV cs.AI

    MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension

    Authors: Khiem Le, Zhichun Guo, Kaiwen Dong, Xiaobao Huang, Bozhao Nan, Roshni Iyer, Xiangliang Zhang, Olaf Wiest, Wei Wang, Nitesh V. Chawla

    Abstract: Recently, Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding. However, their proficiency within the chemistry domain remains restricted, especially in solving professional molecule-related tasks. This challenge is attributed to their inherent limitations in comprehend… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  41. arXiv:2406.06478  [pdf

    cs.RO cs.AI

    High-precision surgical navigation using speckle structured light-based thoracoabdominal puncture robot

    Authors: Zezhao Guo, Yanzhong Guo, Zhanfang Zhao

    Abstract: Abstract Background During percutaneous puncture robotic surgical navigation, the needle insertion point is positioned on the patient's chest and abdomen body surface. By locating any point on the soft skin tissue, it is difficult to apply the traditional reflective ball tracking method. The patient's chest and abdomen body surface has fluctuations in breathing and appears irregular. The chest a… ▽ More

    Submitted 6 May, 2024; originally announced June 2024.

    Comments: 17pages,7figures

  42. arXiv:2406.05535  [pdf, other

    cs.LG cs.AI cs.CR

    Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability

    Authors: Junqi Gao, Biqing Qi, Yao Li, Zhichang Guo, Dong Li, Yuming Xing, Dazhi Zhang

    Abstract: The transferability of adversarial perturbations provides an effective shortcut for black-box attacks. Targeted perturbations have greater practicality but are more difficult to transfer between models. In this paper, we experimentally and theoretically demonstrated that neural networks trained on the same dataset have more consistent performance in High-Sample-Density-Regions (HSDR) of each class… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Journal ref: Advances in Neural Information Processing Systems 36, 2023

  43. arXiv:2406.03702  [pdf, other

    cs.CV

    DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation

    Authors: Zilu Guo, Liuyang Bian, Xuan Huang, Hu Wei, Jingyu Li, Huasheng Ni

    Abstract: Atrous convolutions are employed as a method to increase the receptive field in semantic segmentation tasks. However, in previous works of semantic segmentation, it was rarely employed in the shallow layers of the model. We revisit the design of atrous convolutions in modern convolutional neural networks (CNNs), and demonstrate that the concept of using large kernels to apply atrous convolutions c… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  44. arXiv:2406.03222  [pdf, other

    quant-ph

    Solving Sharp Bounded-error Quantum Polynomial Time Problem by Evolution methods

    Authors: Zhen Guo, Li You

    Abstract: Counting ground state degeneracy of a $k$-local Hamiltonian is important in many fields of physics. Its complexity belongs to the problem of sharp bounded-error quantum polynomial time (#BQP) class and few methods are known for its solution. Finding ground states of a $k$-local Hamiltonian, on the other hand, is an easier problem of Quantum Merlin Arthur (QMA) class, for which many efficient metho… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures

  45. arXiv:2406.02737  [pdf, other

    cs.CR cs.SE

    CAMP: Compiler and Allocator-based Heap Memory Protection

    Authors: Zhenpeng Lin, Zheng Yu, Ziyi Guo, Simone Campanoni, Peter Dinda, Xinyu Xing

    Abstract: The heap is a critical and widely used component of many applications. Due to its dynamic nature, combined with the complexity of heap management algorithms, it is also a frequent target for security exploits. To enhance the heap's security, various heap protection techniques have been introduced, but they either introduce significant runtime overhead or have limited protection. We present CAMP,… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  46. arXiv:2406.02624  [pdf, other

    cs.CR cs.SE

    Take a Step Further: Understanding Page Spray in Linux Kernel Exploitation

    Authors: Ziyi Guo, Dang K Le, Zhenpeng Lin, Kyle Zeng, Ruoyu Wang, Tiffany Bao, Yan Shoshitaishvili, Adam Doupé, Xinyu Xing

    Abstract: Recently, a novel method known as Page Spray emerges, focusing on page-level exploitation for kernel vulnerabilities. Despite the advantages it offers in terms of exploitability, stability, and compatibility, comprehensive research on Page Spray remains scarce. Questions regarding its root causes, exploitation model, comparative benefits over other exploitation techniques, and possible mitigation… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  47. arXiv:2406.02035  [pdf, other

    cs.LG cs.AI

    A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

    Authors: Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Avila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana Borsa, Arthur Guez, Will Dabney

    Abstract: Learning a good representation is a crucial challenge for Reinforcement Learning (RL) agents. Self-predictive learning provides means to jointly learn a latent representation and dynamics model by bootstrapping from future latent representations (BYOL). Recent work has developed theoretical insights into these algorithms by studying a continuous-time ODE model for self-predictive representation le… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  48. arXiv:2406.01940  [pdf, other

    cs.CL cs.LG cs.LO

    Process-Driven Autoformalization in Lean 4

    Authors: Jianqiao Lu, Zhengying Liu, Yingjia Wan, Yinya Huang, Haiming Wang, Zhicheng Yang, Jing Tang, Zhijiang Guo

    Abstract: Autoformalization, the conversion of natural language mathematics into formal languages, offers significant potential for advancing mathematical reasoning. However, existing efforts are limited to formal languages with substantial online corpora and struggle to keep pace with rapidly evolving languages like Lean 4. To bridge this gap, we propose a new benchmark \textbf{Form}alization for \textbf{L… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 22 pages, 1 figures, 11 tables

  49. arXiv:2406.01043  [pdf, ps, other

    math.AP

    Generalized Young Measure Solutions for a Class of Quasilinear Parabolic Equations with Linear Growth

    Authors: Jingfeng Shao, Zhichang Guo, Chao Zhang

    Abstract: Using the generalized Young measure theory, we extend the theory of Young measure solutions to a class of quasilinear parabolic equations with linear growth, and introduce the concept of generalized Young measure solutions. We prove the existence and uniqueness of the generalized Young measure solutions. In addition, for the gradient flow of convex parabolic variational integral, we show that the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: This paper extend the theory of Young measure solutions to a class of quasilinear parabolic equations with linear growth, and introduce a concept of generalized Young measure solutions

    MSC Class: 35C99; 35D99; 35K59

  50. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.