Skip to main content

Showing 1–50 of 1,678 results for author: Shen, Z

  1. arXiv:2407.08800  [pdf, other

    cs.CV cs.LG

    Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique

    Authors: Jackson Hamel, Ming-Jun Lai, Zhaiming Shen, Ye Tian

    Abstract: In this work, we propose to use a local clustering approach based on the sparse solution technique to study the medical image, especially the lung cancer image classification task. We view images as the vertices in a weighted graph and the similarity between a pair of images as the edges in the graph. The vertices within the same cluster can be assumed to share similar features and properties, thu… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.07272  [pdf, ps, other

    math.DG

    On the Berwald-Weyl Curvature

    Authors: Zhongmin Shen, Liling Sun

    Abstract: In this paper, we study the Berwald-Weyl curvature which is defined for a spray/Finsler metric with a volume form. We obtain some expressions for the Berwald-Weyl curvature. This quantity is a projective invariant with respect to a fixed volume form. We prove that for any spray of scalar curvature on a manifold of dimension $n\geq 3$, the Berwald-Weyl curvature vanishes with respect to any volume… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2407.07093  [pdf, other

    cs.CL cs.AI cs.LG

    FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation

    Authors: Liqun Ma, Mingjie Sun, Zhiqiang Shen

    Abstract: This work presents a Fully BInarized Large Language Model (FBI-LLM), demonstrating for the first time how to train a large-scale binary language model from scratch (not the partial binary or ternary LLM like BitNet b1.58) to match the performance of its full-precision counterparts (e.g., FP16 or BF16) in transformer-based LLMs. It achieves this by employing an autoregressive distillation (AD) loss… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Github at https://github.com/LiqunMa/FBI-LLM

  4. arXiv:2407.06815  [pdf, other

    hep-ph astro-ph.HE

    Searching Accretion-Enhanced Dark Matter Annihilation Signals in the Galactic Centre

    Authors: Mei-Wen Yang, Zhi-Qi Guo, Xiao-Yi Luo, Zhao-Qiang Shen, Zi-Qing Xia, Chih-Ting Lu, Yue-Lin Sming Tsai, Yi-Zhong Fan

    Abstract: This study reanalyzes the detection prospects of dark matter (DM) annihilation signals in the Galactic Center, focusing on velocity-dependent dynamics within a spike density near the supermassive black hole (Sgr~A$^{\star}$). We investigate three annihilation processes -- $p$-wave, resonance, and forbidden annihilation -- under semi-relativistic velocities, leveraging gamma-ray data from Fermi and… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  5. arXiv:2407.05796  [pdf, other

    eess.IV cs.CV

    Poisson Ordinal Network for Gleason Group Estimation Using Bi-Parametric MRI

    Authors: Yinsong Xu, Yipei Wang, Ziyi Shen, Iani J. M. B. Gayo, Natasha Thorley, Shonit Punwani, Aidong Men, Dean Barratt, Qingchao Chen, Yipeng Hu

    Abstract: The Gleason groups serve as the primary histological grading system for prostate cancer, providing crucial insights into the cancer's potential for growth and metastasis. In clinical practice, pathologists determine the Gleason groups based on specimens obtained from ultrasound-guided biopsies. In this study, we investigate the feasibility of directly estimating the Gleason groups from MRI scans t… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024

  6. arXiv:2407.05767  [pdf, other

    eess.IV cs.CV

    Nonrigid Reconstruction of Freehand Ultrasound without a Tracker

    Authors: Qi Li, Ziyi Shen, Qianye Yang, Dean C. Barratt, Matthew J. Clarkson, Tom Vercauteren, Yipeng Hu

    Abstract: Reconstructing 2D freehand Ultrasound (US) frames into 3D space without using a tracker has recently seen advances with deep learning. Predicting good frame-to-frame rigid transformations is often accepted as the learning objective, especially when the ground-truth labels from spatial tracking devices are inherently rigid transformations. Motivated by a) the observed nonrigid deformation due to so… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted at MICCAI 2024

  7. arXiv:2407.05248  [pdf, other

    cs.CV

    Self-Paced Sample Selection for Barely-Supervised Medical Image Segmentation

    Authors: Junming Su, Zhiqiang Shen, Peng Cao, Jinzhu Yang, Osmar R. Zaiane

    Abstract: The existing barely-supervised medical image segmentation (BSS) methods, adopting a registration-segmentation paradigm, aim to learn from data with very few annotations to mitigate the extreme label scarcity problem. However, this paradigm poses a challenge: pseudo-labels generated by image registration come with significant noise. To address this issue, we propose a self-paced sample selection fr… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted to MICCAI 2024

  8. arXiv:2407.05200  [pdf, other

    astro-ph.GA

    First Results from the Dragonfly Ultrawide Survey: the Largest Eleven Quenched Diffuse Dwarf Galaxies in 3100 deg$^2$ with Spectroscopic Confirmation

    Authors: Zili Shen, William P. Bowman, Pieter van Dokkum, Roberto G. Abraham, Imad Pasha, Michael A. Keim, Qing Liu, Deborah M. Lokhorst, Steven R. Janssens, Seery Chen

    Abstract: The Dragonfly Telephoto Array employs a unique design to detect very large and diffuse galaxies, which might be missed with conventional telescopes. The Dragonfly Ultrawide Survey (DFUWS) is a new wide-field survey which will cover 10,000 deg$^2$ of the northern sky, and it provides an ideal dataset to find these large diffuse galaxies. From 3100 deg$^2$ of DFUWS data, we identified eleven large,… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Submitted to ApJ

  9. arXiv:2407.03993  [pdf, other

    cs.CL

    A Survey on Natural Language Counterfactual Generation

    Authors: Yongjie Wang, Xiaoqi Qiu, Yu Yue, Xu Guo, Zhiwei Zeng, Yuhong Feng, Zhiqi Shen

    Abstract: Natural Language Counterfactual generation aims to minimally modify a given text such that the modified text will be classified into a different class. The generated counterfactuals provide insight into the reasoning behind a model's predictions by highlighting which words significantly influence the outcomes. Additionally, they can be used to detect model fairness issues or augment the training d… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: A survey paper

    MSC Class: 68T50 ACM Class: I.2.7

  10. arXiv:2407.03546  [pdf, other

    math.PR

    Exponential Euler method for stiff SDEs driven by fractional Brownian motion

    Authors: Haozhe Chen, Zhaotong Shen, Qian Yu

    Abstract: In a recent paper by Kamrani et al. (2024), exponential Euler method for stiff stochastic differential equations with additive fractional Brownian noise was discussed, and the convergence order close to the Hurst parameter H was proved. Utilizing the technique of Malliavin derivative, we prove the exponential Euler scheme and obtain a convergence order of one, which is the optimal rate in numerica… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  11. arXiv:2407.03227  [pdf, other

    cs.CL cs.AI cs.DB

    Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning

    Authors: Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan

    Abstract: We focus on Text-to-SQL semantic parsing from the perspective of Large Language Models. Motivated by challenges related to the size of commercial database schemata and the deployability of business intelligence solutions, we propose an approach that dynamically retrieves input database information and uses abstract syntax trees to select few-shot examples for in-context learning. Furthermore, we… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  12. arXiv:2407.02600  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Macroscopic uniform 2D moiré superlattices with controllable angles

    Authors: Gregory Zaborski Jr., Paulina E. Majchrzak, Samuel Lai, Amalya C. Johnson, Ashley P. Saunders, Ziyan Zhu, Yujun Deng, Donghui Lu, Makoto Hashimoto, Z-X Shen, Fang Liu

    Abstract: Moiré superlattices, engineered through precise stacking of van der Waals (vdW) layers, hold immense promise for exploring strongly correlated and topological phenomena. However, these applications have been held back by the common preparation method: tear-and-stack of Scotch tape exfoliated monolayers. It has low efficiency and reproducibility, along with challenges of twist angle inhomogeneity,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 16 pages, 4 figures

  13. arXiv:2406.20098  [pdf, other

    cs.CV cs.AI cs.CL

    Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

    Authors: Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen

    Abstract: Multimodal large language models (MLLMs) have shown impressive success across modalities such as image, video, and audio in a variety of understanding and generation tasks. However, current MLLMs are surprisingly poor at understanding webpage screenshots and generating their corresponding HTML code. To address this problem, we propose Web2Code, a benchmark consisting of a new large-scale webpage-t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Website at https://mbzuai-llm.github.io/webpage2code/

  14. arXiv:2406.19711  [pdf, other

    cs.LG

    CHASE: A Causal Heterogeneous Graph based Framework for Root Cause Analysis in Multimodal Microservice Systems

    Authors: Ziming Zhao, Tiehua Zhang, Zhishu Shen, Hai Dong, Xingjun Ma, Xianhui Liu, Yun Yang

    Abstract: In recent years, the widespread adoption of distributed microservice architectures within the industry has significantly increased the demand for enhanced system availability and robustness. Due to the complex service invocation paths and dependencies at enterprise-level microservice systems, it is challenging to locate the anomalies promptly during service invocations, thus causing intractable is… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  15. arXiv:2406.19644  [pdf, other

    cs.AI

    Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs

    Authors: Zichao Shen, Tianchen Zhu, Qingyun Sun, Shiqi Gao, Jianxin Li

    Abstract: Reinforcement learning (RL) faces challenges in evaluating policy trajectories within intricate game tasks due to the difficulty in designing comprehensive and precise reward functions. This inherent difficulty curtails the broader application of RL within game environments characterized by diverse constraints. Preference-based reinforcement learning (PbRL) presents a pioneering framework that cap… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: accepted by IJCAI 2024 GAAMAL

  16. arXiv:2406.18962  [pdf, other

    cs.IR

    Multi-modal Food Recommendation using Clustering and Self-supervised Learning

    Authors: Yixin Zhang, Xin Zhou, Qianwen Meng, Fanglin Zhu, Yonghui Xu, Zhiqi Shen, Lizhen Cui

    Abstract: Food recommendation systems serve as pivotal components in the realm of digital lifestyle services, designed to assist users in discovering recipes and food items that resonate with their unique dietary predilections. Typically, multi-modal descriptions offer an exhaustive profile for each recipe, thereby ensuring recommendations that are both personalized and accurate. Our preliminary investigati… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Working paper

  17. arXiv:2406.18769  [pdf

    cond-mat.mes-hall cond-mat.other

    Subharmonic oscillations in the Floquet circuit with the frequency-synthesis dimension

    Authors: Bo Lv, Shiyun Xia, Ye Tian, Ting Liu, Hongyang Mu, Zhichao Shen, Sijie Wang, Zheng Zhu, Huibin Tao, Fanyi Meng, Jinhui Shi

    Abstract: The period-doubling oscillation emerges with the coexistence between zero and π modes in Floquet topological insulator. Here, utilized the flexibility of the circuit, we construct the Floquet circuit with frequency-synthetic dimension and find the topological-protected deeply-subharmonic oscillations with the period extensively exceeding the doubling-driven period. In the construction framework, t… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 31 pages, 10 figures

  18. arXiv:2406.17991  [pdf, other

    astro-ph.CO

    Tele-Correlation: Calibrating Shear-Shear Correlation with Real Data

    Authors: Zhi Shen, Jun Zhang, Cong Liu, Hekun Li, Haoran Wang, Zhenjie Liu, Jiarui Sun

    Abstract: Tele-correlation refers to the correlation of galaxy shapes with large angular separations (e.g., $>100$ degrees). Since there are no astrophysical reasons causing such a correlation on cosmological scales, any detected tele-correlation could disclose systematic effects in shear-shear correlation measurement. If the shear estimators are measured on single exposures, we show that the field distorti… ▽ More

    Submitted 27 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  19. arXiv:2406.17979  [pdf, other

    astro-ph.IM

    Realizing the potential of the Dragonfly Spectral Line Mapper: Calibration methods and on-sky performance

    Authors: Deborah M. Lokhorst, Seery Chen, Imad Pasha, Victoria Purcell, William P. Bowman, Qing Liu, Zili Shen, Aidan MacNichol, Evgeni I. Malakhov, Roberto G. Abraham, Pieter van Dokkum

    Abstract: The Dragonfly Spectral Line Mapper is an innovative all-refracting telescope designed to carry out ultra-low surface brightness wide-field mapping of visible wavelength line emission. Equipped with ultranarrowband (0.8 nm bandwidth) filters mounted in Dragonfly Filter-Tilter instrumentation, the Dragonfly Spectral Line Mapper maps H$α$, [NII]$λ$6583, and [OIII]$λ$5007 line emission produced by str… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures, SPIE Astronomical Telescopes and Instrumentation 2024 Proceedings

  20. arXiv:2406.17006  [pdf, other

    hep-ex

    Probing the nature of the $χ_{c1}(3872)$ state using radiative decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1094 additional authors not shown)

    Abstract: The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 31 pages, 2 figures. All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-015.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-015, CERN-EP-2025-157

  21. arXiv:2406.16137  [pdf, other

    cs.CV

    MLPHand: Real Time Multi-View 3D Hand Mesh Reconstruction via MLP Modeling

    Authors: Jian Yang, Jiakun Li, Guoming Li, Zhen Shen, Huai-Yu Wu, Zhaoxin Fan, Heng Huang

    Abstract: Multi-view hand mesh reconstruction is a critical task for applications in virtual reality and human-computer interaction, but it remains a formidable challenge. Although existing multi-view hand reconstruction methods achieve remarkable accuracy, they typically come with an intensive computational burden that hinders real-time inference. To this end, we propose MLPHand, a novel method designed fo… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  22. arXiv:2406.15786  [pdf, other

    cs.LG cs.AI cs.CL

    What Matters in Transformers? Not All Attention is Needed

    Authors: Shwai He, Guoheng Sun, Zheyu Shen, Ang Li

    Abstract: Scaling Transformer-based large language models (LLMs) has demonstrated promising performance across various tasks. However, this scaling also introduces redundant structures, posing challenges for real-world deployment. Despite some recognition of redundancy in LLMs, the variability of redundancy across different structures, such as MLP and Attention layers, is under-explored. In this work, we in… ▽ More

    Submitted 7 July, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: 15 pages, 13 figures, 6 tables

  23. arXiv:2406.15301  [pdf, other

    astro-ph.IM astro-ph.GA

    Software infrastructure for the highly-distributed semi-autonomous Dragonfly Spectral Line Mapper

    Authors: Imad Pasha, Seery Chen, Deborah Lokhorst, William P. Bowman, Zili Shen, Qing Liu, Evgeni I. Malakhov, Roberto Abraham, Pieter G. van Dokkum

    Abstract: The Dragonfly Spectral Line Mapper (DSLM) is a semi-autonomous, distributed-aperture based telescope design, featuring a modular setup of 120 Canon telephoto lenses, and equal numbers of ultra-narrowband filters, detectors, and other peripherals. Here we introduce the observatory software stack for this highly-distributed system. Its core is the Dragonfly Communication Protocol (DCP), a pure-Pytho… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 18 pages, presented at the SPIE Astronomical Telescopes and Instrumentation conference in Yokohama, Japan

  24. arXiv:2406.15101  [pdf, other

    astro-ph.IM

    The Dragonfly Spectral Line Mapper: Completion of the 120-lens array

    Authors: Seery Chen, Deborah M. Lokhorst, Imad Pasha, William P. Bowman, Qing Liu, Zili Shen, Aidan MacNichol, Evgeni I. Malakhov, Roberto G. Abraham, Pieter van Dokkum

    Abstract: The Dragonfly Spectral Line Mapper is a mosaic telescope comprising 120 Canon telephoto lenses, based on the design of the Dragonfly Telephoto Array. With a wide field of view, and the addition of the "Dragonfly Filter-Tilter" instrumentation holding ultra narrow bandpass filters in front of each lens, the Dragonfly Spectral Line mapper is optimized for ultra low surface brightness imaging of visi… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 12 pages, 9 figures, SPIE conference proceedings

  25. arXiv:2406.14318  [pdf, other

    cs.CR cs.AI cs.CL

    The Fire Thief Is Also the Keeper: Balancing Usability and Privacy in Prompts

    Authors: Zhili Shen, Zihang Xi, Ying He, Wei Tong, Jingyu Hua, Sheng Zhong

    Abstract: The rapid adoption of online chatbots represents a significant advancement in artificial intelligence. However, this convenience brings considerable privacy concerns, as prompts can inadvertently contain sensitive information exposed to large language models (LLMs). Limited by high computational costs, reduced task usability, and excessive system modifications, previous works based on local deploy… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  26. arXiv:2406.13225  [pdf, other

    cs.LG cs.AI cs.IR

    Communication-Efficient Federated Knowledge Graph Embedding with Entity-Wise Top-K Sparsification

    Authors: Xiaoxiong Zhang, Zhiwei Zeng, Xin Zhou, Dusit Niyato, Zhiqi Shen

    Abstract: Federated Knowledge Graphs Embedding learning (FKGE) encounters challenges in communication efficiency stemming from the considerable size of parameters and extensive communication rounds. However, existing FKGE methods only focus on reducing communication rounds by conducting multiple rounds of local training in each communication round, and ignore reducing the size of parameters transmitted with… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  27. arXiv:2406.12111  [pdf, other

    hep-ex

    Precision measurement of the $Ξ^-_b$ baryon lifetime

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1064 additional authors not shown)

    Abstract: A sample of $pp$ collision data, corresponding to an integrated luminosity of 5.5 fb$^{-1}$ and collected by the LHCb experiment during Run 2, is used to measure the ratio of the lifetime of the $Ξ^-_b$ baryon to that of the $Λ^0_b$ baryon, $r_τ\equivτ_{Ξ^-_b}/τ_{Λ^0_b}$. The value ${r_τ^{\rm Run\,2}=1.076\pm0.013\pm0.006}$ is obtained, where the first uncertainty is statistical and the second sys… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures. All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2014-010.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-010, CERN-EP-2024-139

  28. arXiv:2406.11943  [pdf, other

    cs.IR cs.AI

    Personalized Federated Knowledge Graph Embedding with Client-Wise Relation Graph

    Authors: Xiaoxiong Zhang, Zhiwei Zeng, Xin Zhou, Dusit Niyato, Zhiqi Shen

    Abstract: Federated Knowledge Graph Embedding (FKGE) has recently garnered considerable interest due to its capacity to extract expressive representations from distributed knowledge graphs, while concurrently safeguarding the privacy of individual clients. Existing FKGE methods typically harness the arithmetic mean of entity embeddings from all clients as the global supplementary knowledge, and learn a repl… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  29. arXiv:2406.11418  [pdf, other

    cs.CL

    BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM

    Authors: Zhewen Shen, Aditya Joshi, Ruey-Cheng Chen

    Abstract: Children from bilingual backgrounds benefit from interactions with parents and teachers to re-acquire their heritage language. In this paper, we investigate how this insight from behavioral study can be incorporated into the learning of small-scale language models. We introduce BAMBINO-LM, a continual pre-training strategy for BabyLM that uses a novel combination of alternation and PPO-based perpl… ▽ More

    Submitted 9 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 5 pages + references; Selected at CMCL Workshop@ACL 2024

  30. arXiv:2406.09133  [pdf

    cs.CL

    RH-SQL: Refined Schema and Hardness Prompt for Text-to-SQL

    Authors: Jiawen Yi, Guo Chen, Zixiang Shen

    Abstract: Text-to-SQL is a technology that converts natural language queries into the structured query language SQL. A novel research approach that has recently gained attention focuses on methods based on the complexity of SQL queries, achieving notable performance improvements. However, existing methods entail significant storage and training costs, which hampers their practical application. To address th… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 4 pages, 2 figures, 2024 6th International Conference on Electronic Engineering and Informatics (EEI 2024)

  31. arXiv:2406.09084  [pdf, other

    stat.ML cs.LG

    Operator-informed score matching for Markov diffusion models

    Authors: Zheyang Shen, Chris J. Oates

    Abstract: Diffusion models are typically trained using score matching, yet score matching is agnostic to the particular forward process that defines the model. This paper argues that Markov diffusion models enjoy an advantage over other types of diffusion model, as their associated operators can be exploited to improve the training process. In particular, (i) there exists an explicit formal solution to the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Preprint; 19 pages, 5 figures

  32. arXiv:2406.07835  [pdf, other

    cs.CL cs.AI

    SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

    Authors: David Wadden, Kejian Shi, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan

    Abstract: We present SciRIFF (Scientific Resource for Instruction-Following and Finetuning), a dataset of 137K instruction-following demonstrations for 54 tasks covering five essential scientific literature understanding capabilities: information extraction, summarization, question answering, claim verification, and classification. SciRIFF demonstrations are notable for their long input contexts, detailed t… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPS Datasets and Benchmarks 2024

  33. arXiv:2406.07545  [pdf, other

    cs.CL cs.AI

    Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena

    Authors: Aidar Myrzakhan, Sondos Mahmoud Bsharat, Zhiqiang Shen

    Abstract: Multiple-choice questions (MCQ) are frequently used to assess large language models (LLMs). Typically, an LLM is given a question and selects the answer deemed most probable after adjustments for factors like length. Unfortunately, LLMs may inherently favor certain answer choice IDs, such as A/B/C/D, due to inherent biases of priori unbalanced probabilities, influencing the prediction of answers b… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Code and dataset are available at https://github.com/VILA-Lab/Open-LLM-Leaderboard

  34. arXiv:2406.05562  [pdf, ps, other

    math.KT math.AG

    $G_0$ of affine, simplicial toric varieties

    Authors: Zeyu Shen

    Abstract: Let $X$ be an affine, simplicial toric variety over a field. Let $G_0$ denote the Grothendieck group of coherent sheaves on a Noetherian scheme and let $F^1G_0$ denote the first step of the filtration on $G_0$ by codimension of support. Then $G_0(X)\cong\mathbb{Z}\oplus F^1G_0(X)$ and $F^1G_0(X)$ is a finite abelian group. In dimension 2, we show that $F^1G_0(X)$ is a finite cyclic group and deter… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 10 pages

    MSC Class: 19A99; 14C35; 19E08

  35. arXiv:2406.04689  [pdf, other

    cs.CV

    CDeFuse: Continuous Decomposition for Infrared and Visible Image Fusion

    Authors: Haolong Ma, Hui Li, Chunyang Cheng, Xiaoning Song, Zhongwei Shen

    Abstract: As a common image processing technique, image decomposition is often used to extract complementary information between modalities. In current decomposition-based image fusion methods, typically, source images are decomposed into three parts at single scale (i.e., visible-exclusive part, infrared-exclusive part, and common part) and lacking interaction between modalities during the decomposition pr… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  36. arXiv:2406.03933  [pdf, other

    cs.CR cs.IR

    Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation

    Authors: Honglei Zhang, Haoxuan Li, Jundong Chen, Sen Cui, Kunda Yan, Abudukelimu Wuerkaixi, Xin Zhou, Zhiqi Shen, Yidong Li

    Abstract: Federated recommendation aims to collect global knowledge by aggregating local models from massive devices, to provide recommendations while ensuring privacy. Current methods mainly leverage aggregation functions invented by federated vision community to aggregate parameters from similar clients, e.g., clustering aggregation. Despite considerable performance, we argue that it is suboptimal to appl… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  37. arXiv:2406.03387  [pdf, other

    hep-ex

    Measurement of the branching fraction ratios $R(D^{+})$ and $R(D^{*+})$ using muonic $τ$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1063 additional authors not shown)

    Abstract: The branching fraction ratios of $\overline{B}^0\to D^+τ^-\overlineν_τ$ and $\overline{B}^0\to D^{*+}τ^-\overlineν_τ$ decays are measured with respect to their muonic counterparts, using a data sample corresponding to an integrated luminosity of 2.0 fb$^{-1}$ collected by the LHCb experiment in proton-proton collisions at $\sqrt{s} = 13$ TeV. The reconstructed final states are formed by combining… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lhcbproject.web.cern.ch/Publications/LHCbProjectPublic/LHCb-PAPER-2024-007.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-007, CERN-EP-2024-125

  38. arXiv:2406.03156  [pdf, other

    hep-ex

    Observation of new charmonium(-like) states in $B^+ \to D^{*\pm} D^{\mp} K^+$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1062 additional authors not shown)

    Abstract: A study of resonant structures in $B^{+}\rightarrow{D^{\ast+}D^{-}K^{+}}$ and $B^{+}\rightarrow{D^{\ast-}D^{+}K^{+}}$ decays is performed, using proton-proton collision data at centre-of-mass energies of $\sqrt{s}=7, 8$, and $13$ TeV recorded by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. A simultaneous amplitude fit is performed to the two channels with contribu… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-047.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-047, CERN-EP-2024-096

  39. arXiv:2406.02618  [pdf, other

    q-bio.QM cs.AI eess.IV

    Immunocto: a massive immune cell database auto-generated for histopathology

    Authors: Mikaël Simard, Zhuoyan Shen, Maria A. Hawkins, Charles-Antoine Collins-Fekete

    Abstract: With the advent of novel cancer treatment options such as immunotherapy, studying the tumour immune micro-environment is crucial to inform on prognosis and understand response to therapeutic agents. A key approach to characterising the tumour immune micro-environment may be through combining (1) digitised microscopic high-resolution optical images of hematoxylin and eosin (H&E) stained tissue sect… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  40. arXiv:2406.00435  [pdf, other

    math.OC q-fin.MF

    Modelling Non-monotone Risk Aversion and Convex Compensation in Incomplete Markets

    Authors: Yang Liu, Zhenyu Shen

    Abstract: In hedge funds, convex compensation schemes are popular to stimulate a high-profit performance for portfolio managers. In economics, non-monotone risk aversion is proposed to argue that individuals may not be risk-averse when the wealth level is low. Combining these two ingredients, we study the optimal control strategy of the manager in incomplete markets. Generally, we propose a wide class of ut… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    MSC Class: 91B16; 91G10

  41. arXiv:2406.00235  [pdf, other

    hep-ex

    Amplitude analysis of the radiative decay $B^0_s\to K^+K^-γ$

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1061 additional authors not shown)

    Abstract: A search for radiative decay of $B^0_s$ mesons to orbitally excited $K^+K^-$ states is performed using proton proton collisions recorded by the \mbox{LHCb}\xspace experiment, corresponding to an integrated luminosity of 9~fb$^{-1}$. The dikaon spectrum in the mass range $m_{KK}<2400$~{\ensuremath{\,\text{Me\kern -0.1em V\!/}c^2}\xspace} is dominated by the $φ(1020)$ resonance that accounts for alm… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-002.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-002, CERN-EP-2024-115

  42. arXiv:2406.00165  [pdf, ps, other

    math-ph math.DS math.PR

    Mesoscopic and Macroscopic Entropy Balance Equations in a Stochastic Dynamics and Its Deterministic Limit

    Authors: Hong Qian, Zhongwei Shen

    Abstract: Entropy, its production, and its change in a dynamical system can be understood from either a fully stochastic dynamic description or from a deterministic dynamics exhibiting chaotic behavior. By taking the former approach based on the general diffusion process with diffusion $\tfrac{1}α{\bf D}(\bf x)$ and drift $\bf b(\bf x)$, where $α$ represents the ``size parameter'' of a system, we show that… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 19 pages

  43. arXiv:2405.20764  [pdf, other

    cs.CV

    CoMoFusion: Fast and High-quality Fusion of Infrared and Visible Image with Consistency Model

    Authors: Zhiming Meng, Hui Li, Zeyang Zhang, Zhongwei Shen, Yunlong Yu, Xiaoning Song, Xiaojun Wu

    Abstract: Generative models are widely utilized to model the distribution of fused images in the field of infrared and visible image fusion. However, current generative models based fusion methods often suffer from unstable training and slow inference speed. To tackle this problem, a novel fusion method based on consistency model is proposed, termed as CoMoFusion, which can generate the high-quality images… ▽ More

    Submitted 11 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  44. arXiv:2405.18373  [pdf, other

    stat.ML cs.LG math.OC

    A Hessian-Aware Stochastic Differential Equation for Modelling SGD

    Authors: Xiang Li, Zebang Shen, Liang Zhang, Niao He

    Abstract: Continuous-time approximation of Stochastic Gradient Descent (SGD) is a crucial tool to study its escaping behaviors from stationary points. However, existing stochastic differential equation (SDE) models fail to fully capture these behaviors, even for simple quadratic objectives. Built on a novel stochastic backward error analysis framework, we derive the Hessian-Aware Stochastic Modified Equatio… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  45. arXiv:2405.17347  [pdf, other

    hep-ex

    Comprehensive analysis of local and nonlocal amplitudes in the $B^0\rightarrow K^{*0}μ^+μ^-$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1070 additional authors not shown)

    Abstract: A comprehensive study of the local and nonlocal amplitudes contributing to the decay $B^0\rightarrow K^{*0}(\to K^+π^-) μ^+μ^-$ is performed by analysing the phase-space distribution of the decay products. The analysis is based on \proton\proton collision data corresponding to an integrated luminosity of 8.4fb$^{-1}$ collected by the LHCb experiment. This measurement employs for the first time a m… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-011.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-011, CERN-EP-2024-122

  46. arXiv:2405.16432  [pdf, other

    cond-mat.mes-hall

    Revealing the hidden Dirac gap in a topological antiferromagnet using Floquet-Bloch manipulation

    Authors: Nina Bielinski, Rajas Chari, Julian May-Mann, Soyeun Kim, Jack Zwettler, Yujun Deng, Anuva Aishwarya, Subhajit Roychowdhury, Chandra Shekhar, Makoto Hashimoto, Donghui Lu, Jiaqiang Yan, Claudia Felser, Vidya Madhavan, Zhi-Xun Shen, Taylor L. Hughes, Fahad Mahmood

    Abstract: Manipulating solids using the time-periodic drive of a laser pulse is a promising route to generate new phases of matter. Whether such `Floquet-Bloch' manipulation can be achieved in topological magnetic systems with disorder has so far been unclear. In this work, we realize Floquet-Bloch manipulation of the Dirac surface-state mass of the topological antiferromagnet (AFM) MnBi$_2$Te$_4$. Using ti… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  47. arXiv:2405.16395  [pdf, other

    cs.LG

    Daily Physical Activity Monitoring -- Adaptive Learning from Multi-source Motion Sensor Data

    Authors: Haoting Zhang, Donglin Zhan, Yunduan Lin, Jinghai He, Qing Zhu, Zuo-Jun Max Shen, Zeyu Zheng

    Abstract: In healthcare applications, there is a growing need to develop machine learning models that use data from a single source, such as that from a wrist wearable device, to monitor physical activities, assess health risks, and provide immediate health recommendations or interventions. However, the limitation of using single-source data often compromises the model's accuracy, as it fails to capture the… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  48. arXiv:2405.15843  [pdf, other

    cs.CV cs.AI

    SpotNet: An Image Centric, Lidar Anchored Approach To Long Range Perception

    Authors: Louis Foucard, Samar Khanna, Yi Shi, Chi-Kuei Liu, Quinn Z Shen, Thuyen Ngo, Zi-Xiang Xia

    Abstract: In this paper, we propose SpotNet: a fast, single stage, image-centric but LiDAR anchored approach for long range 3D object detection. We demonstrate that our approach to LiDAR/image sensor fusion, combined with the joint learning of 2D and 3D detection tasks, can lead to accurate 3D object detection with very sparse LiDAR support. Unlike more recent bird's-eye-view (BEV) sensor-fusion methods whi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  49. arXiv:2405.14778  [pdf, ps, other

    stat.ML cs.LG

    Optimal Rates for Vector-Valued Spectral Regularization Learning Algorithms

    Authors: Dimitri Meunier, Zikai Shen, Mattes Mollenhauer, Arthur Gretton, Zhu Li

    Abstract: We study theoretical properties of a broad class of regularized algorithms with vector-valued output. These spectral algorithms include kernel ridge regression, kernel principal component regression, various implementations of gradient descent and many more. Our contributions are twofold. First, we rigorously confirm the so-called saturation effect for ridge regression with vector-valued output by… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  50. arXiv:2405.13103  [pdf, other

    hep-ex

    Search for the lepton-flavor violating decay $B^0_s\toφμ^\pmτ^\mp$

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1062 additional authors not shown)

    Abstract: A search for the lepton-flavor violating decays $B^0_s\toφμ^\pmτ^\mp$ is presented, using a sample of proton-proton collisions at center-of-mass energies of 7, 8, and 13 TeV, collected with the LHCb detector and corresponding to a total integrated luminosity of $9\,\text{fb}^{-1}$. The $τ$ leptons are selected using decays with three charged pions. No significant excess is observed, and an upper l… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-006.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-006, CERN-EP-2024-114