Skip to main content

Showing 1–50 of 604 results for author: Peng, W

  1. Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification

    Authors: Wenshuo Peng, Kaipeng Zhang, Yue Yang, Hao Zhang, Yu Qiao

    Abstract: Vision-language foundation models have been incredibly successful in a wide range of downstream computer vision tasks using adaptation methods. However, due to the high cost of obtaining pre-training datasets, pairs with weak image-text correlation in the data exist in large numbers. We call them weak-paired samples. Due to the limitations of these weak-paired samples, the pre-training model are u… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 9 pages,4 figures

  2. arXiv:2407.08085  [pdf, other

    hep-ex astro-ph.CO physics.ins-det

    Light Dark Matter Constraints from SuperCDMS HVeV Detectors Operated Underground with an Anticoincidence Event Selection

    Authors: SuperCDMS Collaboration, M. F. Albakry, I. Alkhatib, D. Alonso-González, D. W. P. Amaral, J. Anczarski, T. Aralis, T. Aramaki, I. J. Arnquist, I. Ataee Langroudy, E. Azadbakht, C. Bathurst, R. Bhattacharyya, A. J. Biffl, P. L. Brink, M. Buchanan, R. Bunker, B. Cabrera, R. Calkins, R. A. Cameron, C. Cartaro, D. G. Cerdeño, Y. -Y. Chang, M. Chaudhuri, J. -H. Chen , et al. (115 additional authors not shown)

    Abstract: This article presents constraints on dark-matter-electron interactions obtained from the first underground data-taking campaign with multiple SuperCDMS HVeV detectors operated in the same housing. An exposure of 7.63 g-days is used to set upper limits on the dark-matter-electron scattering cross section for dark matter masses between 0.5 and 1000 MeV/$c^2$, as well as upper limits on dark photon k… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 7 pages + title and references, 4 figures, and 1 table

  3. arXiv:2407.03245  [pdf, other

    cs.RO cs.AI eess.SY

    TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach

    Authors: Weikun Peng, Jun Lv, Yuwei Zeng, Haonan Chen, Siheng Zhao, Jichen Sun, Cewu Lu, Lin Shao

    Abstract: The tie-knotting task is highly challenging due to the tie's high deformation and long-horizon manipulation actions. This work presents TieBot, a Real-to-Sim-to-Real learning from visual demonstration system for the robots to learn to knot a tie. We introduce the Hierarchical Feature Matching approach to estimate a sequence of tie's meshes from the demonstration video. With these estimated meshes… ▽ More

    Submitted 3 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: fix few typos

  4. arXiv:2406.18129  [pdf, other

    cs.CV cs.LG

    CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection

    Authors: Meiying Zhang, Weiyuan Peng, Guangyao Ding, Chenyang Lei, Chunlin Ji, Qi Hao

    Abstract: Simulation data can be accurately labeled and have been expected to improve the performance of data-driven algorithms, including object detection. However, due to the various domain inconsistencies from simulation to reality (sim-to-real), cross-domain object detection algorithms usually suffer from dramatic performance drops. While numerous unsupervised domain adaptation (UDA) methods have been d… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  5. arXiv:2406.18116  [pdf, other

    cs.CL cs.AI cs.HC

    BADGE: BADminton report Generation and Evaluation with LLM

    Authors: Shang-Hsuan Chiang, Lin-Wei Chao, Kuang-Da Wang, Chih-Chuan Wang, Wen-Chih Peng

    Abstract: Badminton enjoys widespread popularity, and reports on matches generally include details such as player names, game scores, and ball types, providing audiences with a comprehensive view of the games. However, writing these reports can be a time-consuming task. This challenge led us to explore whether a Large Language Model (LLM) could automate the generation and evaluation of badminton reports. We… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted by IJCAI 2024 Workshop: The 2nd International Workshop on Intelligent Technologies for Precision Sports Science (IT4PSS)

  6. arXiv:2406.11483  [pdf

    cs.CE

    Analysis of water injection heat recovery potential of abandoned oil wells to geothermal wells in northern Shaanxi

    Authors: Yu Huagui, Liu Shi, Pang Yanyan, Wang Peng, Gao Qian

    Abstract: The Chang 2 bottom water reservoir area in the western part of northern Shaanxi is one of the core oil-producing areas in the Ordos Basin.One of the main reservoirs is the Chang 2 reservoir of the Triassic Yanchang Formation, which has good physical conditions, active edge and bottom water, and high geothermal gradient. In this paper, the reservoir numerical simulation software CMG is used to simu… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Journal ref: Modern Electric Power, 2023, 1-9

  7. arXiv:2406.11176  [pdf, other

    cs.CL cs.AI

    Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement

    Authors: Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng Li, Wei Peng, Sujian Li

    Abstract: Large language model agents have exhibited exceptional performance across a range of complex interactive tasks. Recent approaches have utilized tuning with expert trajectories to enhance agent performance, yet they primarily concentrate on outcome rewards, which may lead to errors or suboptimal actions due to the absence of process supervision signals. In this paper, we introduce the Iterative ste… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  8. arXiv:2406.10744  [pdf, other

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu , et al. (75 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 12 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 PBDL Challenges: https://pbdl-ws.github.io/pbdl2024/challenge/index.html

  9. arXiv:2406.09265  [pdf, other

    cs.CL

    Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs

    Authors: Weixuan Wang, Barry Haddow, Wei Peng, Alexandra Birch

    Abstract: Multilingual large language models (LLMs) have greatly increased the ceiling of performance on non-English tasks. However the mechanisms behind multilingualism in these LLMs are poorly understood. Of particular interest is the degree to which internal representations are shared between languages. Recent work on neuron analysis of LLMs has focused on the monolingual case, and the limited work on th… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2405.15299  [pdf, other

    cs.CV

    Transparent Object Depth Completion

    Authors: Yifan Zhou, Wanli Peng, Zhongyu Yang, He Liu, Yi Sun

    Abstract: The perception of transparent objects for grasp and manipulation remains a major challenge, because existing robotic grasp methods which heavily rely on depth maps are not suitable for transparent objects due to their unique visual properties. These properties lead to gaps and inaccuracies in the depth maps of the transparent objects captured by depth sensors. To address this issue, we propose an… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  11. arXiv:2405.10305  [pdf, other

    cs.CV cs.AI

    4D Panoptic Scene Graph Generation

    Authors: Jingkang Yang, Jun Cen, Wenxuan Peng, Shuai Liu, Fangzhou Hong, Xiangtai Li, Kaiyang Zhou, Qifeng Chen, Ziwei Liu

    Abstract: We are living in a three-dimensional space while moving forward through a fourth dimension: time. To allow artificial intelligence to develop a comprehensive understanding of such a 4D environment, we introduce 4D Panoptic Scene Graph (PSG-4D), a new representation that bridges the raw visual data perceived in a dynamic 4D world and high-level visual understanding. Specifically, PSG-4D abstracts r… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted as NeurIPS 2023. Code: https://github.com/Jingkang50/PSG4D Previous Series: PSG https://github.com/Jingkang50/OpenPSG and PVSG https://github.com/Jingkang50/OpenPVSG

  12. arXiv:2405.07464  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Atomic-scale tunable phonon transport at tailored grain boundaries

    Authors: Xiaowang Wang, Chaitanya A. Gadre, Runqing Yang, Wanjuan Zou, Xing Bin, Christopher Addiego, Toshihiro Aoki, Yujie Quan, Wei-Tao Peng, Yifeng Huang, Chaojie Du, Mingjie Xu, Xingxu Yan, Ruqian Wu, Shyue Ping Ong, Bolin Liao, Penghui Cao, Xiaoqing Pan

    Abstract: Manipulating thermal properties in materials has been of fundamental importance for advancing innovative technologies. Heat carriers such as phonons are impeded by breaking crystal symmetry or periodicity. Notable methods of impeding the phonon propagation include varying the density of defects, interfaces, and nanostructures, as well as changing composition. However, a robust link between the ind… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  13. arXiv:2405.06964  [pdf, other

    cs.RO cs.AI

    ManiFoundation Model for General-Purpose Robotic Manipulation of Contact Synthesis with Arbitrary Objects and Robots

    Authors: Zhixuan Xu, Chongkai Gao, Zixuan Liu, Gang Yang, Chenrui Tie, Haozhuo Zheng, Haoyu Zhou, Weikun Peng, Debang Wang, Tianyi Chen, Zhouliang Yu, Lin Shao

    Abstract: To substantially enhance robot intelligence, there is a pressing need to develop a large model that enables general-purpose robots to proficiently undertake a broad spectrum of manipulation tasks, akin to the versatile task-planning ability exhibited by LLMs. The vast diversity in objects, robots, and manipulation tasks presents huge challenges. Our work introduces a comprehensive framework to dev… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  14. arXiv:2404.18527  [pdf

    cs.LG cs.AI cs.CR stat.AP

    Bridging Data Barriers among Participants: Assessing the Potential of Geoenergy through Federated Learning

    Authors: Weike Peng, Jiaxin Gao, Yuntian Chen, Shengwei Wang

    Abstract: Machine learning algorithms emerge as a promising approach in energy fields, but its practical is hindered by data barriers, stemming from high collection costs and privacy concerns. This study introduces a novel federated learning (FL) framework based on XGBoost models, enabling safe collaborative modeling with accessible yet concealed data from multiple parties. Hyperparameter tuning of the mode… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  15. arXiv:2404.16469  [pdf, ps, other

    cond-mat.supr-con

    From weak to strong-coupling superconductivity tuned by substrate in TiN films

    Authors: Yixin Liu, Zulei Xu, Aobo Yu, Xiaoni Wang, Wei Peng, Yu Wu, Gang Mu, Zhi-Rong Lin

    Abstract: The interplay between substrates and superconducting thin films has attracted increasing attention. Here, we report an in-depth investigation on superconducting properties of the epitaxial TiN thin films grown on two different substrates by dc reactive magnetron sputtering. The TiN films grown on (0001) sapphire exhibit (111) crystal orientation, while that grown on (100) Si substrates exhibit (10… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 figures

  16. arXiv:2404.10413  [pdf, other

    cs.DB cs.LG cs.PF

    VDTuner: Automated Performance Tuning for Vector Data Management Systems

    Authors: Tiannuo Yang, Wen Hu, Wangqi Peng, Yusen Li, Jianguo Li, Gang Wang, Xiaoguang Liu

    Abstract: Vector data management systems (VDMSs) have become an indispensable cornerstone in large-scale information retrieval and machine learning systems like large language models. To enhance the efficiency and flexibility of similarity search, VDMS exposes many tunable index parameters and system parameters for users to specify. However, due to the inherent characteristics of VDMS, automatic performance… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted by ICDE 2024

  17. arXiv:2404.10229  [pdf, other

    cs.CL

    Generative Text Steganography with Large Language Model

    Authors: Jiaxuan Wu, Zhengxian Wu, Yiming Xue, Juan Wen, Wanli Peng

    Abstract: Recent advances in large language models (LLMs) have blurred the boundary of high-quality text generation between humans and machines, which is favorable for generative text steganography. While, current advanced steganographic mapping is not suitable for LLMs since most users are restricted to accessing only the black-box API or user interface of the LLMs, thereby lacking access to the training v… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  18. arXiv:2404.07200  [pdf, other

    cs.LG

    Toward a Better Understanding of Fourier Neural Operators: Analysis and Improvement from a Spectral Perspective

    Authors: Shaoxiang Qin, Fuyuan Lyu, Wenhui Peng, Dingyang Geng, Ju Wang, Naiping Gao, Xue Liu, Liangzhu Leon Wang

    Abstract: In solving partial differential equations (PDEs), Fourier Neural Operators (FNOs) have exhibited notable effectiveness compared to Convolutional Neural Networks (CNNs). This paper presents clear empirical evidence through spectral analysis to elucidate the superiority of FNO over CNNs: FNO is significantly more capable of learning low-frequencies. This empirical evidence also unveils FNO's distinc… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  19. arXiv:2404.05834  [pdf, other

    physics.flu-dyn

    Fourier neural operator for large eddy simulation of compressible Rayleigh-Taylor turbulence

    Authors: Tengfei Luo, Zhijie Li, Zelong Yuan, Wenhui Peng, Tianyuan Liu, Liangzhu, Wang, Jianchun Wang

    Abstract: The Fourier neural operator (FNO) framework is applied to the large eddy simulation (LES) of three-dimensional compressible Rayleigh-Taylor (RT) turbulence with miscible fluids at Atwood number $A_t=0.5$, stratification parameter $Sr=1.0$, and Reynolds numbers $Re=10000$ and 30000. The FNO model is first used for predicting three-dimensional compressible turbulence. The different magnitudes of phy… ▽ More

    Submitted 2 July, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  20. arXiv:2403.16235  [pdf, other

    astro-ph.GA

    The Next Generation Virgo Cluster Survey (NGVS). III. A Catalog of Surface Brightness Fluctuation Distances and the Three-Dimensional Distribution of Galaxies in the Virgo Cluster

    Authors: Michele Cantiello, John P. Blakeslee, Patrick Côté, Gabriella Raimondo, Jean-Charles Cuillandre, Patrick R. Durrell, Stephen Gwyn, Nandini Hazra, Eric W. Peng, Joel C. Roediger, Rúben Sánchez-Janssen, Max Kurzner

    Abstract: The surface brightness fluctuation (SBF) method is a robust and efficient way of measuring distances to galaxies containing evolved stellar populations. Although many recent applications of the method have used space-based imaging, SBF remains a powerful technique for ground-based telescopes. Deep, wide-field imaging surveys with subarsecond seeing enable SBF measurements for numerous nearby galax… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 30 pages, 15 figures, Acccepted for publication on the ApJ

  21. arXiv:2403.16026  [pdf, other

    physics.flu-dyn nlin.CD

    A transformer-based neural operator for large-eddy simulation of turbulence

    Authors: Zhijie Li, Tianyuan Liu, Wenhui Peng, Zelong Yuan, Jianchun Wang

    Abstract: Predicting the large-scale dynamics of three-dimensional (3D) turbulence is challenging for machine learning approaches. This paper introduces a transformer-based neural operator (TNO) to achieve precise and efficient predictions in the large-eddy simulation (LES) of 3D turbulence. The performance of the proposed TNO model is systematically tested and compared with LES using classical sub-grid sca… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: 45 pages, 21 figures. arXiv admin note: text overlap with arXiv:2305.10215

  22. Observation of spectral lines in the exceptional GRB 221009A

    Authors: Yan-Qiu Zhang, Shao-Lin Xiong, Ji-Rong Mao, Shuang-Nan Zhang, Wang-Chen Xue, Chao Zheng, Jia-Cong Liu, Zhen Zhang, Xi-Lu Wang, Ming-Yu Ge, Shu-Xu Yi, Li-Ming Song, Zheng-Hua An, Ce Cai, Xin-Qiao Li, Wen-Xi Peng, Wen-Jun Tan, Chen-Wei Wang, Xiang-Yang Wen, Yue Wang, Shuo Xiao, Fan Zhang, Peng Zhang, Shi-Jie Zheng

    Abstract: As the brightest gamma-ray burst ever observed, GRB 221009A provided a precious opportunity to explore spectral line features. In this paper, we performed a comprehensive spectroscopy analysis of GRB 221009A jointly with GECAM-C and Fermi/GBM data to search for emission and absorption lines. For the first time we investigated the line feature throughout this GRB including the most bright part wher… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by SCIENCE CHINA Physics, Mechanics & Astronomy (SCPMA)

    Journal ref: Observation of spectral lines in the exceptional GRB 221009A. Sci. China-Phys. Mech. Astron. 67, 289511 (2024)

  23. arXiv:2403.12406  [pdf, other

    cs.AI cs.LG

    Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion

    Authors: Kuang-Da Wang, Wei-Yao Wang, Ping-Chun Hsieh, Wen-Chih Peng

    Abstract: In the dynamic and rapid tactic involvements of turn-based sports, badminton stands out as an intrinsic paradigm that requires alter-dependent decision-making of players. While the advancement of learning from offline expert data in sequential decision-making has been witnessed in various domains, how to rally-wise imitate the behaviors of human players from offline badminton matches has remained… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Preprint

  24. arXiv:2403.10281  [pdf, other

    cs.CL cs.AI cs.LG

    Team Trifecta at Factify5WQA: Setting the Standard in Fact Verification with Fine-Tuning

    Authors: Shang-Hsuan Chiang, Ming-Chih Lo, Lin-Wei Chao, Wen-Chih Peng

    Abstract: In this paper, we present Pre-CoFactv3, a comprehensive framework comprised of Question Answering and Text Classification components for fact verification. Leveraging In-Context Learning, Fine-tuned Large Language Models (LLMs), and the FakeNet model, we address the challenges of fact verification. Our experiments explore diverse approaches, comparing different Pre-trained LLMs, introducing FakeNe… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI 2024 Workshop: FACTIFY 3.0 - Workshop Series on Multimodal Fact-Checking and Hate Speech Detection

  25. arXiv:2403.09926  [pdf, other

    astro-ph.GA

    The Next Generation Virgo Cluster Survey (NGVS). XXVII.The Size and Structure of Globular Cluster Systems and their Connection to Dark Matter Halos

    Authors: Sungsoon Lim, Eric W. Peng, Patrick Côté, Laura Ferrarese, Joel C. Roediger, Chengze Liu, Chelsea Spengler, Elisabeth Sola, Pierre-Alain Duc, Laura V. Sales, John P. Blakeslee, Jean-Charles Cuillandre, Patrick R. Durrell, Eric Emsellem, Stephen D. J. Gwyn, Ariane Lançon, Francine R. Marleau, J. Christopher Mihos, Oliver Müller, Thomas H. Puzia, Rubén Sánchez-Janssen

    Abstract: We study the size and structure of globular clusters (GC) systems of 118 early-type galaxies from the NGVS, MATLAS, and ACSVCS surveys. Fitting Sérsic profiles, we investigate the relationship between effective radii of GC systems ($R_{e, \rm gc}$) and galaxy properties. GC systems are 2--4 times more extended than host galaxies across the entire stellar mass range of our sample (… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 28 pages, 18 Figures, 3 tables, accepted for publication in ApJ

  26. arXiv:2403.04785  [pdf, other

    cs.CL cs.AI

    Large Language Multimodal Models for 5-Year Chronic Disease Cohort Prediction Using EHR Data

    Authors: Jun-En Ding, Phan Nguyen Minh Thao, Wen-Chih Peng, Jian-Zhe Wang, Chun-Cheng Chug, Min-Chen Hsieh, Yun-Chien Tseng, Ling Chen, Dongsheng Luo, Chi-Te Wang, Pei-fu Chen, Feng Liu, Fang-Ming Hung

    Abstract: Chronic diseases such as diabetes are the leading causes of morbidity and mortality worldwide. Numerous research studies have been attempted with various deep learning models in diagnosis. However, most previous studies had certain limitations, including using publicly available datasets (e.g. MIMIC), and imbalanced data. In this study, we collected five-year electronic health records (EHRs) from… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  27. arXiv:2403.03051  [pdf, other

    physics.flu-dyn

    Prediction of turbulent channel flow using Fourier neural operator-based machine-learning strategy

    Authors: Yunpeng Wang, Zhijie Li, Zelong Yuan, Wenhui Peng, Tianyuan Liu, Jianchun Wang

    Abstract: Fast and accurate predictions of turbulent flows are of great importance in the science and engineering field. In this paper, we investigate the implicit U-Net enhanced Fourier neural operator (IUFNO) in the stable prediction of long-time dynamics of three-dimensional (3D) turbulent channel flows. The trained IUFNO models are tested in the large-eddy simulations (LES) at coarse grids for three fri… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  28. arXiv:2402.17271  [pdf, other

    physics.ins-det nucl-ex

    Capacitive coupling study of the HERD SCD prototype: preliminary results

    Authors: Ruo-Si Lu, Rui Qiao, Ke Gong, Wen-Xi Peng, Wei-Shuai Zhang, Dong-Ya Guo, Jia-Ju Wei, Yi-Ming Hu, Jian-Hua Guo, Qi Wu, Peng Hu, Xuan Liu, Bing Lu, Yi-Rong Zhang

    Abstract: The Silicon Charge Detector (SCD) is a subdetector of the High Energy Cosmic Radiation Detection payload. The dynamic range of the silicon microstrip detector can be extended by the capacitive coupling effect, which is related to the interstrip capacitance and the coupling capacitance. A detector prototype with several sets of parameters was designed and tested in the ion beams at the CERN Super P… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  29. arXiv:2402.15741  [pdf, other

    cond-mat.mtrl-sci

    Observation of the In-plane Anomalous Hall Effect induced by Octupole in Magnetization Space

    Authors: Wenzhi Peng, Zheng Liu, Haolin Pan, Peng Wang, Yulong Chen, Jiachen Zhang, Xuhao Yu, Jinhui Shen, Mingmin Yang, Qian Niu, Yang Gao, Dazhi Hou

    Abstract: The Anomalous Hall Effect (AHE) manifests as a transverse voltage proportional to magnetization in ferromagnetic materials under the application of a charge current, being an indispensable tool for probing magnetism, especially in nanoscale devices. However, the AHE primarily sensitizes to out-of-plane magnetization, thereby hindering its capacity to discern the in-plane magnetization, a character… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  30. arXiv:2402.12888  [pdf, other

    eess.IV

    Transformer-based Learned Image Compression for Joint Decoding and Denoising

    Authors: Yi-Hsin Chen, Kuan-Wei Ho, Shiau-Rung Tsai, Guan-Hsun Lin, Alessandro Gnutti, Wen-Hsiao Peng, Riccardo Leonardi

    Abstract: This work introduces a Transformer-based image compression system. It has the flexibility to switch between the standard image reconstruction and the denoising reconstruction from a single compressed bitstream. Instead of training separate decoders for these tasks, we incorporate two add-on modules to adapt a pre-trained image decoder from performing the standard image reconstruction to joint deco… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to PCS 2024

  31. arXiv:2402.12816  [pdf, other

    eess.IV

    OMRA: Online Motion Resolution Adaptation to Remedy Domain Shift in Learned Hierarchical B-frame Coding

    Authors: Zong-Lin Gao, Sang NguyenQuang, Wen-Hsiao Peng, Xiem HoangVan

    Abstract: Learned hierarchical B-frame coding aims to leverage bi-directional reference frames for better coding efficiency. However, the domain shift between training and test scenarios due to dataset limitations poses a challenge. This issue arises from training the codec with small groups of pictures (GOP) but testing it on large GOPs. Specifically, the motion estimation network, when trained on small GO… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 7 pages, submitted to IEEE ICIP 2024

  32. arXiv:2402.05418  [pdf, other

    astro-ph.GA

    The Next Generation Virgo Cluster Survey. XXXVII. Distant RR Lyrae Stars and the Milky Way Stellar Halo out to 300 kpc

    Authors: Yuting Feng, Puragra Guhathakurta, Eric W. Peng, Stephen D. J. Gwyn, Laura Ferrarese, Patrick Côté, Jean-Charles Cuillandre, Jeffrey Munsell, Manjima Talukdar

    Abstract: RR Lyrae stars are standard candles with characteristic photometric variability and serve as powerful tracers of Galactic structure, substructure, accretion history, and dark matter content. Here we report the discovery of distant RR Lyrae stars, including some of the most distant stars known in the Milky Way halo, with Galactocentric distances of approximately 300 kpc. We use time-series u*g'i'z'… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by ApJ

  33. arXiv:2402.01204  [pdf, other

    cs.LG cs.AI

    A Survey on Self-Supervised Learning for Non-Sequential Tabular Data

    Authors: Wei-Yao Wang, Wei-Wei Du, Derek Xu, Wei Wang, Wen-Chih Peng

    Abstract: Self-supervised learning (SSL) has been incorporated into many state-of-the-art models in various domains, where SSL defines pretext tasks based on unlabeled datasets to learn contextualized and robust representations. Recently, SSL has been a new trend in exploring the representation learning capability in the realm of tabular data, which is more challenging due to not having explicit relations f… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: The paper list can be found at https://github.com/wwweiwei/awesome-self-supervised-learning-for-tabular-data

  34. arXiv:2402.01140  [pdf, other

    cs.LG cs.AI cs.DC

    Root Cause Analysis In Microservice Using Neural Granger Causal Discovery

    Authors: Cheng-Ming Lin, Ching Chang, Wei-Yao Wang, Kuang-Da Wang, Wen-Chih Peng

    Abstract: In recent years, microservices have gained widespread adoption in IT operations due to their scalability, maintenance, and flexibility. However, it becomes challenging for site reliability engineers (SREs) to pinpoint the root cause due to the complex relationships in microservices when facing system malfunctions. Previous research employed structured learning methods (e.g., PC-algorithm) to estab… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: AAAI 2024 Main Track

  35. arXiv:2402.00253  [pdf, other

    cs.CV cs.CL cs.LG

    A Survey on Hallucination in Large Vision-Language Models

    Authors: Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng

    Abstract: Recent development of Large Vision-Language Models (LVLMs) has attracted growing attention within the AI landscape for its practical implementation potential. However, ``hallucination'', or more specifically, the misalignment between factual visual content and corresponding textual generation, poses a significant challenge of utilizing LVLMs. In this comprehensive survey, we dissect LVLM-related h… ▽ More

    Submitted 5 May, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  36. arXiv:2401.15509  [pdf, other

    cs.CL cs.AI cs.SI

    Style-News: Incorporating Stylized News Generation and Adversarial Verification for Neural Fake News Detection

    Authors: Wei-Yao Wang, Yu-Chieh Chang, Wen-Chih Peng

    Abstract: With the improvements in generative models, the issues of producing hallucinations in various domains (e.g., law, writing) have been brought to people's attention due to concerns about misinformation. In this paper, we focus on neural fake news, which refers to content generated by neural networks aiming to mimic the style of real news to deceive people. To prevent harmful disinformation spreading… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: EACL 2024 Main Track

  37. arXiv:2401.09025  [pdf, other

    cs.HC cs.CY

    Exploring the Diversity of Music Experiences for Deaf and Hard of Hearing People

    Authors: Kyrie Zhixuan Zhou, Weirui Peng, Yuhan Liu, Rachel F. Adler

    Abstract: Sensory substitution or enhancement techniques have been proposed to enable deaf or hard of hearing (DHH) people to listen to and even compose music. However, little is known about how such techniques enhance DHH people's music experience. Since deafness is a spectrum -- as are DHH people's preferences and perceptions of music -- a more situated understanding of their interaction with music is nee… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  38. arXiv:2401.08053  [pdf, other

    cs.CV

    SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation

    Authors: Zhixuan Liu, Peter Schaldenbrand, Beverley-Claire Okogwu, Wenxuan Peng, Youngsik Yun, Andrew Hundt, Jihie Kim, Jean Oh

    Abstract: Accurate representation in media is known to improve the well-being of the people who consume it. Generative image models trained on large web-crawled datasets such as LAION are known to produce images with harmful stereotypes and misrepresentations of cultures. We improve inclusive representation in generated images by (1) engaging with communities to collect a culturally representative dataset t… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  39. arXiv:2401.06775  [pdf, other

    cs.CL cs.AI

    Large language models in healthcare and medical domain: A review

    Authors: Zabir Al Nazi, Wei Peng

    Abstract: The deployment of large language models (LLMs) within the healthcare sector has sparked both enthusiasm and apprehension. These models exhibit the remarkable capability to provide proficient responses to free-text queries, demonstrating a nuanced understanding of professional medical knowledge. This comprehensive survey delves into the functionalities of existing LLMs designed for healthcare appli… ▽ More

    Submitted 8 July, 2024; v1 submitted 12 December, 2023; originally announced January 2024.

  40. arXiv:2401.06517  [pdf, other

    eess.IV

    LiDAR Depth Map Guided Image Compression Model

    Authors: Alessandro Gnutti, Stefano Della Fiore, Mattia Savardi, Yi-Hsin Chen, Riccardo Leonardi, Wen-Hsiao Peng

    Abstract: The incorporation of LiDAR technology into some high-end smartphones has unlocked numerous possibilities across various applications, including photography, image restoration, augmented reality, and more. In this paper, we introduce a novel direction that harnesses LiDAR depth maps to enhance the compression of the corresponding RGB camera images. To the best of our knowledge, this represents the… ▽ More

    Submitted 27 June, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  41. arXiv:2401.02074  [pdf, other

    math.DS

    On the boundary of the central quadratic hyperbolic component

    Authors: Guizhen Cui, Wenjuan Peng

    Abstract: We give a concrete description for the boundary of the central quadratic hyperbolic component. The connectedness of the Julia sets of the boundary maps are also considered.

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 9 pages,3 figures

    MSC Class: 37F10; 37F20

  42. arXiv:2401.00652  [pdf, other

    cs.CV

    From Covert Hiding to Visual Editing: Robust Generative Video Steganography

    Authors: Xueying Mao, Xiaoxiao Hu, Wanli Peng, Zhenliang Gan, Qichao Ying, Zhenxing Qian, Sheng Li, Xinpeng Zhang

    Abstract: Traditional video steganography methods are based on modifying the covert space for embedding, whereas we propose an innovative approach that embeds secret message within semantic feature for steganography during the video editing process. Although existing traditional video steganography methods display a certain level of security and embedding capacity, they lack adequate robustness against comm… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Under Review

  43. arXiv:2312.17617  [pdf, other

    cs.CL

    Large Language Models for Generative Information Extraction: A Survey

    Authors: Derong Xu, Wei Chen, Wenjun Peng, Chao Zhang, Tong Xu, Xiangyu Zhao, Xian Wu, Yefeng Zheng, Yang Wang, Enhong Chen

    Abstract: Information extraction (IE) aims to extract structural knowledge (such as entities, relations, and events) from plain natural language texts. Recently, generative Large Language Models (LLMs) have demonstrated remarkable capabilities in text understanding and generation, allowing for generalization across various domains and tasks. As a result, numerous works have been proposed to harness abilitie… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: v2: Updated 100+ new papers, 5 technical categories

  44. arXiv:2312.15829  [pdf, other

    eess.IV

    MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

    Authors: Yi-Hsin Chen, Hong-Sheng Xie, Cheng-Wei Chen, Zong-Lin Gao, Martin Benjak, Wen-Hsiao Peng, Jörn Ostermann

    Abstract: Conditional coding has lately emerged as the mainstream approach to learned video compression. However, a recent study shows that it may perform worse than residual coding when the information bottleneck arises. Conditional residual coding was thus proposed, creating a new school of thought to improve on conditional coding. Notably, conditional residual coding relies heavily on the assumption that… ▽ More

    Submitted 10 July, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted for Publication in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  45. arXiv:2312.13629  [pdf, other

    physics.comp-ph nlin.PS

    $PT$ Symmetric PINN for integrable nonlocal equations: Forward and inverse problems

    Authors: Wei-Qi Peng, Yong Chen

    Abstract: Since the $PT$-symmetric nonlocal equations contain the physical information of the $PT$-symmetric, it is very appropriate to embed the physical information of the $PT$-symmetric into the loss function of PINN, named PTS-PINN. For general $PT$-symmetric nonlocal equations, especially those equations involving the derivation of nonlocal terms, due to the existence of nonlocal terms, directly using… ▽ More

    Submitted 11 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  46. arXiv:2312.11887  [pdf, other

    astro-ph.HE astro-ph.GA

    Searching for Intermediate Mass Black Holes in Globular Clusters Through Tidal Disruption Events

    Authors: Vivian L. Tang, Piero Madau, Elisa Bortolas, Eric W. Peng

    Abstract: Intermediate mass black holes (IMBHs) may be the link between stellar mass holes and the supermassive variety in the nuclei of galaxies, and globular clusters (GCs) may be one of the most promising environments for their formation. Here we carry out a pilot study of the observability of tidal disruption events (TDEs) from 10^3 Msun < M_BH < 10^5 Msun IMBHs embedded in stellar cusps at the center o… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 12 pages, 9 figures, submitted for publication in The Astrophysical Journal

  47. arXiv:2312.11553  [pdf, other

    cs.SI cs.AI cs.CL cs.LG

    SeGA: Preference-Aware Self-Contrastive Learning with Prompts for Anomalous User Detection on Twitter

    Authors: Ying-Ying Chang, Wei-Yao Wang, Wen-Chih Peng

    Abstract: In the dynamic and rapidly evolving world of social media, detecting anomalous users has become a crucial task to address malicious activities such as misinformation and cyberbullying. As the increasing number of anomalous users improves the ability to mimic normal users and evade detection, existing methods only focusing on bot detection are ineffective in terms of capturing subtle distinctions b… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: AAAI 2024 Main Track

  48. arXiv:2312.10942  [pdf, other

    cs.AI cs.LG

    ShuttleSHAP: A Turn-Based Feature Attribution Approach for Analyzing Forecasting Models in Badminton

    Authors: Wei-Yao Wang, Wen-Chih Peng, Wei Wang, Philip S. Yu

    Abstract: Agent forecasting systems have been explored to investigate agent patterns and improve decision-making in various domains, e.g., pedestrian predictions and marketing bidding. Badminton represents a fascinating example of a multifaceted turn-based sport, requiring both sophisticated tactic developments and alternate-dependent decision-making. Recent deep learning approaches for player tactic foreca… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: Preprint

  49. Identified charged-hadron production in $p$$+$Al, $^3$He$+$Au, and Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, J. Alexander, M. Alfred, V. Andrieux, K. Aoki, N. Apadula, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, X. Bai, N. S. Bandara, B. Bannier, K. N. Barish, S. Bathe, V. Baublis , et al. (456 additional authors not shown)

    Abstract: The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interprete… ▽ More

    Submitted 22 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 480 authors from 78 institutions, 18 pages, 6 tables, 16 figures. v2 is version accepted for publication in Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

    Journal ref: Phys. Rev. C 109, 054910 (2024)

  50. arXiv:2312.06372  [pdf, other

    cs.CV

    Ternary Spike: Learning Ternary Spikes for Spiking Neural Networks

    Authors: Yufei Guo, Yuanpei Chen, Xiaode Liu, Weihang Peng, Yuhan Zhang, Xuhui Huang, Zhe Ma

    Abstract: The Spiking Neural Network (SNN), as one of the biologically inspired neural network infrastructures, has drawn increasing attention recently. It adopts binary spike activations to transmit information, thus the multiplications of activations and weights can be substituted by additions, which brings high energy efficiency. However, in the paper, we theoretically and experimentally prove that the b… ▽ More

    Submitted 16 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024