Skip to main content

Showing 1–50 of 367 results for author: Gan, W

  1. arXiv:2407.09021  [pdf, other

    eess.AS

    Squeeze-and-Excite ResNet-Conformers for Sound Event Localization, Detection, and Distance Estimation for DCASE 2024 Challenge

    Authors: Jun Wei Yeow, Ee-Leng Tan, Jisheng Bai, Santi Peksi, Woon-Seng Gan

    Abstract: This technical report details our systems submitted for Task 3 of the DCASE 2024 Challenge: Audio and Audiovisual Sound Event Localization and Detection (SELD) with Source Distance Estimation (SDE). We address only the audio-only SELD with SDE (SELDDE) task in this report. We propose to improve the existing ResNet-Conformer architectures with Squeeze-and-Excitation blocks in order to introduce add… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Technical report for DCASE 2024 Challenge Task 3

  2. arXiv:2407.08420  [pdf

    cond-mat.mtrl-sci physics.optics

    Skin Effect of Nonlinear Optical Responses in Antiferromagnets

    Authors: Hang Zhou, Rui-Chun Xiao, Shu-Hui Zhang, Wei Gan, Hui Han, Hong-Miao Zhao, Wenjian Lu, Changjin Zhang, Yuping Sun, Hui Li, Ding-Fu Shao

    Abstract: Nonlinear optics plays important roles in the research of fundamental physics and the applications of highperformance optoelectronic devices. The bulk nonlinear optical responses arise from the uniform light absorption in noncentrosymmetric crystals, and hence are usually considered to be the collective phenomena of all atoms. Here we show, in contrast to this common expectation, the nonlinear opt… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  3. arXiv:2407.05744  [pdf, other

    eess.AS cs.SD

    Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas

    Authors: Bhan Lam, Zhen-Ting Ong, Kenneth Ooi, Wen-Hui Ong, Trevor Wong, Karn N. Watcharasupat, Vanessa Boey, Irene Lee, Joo Young Hong, Jian Kang, Kar Fye Alvin Lee, Georgios Christopoulos, Woon-Seng Gan

    Abstract: Formalized in ISO 12913, the "soundscape" approach is a paradigmatic shift towards perception-based urban sound management, aiming to alleviate the substantial socioeconomic costs of noise pollution to advance the United Nations Sustainable Development Goals. Focusing on traffic-exposed outdoor residential sites, we implemented an automatic masker selection system (AMSS) utilizing natural sounds t… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 41 pages, 4 figures. Preprint submitted to an Elsevier journal

  4. arXiv:2406.11602  [pdf, other

    astro-ph.SR

    Association between a Failed Prominence Eruption and the Drainage of Mass from Another Prominence

    Authors: Jianchao Xue, Li Feng, Hui Li, Ping Zhang, Jun Chen, Guanglu Shi, Kaifan Ji, Ye Qiu, Chuan Li, Lei Lu, Beili Ying, Ying Li, Yu Huang, Youping Li, Jingwei Li, Jie Zhao, Dechao Song, Shuting Li, Zhengyuan Tian, Yingna Su, Qingmin Zhang, Yunyi Ge, Jiahui Shan, Qiao Li, Gen Li , et al. (9 additional authors not shown)

    Abstract: Sympathetic eruptions of solar prominences have been studied for decades, however, it is usually difficult to identify their causal links. Here we present two failed prominence eruptions on 26 October 2022 and explore their connections. Using stereoscopic observations, the south prominence (PRO-S) erupts with untwisting motions, flare ribbons occur underneath, and new connections are formed during… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 15 pages, 7 figures, has been accepted by Solar Physics

  5. Parameter effects on the total intensity of H I Lyα line for a modelled coronal mass ejection and its driven shock

    Authors: Beili Ying, Guanglu Shi, Li Feng, Lei Lu, Jianchao Xue, Shuting Li, Weiqun Gan, Hui Li

    Abstract: The combination of the H I Lyα (121.6 nm) line formation mechanism with ultraviolet (UV) Lyα and white-light (WL) observations provides an effective method for determining the electron temperature of coronal mass ejections (CMEs). A key to ensuring the accuracy of this diagnostic technique is the precise calculation of theoretical Lyα intensities. This study performs a modelled CME and its driven… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 22 pages, 9 figures, accepted by Solar Physics

  6. arXiv:2406.05070  [pdf, other

    cs.DB

    Targeted Mining Precise-positioning Episode Rules

    Authors: Jian Zhu, Xiaoye Chen, Wensheng Gan, Zefeng Chen, Philip S. Yu

    Abstract: The era characterized by an exponential increase in data has led to the widespread adoption of data intelligence as a crucial task. Within the field of data mining, frequent episode mining has emerged as an effective tool for extracting valuable and essential information from event sequences. Various algorithms have been developed to discover frequent episodes and subsequently derive episode rules… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: IEEE TETCI, 14 pages

  7. arXiv:2406.02783  [pdf, other

    astro-ph.SR

    High-resolution Observation of Blowout Jets Regulated by Sunspot Rotation

    Authors: Tingyu Gou, Rui Liu, Yang Su, Astrid M. Veronig, Hanya Pan, Runbin Luo, Weiqun Gan

    Abstract: Coronal jets are believed to be the miniature version of large-scale solar eruptions. In particular, the eruption of a mini-filament inside the base arch is suggested to be the trigger and even driver of blowout jets. Here we propose an alternative triggering mechanism, based on high-resolution H-alpha observations of a blowout jet associated with a mini-filament and an M1.2-class flare. The mini-… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 16 pages, 10 figures, accepted in Solar Physics

  8. arXiv:2405.18665  [pdf, other

    astro-ph.SR physics.space-ph

    Refinement of global coronal and interplanetary magnetic field extrapolations constrained by remote-sensing and in-situ observations at the solar minimum

    Authors: Guanglu Shi, Li Feng, Beili Ying, Shuting Li, Weiqun Gan

    Abstract: Solar magnetic fields are closely related to various physical phenomena on the sun, which can be extrapolated with different models from photospheric magnetograms. However, the Open Flux Problem (OFP), the underestimation of the magnetic field derived from the extrapolated model, is still unsolved. To minimize the impact of the OFP, we propose three evaluation parameters to quantitatively evaluate… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 21 pages, 15 figures, accepted for publication in ApJ

  9. arXiv:2405.16457  [pdf, other

    gr-qc

    Entanglement island and Page curve for one-sided charged black hole

    Authors: Yun-Feng Qu, Yi-Ling Lan, Hongwei Yu, Wen-Cong Gan, Fu-Wen Shu

    Abstract: In this paper, we extend the method of calculating the entanglement entropy of Hawking radiation of black holes using the "in" vacuum state, which describes one-sided asymptotically flat neutral black hole formed by gravitational collapse, to dynamic charged black holes. We explore the influence of charge on the position of the boundary of island $\partial I$ and the Page time. Due to their distin… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  10. arXiv:2405.14158  [pdf, other

    eess.SP

    Computation-efficient Virtual Sensing Approach with Multichannel Adjoint Least Mean Square Algorithm

    Authors: Boxiang Wang, Junwei Ji, Xiaoyi Shen, Dongyuan Shi, Woon-Seng Gan

    Abstract: Multichannel active noise control (ANC) systems are designed to create a large zone of quietness (ZoQ) around the error microphones, however, the placement of these microphones often presents challenges due to physical limitations. Virtual sensing technique that effectively suppresses the noise far from the physical error microphones is one of the most promising solutions. Nevertheless, the conven… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  11. arXiv:2405.13055  [pdf, other

    cs.CL cs.AI cs.CY

    Large Language Models for Medicine: A Survey

    Authors: Yanxin Zheng, Wensheng Gan, Zefeng Chen, Zhenlian Qi, Qian Liang, Philip S. Yu

    Abstract: To address challenges in the digital economy's landscape of digital intelligence, large language models (LLMs) have been developed. Improvements in computational power and available resources have significantly advanced LLMs, allowing their integration into diverse domains for human life. Medical LLMs are essential application tools with potential across various medical scenarios. In this paper, w… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Preprint. 5 figures,5 tables

  12. arXiv:2405.13001  [pdf, other

    cs.CL cs.AI cs.CY

    Large Language Models for Education: A Survey

    Authors: Hanyi Xu, Wensheng Gan, Zhenlian Qi, Jiayang Wu, Philip S. Yu

    Abstract: Artificial intelligence (AI) has a profound impact on traditional education. In recent years, large language models (LLMs) have been increasingly used in various applications such as natural language processing, computer vision, speech recognition, and autonomous driving. LLMs have also been applied in many fields, including recommendation, finance, government, education, legal affairs, and financ… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Journal of Machine Learning and Cybernetics. 4 tables, 6 figures

  13. arXiv:2405.12996  [pdf, other

    eess.IV

    Dose-aware Diffusion Model for 3D Low-dose PET: Multi-institutional Validation with Reader Study and Real Low-dose Data

    Authors: Huidong Xie, Weijie Gan, Bo Zhou, Ming-Kai Chen, Michal Kulon, Annemarie Boustani, Benjamin A. Spencer, Reimund Bayerlein, Xiongchao Chen, Qiong Liu, Xueqi Guo, Menghua Xia, Yinchi Zhou, Hui Liu, Liang Guo, Hongyu An, Ulugbek S. Kamilov, Hanzhong Wang, Biao Li, Axel Rominger, Kuangyu Shi, Ge Wang, Ramsey D. Badawi, Chi Liu

    Abstract: As PET imaging is accompanied by radiation exposure and potentially increased cancer risk, reducing radiation dose in PET scans without compromising the image quality is an important topic. Deep learning (DL) techniques have been investigated for low-dose PET imaging. However, existing models have often resulted in compromised image quality when achieving low-dose PET and have limited generalizabi… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 16 Pages, 15 Figures, 4 Tables. Paper under review. arXiv admin note: substantial text overlap with arXiv:2311.04248

  14. arXiv:2405.12496  [pdf, other

    eess.AS cs.NI cs.SD eess.SP

    A Survey of Integrating Wireless Technology into Active Noise Control

    Authors: Xiaoyi Shen, Dongyuan Shi, Zhengding Luo, Junwei Ji, Woon-Seng Gan

    Abstract: Active Noise Control (ANC) is a widely adopted technology for reducing environmental noise across various scenarios. This paper focuses on enhancing noise reduction performance, particularly through the refinement of signal quality fed into ANC systems. We discuss the main wireless technique integrated into the ANC system, equipped with some innovative algorithms, in diverse environments. Instead… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  15. arXiv:2405.07536  [pdf, other

    cs.RO eess.SY

    Multi-AUV Kinematic Task Assignment based on Self-organizing Map Neural Network and Dubins Path Generator

    Authors: Xin Li, Wenyang Gan, Pang Wen, Daqi Zhu

    Abstract: To deal with the task assignment problem of multi-AUV systems under kinematic constraints, which means steering capability constraints for underactuated AUVs or other vehicles likely, an improved task assignment algorithm is proposed combining the Dubins Path algorithm with improved SOM neural network algorithm. At first, the aimed tasks are assigned to the AUVs by improved SOM neural network meth… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  16. arXiv:2405.07485  [pdf, other

    astro-ph.HE astro-ph.SR

    The Energy Sources, the Physical Properties, and the Mass-loss History of SN 2017dio

    Authors: Deng-Wang Shi, Shan-Qin Wang, Wen-Pei Gan, En-Wei Liang

    Abstract: We study the energy sources, the physical properties of the ejecta and the circumstellar medium (CSM), as well as the mass-loss history of the progenitor of SN 2017dio which is a broad-lined Ic (Ic-BL) supernova (SN) having unusual light curves (LCs) and signatures of hydrogen-rich CSM in its early spectrum. We find that the temperature of SN 2017dio began to increase linearly about 20 days after… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in ApJ, 17 pages, 4 figures, 3 tables

  17. arXiv:2405.01308  [pdf, ps, other

    astro-ph.SR

    Spectral and Imaging Observations of a C2.3 White-Light Flare from the Advanced Space-Based Solar Observatory (ASO-S) and the Chinese H$α$ Solar Explorer (CHASE)

    Authors: Qiao Li, Ying Li, Yang Su, Dechao Song, Hui Li, Li Feng, Yu Huang, Youping Li, Jingwei Li, Jie Zhao, Lei Lu, Beili Ying, Jianchao Xue, Ping Zhang, Jun Tian, Xiaofeng Liu, Gen Li, Zhichen Jing, Shuting Li, Guanglu Shi, Zhengyuan Tian, Wei Chen, Yingna Su, Qingmin Zhang, Dong Li , et al. (5 additional authors not shown)

    Abstract: Solar white-light flares are characterized by an enhancement in the optical continuum, which are usually large flares (say X- and M-class flares). Here we report a small C2.3 white-light flare (SOL2022-12-20T04:10) observed by the \emph{Advanced Space-based Solar Observatory} and the \emph{Chinese H$α$ Solar Explorer}. This flare exhibits an increase of $\approx$6.4\% in the photospheric Fe \texts… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 23 pages, 6 figures, accepted by Solar Physics

  18. arXiv:2404.18428  [pdf, other

    cs.DB

    Geospatial Big Data: Survey and Challenges

    Authors: Jiayang Wu, Wensheng Gan, Han-Chieh Chao, Philip S. Yu

    Abstract: In recent years, geospatial big data (GBD) has obtained attention across various disciplines, categorized into big earth observation data and big human behavior data. Identifying geospatial patterns from GBD has been a vital research focus in the fields of urban management and environmental sustainability. This paper reviews the evolution of GBD mining and its integration with advanced artificial… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: IEEE JSTARS. 14 pages, 5 figures

  19. arXiv:2403.18139  [pdf, other

    eess.IV cs.CV

    Pseudo-MRI-Guided PET Image Reconstruction Method Based on a Diffusion Probabilistic Model

    Authors: Weijie Gan, Huidong Xie, Carl von Gall, Günther Platsch, Michael T. Jurkiewicz, Andrea Andrade, Udunna C. Anazodo, Ulugbek S. Kamilov, Hongyu An, Jorge Cabello

    Abstract: Anatomically guided PET reconstruction using MRI information has been shown to have the potential to improve PET image quality. However, these improvements are limited to PET scans with paired MRI information. In this work we employed a diffusion probabilistic model (DPM) to infer T1-weighted-MRI (deep-MRI) images from FDG-PET brain images. We then use the DPM-generated T1w-MRI to guide the PET re… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  20. arXiv:2403.01686  [pdf, other

    astro-ph.HE astro-ph.GA

    AT2023lli: A Tidal Disruption Event with Prominent Optical Early Bump and Delayed Episodic X-ray Emission

    Authors: Shifeng Huang, Ning Jiang, Jiazheng Zhu, Yibo Wang, Tinggui Wang, Shan-Qin Wang, Wen-Pei Gan, En-Wei Liang, Yu-Jing Qin, Zheyu Lin, Lin-Na Xu, Min-Xuan Cai, Ji-An Jiang, Xu Kong, Jiaxun Li, Long Li, Jian-Guo Wang, Ze-Lin Xu, Yongquan Xue, Ye-Fei Yuan, Jingquan Cheng, Lulu Fan, Jie Gao, Lei Hu, Weida Hu , et al. (20 additional authors not shown)

    Abstract: High-cadence, multiwavelength observations have continuously revealed the diversity of tidal disruption events (TDEs), thus greatly advancing our knowledge and understanding of TDEs. In this work, we conducted an intensive optical-UV and X-ray follow-up campaign of TDE AT2023lli, and found a remarkable month-long bump in its UV/optical light curve nearly two months prior to maximum brightness. The… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures,accepted for publication by ApJL

  21. Unsupervised learning based end-to-end delayless generative fixed-filter active noise control

    Authors: Zhengding Luo, Dongyuan Shi, Xiaoyi Shen, Woon-Seng Gan

    Abstract: Delayless noise control is achieved by our earlier generative fixed-filter active noise control (GFANC) framework through efficient coordination between the co-processor and real-time controller. However, the one-dimensional convolutional neural network (1D CNN) in the co-processor requires initial training using labelled noise datasets. Labelling noise data can be resource-intensive and may intro… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  22. arXiv:2402.07374  [pdf, ps, other

    astro-ph.SR

    The White-light Emissions in Two X-class Flares Observed by ASO-S and CHASE

    Authors: Ying Li, Zhichen Jing, De-Chao Song, Qiao Li, Jun Tian, Xiaofeng Liu, Ya Wang, M. D. Ding, Andrea Francesco Battaglia, Li Feng, Hui Li, Weiqun Gan

    Abstract: The white-light continuum emissions in solar flares (i.e., white-light flares) are usually observed on the solar disk but, in a few cases, off the limb. Here we present on-disk as well as off-limb continuum emissions at 3600 Å (in the Balmer continuum) in an X2.1 flare (SOL2023-03-03T17:52) and an X1.5 flare (SOL2023-08-07T20:46), respectively, observed by the White-light Solar Telescope (WST) on… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 13 pages, 1 table, 4 figures, accepted for publication in ApJL

  23. arXiv:2402.02694  [pdf, other

    eess.AS cs.LG cs.SD

    Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

    Authors: Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Dongyuan Shi, Woon-Seng Gan, Mark D. Plumbley, Susanto Rahardja, Bin Xiang, Jianfeng Chen

    Abstract: Acoustic scene classification (ASC) is a crucial research problem in computational auditory scene analysis, and it aims to recognize the unique acoustic characteristics of an environment. One of the challenges of the ASC task is the domain shift between training and testing data. Since 2018, ASC challenges have focused on the generalization of ASC models across different recording devices. Althoug… ▽ More

    Submitted 28 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  24. arXiv:2401.13998  [pdf, other

    eess.IV cs.CV

    WAL-Net: Weakly supervised auxiliary task learning network for carotid plaques classification

    Authors: Haitao Gan, Lingchao Fu, Ran Zhou, Weiyan Gan, Furong Wang, Xiaoyan Wu, Zhi Yang, Zhongwei Huang

    Abstract: The classification of carotid artery ultrasound images is a crucial means for diagnosing carotid plaques, holding significant clinical relevance for predicting the risk of stroke. Recent research suggests that utilizing plaque segmentation as an auxiliary task for classification can enhance performance by leveraging the correlation between segmentation and classification tasks. However, this appro… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  25. arXiv:2401.08678  [pdf, other

    eess.AS cs.SD

    Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music

    Authors: Han Yin, Mou Wang, Jisheng Bai, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen

    Abstract: This paper presents a detailed description of our proposed methods for the ICASSP 2024 Cadenza Challenge. Experimental results show that the proposed system can achieve better performance than official baselines.

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Submitted to ICASSP 2024

  26. arXiv:2401.07275  [pdf, ps, other

    astro-ph.SR

    A Statistical Study of Solar White-Light Flares Observed by the White-light Solar Telescope of the Lyman-alpha Solar Telescope on the Advanced Space-based Solar Observatory (ASO-S/LST/WST) at 360 nm

    Authors: Zhichen Jing, Ying Li, Li Feng, Hui Li, Yu Huang, Youping Li, Yang Su, Wei Chen, Jun Tian, Dechao Song, Jingwei Li, Jianchao Xue, Jie Zhao, Lei Lu, Beili Ying, Ping Zhang, Yingna Su, Qingmin Zhang, Dong Li, Yunyi Ge, Shuting Li, Qiao Li, Gen Li, Xiaofeng Liu, Guanglu Shi , et al. (4 additional authors not shown)

    Abstract: Solar white-light flares (WLFs) are those accompanied by brightenings in the optical continuum or integrated light. The White-light Solar Telescope (WST), as an instrument of the Lyman-alpha Solar Telescope (LST) on the Advanced Space-based Solar Observatory (ASO-S), provides continuous solar full-disk images at 360 nm, which can be used to study WLFs. We analyze 205 major flares above M1.0 from O… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  27. arXiv:2401.06624  [pdf, ps, other

    math.NT math.RT

    Generalised Whittaker models as instances of relative Langlands duality II: Plancherel density and global periods

    Authors: Wee Teck Gan, Bryan Wang Peng Jun

    Abstract: In an earlier paper of the authors, a general family of instances of the relative Langlands duality of Ben-Zvi-Sakellaridis-Venkatesh [BZSV] were proposed and studied in the setting of branching problems for smooth representations. In this paper, we show the numerical conjectures of [BZSV] for the local Plancherel density, as well as an application to their conjectures on global periods, for this… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    MSC Class: 22E50; 11F70

  28. arXiv:2401.01599  [pdf, other

    cs.LG math.ST

    Generalization Error Curves for Analytic Spectral Algorithms under Power-law Decay

    Authors: Yicheng Li, Weiye Gan, Zuoqiang Shi, Qian Lin

    Abstract: The generalization error curve of certain kernel regression method aims at determining the exact order of generalization error with various source condition, noise level and choice of the regularization parameter rather than the minimax rate. In this work, under mild assumptions, we rigorously provide a full characterization of the generalization error curves of the kernel gradient descent method… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  29. arXiv:2312.14560  [pdf

    physics.optics cond-mat.mtrl-sci physics.app-ph

    Optical wood with switchable solar transmittance for all-round thermal management

    Authors: He Gao, Ying Li, Yanjun Xie, Daxin Liang, Jian Li, Yonggui Wang, Zefang Xiao, Haigang Wang, Wentao Gan, Lorenzo Pattelli, Hongbo Xu

    Abstract: Technologies enabling passive daytime radiative cooling and daylight harvesting are highly relevant for energy-efficient buildings. Despite recent progress demonstrated with passively cooling polymer coatings, however, it remains challenging to combine also a passive heat gain mechanism into a single substrate for all-round thermal management. Herein, we developed an optical wood (OW) with switcha… ▽ More

    Submitted 11 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: accepted version of the manuscript published on Composites Part B: Engineering

    Journal ref: Composites Part B: Engineering, Volume 275, 2024, 111287, ISSN 1879-1069

  30. arXiv:2312.10073  [pdf, other

    cs.IR cs.AI

    Data Scarcity in Recommendation Systems: A Survey

    Authors: Zefeng Chen, Wensheng Gan, Jiayang Wu, Kaixia Hu, Hong Lin

    Abstract: The prevalence of online content has led to the widespread adoption of recommendation systems (RSs), which serve diverse purposes such as news, advertisements, and e-commerce recommendations. Despite their significance, data scarcity issues have significantly impaired the effectiveness of existing RS models and hindered their progress. To address this challenge, the concept of knowledge transfer,… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: ACM Transactions on Recommender Systems, 32 pages

  31. arXiv:2312.03718  [pdf, other

    cs.CL cs.AI

    Large Language Models in Law: A Survey

    Authors: Jinqi Lai, Wensheng Gan, Jiayang Wu, Zhenlian Qi, Philip S. Yu

    Abstract: The advent of artificial intelligence (AI) has significantly impacted the traditional judicial industry. Moreover, recently, with the development of AI-generated content (AIGC), AI and law have found applications in various domains, including image recognition, automatic text generation, and interactive chat. With the rapid emergence and growing popularity of large models, it is evident that AI wi… ▽ More

    Submitted 25 November, 2023; originally announced December 2023.

    Comments: Preprint

  32. arXiv:2311.18810  [pdf, other

    cs.CV

    Convergence of Nonconvex PnP-ADMM with MMSE Denoisers

    Authors: Chicago Park, Shirin Shoushtari, Weijie Gan, Ulugbek S. Kamilov

    Abstract: Plug-and-Play Alternating Direction Method of Multipliers (PnP-ADMM) is a widely-used algorithm for solving inverse problems by integrating physical measurement models and convolutional neural network (CNN) priors. PnP-ADMM has been theoretically proven to converge for convex data-fidelity terms and nonexpansive CNNs. It has however been observed that PnP-ADMM often empirically converges even for… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  33. arXiv:2311.18073  [pdf, other

    eess.IV

    DiffGEPCI: 3D MRI Synthesis from mGRE Signals using 2.5D Diffusion Model

    Authors: Yuyang Hu, Satya V. V. N. Kothapalli, Weijie Gan, Alexander L. Sukstanskii, Gregory F. Wu, Manu Goyal, Dmitriy A. Yablonskiy, Ulugbek S. Kamilov

    Abstract: We introduce a new framework called DiffGEPCI for cross-modality generation in magnetic resonance imaging (MRI) using a 2.5D conditional diffusion model. DiffGEPCI can synthesize high-quality Fluid Attenuated Inversion Recovery (FLAIR) and Magnetization Prepared-Rapid Gradient Echo (MPRAGE) images, without acquiring corresponding measurements, by leveraging multi-Gradient-Recalled Echo (mGRE) MRI… ▽ More

    Submitted 18 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  34. arXiv:2311.15445  [pdf, other

    cs.CV eess.IV

    FLAIR: A Conditional Diffusion Framework with Applications to Face Video Restoration

    Authors: Zihao Zou, Jiaming Liu, Shirin Shoushtari, Yubo Wang, Weijie Gan, Ulugbek S. Kamilov

    Abstract: Face video restoration (FVR) is a challenging but important problem where one seeks to recover a perceptually realistic face videos from a low-quality input. While diffusion probabilistic models (DPMs) have been shown to achieve remarkable performance for face image restoration, they often fail to preserve temporally coherent, high-quality videos, compromising the fidelity of reconstructed faces.… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 32 pages, 27 figures

  35. arXiv:2311.14068  [pdf, other

    eess.AS

    Interactive Dual-Conformer with Scene-Inspired Mask for Soft Sound Event Detection

    Authors: Han Yin, Jisheng Bai, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen

    Abstract: Traditional binary hard labels for sound event detection (SED) lack details about the complexity and variability of sound event distributions. Recently, a novel annotation workflow is proposed to generate fine-grained non-binary soft labels, resulting in a new real-life dataset named MAESTRO Real for SED. In this paper, we first propose an interactive dual-conformer (IDC) module, in which a cross-… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: to be improved (unfinished)

  36. arXiv:2311.13165  [pdf, other

    cs.AI

    Multimodal Large Language Models: A Survey

    Authors: Jiayang Wu, Wensheng Gan, Zefeng Chen, Shicheng Wan, Philip S. Yu

    Abstract: The exploration of multimodal language models integrates multiple data types, such as images, text, language, audio, and other heterogeneity. While the latest large language models excel in text-based tasks, they often struggle to understand and process other data types. Multimodal models address this limitation by combining various modalities, enabling a more comprehensive understanding of divers… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: IEEE BigData 2023. 10 pages

  37. arXiv:2311.13160  [pdf, other

    cs.AI

    Large Language Models in Education: Vision and Opportunities

    Authors: Wensheng Gan, Zhenlian Qi, Jiayang Wu, Jerry Chun-Wei Lin

    Abstract: With the rapid development of artificial intelligence technology, large language models (LLMs) have become a hot research topic. Education plays an important role in human social development and progress. Traditional education faces challenges such as individual student differences, insufficient allocation of teaching resources, and assessment of teaching effectiveness. Therefore, the applications… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: IEEE BigData 2023. 10 pages

  38. arXiv:2311.12371  [pdf, other

    eess.AS

    AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning

    Authors: Jisheng Bai, Han Yin, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen, Susanto Rahardja

    Abstract: Previous studies in automated audio captioning have faced difficulties in accurately capturing the complete temporal details of acoustic scenes and events within long audio sequences. This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning. Specifically, we propose to fine-tune the pre-trained hierarchical token-sema… ▽ More

    Submitted 4 January, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  39. arXiv:2311.10945  [pdf, other

    cs.CL cs.AI

    An Empirical Bayes Framework for Open-Domain Dialogue Generation

    Authors: Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan

    Abstract: To engage human users in meaningful conversation, open-domain dialogue agents are required to generate diverse and contextually coherent dialogue. Despite recent advancements, which can be attributed to the usage of pretrained language models, the generation of diverse and coherent dialogue remains an open research problem. A popular approach to address this issue involves the adaptation of variat… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  40. arXiv:2311.10943  [pdf, other

    cs.CL

    Partially Randomizing Transformer Weights for Dialogue Response Diversity

    Authors: Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan

    Abstract: Despite recent progress in generative open-domain dialogue, the issue of low response diversity persists. Prior works have addressed this issue via either novel objective functions, alternative learning approaches such as variational frameworks, or architectural extensions such as the Randomized Link (RL) Transformer. However, these approaches typically entail either additional difficulties during… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  41. arXiv:2311.07226  [pdf, other

    cs.RO cs.AI

    Large Language Models for Robotics: A Survey

    Authors: Fanlong Zeng, Wensheng Gan, Yongheng Wang, Ning Liu, Philip S. Yu

    Abstract: The human ability to learn, generalize, and control complex manipulation tasks through multi-modality feedback suggests a unique capability, which we refer to as dexterity intelligence. Understanding and assessing this intelligence is a complex task. Amidst the swift progress and extensive proliferation of large language models (LLMs), their applications in the field of robotics have garnered incr… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Preprint. 4 figures, 3 tables

  42. arXiv:2311.05804  [pdf, other

    cs.AI

    Model-as-a-Service (MaaS): A Survey

    Authors: Wensheng Gan, Shicheng Wan, Philip S. Yu

    Abstract: Due to the increased number of parameters and data in the pre-trained model exceeding a certain level, a foundation model (e.g., a large language model) can significantly improve downstream task performance and emerge with some novel special abilities (e.g., deep learning, complex reasoning, and human alignment) that were not present before. Foundation models are a form of generative artificial in… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Preprint. 3 figures, 1 tables

  43. arXiv:2311.04248  [pdf, other

    eess.IV

    DDPET-3D: Dose-aware Diffusion Model for 3D Ultra Low-dose PET Imaging

    Authors: Huidong Xie, Weijie Gan, Bo Zhou, Xiongchao Chen, Qiong Liu, Xueqi Guo, Liang Guo, Hongyu An, Ulugbek S. Kamilov, Ge Wang, Chi Liu

    Abstract: As PET imaging is accompanied by substantial radiation exposure and cancer risk, reducing radiation dose in PET scans is an important topic. Recently, diffusion models have emerged as the new state-of-the-art generative model to generate high-quality samples and have demonstrated strong potential for various tasks in medical imaging. However, it is difficult to extend diffusion models for 3D image… ▽ More

    Submitted 28 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Paper under review. 16 pages, 11 figures, 4 tables

  44. arXiv:2311.02121  [pdf, other

    cs.CV

    Enhancing Monocular Height Estimation from Aerial Images with Street-view Images

    Authors: Xiaomou Hou, Wanshui Gan, Naoto Yokoya

    Abstract: Accurate height estimation from monocular aerial imagery presents a significant challenge due to its inherently ill-posed nature. This limitation is rooted in the absence of adequate geometric constraints available to the model when training with monocular imagery. Without additional geometric information to supplement the monocular image data, the model's ability to provide reliable estimations i… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  45. arXiv:2311.02003  [pdf, other

    eess.IV cs.CV

    A Structured Pruning Algorithm for Model-based Deep Learning

    Authors: Chicago Park, Weijie Gan, Zihao Zou, Yuyang Hu, Zhixin Sun, Ulugbek S. Kamilov

    Abstract: There is a growing interest in model-based deep learning (MBDL) for solving imaging inverse problems. MBDL networks can be seen as iterative algorithms that estimate the desired image using a physical measurement model and a learned image prior specified using a convolutional neural net (CNNs). The iterative nature of MBDL networks increases the test-time computational complexity, which limits the… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  46. arXiv:2311.00456  [pdf, ps, other

    astro-ph.SR

    Partial Eruption of Solar Filaments. I. Configuration and Formation of Double-decker Filaments

    Authors: Yijun Hou, Chuan Li, Ting Li, Jiangtao Su, Ye Qiu, Shuhong Yang, Liheng Yang, Leping Li, Yilin Guo, Zhengyong Hou, Qiao Song, Xianyong Bai, Guiping Zhou, Mingde Ding, Weiqun Gan, Yuanyong Deng

    Abstract: Partial eruptions of solar filaments are the typical representative of solar eruptive behavior diversity. Here we investigate a typical filament partial eruption event and present integrated evidence for configuration of the pre-eruption filament and its formation. The CHASE H$α$ observations reveal structured Doppler velocity distribution within the pre-eruption filament, where distinct redshift… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 16 pages, 8 figures, 1 table, accepted for publication in ApJ as part of the Focus Issue "Early results from the Chinese Ha Solar Explorer (CHASE)"

  47. arXiv:2311.00230  [pdf, other

    cs.CV

    DINO-Mix: Enhancing Visual Place Recognition with Foundational Vision Model and Feature Mixing

    Authors: Gaoshuang Huang, Yang Zhou, Xiaofei Hu, Chenglong Zhang, Luying Zhao, Wenjian Gan, Mingbo Hou

    Abstract: Utilizing visual place recognition (VPR) technology to ascertain the geographical location of publicly available images is a pressing issue for real-world VPR applications. Although most current VPR methods achieve favorable results under ideal conditions, their performance in complex environments, characterized by lighting variations, seasonal changes, and occlusions caused by moving objects, is… ▽ More

    Submitted 5 December, 2023; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Under review / Open source code

  48. arXiv:2310.13699  [pdf, other

    cs.HC cs.ET

    Interaction in Metaverse: A Survey

    Authors: Hong Lin, Zirun Gan, Wensheng Gan, Zhenlian Qi, Yuehua Wang, Philip S. Yu

    Abstract: Human-computer interaction (HCI) emerged with the birth of the computer and has been upgraded through decades of development. Metaverse has attracted a lot of interest with its immersive experience, and HCI is the entrance to the Metaverse for people. It is predictable that HCI will determine the immersion of the Metaverse. However, the technologies of HCI in Metaverse are not mature enough. There… ▽ More

    Submitted 27 September, 2023; originally announced October 2023.

    Comments: Preprint. 3 figures, 3 tables

  49. arXiv:2310.07504  [pdf, other

    eess.IV cs.CV

    PtychoDV: Vision Transformer-Based Deep Unrolling Network for Ptychographic Image Reconstruction

    Authors: Weijie Gan, Qiuchen Zhai, Michael Thompson McCann, Cristina Garcia Cardona, Ulugbek S. Kamilov, Brendt Wohlberg

    Abstract: Ptychography is an imaging technique that captures multiple overlapping snapshots of a sample, illuminated coherently by a moving localized probe. The image recovery from ptychographic data is generally achieved via an iterative algorithm that solves a nonlinear phase retrieval problem derived from measured diffraction patterns. However, these iterative approaches have high computational cost. In… ▽ More

    Submitted 6 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  50. arXiv:2310.04297  [pdf, other

    eess.IV

    A Plug-and-Play Image Registration Network

    Authors: Junhao Hu, Weijie Gan, Zhixin Sun, Hongyu An, Ulugbek S. Kamilov

    Abstract: Deformable image registration (DIR) is an active research topic in biomedical imaging. There is a growing interest in developing DIR methods based on deep learning (DL). A traditional DL approach to DIR is based on training a convolutional neural network (CNN) to estimate the registration field between two input images. While conceptually simple, this approach comes with a limitation that it exclu… ▽ More

    Submitted 19 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.