subscribe to arXiv mailings

LANSCE-mQ: Dedicated search for milli/fractionally charged particles at LANL

Authors: Yu-Dai Tsai, Insung Hwang, Ryan Schmitz, Matthew Citron, Kranti Gunthoti, Jacob Steenis, Hoyong Jeong, Hyunki Moon, Jae Hyeok Yoo, Ming Xiong Liu

Abstract: In this paper, we propose an experiment, LANSCE-mQ, aiming to detect fractionally charged and millicharged particles (mCP) using an 800 MeV proton beam fixed target at the Los Alamos Neutron Science Center (LANSCE) facility. This search can shed new light on numerous fundamental questions, including charge quantization, the predictions of string theories and grand unification theories, the gauge s… ▽ More In this paper, we propose an experiment, LANSCE-mQ, aiming to detect fractionally charged and millicharged particles (mCP) using an 800 MeV proton beam fixed target at the Los Alamos Neutron Science Center (LANSCE) facility. This search can shed new light on numerous fundamental questions, including charge quantization, the predictions of string theories and grand unification theories, the gauge symmetry of the Standard Model, dark sector models, and the tests of cosmic reheating. We propose to install two-layer scintillation detectors made of plastic (such as EJ-200) or CeBr3 to search for mCPs. Dedicated Geant4 detector simulations and in situ measurements have been conducted to obtain a preliminary determination of the background rate. The dominant backgrounds are beam-induced neutrons and coincident dark current signals from the photomultiplier tubes, while beam-induced gammas and cosmic muons are subdominant. We determined that LANSCE-mQ, the dedicated mCP experiment, has the leading mCP sensitivity for mass between ~ 1 MeV to 300 MeV. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: 8 pages, 8 figures

Report number: FERMILAB-PUB-24-0357-T-V

arXiv:2407.06842 [pdf, other]

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Authors: Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai, Yi Yang, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang

Abstract: Recent work on image content manipulation based on vision-language pre-training models has been effectively extended to text-driven 3D scene editing. However, existing schemes for 3D scene editing still exhibit certain shortcomings, hindering their further interactive design. Such schemes typically adhere to fixed input patterns, limiting users' flexibility in text input. Moreover, their editing c… ▽ More Recent work on image content manipulation based on vision-language pre-training models has been effectively extended to text-driven 3D scene editing. However, existing schemes for 3D scene editing still exhibit certain shortcomings, hindering their further interactive design. Such schemes typically adhere to fixed input patterns, limiting users' flexibility in text input. Moreover, their editing capabilities are constrained by a single or a few 2D visual models and require intricate pipeline design to integrate these models into 3D reconstruction processes. To address the aforementioned issues, we propose a dialogue-based 3D scene editing approach, termed CE3D, which is centered around a large language model that allows for arbitrary textual input from users and interprets their intentions, subsequently facilitating the autonomous invocation of the corresponding visual expert models. Furthermore, we design a scheme utilizing Hash-Atlas to represent 3D scene views, which transfers the editing of 3D scenes onto 2D atlas images. This design achieves complete decoupling between the 2D editing and 3D reconstruction processes, enabling CE3D to flexibly integrate a wide range of existing 2D or 3D visual models without necessitating intricate fusion designs. Experimental results demonstrate that CE3D effectively integrates multiple visual models to achieve diverse editing visual effects, possessing strong scene comprehension and multi-round dialog capabilities. The code is available at https://sk-fun.fun/CE3D. △ Less

Submitted 9 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

Comments: Accepted by ECCV2024; Project Website: https://sk-fun.fun/CE3D

arXiv:2407.06815 [pdf, other]

Searching Accretion-Enhanced Dark Matter Annihilation Signals in the Galactic Centre

Authors: Mei-Wen Yang, Zhi-Qi Guo, Xiao-Yi Luo, Zhao-Qiang Shen, Zi-Qing Xia, Chih-Ting Lu, Yue-Lin Sming Tsai, Yi-Zhong Fan

Abstract: This study reanalyzes the detection prospects of dark matter (DM) annihilation signals in the Galactic Center, focusing on velocity-dependent dynamics within a spike density near the supermassive black hole (Sgr~A$^{\star}$). We investigate three annihilation processes -- $p$-wave, resonance, and forbidden annihilation -- under semi-relativistic velocities, leveraging gamma-ray data from Fermi and… ▽ More This study reanalyzes the detection prospects of dark matter (DM) annihilation signals in the Galactic Center, focusing on velocity-dependent dynamics within a spike density near the supermassive black hole (Sgr~A$^{\star}$). We investigate three annihilation processes -- $p$-wave, resonance, and forbidden annihilation -- under semi-relativistic velocities, leveraging gamma-ray data from Fermi and DAMPE telescopes. Our analysis integrates a fermionic DM model with an electroweak axion-like particle (ALP) portal, exploring annihilation into two or four photons. Employing a comprehensive six-dimensional integration, we precisely calculate DM-induced gamma-ray fluxes near Sgr~A$^{\star}$, incorporating velocity and positional dependencies in the annihilation cross-section and photon yield spectra. Our findings highlight scenarios of resonance and forbidden annihilation, where the larger ALP-DM-DM coupling constant $C_{aχχ}$ can affect spike density, potentially yielding detectable gamma-ray line spectra within Fermi and DAMPE energy resolution. We set upper limits for $C_{aχχ}$ across these scenarios, offering insights into the detectability and spectral characteristics of DM annihilation signals from the Galactic Center. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.05040 [pdf, other]

Code Less, Align More: Efficient LLM Fine-tuning for Code Generation with Data Pruning

Authors: Yun-Da Tsai, Mingjie Liu, Haoxing Ren

Abstract: Recent work targeting large language models (LLMs) for code generation demonstrated that increasing the amount of training data through synthetic code generation often leads to exceptional performance. In this paper we explore data pruning methods aimed at enhancing the efficiency of model training specifically for code LLMs. We present techniques that integrate various clustering and pruning metr… ▽ More Recent work targeting large language models (LLMs) for code generation demonstrated that increasing the amount of training data through synthetic code generation often leads to exceptional performance. In this paper we explore data pruning methods aimed at enhancing the efficiency of model training specifically for code LLMs. We present techniques that integrate various clustering and pruning metrics to selectively reduce training data without compromising the accuracy and functionality of the generated code. We observe significant redundancies in synthetic training data generation, where our experiments demonstrate that benchmark performance can be largely preserved by training on only 10% of the data. Moreover, we observe consistent improvements in benchmark results through moderate pruning of the training data. Our experiments show that these pruning strategies not only reduce the computational resources needed but also enhance the overall quality code generation. △ Less

Submitted 6 July, 2024; originally announced July 2024.

arXiv:2407.05036 [pdf, other]

Enhance the Robustness of Text-Centric Multimodal Alignments

Authors: Ting-Yu Yen, Yun-Da Tsai, Keng-Te Liao, Shou-De Lin

Abstract: Converting different modalities into general text, serving as input prompts for large language models (LLMs), is a common method to align multimodal models when there is limited pairwise data. This text-centric approach leverages the unique properties of text as a modality space, transforming diverse inputs into a unified textual representation. This enables downstream models to effectively interp… ▽ More Converting different modalities into general text, serving as input prompts for large language models (LLMs), is a common method to align multimodal models when there is limited pairwise data. This text-centric approach leverages the unique properties of text as a modality space, transforming diverse inputs into a unified textual representation. This enables downstream models to effectively interpret various modal inputs. This study assesses the quality and robustness of multimodal representations in the presence of missing entries, noise, or absent modalities, revealing that current text-centric alignment methods compromise downstream robustness. To address this issue, we propose a new text-centric approach that achieves superior robustness compared to previous methods across various modalities in different settings. Our findings highlight the potential of this approach to enhance the robustness and adaptability of multimodal representations, offering a promising solution for dynamic and real-world applications. △ Less

Submitted 6 July, 2024; originally announced July 2024.

arXiv:2407.01675 [pdf, other]

Hawking Radiation of Nonrelativistic Scalars: Applications to Pion and Axion Production

Authors: Hao-Ran Cui, Yuhsin Tsai, Tao Xu

Abstract: In studying secondary gamma-ray emissions from Primordial Black Holes (PBHs), the production of scalar particles like pions and axion-like particles (ALPs) via Hawking radiation is crucial. While previous analyses assumed relativistic production, asteroid-mass PBHs, relevant to upcoming experiments like AMEGO-X, likely produce pions and ALPs non-relativistically when their masses exceed 10 MeV. To… ▽ More In studying secondary gamma-ray emissions from Primordial Black Holes (PBHs), the production of scalar particles like pions and axion-like particles (ALPs) via Hawking radiation is crucial. While previous analyses assumed relativistic production, asteroid-mass PBHs, relevant to upcoming experiments like AMEGO-X, likely produce pions and ALPs non-relativistically when their masses exceed 10 MeV. To account for mass dependence in Hawking radiation, we revisit the greybody factors for massive scalars from Schwarzschild black holes, revealing significant mass corrections to particle production rates compared to the projected AMEGO-X sensitivity. We highlight the importance of considering non-relativistic $π^0$ production in interpreting PBH gamma-ray signals, essential for determining PBH properties. Additionally, we comment on the potential suppression of pion production due to form factor effects when producing extended objects via Hawking radiation. We also provide an example code for calculating the Hawking radiation spectrum of massive scalar particles. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 16+2 pages, 8 figures. The numerical code is available at https://github.com/Haoran-Brook/HoRNS

arXiv:2406.13806 [pdf, other]

First detection of coherent elastic neutrino-nucleus scattering on germanium

Authors: S. Adamski, M. Ahn, P. S. Barbeau, V. Belov, I. Bernardi, C. Bock, A. Bolozdynya, R. Bouabid, J. Browning, B. Cabrera-Palmer, N. Cedarblade-Jones, J. Colón Rivera, E. Conley, V. da Silva, J. Daughhetee, J. Detwiler, K. Ding, M. R. Durand, Y. Efremenko, S. R. Elliott, A. Erlandson, L. Fabris, A. Galindo-Uribarri, M. P. Green, J. Hakenmüller , et al. (62 additional authors not shown)

Abstract: We report the first detection of coherent elastic neutrino-nucleus scattering (CEvNS) on germanium, measured at the Spallation Neutron Source at Oak Ridge National Laboratory. The Ge-Mini detector of the COHERENT collaboration employs large-mass, low-noise, high-purity germanium spectrometers, enabling excellent energy resolution, and an analysis threshold of 1.5 keV electron-equivalent ionization… ▽ More We report the first detection of coherent elastic neutrino-nucleus scattering (CEvNS) on germanium, measured at the Spallation Neutron Source at Oak Ridge National Laboratory. The Ge-Mini detector of the COHERENT collaboration employs large-mass, low-noise, high-purity germanium spectrometers, enabling excellent energy resolution, and an analysis threshold of 1.5 keV electron-equivalent ionization energy. We observe a on-beam excess of 20.6$_{+7.1}^{-6.3}$ counts with a total exposure of 10.22 GWhkg and we reject the no-CEvNS hypothesis with 3.9 sigma significance. The result agrees with the predicted standard model of particle physics signal rate within 2 sigma. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 7 pages, 5 figures

arXiv:2406.10583 [pdf, other]

Demonstration of neutron identification in neutrino interactions in the MicroBooNE liquid argon time projection chamber

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (165 additional authors not shown)

Abstract: A significant challenge in measurements of neutrino oscillations is reconstructing the incoming neutrino energies. While modern fully-active tracking calorimeters such as liquid argon time projection chambers in principle allow the measurement of all final state particles above some detection threshold, undetected neutrons remain a considerable source of missing energy with little to no data const… ▽ More A significant challenge in measurements of neutrino oscillations is reconstructing the incoming neutrino energies. While modern fully-active tracking calorimeters such as liquid argon time projection chambers in principle allow the measurement of all final state particles above some detection threshold, undetected neutrons remain a considerable source of missing energy with little to no data constraining their production rates and kinematics. We present the first demonstration of tagging neutrino-induced neutrons in liquid argon time projection chambers using secondary protons emitted from neutron-argon interactions in the MicroBooNE detector. We describe the method developed to identify neutrino-induced neutrons and demonstrate its performance using neutrons produced in muon-neutrino charged current interactions. The method is validated using a small subset of MicroBooNE's total dataset. The selection yields a sample with $60\%$ of selected tracks corresponding to neutron-induced secondary protons. △ Less

Submitted 15 June, 2024; originally announced June 2024.

Report number: FERMILAB-PUB-24-0301

arXiv:2406.10280 [pdf, other]

Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries

Authors: Yu-Hsiang Huang, Yuche Tsai, Hsiang Hsiao, Hong-Yi Lin, Shou-De Lin

Abstract: This study investigates the privacy risks associated with text embeddings, focusing on the scenario where attackers cannot access the original embedding model. Contrary to previous research requiring direct model access, we explore a more realistic threat model by developing a transfer attack method. This approach uses a surrogate model to mimic the victim model's behavior, allowing the attacker t… ▽ More This study investigates the privacy risks associated with text embeddings, focusing on the scenario where attackers cannot access the original embedding model. Contrary to previous research requiring direct model access, we explore a more realistic threat model by developing a transfer attack method. This approach uses a surrogate model to mimic the victim model's behavior, allowing the attacker to infer sensitive information from text embeddings without direct access. Our experiments across various embedding models and a clinical dataset demonstrate that our transfer attack significantly outperforms traditional methods, revealing the potential privacy vulnerabilities in embedding technologies and emphasizing the need for enhanced security measures. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: Accepted at ACL 2024 Main Conference

arXiv:2406.10123 [pdf, other]

Improving neutrino energy estimation of charged-current interaction events with recurrent neural networks in MicroBooNE

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (164 additional authors not shown)

Abstract: We present a deep learning-based method for estimating the neutrino energy of charged-current neutrino-argon interactions. We employ a recurrent neural network (RNN) architecture for neutrino energy estimation in the MicroBooNE experiment, utilizing liquid argon time projection chamber (LArTPC) detector technology. Traditional energy estimation approaches in LArTPCs, which largely rely on reconstr… ▽ More We present a deep learning-based method for estimating the neutrino energy of charged-current neutrino-argon interactions. We employ a recurrent neural network (RNN) architecture for neutrino energy estimation in the MicroBooNE experiment, utilizing liquid argon time projection chamber (LArTPC) detector technology. Traditional energy estimation approaches in LArTPCs, which largely rely on reconstructing and summing visible energies, often experience sizable biases and resolution smearing because of the complex nature of neutrino interactions and the detector response. The estimation of neutrino energy can be improved after considering the kinematics information of reconstructed final-state particles. Utilizing kinematic information of reconstructed particles, the deep learning-based approach shows improved resolution and reduced bias for the muon neutrino Monte Carlo simulation sample compared to the traditional approach. In order to address the common concern about the effectiveness of this method on experimental data, the RNN-based energy estimator is further examined and validated with dedicated data-simulation consistency tests using MicroBooNE data. We also assess its potential impact on a neutrino oscillation study after accounting for all statistical and systematic uncertainties and show that it enhances physics sensitivity. This method has good potential to improve the performance of other physics analyses. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Report number: FERMILAB-PUB-24-0287

arXiv:2406.09601 [pdf, other]

Turns Out I'm Not Real: Towards Robust Detection of AI-Generated Videos

Authors: Qingyuan Liu, Pengyuan Shi, Yun-Yun Tsai, Chengzhi Mao, Junfeng Yang

Abstract: The impressive achievements of generative models in creating high-quality videos have raised concerns about digital integrity and privacy vulnerabilities. Recent works to combat Deepfakes videos have developed detectors that are highly accurate at identifying GAN-generated samples. However, the robustness of these detectors on diffusion-generated videos generated from video creation tools (e.g., S… ▽ More The impressive achievements of generative models in creating high-quality videos have raised concerns about digital integrity and privacy vulnerabilities. Recent works to combat Deepfakes videos have developed detectors that are highly accurate at identifying GAN-generated samples. However, the robustness of these detectors on diffusion-generated videos generated from video creation tools (e.g., SORA by OpenAI, Runway Gen-2, and Pika, etc.) is still unexplored. In this paper, we propose a novel framework for detecting videos synthesized from multiple state-of-the-art (SOTA) generative models, such as Stable Video Diffusion. We find that the SOTA methods for detecting diffusion-generated images lack robustness in identifying diffusion-generated videos. Our analysis reveals that the effectiveness of these detectors diminishes when applied to out-of-domain videos, primarily because they struggle to track the temporal features and dynamic variations between frames. To address the above-mentioned challenge, we collect a new benchmark video dataset for diffusion-generated videos using SOTA video creation tools. We extract representation within explicit knowledge from the diffusion model for video frames and train our detector with a CNN + LSTM architecture. The evaluation shows that our framework can well capture the temporal features between frames, achieves 93.7% detection accuracy for in-domain videos, and improves the accuracy of out-domain videos by up to 16 points. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.01355 [pdf, other]

Differentially Private Fine-Tuning of Diffusion Models

Authors: Yu-Lin Tsai, Yizhe Li, Zekai Chen, Po-Yu Chen, Chia-Mu Yu, Xuebin Ren, Francois Buet-Golfouse

Abstract: The integration of Differential Privacy (DP) with diffusion models (DMs) presents a promising yet challenging frontier, particularly due to the substantial memorization capabilities of DMs that pose significant privacy risks. Differential privacy offers a rigorous framework for safeguarding individual data points during model training, with Differential Privacy Stochastic Gradient Descent (DP-SGD)… ▽ More The integration of Differential Privacy (DP) with diffusion models (DMs) presents a promising yet challenging frontier, particularly due to the substantial memorization capabilities of DMs that pose significant privacy risks. Differential privacy offers a rigorous framework for safeguarding individual data points during model training, with Differential Privacy Stochastic Gradient Descent (DP-SGD) being a prominent implementation. Diffusion method decomposes image generation into iterative steps, theoretically aligning well with DP's incremental noise addition. Despite the natural fit, the unique architecture of DMs necessitates tailored approaches to effectively balance privacy-utility trade-off. Recent developments in this field have highlighted the potential for generating high-quality synthetic data by pre-training on public data (i.e., ImageNet) and fine-tuning on private data, however, there is a pronounced gap in research on optimizing the trade-offs involved in DP settings, particularly concerning parameter efficiency and model scalability. Our work addresses this by proposing a parameter-efficient fine-tuning strategy optimized for private diffusion models, which minimizes the number of trainable parameters to enhance the privacy-utility trade-off. We empirically demonstrate that our method achieves state-of-the-art performance in DP synthesis, significantly surpassing previous benchmarks on widely studied datasets (e.g., with only 0.47M trainable parameters, achieving a more than 35% improvement over the previous state-of-the-art with a small privacy budget on the CelebA-64 dataset). Anonymous codes available at https://anonymous.4open.science/r/DP-LORA-F02F. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 16 pages, 5 figures, 11 tables

arXiv:2405.17496 [pdf, other]

UU-Mamba: Uncertainty-aware U-Mamba for Cardiac Image Segmentation

Authors: Ting Yu Tsai, Li Lin, Shu Hu, Ming-Ching Chang, Hongtu Zhu, Xin Wang

Abstract: Biomedical image segmentation is critical for accurate identification and analysis of anatomical structures in medical imaging, particularly in cardiac MRI. Manual segmentation is labor-intensive, time-consuming, and prone to errors, highlighting the need for automated methods. However, current machine learning approaches face challenges like overfitting and data demands. To tackle these issues, w… ▽ More Biomedical image segmentation is critical for accurate identification and analysis of anatomical structures in medical imaging, particularly in cardiac MRI. Manual segmentation is labor-intensive, time-consuming, and prone to errors, highlighting the need for automated methods. However, current machine learning approaches face challenges like overfitting and data demands. To tackle these issues, we propose a new UU-Mamba model, integrating the U-Mamba model with the Sharpness-Aware Minimization (SAM) optimizer and an uncertainty-aware loss function. SAM enhances generalization by locating flat minima in the loss landscape, thus reducing overfitting. The uncertainty-aware loss combines region-based, distribution-based, and pixel-based loss designs to improve segmentation accuracy and robustness. Evaluation of our method is performed on the ACDC cardiac dataset, outperforming state-of-the-art models including TransUNet, Swin-Unet, nnUNet, and nnFormer. Our approach achieves Dice Similarity Coefficient (DSC) and Mean Squared Error (MSE) scores, demonstrating its effectiveness in cardiac MRI segmentation. △ Less

Submitted 4 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.16833 [pdf, other]

Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models

Authors: Chia-Yi Hsu, Yu-Lin Tsai, Chih-Hsun Lin, Pin-Yu Chen, Chia-Mu Yu, Chun-Ying Huang

Abstract: While large language models (LLMs) such as Llama-2 or GPT-4 have shown impressive zero-shot performance, fine-tuning is still necessary to enhance their performance for customized datasets, domain-specific tasks, or other private needs. However, fine-tuning all parameters of LLMs requires significant hardware resources, which can be impractical for typical users. Therefore, parameter-efficient fin… ▽ More While large language models (LLMs) such as Llama-2 or GPT-4 have shown impressive zero-shot performance, fine-tuning is still necessary to enhance their performance for customized datasets, domain-specific tasks, or other private needs. However, fine-tuning all parameters of LLMs requires significant hardware resources, which can be impractical for typical users. Therefore, parameter-efficient fine-tuning such as LoRA have emerged, allowing users to fine-tune LLMs without the need for considerable computing resources, with little performance degradation compared to fine-tuning all parameters. Unfortunately, recent studies indicate that fine-tuning can increase the risk to the safety of LLMs, even when data does not contain malicious content. To address this challenge, we propose Safe LoRA, a simple one-liner patch to the original LoRA implementation by introducing the projection of LoRA weights from selected layers to the safety-aligned subspace, effectively reducing the safety risks in LLM fine-tuning while maintaining utility. It is worth noting that Safe LoRA is a training-free and data-free approach, as it only requires the knowledge of the weights from the base and aligned LLMs. Our extensive experiments demonstrate that when fine-tuning on purely malicious data, Safe LoRA retains similar safety performance as the original aligned model. Moreover, when the fine-tuning dataset contains a mixture of both benign and malicious data, Safe LoRA mitigates the negative effect made by malicious data while preserving performance on downstream tasks. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.13194 [pdf, other]

KPConvX: Modernizing Kernel Point Convolution with Kernel Attention

Authors: Hugues Thomas, Yao-Hung Hubert Tsai, Timothy D. Barfoot, Jian Zhang

Abstract: In the field of deep point cloud understanding, KPConv is a unique architecture that uses kernel points to locate convolutional weights in space, instead of relying on Multi-Layer Perceptron (MLP) encodings. While it initially achieved success, it has since been surpassed by recent MLP networks that employ updated designs and training strategies. Building upon the kernel point principle, we presen… ▽ More In the field of deep point cloud understanding, KPConv is a unique architecture that uses kernel points to locate convolutional weights in space, instead of relying on Multi-Layer Perceptron (MLP) encodings. While it initially achieved success, it has since been surpassed by recent MLP networks that employ updated designs and training strategies. Building upon the kernel point principle, we present two novel designs: KPConvD (depthwise KPConv), a lighter design that enables the use of deeper architectures, and KPConvX, an innovative design that scales the depthwise convolutional weights of KPConvD with kernel attention values. Using KPConvX with a modern architecture and training strategy, we are able to outperform current state-of-the-art approaches on the ScanObjectNN, Scannetv2, and S3DIS datasets. We validate our design choices through ablation studies and release our code and models. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: CVPR 2024

arXiv:2405.09096 [pdf, other]

Optimizing Sensor Network Design for Multiple Coverage

Authors: Lukas Taus, Yen-Hsi Richard Tsai

Abstract: Sensor placement optimization methods have been studied extensively. They can be applied to a wide range of applications, including surveillance of known environments, optimal locations for 5G towers, and placement of missile defense systems. However, few works explore the robustness and efficiency of the resulting sensor network concerning sensor failure or adversarial attacks. This paper address… ▽ More Sensor placement optimization methods have been studied extensively. They can be applied to a wide range of applications, including surveillance of known environments, optimal locations for 5G towers, and placement of missile defense systems. However, few works explore the robustness and efficiency of the resulting sensor network concerning sensor failure or adversarial attacks. This paper addresses this issue by optimizing for the least number of sensors to achieve multiple coverage of non-simply connected domains by a prescribed number of sensors. We introduce a new objective function for the greedy (next-best-view) algorithm to design efficient and robust sensor networks and derive theoretical bounds on the network's optimality. We further introduce a Deep Learning model to accelerate the algorithm for near real-time computations. The Deep Learning model requires the generation of training examples. Correspondingly, we show that understanding the geometric properties of the training data set provides important insights into the performance and training process of deep learning techniques. Finally, we demonstrate that a simple parallel version of the greedy approach using a simpler objective can be highly competitive. △ Less

Submitted 20 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

arXiv:2405.08064 [pdf, other]

Dark Matter-Radiation Scattering Enhances CMB Phase Shift through Dark Matter-loading

Authors: Subhajit Ghosh, Daven Wei Ren Ho, Yuhsin Tsai

Abstract: A phase shift in the acoustic oscillations of cosmic microwave background (CMB) spectra is a characteristic signature for the presence of non-photon radiation propagating differently from photons, even when the radiation couples to the Standard Model particles solely gravitationally. It is well-established that compared to the presence of free-streaming radiation, CMB spectra shift to higher… ▽ More A phase shift in the acoustic oscillations of cosmic microwave background (CMB) spectra is a characteristic signature for the presence of non-photon radiation propagating differently from photons, even when the radiation couples to the Standard Model particles solely gravitationally. It is well-established that compared to the presence of free-streaming radiation, CMB spectra shift to higher $\ell$-modes in the presence of self-interacting non-photon radiation such as neutrinos and dark radiation. In this study, we further demonstrate that the scattering of non-photon radiation with dark matter can further amplify this phase shift. We show that when the energy density of the interacting radiation surpasses that of interacting dark matter around matter-radiation equality, the phase shift enhancement is proportional to the interacting dark matter abundance and remains insensitive to the radiation energy density. Given the presence of dark matter-radiation interaction, this additional phase shift emerges as a generic signature of models featuring an interacting dark sector or neutrino-dark matter scattering. Using neutrino-dark matter scattering as an example, we numerically calculate the amplified phase shift and offer an analytical interpretation of the result by modeling photon and neutrino perturbations with coupled harmonic oscillators. This framework also explains the phase shift contrast between self-interacting and free-streaming neutrinos. Fitting models with neutrino-dark matter or dark radiation-dark matter interactions to CMB and large-scale structure data, we validate the presence of the enhanced phase shift, affirmed by the linear dependence observed between the preferred regions of the sound horizon angle $θ_s$ and interacting dark matter abundance. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 54 pages, 22 figures, 5 tables

Report number: UT-WI-17-2024

arXiv:2405.01136 [pdf, other]

doi 10.1109/TVT.2023.3346791

Achievable Rate Analysis of Intelligent Omni-Surface Assisted NOMA Holographic MIMO Systems

Authors: Qingchao Li, Mohammed El-Hajjar, Yanshi Sun, Ibrahim Hemadeh, Yingming Tsai, Arman Shojaeifard, Lajos Hanzo

Abstract: An intelligent omni-surface (IOS) assisted holographic multiple-input and multiple-output architecture is conceived for $360^\circ$ full-space coverage at a low energy consumption. The theoretical ergodic rate lower bound of our non-orthogonal multiple access (NOMA) scheme is derived based on the moment matching approximation method, while considering the signal distortion at transceivers imposed… ▽ More An intelligent omni-surface (IOS) assisted holographic multiple-input and multiple-output architecture is conceived for $360^\circ$ full-space coverage at a low energy consumption. The theoretical ergodic rate lower bound of our non-orthogonal multiple access (NOMA) scheme is derived based on the moment matching approximation method, while considering the signal distortion at transceivers imposed by hardware impairments (HWIs). Furthermore, the asymptotically ergodic rate lower bound is derived both for an infinite number of IOS elements and for continuous aperture surfaces. Both the theoretical analysis and the simulation results show that the achievable rate of the NOMA scheme is higher than that of its orthogonal multiple access counterpart. Furthermore, owing to the HWIs at the transceivers, the achievable rate saturates at high signal-to-noise ratio region, instead of reaching its theoretical maximum. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: 6 pages, 3 figures. IEEE Transactions on Vehicular Technology, 2024

arXiv:2404.12019 [pdf, other]

The relic density and temperature evolution of light dark sector

Authors: Xin-Chen Duan, Raymundo Ramos, Yue-Lin Sming Tsai

Abstract: We have developed a set of four fully coupled Boltzmann equations to precisely determine the relic density and temperature of dark matter by including three distinct sectors: dark matter, light scalar, and standard model sectors. The intricacies of heat transfer between DM and the SM sector through a light scalar particle are explored, inspired by stringent experimental constraints on the scalar-H… ▽ More We have developed a set of four fully coupled Boltzmann equations to precisely determine the relic density and temperature of dark matter by including three distinct sectors: dark matter, light scalar, and standard model sectors. The intricacies of heat transfer between DM and the SM sector through a light scalar particle are explored, inspired by stringent experimental constraints on the scalar-Higgs mixing angle and the DM-scalar coupling. Three distinct sectors emerge prior to DM freeze-out, requiring fully coupled Boltzmann equations to accurately compute relic density. Investigation of forbidden, resonance, and secluded DM scenarios demonstrates significant deviations between established methods and the novel approach with fully coupled Boltzmann equations. Despite increased computational demands, this emphasizes the need for improved precision in relic density calculations, underlining the importance of incorporating these equations in comprehensive analyses. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 38 pages, 10 figures

arXiv:2404.10948 [pdf, other]

First double-differential cross section measurement of neutral-current $π^0$ production in neutrino-argon scattering in the MicroBooNE detector

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (166 additional authors not shown)

Abstract: We report the first double-differential cross section measurement of neutral-current neutral pion (NC$π^0$) production in neutrino-argon scattering, as well as single-differential measurements of the same channel in terms of final states with and without protons. The kinematic variables of interest for these measurements are the $π^0$ momentum and the $π^0$ scattering angle with respect to the neu… ▽ More We report the first double-differential cross section measurement of neutral-current neutral pion (NC$π^0$) production in neutrino-argon scattering, as well as single-differential measurements of the same channel in terms of final states with and without protons. The kinematic variables of interest for these measurements are the $π^0$ momentum and the $π^0$ scattering angle with respect to the neutrino beam. A total of 4971 candidate NC$π^0$ events fully-contained within the MicroBooNE detector are selected using data collected at a mean neutrino energy of $\sim 0.8$ GeV from $6.4\times10^{20}$ protons on target from the Booster Neutrino Beam at the Fermi National Accelerator Laboratory. After extensive data-driven model validation to ensure unbiased unfolding, the Wiener-SVD method is used to extract nominal flux-averaged cross sections. The results are compared to predictions from commonly used neutrino event generators, which tend to overpredict the measured NC$π^0$ cross section, especially in the 0.2-0.5 GeV/c $π^0$ momentum range, at forward scattering angles, and when at least one proton is present in the final state. These measurements show sensitivity to a variety of features that complicate the description of NC$π^0$ production including the form factors describing the elementary neutrino interaction and the final state interactions of the outgoing particles in the residual argon nucleus. This data will help improve the modeling of NC$π^0$ production, which represents a major background in measurements of charge-parity violation in the neutrino sector and in searches for new physics beyond the Standard Model. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Report number: FERMILAB-PUB-24-0125

arXiv:2404.09993 [pdf, other]

No More Ambiguity in 360° Room Layout via Bi-Layout Estimation

Authors: Yu-Ju Tsai, Jin-Cheng Jhang, Jingjing Zheng, Wei Wang, Albert Y. C. Chen, Min Sun, Cheng-Hao Kuo, Ming-Hsuan Yang

Abstract: Inherent ambiguity in layout annotations poses significant challenges to developing accurate 360° room layout estimation models. To address this issue, we propose a novel Bi-Layout model capable of predicting two distinct layout types. One stops at ambiguous regions, while the other extends to encompass all visible areas. Our model employs two global context embeddings, where each embedding is des… ▽ More Inherent ambiguity in layout annotations poses significant challenges to developing accurate 360° room layout estimation models. To address this issue, we propose a novel Bi-Layout model capable of predicting two distinct layout types. One stops at ambiguous regions, while the other extends to encompass all visible areas. Our model employs two global context embeddings, where each embedding is designed to capture specific contextual information for each layout type. With our novel feature guidance module, the image feature retrieves relevant context from these embeddings, generating layout-aware features for precise bi-layout predictions. A unique property of our Bi-Layout model is its ability to inherently detect ambiguous regions by comparing the two predictions. To circumvent the need for manual correction of ambiguous annotations during testing, we also introduce a new metric for disambiguating ground truth layouts. Our method demonstrates superior performance on benchmark datasets, notably outperforming leading approaches. Specifically, on the MatterportLayout dataset, it improves 3DIoU from 81.70% to 82.57% across the full test set and notably from 54.80% to 59.97% in subsets with significant ambiguity. Project page: https://liagm.github.io/Bi_Layout/ △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: CVPR 2024, Project page: https://liagm.github.io/Bi_Layout/

arXiv:2404.09949 [pdf, other]

Measurement of the differential cross section for neutral pion production in charged-current muon neutrino interactions on argon with the MicroBooNE detector

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, G. Barr, D. Barrow, J. Barrow, V. Basque, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book, M. B. Brunetti, L. Camilleri , et al. (163 additional authors not shown)

Abstract: We present a measurement of neutral pion production in charged-current interactions using data recorded with the MicroBooNE detector exposed to Fermilab's booster neutrino beam. The signal comprises one muon, one neutral pion, any number of nucleons, and no charged pions. Studying neutral pion production in the MicroBooNE detector provides an opportunity to better understand neutrino-argon interac… ▽ More We present a measurement of neutral pion production in charged-current interactions using data recorded with the MicroBooNE detector exposed to Fermilab's booster neutrino beam. The signal comprises one muon, one neutral pion, any number of nucleons, and no charged pions. Studying neutral pion production in the MicroBooNE detector provides an opportunity to better understand neutrino-argon interactions, and is crucial for future accelerator-based neutrino oscillation experiments. Using a dataset corresponding to $6.86 \times 10^{20}$ protons on target, we present single-differential cross sections in muon and neutral pion momenta, scattering angles with respect to the beam for the outgoing muon and neutral pion, as well as the opening angle between the muon and neutral pion. Data extracted cross sections are compared to generator predictions. We report good agreement between the data and the models for scattering angles, except for an over-prediction by generators at muon forward angles. Similarly, the agreement between data and the models as a function of momentum is good, except for an underprediction by generators in the medium momentum ranges, $200-400$ MeV for muons and $100-200$ MeV for pions. △ Less

Submitted 6 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

Report number: FERMILAB-PUB-24-0142-CSAID-PPD

arXiv:2404.07977 [pdf, other]

Gaga: Group Any Gaussians via 3D-aware Memory Bank

Authors: Weijie Lyu, Xueting Li, Abhijit Kundu, Yi-Hsuan Tsai, Ming-Hsuan Yang

Abstract: We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models. Contrasted to prior 3D scene segmentation approaches that heavily rely on video object tracking, Gaga utilizes spatial information and effectively associates object masks across diverse camera poses. By eliminating the assumption of cont… ▽ More We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models. Contrasted to prior 3D scene segmentation approaches that heavily rely on video object tracking, Gaga utilizes spatial information and effectively associates object masks across diverse camera poses. By eliminating the assumption of continuous view changes in training images, Gaga demonstrates robustness to variations in camera poses, particularly beneficial for sparsely sampled images, ensuring precise mask label consistency. Furthermore, Gaga accommodates 2D segmentation masks from diverse sources and demonstrates robust performance with different open-world zero-shot segmentation models, enhancing its versatility. Extensive qualitative and quantitative evaluations demonstrate that Gaga performs favorably against state-of-the-art methods, emphasizing its potential for real-world applications such as scene understanding and manipulation. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: Project Page: https://www.gaga.gallery

arXiv:2404.00095 [pdf, other]

GDA: Generalized Diffusion for Robust Test-time Adaptation

Authors: Yun-Yun Tsai, Fu-Chen Chen, Albert Y. C. Chen, Junfeng Yang, Che-Chun Su, Min Sun, Cheng-Hao Kuo

Abstract: Machine learning models struggle with generalization when encountering out-of-distribution (OOD) samples with unexpected distribution shifts. For vision tasks, recent studies have shown that test-time adaptation employing diffusion models can achieve state-of-the-art accuracy improvements on OOD samples by generating new samples that align with the model's domain without the need to modify the mod… ▽ More Machine learning models struggle with generalization when encountering out-of-distribution (OOD) samples with unexpected distribution shifts. For vision tasks, recent studies have shown that test-time adaptation employing diffusion models can achieve state-of-the-art accuracy improvements on OOD samples by generating new samples that align with the model's domain without the need to modify the model's weights. Unfortunately, those studies have primarily focused on pixel-level corruptions, thereby lacking the generalization to adapt to a broader range of OOD types. We introduce Generalized Diffusion Adaptation (GDA), a novel diffusion-based test-time adaptation method robust against diverse OOD types. Specifically, GDA iteratively guides the diffusion by applying a marginal entropy loss derived from the model, in conjunction with style and content preservation losses during the reverse sampling process. In other words, GDA considers the model's output behavior with the semantic information of the samples as a whole, which can reduce ambiguity in downstream tasks during the generation process. Evaluation across various popular model architectures and OOD benchmarks shows that GDA consistently outperforms prior work on diffusion-driven adaptation. Notably, it achieves the highest classification accuracy improvements, ranging from 4.4\% to 5.02\% on ImageNet-C and 2.5\% to 7.4\% on Rendition, Sketch, and Stylized benchmarks. This performance highlights GDA's generalization to a broader range of OOD benchmarks. △ Less

Submitted 2 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

arXiv:2403.19574 [pdf, other]

Measurement of double-differential cross sections for mesonless charged-current muon neutrino interactions on argon with final-state protons using the MicroBooNE detector

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, G. Barr, D. Barrow, J. Barrow, V. Basque, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book, M. B. Brunetti, L. Camilleri , et al. (163 additional authors not shown)

Abstract: Charged-current neutrino interactions with final states containing zero mesons and at least one proton are of high interest for current and future accelerator-based neutrino oscillation experiments. Using the Booster Neutrino Beam and the MicroBooNE detector at Fermi National Accelerator Laboratory, we have obtained the first double-differential cross section measurements of this channel for muon… ▽ More Charged-current neutrino interactions with final states containing zero mesons and at least one proton are of high interest for current and future accelerator-based neutrino oscillation experiments. Using the Booster Neutrino Beam and the MicroBooNE detector at Fermi National Accelerator Laboratory, we have obtained the first double-differential cross section measurements of this channel for muon neutrino scattering on an argon target with a proton momentum threshold of 0.25 GeV/c. We also report a flux-averaged total cross section of $σ= (11.8 \pm 1.2) \times 10^{-38}$ cm$^2$ / Ar and several single-differential measurements which extend and improve upon previous results. Statistical and systematic uncertainties are quantified with a full treatment of correlations across 359 kinematic bins, including correlations between distributions describing different observables. The resulting data set provides the most detailed information obtained to date for testing models of mesonless neutrino-argon scattering. △ Less

Submitted 16 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

Comments: 83 pages, 67 figures (including supplemental material). For v2, added oversized files in extended data release

Report number: FERMILAB-PUB-24-0120-AD-CSAID-LBNF-PPD-TD

arXiv:2403.13334 [pdf]

Hyacinth6B: A large language model for Traditional Chinese

Authors: Chih-Wei Song, Yin-Te Tsai

Abstract: This research's primary motivation of this study is to address the high hardware and computational demands typically associated with LLMs.Therefore,our goal is to find a balance between model lightness and performance,striving to maximize performance while using a comparatively lightweight model. Hyacinth6B was developed with this objective in mind,aiming to fully leverage the core capabilities of… ▽ More This research's primary motivation of this study is to address the high hardware and computational demands typically associated with LLMs.Therefore,our goal is to find a balance between model lightness and performance,striving to maximize performance while using a comparatively lightweight model. Hyacinth6B was developed with this objective in mind,aiming to fully leverage the core capabilities of LLMs without incurring substantial resource costs, effectively pushing the boundaries of smaller model's performance. The training approach involves parameter efficient finetuning using the LoRA method. △ Less

Submitted 26 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

Comments: 14pages

arXiv:2403.10596 [pdf, other]

Neural Erosion: Emulating Controlled Neurodegeneration and Aging in AI Systems

Authors: Antonios Alexos, Yu-Dai Tsai, Ian Domingo, Maryam Pishgar, Pierre Baldi

Abstract: Creating controlled methods to simulate neurodegeneration in artificial intelligence (AI) is crucial for applications that emulate brain function decline and cognitive disorders. We use IQ tests performed by Large Language Models (LLMs) and, more specifically, the LLaMA 2 to introduce the concept of ``neural erosion." This deliberate erosion involves ablating synapses or neurons, or adding Gaussia… ▽ More Creating controlled methods to simulate neurodegeneration in artificial intelligence (AI) is crucial for applications that emulate brain function decline and cognitive disorders. We use IQ tests performed by Large Language Models (LLMs) and, more specifically, the LLaMA 2 to introduce the concept of ``neural erosion." This deliberate erosion involves ablating synapses or neurons, or adding Gaussian noise during or after training, resulting in a controlled progressive decline in the LLMs' performance. We are able to describe the neurodegeneration in the IQ tests and show that the LLM first loses its mathematical abilities and then its linguistic abilities, while further losing its ability to understand the questions. To the best of our knowledge, this is the first work that models neurodegeneration with text data, compared to other works that operate in the computer vision domain. Finally, we draw similarities between our study and cognitive decline clinical studies involving test subjects. We find that with the application of neurodegenerative methods, LLMs lose abstract thinking abilities, followed by mathematical degradation, and ultimately, a loss in linguistic ability, responding to prompts incoherently. These findings are in accordance with human studies. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 19 pages, 6 figures in the main text, 5 figures in the Appendix

arXiv:2403.06230 [pdf, other]

LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem

Authors: Yun-Ang Wu, Yun-Da Tsai, Shou-De Lin

Abstract: In this study, we delve into the Thresholding Linear Bandit (TLB) problem, a nuanced domain within stochastic Multi-Armed Bandit (MAB) problems, focusing on maximizing decision accuracy against a linearly defined threshold under resource constraints. We present LinearAPT, a novel algorithm designed for the fixed budget setting of TLB, providing an efficient solution to optimize sequential decision… ▽ More In this study, we delve into the Thresholding Linear Bandit (TLB) problem, a nuanced domain within stochastic Multi-Armed Bandit (MAB) problems, focusing on maximizing decision accuracy against a linearly defined threshold under resource constraints. We present LinearAPT, a novel algorithm designed for the fixed budget setting of TLB, providing an efficient solution to optimize sequential decision-making. This algorithm not only offers a theoretical upper bound for estimated loss but also showcases robust performance on both synthetic and real-world datasets. Our contributions highlight the adaptability, simplicity, and computational efficiency of LinearAPT, making it a valuable addition to the toolkit for addressing complex sequential decision-making challenges. △ Less

Submitted 10 March, 2024; originally announced March 2024.

arXiv:2403.03212 [pdf, other]

Performance of a modular ton-scale pixel-readout liquid argon time projection chamber

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

Abstract: The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi… ▽ More The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 47 pages, 41 figures

Report number: FERMILAB-PUB-24-0073-LBNF

arXiv:2403.02721 [pdf, other]

Light Thermal Dark Matter Beyond $p$-Wave Annihilation in Minimal Higgs Portal Model

Authors: Yu-Tong Chen, Shigeki Matsumoto, Tian-Peng Tang, Yue-Lin Sming Tsai, Lei Wu

Abstract: This study explores a minimal renormalizable dark matter (DM) model, incorporating a sub-GeV Majorana DM and a singlet scalar particle $φ$. Using scalar and pseudo-scalar interactions (couplings $c_s$ and $c_p$), we investigate implications for DM detection, considering $s$-wave, $p$-wave, and combined ($s$+$p$ wave) contributions in DM annihilation cross-section, as well as loop-correction contri… ▽ More This study explores a minimal renormalizable dark matter (DM) model, incorporating a sub-GeV Majorana DM and a singlet scalar particle $φ$. Using scalar and pseudo-scalar interactions (couplings $c_s$ and $c_p$), we investigate implications for DM detection, considering $s$-wave, $p$-wave, and combined ($s$+$p$ wave) contributions in DM annihilation cross-section, as well as loop-correction contributions to DM-nucleon elastic scattering. Identifying a broad parameter space ($10 \,\rm{MeV} < m_χ\lesssim m_φ$) within the $2σ$ allowed region, we explore scenarios ($\left|c_s\right|\gg \left|c_p\right|$, $\left|c_s\right|\ll \left|c_p\right|$, and $\left|c_s\right|\approx \left|c_p\right|$). We find that (i) a non-zero pseudo-scalar coupling alleviates direct detection constraints as a comparison with the previous pure scalar coupling case; (ii) CMB observations set stringent limits on pseudo-scalar interaction dominant cases, making $s$-wave annihilation viable only for $m_χ>1\,\rm{GeV}$; (iii) the preferred $φ$-resonance region can be tested in the future indirect detection experiments, such as e-ASTROGAM. △ Less

Submitted 30 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

Comments: 35 pages, 4 figures

arXiv:2402.19281 [pdf, other]

First simultaneous measurement of differential muon-neutrino charged-current cross sections on argon for final states with and without protons using MicroBooNE data

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, G. Barr, D. Barrow, J. Barrow, V. Basque, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book, M. B. Brunetti, L. Camilleri , et al. (163 additional authors not shown)

Abstract: We report the first double-differential neutrino-argon cross section measurement made simultaneously for final states with and without protons for the inclusive muon neutrino charged-current interaction channel. The proton kinematics of this channel are further explored with a differential cross section measurement as a function of the leading proton's kinetic energy that extends across the detect… ▽ More We report the first double-differential neutrino-argon cross section measurement made simultaneously for final states with and without protons for the inclusive muon neutrino charged-current interaction channel. The proton kinematics of this channel are further explored with a differential cross section measurement as a function of the leading proton's kinetic energy that extends across the detection threshold. These measurements utilize data collected using the MicroBooNE detector from 6.4$\times10^{20}$ protons on target from the Fermilab Booster Neutrino Beam with a mean neutrino energy of $\sim$0.8 GeV. Extensive data-driven model validation utilizing the conditional constraint formalism is employed. This motivates enlarging the uncertainties with an empirical reweighting approach to minimize the possibility of extracting biased cross section results. The extracted nominal flux-averaged cross sections are compared to widely used event generator predictions revealing severe mismodeling of final states without protons for muon neutrino charged-current interactions, possibly from insufficient treatment of final state interactions. These measurements provide a wealth of new information useful for improving event generators which will enhance the sensitivity of precision measurements in neutrino experiments. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Report number: FERMILAB-PUB-24-0045

arXiv:2402.19216 [pdf, other]

Inclusive cross section measurements in final states with and without protons for charged-current $ν_μ$-Ar scattering in MicroBooNE

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, G. Barr, D. Barrow, J. Barrow, V. Basque, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book, M. B. Brunetti, L. Camilleri , et al. (164 additional authors not shown)

Abstract: A detailed understanding of inclusive muon neutrino charged-current interactions on argon is crucial to the study of neutrino oscillations in current and future experiments using liquid argon time projection chambers. To that end, we report a comprehensive set of differential cross section measurements for this channel that simultaneously probe the leptonic and hadronic systems by dividing the cha… ▽ More A detailed understanding of inclusive muon neutrino charged-current interactions on argon is crucial to the study of neutrino oscillations in current and future experiments using liquid argon time projection chambers. To that end, we report a comprehensive set of differential cross section measurements for this channel that simultaneously probe the leptonic and hadronic systems by dividing the channel into final states with and without protons. Measurements of the proton kinematics and proton multiplicity of the final state are also presented. For these measurements, we utilize data collected with the MicroBooNE detector from 6.4$\times10^{20}$ protons on target from the Fermilab Booster Neutrino Beam at a mean neutrino energy of approximately 0.8 GeV. We present in detail the cross section extraction procedure, including the unfolding, and model validation that uses data to model comparisons and the conditional constraint formalism to detect mismodeling that may introduce biases to extracted cross sections that are larger than their uncertainties. The validation exposes insufficiencies in the overall model, motivating the inclusion of an additional data-driven reweighting systematic to ensure the accuracy of the unfolding. The extracted results are compared to a number of event generators and their performance is discussed with a focus on the regions of phase-space that indicate the greatest need for modeling improvements. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Report number: FERMILAB-PUB-24-0044

arXiv:2402.19071 [pdf, other]

FATE in MMLA: A Student-Centred Exploration of Fairness, Accountability, Transparency, and Ethics in Multimodal Learning Analytics

Authors: Yueqiao Jin, Vanessa Echeverria, Lixiang Yan, Linxuan Zhao, Riordan Alfredo, Yi-Shan Tsai, Dragan Gašević, Roberto Martinez-Maldonado

Abstract: Multimodal Learning Analytics (MMLA) integrates novel sensing technologies and artificial intelligence algorithms, providing opportunities to enhance student reflection during complex, collaborative learning experiences. Although recent advancements in MMLA have shown its capability to generate insights into diverse learning behaviours across various learning settings, little research has been con… ▽ More Multimodal Learning Analytics (MMLA) integrates novel sensing technologies and artificial intelligence algorithms, providing opportunities to enhance student reflection during complex, collaborative learning experiences. Although recent advancements in MMLA have shown its capability to generate insights into diverse learning behaviours across various learning settings, little research has been conducted to evaluate these systems in authentic learning contexts, particularly regarding students' perceived fairness, accountability, transparency, and ethics (FATE). Understanding these perceptions is essential to using MMLA effectively without introducing ethical complications or negatively affecting how students learn. This study aimed to address this gap by assessing the FATE of MMLA in an authentic, collaborative learning context. We conducted semi-structured interviews with 14 undergraduate students who used MMLA visualisations for post-activity reflection. The findings highlighted the significance of accurate and comprehensive data representation to ensure visualisation fairness, the need for different levels of data access to foster accountability, the imperative of measuring and cultivating transparency with students, and the necessity of transforming informed consent from dichotomous to continuous and measurable scales. While students value the benefits of MMLA, they also emphasise the importance of ethical considerations, highlighting a pressing need for the LA and MMLA community to investigate and address FATE issues actively. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: 16 pages, 1 figure

arXiv:2402.18880 [pdf, other]

Weak Lensing Constraints on Dark Matter-Baryon Interactions with $N$-Body Simulations and Machine Learning

Authors: Chi Zhang, Lei Zu, Hou-Zun Chen, Yue-Lin Sming Tsai, Yi-Zhong Fan

Abstract: We investigate the elastic scattering cross section between dark matter and protons using the DES Year 3 weak lensing data. This scattering induces a dark acoustic oscillation structure in the matter power spectra. To address non-linear effects at low redshift, we utilize principal component analysis alongside a limited set of $N$-body simulations, improving the reliability of our matter power spe… ▽ More We investigate the elastic scattering cross section between dark matter and protons using the DES Year 3 weak lensing data. This scattering induces a dark acoustic oscillation structure in the matter power spectra. To address non-linear effects at low redshift, we utilize principal component analysis alongside a limited set of $N$-body simulations, improving the reliability of our matter power spectrum prediction. We further perform a robust Markov Chain Monte Carlo analysis to derive the upper bounds on the DM-proton elastic scattering cross-section, assuming different velocity dependencies. Our results, presented as the first Frequentist upper limits, are compared with the ones obtained by Bayesian approach. Compared with the upper limits derived from the Planck cosmic microwave background data, our findings from DES Year 3 data exhibit improvements of up to a factor of five. In addition, we forecast the future sensitivities of the China Space Station Telescope, the upcoming capabilities of this telescope could improve the current limits by approximately one order of magnitude. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: 10 pages, 5 figures

arXiv:2402.17600 [pdf]

Sustained Robust Exciton Emission in Suspended Monolayer WSe_2 within the Low Carrier Density Regime for Quantum Emitter Applications

Authors: Zheng-Zhe Chen, Chiao-Yun Chang, Ya-Ting Tsai, Po-Cheng Tsai, Shih-Yen Lin, Min-Hsiung Shih

Abstract: The development of semiconductor optoelectronic devices is moving toward low power consumption and miniaturization, especially for high-efficiency quantum emitters. However, most of these quantum sources work at low carrier density region, where the Shockley-Read-Hall recombination may dominant and seriously reduce the emission efficiency. In order to diminish the affection of carrier trapping and… ▽ More The development of semiconductor optoelectronic devices is moving toward low power consumption and miniaturization, especially for high-efficiency quantum emitters. However, most of these quantum sources work at low carrier density region, where the Shockley-Read-Hall recombination may dominant and seriously reduce the emission efficiency. In order to diminish the affection of carrier trapping and sustain a strong photoluminescence emission under low power pumping condition, we investigated on the influence of Suspending to monolayered tungsten diselenide, novel two-dimensional quantum material. Not only the PL intensity, but also the fundamental photoluminescence quantum yield has exhibited a huge, order-scale enhancement through suspending, even surprisingly, we found the PLQY improvement revealed far significantly under small pumping power and came out an exponential increase tendency toward even lower carrier density region. With its strong excitonic effect, suspended WSe_2 offers a solution to reduce carrier trapping and participate in non-radiative processes. Moreover, in the low-power range where SRH recombination dominates, suspended WSe_2 exhibited remarkably higher percentage of excitonic radiation compared to contacted WSe_2. Herein, we quantitatively demonstrate the significance of suspended WSe_2 monolayer at low carrier density region, highlighting its potential for developing compact, low-power quantum emitters in the future. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.08086 [pdf, other]

Text-centric Alignment for Multi-Modality Learning

Authors: Yun-Da Tsai, Ting-Yu Yen, Pei-Fu Guo, Zhe-Yan Li, Shou-De Lin

Abstract: This research paper addresses the challenge of modality mismatch in multimodal learning, where the modalities available during inference differ from those available at training. We propose the Text-centric Alignment for Multi-Modality Learning (TAMML) approach, an innovative method that utilizes Large Language Models (LLMs) with in-context learning and foundation models to enhance the generalizabi… ▽ More This research paper addresses the challenge of modality mismatch in multimodal learning, where the modalities available during inference differ from those available at training. We propose the Text-centric Alignment for Multi-Modality Learning (TAMML) approach, an innovative method that utilizes Large Language Models (LLMs) with in-context learning and foundation models to enhance the generalizability of multimodal systems under these conditions. By leveraging the unique properties of text as a unified semantic space, TAMML demonstrates significant improvements in handling unseen, diverse, and unpredictable modality combinations. TAMML not only adapts to varying modalities but also maintains robust performance, showcasing the potential of foundation models in overcoming the limitations of traditional fixed-modality frameworks in embedding representations. This study contributes to the field by offering a flexible, effective solution for real-world applications where modality availability is dynamic and uncertain. △ Less

Submitted 20 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

arXiv:2402.03021 [pdf, other]

Data-induced multiscale losses and efficient multirate gradient descent schemes

Authors: Juncai He, Liangchen Liu, Yen-Hsi Richard Tsai

Abstract: This paper investigates the impact of multiscale data on machine learning algorithms, particularly in the context of deep learning. A dataset is multiscale if its distribution shows large variations in scale across different directions. This paper reveals multiscale structures in the loss landscape, including its gradients and Hessians inherited from the data. Correspondingly, it introduces a nove… ▽ More This paper investigates the impact of multiscale data on machine learning algorithms, particularly in the context of deep learning. A dataset is multiscale if its distribution shows large variations in scale across different directions. This paper reveals multiscale structures in the loss landscape, including its gradients and Hessians inherited from the data. Correspondingly, it introduces a novel gradient descent approach, drawing inspiration from multiscale algorithms used in scientific computing. This approach seeks to transcend empirical learning rate selection, offering a more systematic, data-informed strategy to enhance training efficiency, especially in the later stages. △ Less

Submitted 6 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

Comments: 28 pages, 4 figures, submitted under review

MSC Class: 65F10; 65F45; 68T07 ACM Class: G.1.6; I.2.6

arXiv:2402.01568 [pdf, other]

Doping Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar Es-sghir, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1300 additional authors not shown)

Abstract: Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN… ▽ More Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon doping can substantially recover light losses due to contamination of the liquid argon by nitrogen. △ Less

Submitted 9 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: 35 pages, 20 figures

Report number: CERN-EP-2024-024; FERMILAB-PUB-23-0819-LBNF

arXiv:2402.00902 [pdf, other]

GAMPix: a novel fine-grained, low-noise and ultra-low power pixelated charge readout for TPCs

Authors: Tom Shutt, Bahrudin Trbalic, Aldo Pena-Perez, Steffen Luitz, Mark Convery, Angelo Dragone, Lorenzo Rota, Dietrich R. Freytag, Dionisio Doering, Filippo Mele, Miriam Moore, Hiro Tanaka, Yun-Tse Tsai

Abstract: We report on the development of a novel pixel charge readout system, Grid Activated Multi-scale pixel readout (GAMPix), which is under development for use in the GammaTPC gamma ray instrument concept. GammaTPC is being developed to optimize the use of liquid argon time projection chamber technology for gamma ray astrophysics, for which a fine grained low power charge readout is essential. GAMPix u… ▽ More We report on the development of a novel pixel charge readout system, Grid Activated Multi-scale pixel readout (GAMPix), which is under development for use in the GammaTPC gamma ray instrument concept. GammaTPC is being developed to optimize the use of liquid argon time projection chamber technology for gamma ray astrophysics, for which a fine grained low power charge readout is essential. GAMPix uses a new architecture with coarse and fine scale instrumented electrodes to solve the twin problems of loss of measured charge after diffusion, and high readout power. Fundamentally, it enables low noise and ultra low power charge readout at the spatial scale limited by diffusion in a time projection chamber, and has other possibly applications, including future DUNE modules. △ Less

Submitted 31 January, 2024; originally announced February 2024.

Comments: 23 pages, 18 figures

arXiv:2402.00251 [pdf, other]

Efficient Non-Parametric Uncertainty Quantification for Black-Box Large Language Models and Decision Planning

Authors: Yao-Hung Hubert Tsai, Walter Talbott, Jian Zhang

Abstract: Step-by-step decision planning with large language models (LLMs) is gaining attention in AI agent development. This paper focuses on decision planning with uncertainty estimation to address the hallucination problem in language models. Existing approaches are either white-box or computationally demanding, limiting use of black-box proprietary LLMs within budgets. The paper's first contribution is… ▽ More Step-by-step decision planning with large language models (LLMs) is gaining attention in AI agent development. This paper focuses on decision planning with uncertainty estimation to address the hallucination problem in language models. Existing approaches are either white-box or computationally demanding, limiting use of black-box proprietary LLMs within budgets. The paper's first contribution is a non-parametric uncertainty quantification method for LLMs, efficiently estimating point-wise dependencies between input-decision on the fly with a single inference, without access to token logits. This estimator informs the statistical interpretation of decision trustworthiness. The second contribution outlines a systematic design for a decision-making agent, generating actions like ``turn on the bathroom light'' based on user prompts such as ``take a bath''. Users will be asked to provide preferences when more than one action has high estimated point-wise dependencies. In conclusion, our uncertainty estimation and decision-making agent design offer a cost-efficient approach for AI agent development. △ Less

Submitted 31 January, 2024; originally announced February 2024.

arXiv:2401.15879 [pdf, other]

lil'HDoC: An Algorithm for Good Arm Identification under Small Threshold Gap

Authors: Tzu-Hsien Tsai, Yun-Da Tsai, Shou-De Lin

Abstract: Good arm identification (GAI) is a pure-exploration bandit problem in which a single learner outputs an arm as soon as it is identified as a good arm. A good arm is defined as an arm with an expected reward greater than or equal to a given threshold. This paper focuses on the GAI problem under a small threshold gap, which refers to the distance between the expected rewards of arms and the given th… ▽ More Good arm identification (GAI) is a pure-exploration bandit problem in which a single learner outputs an arm as soon as it is identified as a good arm. A good arm is defined as an arm with an expected reward greater than or equal to a given threshold. This paper focuses on the GAI problem under a small threshold gap, which refers to the distance between the expected rewards of arms and the given threshold. We propose a new algorithm called lil'HDoC to significantly improve the total sample complexity of the HDoC algorithm. We demonstrate that the sample complexity of the first $λ$ output arm in lil'HDoC is bounded by the original HDoC algorithm, except for one negligible term, when the distance between the expected reward and threshold is small. Extensive experiments confirm that our algorithm outperforms the state-of-the-art algorithms in both synthetic and real-world datasets. △ Less

Submitted 12 March, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

arXiv:2401.14285 [pdf, other]

POUR-Net: A Population-Prior-Aided Over-Under-Representation Network for Low-Count PET Attenuation Map Generation

Authors: Bo Zhou, Jun Hou, Tianqi Chen, Yinchi Zhou, Xiongchao Chen, Huidong Xie, Qiong Liu, Xueqi Guo, Yu-Jung Tsai, Vladimir Y. Panin, Takuya Toyonaga, James S. Duncan, Chi Liu

Abstract: Low-dose PET offers a valuable means of minimizing radiation exposure in PET imaging. However, the prevalent practice of employing additional CT scans for generating attenuation maps (u-map) for PET attenuation correction significantly elevates radiation doses. To address this concern and further mitigate radiation exposure in low-dose PET exams, we propose POUR-Net - an innovative population-prio… ▽ More Low-dose PET offers a valuable means of minimizing radiation exposure in PET imaging. However, the prevalent practice of employing additional CT scans for generating attenuation maps (u-map) for PET attenuation correction significantly elevates radiation doses. To address this concern and further mitigate radiation exposure in low-dose PET exams, we propose POUR-Net - an innovative population-prior-aided over-under-representation network that aims for high-quality attenuation map generation from low-dose PET. First, POUR-Net incorporates an over-under-representation network (OUR-Net) to facilitate efficient feature extraction, encompassing both low-resolution abstracted and fine-detail features, for assisting deep generation on the full-resolution level. Second, complementing OUR-Net, a population prior generation machine (PPGM) utilizing a comprehensive CT-derived u-map dataset, provides additional prior information to aid OUR-Net generation. The integration of OUR-Net and PPGM within a cascade framework enables iterative refinement of $μ$-map generation, resulting in the production of high-quality $μ$-maps. Experimental results underscore the effectiveness of POUR-Net, showing it as a promising solution for accurate CT-free low-count PET attenuation correction, which also surpasses the performance of previous baseline methods. △ Less

Submitted 25 January, 2024; originally announced January 2024.

Comments: 10 pages, 5 figures

arXiv:2401.11944 [pdf, other]

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

Authors: Ge Zhang, Xinrun Du, Bei Chen, Yiming Liang, Tongxu Luo, Tianyu Zheng, Kang Zhu, Yuyang Cheng, Chunpu Xu, Shuyue Guo, Haoran Zhang, Xingwei Qu, Junjie Wang, Ruibin Yuan, Yizhi Li, Zekun Wang, Yudong Liu, Yu-Hsuan Tsai, Fengji Zhang, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu

Abstract: As the capabilities of large multimodal models (LMMs) continue to advance, evaluating the performance of LMMs emerges as an increasing need. Additionally, there is an even larger gap in evaluating the advanced knowledge and reasoning abilities of LMMs in non-English contexts such as Chinese. We introduce CMMMU, a new Chinese Massive Multi-discipline Multimodal Understanding benchmark designed to e… ▽ More As the capabilities of large multimodal models (LMMs) continue to advance, evaluating the performance of LMMs emerges as an increasing need. Additionally, there is an even larger gap in evaluating the advanced knowledge and reasoning abilities of LMMs in non-English contexts such as Chinese. We introduce CMMMU, a new Chinese Massive Multi-discipline Multimodal Understanding benchmark designed to evaluate LMMs on tasks demanding college-level subject knowledge and deliberate reasoning in a Chinese context. CMMMU is inspired by and strictly follows the annotation and analysis pattern of MMMU. CMMMU includes 12k manually collected multimodal questions from college exams, quizzes, and textbooks, covering six core disciplines: Art & Design, Business, Science, Health & Medicine, Humanities & Social Science, and Tech & Engineering, like its companion, MMMU. These questions span 30 subjects and comprise 39 highly heterogeneous image types, such as charts, diagrams, maps, tables, music sheets, and chemical structures. CMMMU focuses on complex perception and reasoning with domain-specific knowledge in the Chinese context. We evaluate 11 open-source LLMs and one proprietary GPT-4V(ision). Even GPT-4V only achieves accuracies of 42%, indicating a large space for improvement. CMMMU will boost the community to build the next-generation LMMs towards expert artificial intelligence and promote the democratization of LMMs by providing diverse language contexts. △ Less

Submitted 18 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.09222 [pdf, other]

Local Trajectory Variation Exponent (LTVE) for Visualizing Dynamical Systems

Authors: Yun Chen Tsai, Shingyu Leung

Abstract: The identification and visualization of Lagrangian structures in flows plays a crucial role in the study of dynamic systems and fluid dynamics. The Finite Time Lyapunov Exponent (FTLE) has been widely used for this purpose; however, it only approximates the flow by considering the positions of particles at the initial and final times, ignoring the actual trajectory of the particle. To overcome thi… ▽ More The identification and visualization of Lagrangian structures in flows plays a crucial role in the study of dynamic systems and fluid dynamics. The Finite Time Lyapunov Exponent (FTLE) has been widely used for this purpose; however, it only approximates the flow by considering the positions of particles at the initial and final times, ignoring the actual trajectory of the particle. To overcome this limitation, we propose a novel quantity that extends and generalizes the FTLE by incorporating trajectory metrics as a measure of similarity between trajectories. Our proposed method utilizes trajectory metrics to quantify the distance between trajectories, providing a more robust and accurate measure of the LCS. By incorporating trajectory metrics, we can capture the actual path of the particle and account for its behavior over time, resulting in a more comprehensive analysis of the flow. Our approach extends the traditional FTLE approach to include trajectory metrics as a means of capturing the complexity of the flow. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2401.02143 [pdf, other]

Graph Neural Networks for Tabular Data Learning: A Survey with Taxonomy and Directions

Authors: Cheng-Te Li, Yu-Che Tsai, Chih-Yao Chen, Jay Chiehen Liao

Abstract: In this survey, we dive into Tabular Data Learning (TDL) using Graph Neural Networks (GNNs), a domain where deep learning-based approaches have increasingly shown superior performance in both classification and regression tasks compared to traditional methods. The survey highlights a critical gap in deep neural TDL methods: the underrepresentation of latent correlations among data instances and fe… ▽ More In this survey, we dive into Tabular Data Learning (TDL) using Graph Neural Networks (GNNs), a domain where deep learning-based approaches have increasingly shown superior performance in both classification and regression tasks compared to traditional methods. The survey highlights a critical gap in deep neural TDL methods: the underrepresentation of latent correlations among data instances and feature values. GNNs, with their innate capability to model intricate relationships and interactions between diverse elements of tabular data, have garnered significant interest and application across various TDL domains. Our survey provides a systematic review of the methods involved in designing and implementing GNNs for TDL (GNN4TDL). It encompasses a detailed investigation into the foundational aspects and an overview of GNN-based TDL methods, offering insights into their evolving landscape. We present a comprehensive taxonomy focused on constructing graph structures and representation learning within GNN-based TDL methods. In addition, the survey examines various training plans, emphasizing the integration of auxiliary tasks to enhance the effectiveness of instance representations. A critical part of our discussion is dedicated to the practical application of GNNs across a spectrum of GNN4TDL scenarios, demonstrating their versatility and impact. Lastly, we discuss the limitations and propose future research directions, aiming to spur advancements in GNN4TDL. This survey serves as a resource for researchers and practitioners, offering a thorough understanding of GNNs' role in revolutionizing TDL and pointing towards future innovations in this promising area. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: Under review, ongoing work, Github page: https://github.com/Roytsai27/awesome-GNN4TDL

arXiv:2312.13945 [pdf, other]

First search for dark-trident processes using the MicroBooNE detector

Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, G. Barr, D. Barrow, J. Barrow, V. Basque, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book, M. B. Brunetti, L. Camilleri , et al. (163 additional authors not shown)

Abstract: We present a first search for dark-trident scattering in a neutrino beam using a data set corresponding to $7.2 \times 10^{20}$ protons on target taken with the MicroBooNE detector at Fermilab. Proton interactions in the neutrino target at the Main Injector produce $π^0$ and $η$ mesons, which could decay into dark-matter (DM) particles mediated via a dark photon $A^\prime$. A convolutional neural… ▽ More We present a first search for dark-trident scattering in a neutrino beam using a data set corresponding to $7.2 \times 10^{20}$ protons on target taken with the MicroBooNE detector at Fermilab. Proton interactions in the neutrino target at the Main Injector produce $π^0$ and $η$ mesons, which could decay into dark-matter (DM) particles mediated via a dark photon $A^\prime$. A convolutional neural network is trained to identify interactions of the DM particles in the liquid-argon time projection chamber (LArTPC) exploiting its image-like reconstruction capability. In the absence of a DM signal, we provide limits at the $90\%$ confidence level on the squared kinematic mixing parameter $\varepsilon^2$ as a function of the dark-photon mass in the range $10\le M_{A^\prime}\le 400$ MeV. The limits cover previously unconstrained parameter space for the production of fermion or scalar DM particles $χ$ for two benchmark models with mass ratios $M_χ/M_{A^\prime}=0.6$ and $2$ and for dark fine-structure constants $0.1\leα_D\le 1$. △ Less

Submitted 16 May, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

arXiv:2312.08371 [pdf, other]

PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection

Authors: Kuan-Chih Huang, Weijie Lyu, Ming-Hsuan Yang, Yi-Hsuan Tsai

Abstract: Recent temporal LiDAR-based 3D object detectors achieve promising performance based on the two-stage proposal-based approach. They generate 3D box candidates from the first-stage dense detector, followed by different temporal aggregation methods. However, these approaches require per-frame objects or whole point clouds, posing challenges related to memory bank utilization. Moreover, point clouds a… ▽ More Recent temporal LiDAR-based 3D object detectors achieve promising performance based on the two-stage proposal-based approach. They generate 3D box candidates from the first-stage dense detector, followed by different temporal aggregation methods. However, these approaches require per-frame objects or whole point clouds, posing challenges related to memory bank utilization. Moreover, point clouds and trajectory features are combined solely based on concatenation, which may neglect effective interactions between them. In this paper, we propose a point-trajectory transformer with long short-term memory for efficient temporal 3D object detection. To this end, we only utilize point clouds of current-frame objects and their historical trajectories as input to minimize the memory bank storage requirement. Furthermore, we introduce modules to encode trajectory features, focusing on long short-term and future-aware perspectives, and then effectively aggregate them with point cloud features. We conduct extensive experiments on the large-scale Waymo dataset to demonstrate that our approach performs well against state-of-the-art methods. Code and models will be made publicly available at https://github.com/kuanchihhuang/PTT. △ Less

Submitted 24 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: Accepted to CVPR 2024. Project page: https://github.com/kuanchihhuang/PTT

arXiv:2312.07530 [pdf, other]

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance

Authors: Kuan-Chih Huang, Yi-Hsuan Tsai, Ming-Hsuan Yang

Abstract: Weakly supervised 3D object detection aims to learn a 3D detector with lower annotation cost, e.g., 2D labels. Unlike prior work which still relies on few accurate 3D annotations, we propose a framework to study how to leverage constraints between 2D and 3D domains without requiring any 3D labels. Specifically, we employ visual data from three perspectives to establish connections between 2D and 3… ▽ More Weakly supervised 3D object detection aims to learn a 3D detector with lower annotation cost, e.g., 2D labels. Unlike prior work which still relies on few accurate 3D annotations, we propose a framework to study how to leverage constraints between 2D and 3D domains without requiring any 3D labels. Specifically, we employ visual data from three perspectives to establish connections between 2D and 3D domains. First, we design a feature-level constraint to align LiDAR and image features based on object-aware regions. Second, the output-level constraint is developed to enforce the overlap between 2D and projected 3D box estimations. Finally, the training-level constraint is utilized by producing accurate and consistent 3D pseudo-labels that align with the visual data. We conduct extensive experiments on the KITTI dataset to validate the effectiveness of the proposed three constraints. Without using any 3D labels, our method achieves favorable performance against state-of-the-art approaches and is competitive with the method that uses 500-frame 3D annotations. Code and models will be made publicly available at https://github.com/kuanchihhuang/VG-W3D. △ Less

Submitted 23 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

Comments: Project page: https://github.com/kuanchihhuang/VG-W3D

arXiv:2312.03985 [pdf, other]

Flux tunable graphene-based superconducting quantum circuits coupled to 3D cavity

Authors: Kuei-Lin Chiu, Youyi Chang, Avishma J. Lasrado, Cheng-Han Lo, Yung-Hsiang Chen, Tao-Yi Hsu, Yen-Chih Chen, Yi-Chen Tsai, Samina, Yen-Hsiang Lin, Chung-Ting Ke

Abstract: Correlation between transmon and its composite Josephson junctions (JJ) plays an important role in designing new types of superconducting qubits based on quantum materials. It is desirable to have a type of device that not only allows exploration for use in quantum information processing but also probing intrinsic properties in the composite JJs. Here, we construct a flux-tunable 3D transmon-type… ▽ More Correlation between transmon and its composite Josephson junctions (JJ) plays an important role in designing new types of superconducting qubits based on quantum materials. It is desirable to have a type of device that not only allows exploration for use in quantum information processing but also probing intrinsic properties in the composite JJs. Here, we construct a flux-tunable 3D transmon-type superconducting quantum circuit made of graphene as a proof-of-concept prototype device. This 3D transmon-type device not only enables coupling to 3D cavities for microwave probes but also permits DC transport measurements on the same device, providing useful connections between transmon properties and critical currents associated with JJ's properties. We have demonstrated how flux-modulation in cavity frequency and DC critical current can be correlated under the influence of Fraunhofer pattern of JJs in an asymmetric SQUID. The correlation analysis was further extended to link the flux-modulated transmon properties, such as flux-tunability in qubit and cavity frequencies, with SQUID symmetry analysis based on DC measurements. Our study paves the way towards integrating novel materials for exploration of new types of quantum devices for future technology while probing underlying physics in the composite materials. △ Less

Submitted 6 December, 2023; originally announced December 2023.

arXiv:2312.03130 [pdf, other]

The DUNE Far Detector Vertical Drift Technology, Technical Design Report

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1304 additional authors not shown)

Abstract: DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi… ▽ More DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precision measurements of the PMNS matrix parameters, including the CP-violating phase. It will also stand ready to observe supernova neutrino bursts, and seeks to observe nucleon decay as a signature of a grand unified theory underlying the standard model. The DUNE far detector implements liquid argon time-projection chamber (LArTPC) technology, and combines the many tens-of-kiloton fiducial mass necessary for rare event searches with the sub-centimeter spatial resolution required to image those events with high precision. The addition of a photon detection system enhances physics capabilities for all DUNE physics drivers and opens prospects for further physics explorations. Given its size, the far detector will be implemented as a set of modules, with LArTPC designs that differ from one another as newer technologies arise. In the vertical drift LArTPC design, a horizontal cathode bisects the detector, creating two stacked drift volumes in which ionization charges drift towards anodes at either the top or bottom. The anodes are composed of perforated PCB layers with conductive strips, enabling reconstruction in 3D. Light-trap-style photon detection modules are placed both on the cryostat's side walls and on the central cathode where they are optically powered. This Technical Design Report describes in detail the technical implementations of each subsystem of this LArTPC that, together with the other far detector modules and the near detector, will enable DUNE to achieve its physics goals. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 425 pages; 281 figures Central editing team: A. Heavey, S. Kettell, A. Marchionni, S. Palestini, S. Rajogopalan, R. J. Wilson

Report number: Fermilab Report no: TM-2813-LBNF

Showing 1–50 of 625 results for author: Tsai, Y