subscribe to arXiv mailings

arXiv:2406.19740 [pdf, ps, other]

doi 10.3847/1538-4357/ad47a3

Spatial distribution of C4H and c-C3H2 in cold molecular cores

Authors: Yijia Liu, Junzhi Wang, Shu Liu, Ningyu Tang, Yan Gong, Yuqiang Li, Juan LI, Rui Luo, Yani Xu

Abstract: C$_4$H and $c$-C$_3$H$_2$, as unsaturated hydrocarbon molecules, are important for forming large organic molecules in the interstellar medium. We present mapping observations of C$_4$H ($N$=9$-8$) lines, $c$-C$_3$H$_2$ ($J_{Ka,Kb}$=2$_{1,2}$-1$_{0,1}$) %at 85338.894 MHz and H$^{13}$CO$^+$ ($J$=1$-0$) %at 86754.2884 MHz toward 19 nearby cold molecular cores in the Milky Way with the IRAM 30m telesc… ▽ More C$_4$H and $c$-C$_3$H$_2$, as unsaturated hydrocarbon molecules, are important for forming large organic molecules in the interstellar medium. We present mapping observations of C$_4$H ($N$=9$-8$) lines, $c$-C$_3$H$_2$ ($J_{Ka,Kb}$=2$_{1,2}$-1$_{0,1}$) %at 85338.894 MHz and H$^{13}$CO$^+$ ($J$=1$-0$) %at 86754.2884 MHz toward 19 nearby cold molecular cores in the Milky Way with the IRAM 30m telescope. C$_4$H 9--8 was detected in 13 sources, while $c$-C$_3$H$_2$ was detected in 18 sources. The widely existing C$_4$H and $c$-C$_3$H$_2$ molecules in cold cores provide material to form large organic molecules. Different spatial distributions between C$_4$H 9--8 and $c$-C$_3$H$_2$ 2--1 were found. The relative abundances of these three molecules were obtained under the assumption of local thermodynamic equilibrium conditions with a fixed excitation temperature. The abundance ratio of C$_4$H to $c$-C$_3$H$_2$ ranged from 0.34 $\pm$ 0.09 in G032.93+02 to 4.65 $\pm$ 0.50 in G008.67+22. A weak correlation between C$_4$H/H$^{13}$CO$^+$ and $c$-C$_3$H$_2$/H$^{13}$CO$^+$ abundance ratios was found, with a correlation coefficient of 0.46, which indicates that there is no tight astrochemical connection between C$_4$H and $c$-C$_3$H$_2$ molecules. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: 17 pages, 2 figures

arXiv:2406.17559 [pdf, other]

Minimal Interaction Edge Tuning: A New Paradigm for Visual Adaptation

Authors: Ningyuan Tang, Minghao Fu, Jianxin Wu

Abstract: The rapid scaling of large vision pretrained models makes fine-tuning tasks more and more difficult on edge devices with low computational resources. We explore a new visual adaptation paradigm called edge tuning, which treats large pretrained models as standalone feature extractors that run on powerful cloud servers. The fine-tuning carries out on edge devices with small networks which require lo… ▽ More The rapid scaling of large vision pretrained models makes fine-tuning tasks more and more difficult on edge devices with low computational resources. We explore a new visual adaptation paradigm called edge tuning, which treats large pretrained models as standalone feature extractors that run on powerful cloud servers. The fine-tuning carries out on edge devices with small networks which require low computational resources. Existing methods that are potentially suitable for our edge tuning paradigm are discussed. But, three major drawbacks hinder their application in edge tuning: low adaptation capability, large adapter network, and high information transfer overhead. To address these issues, we propose Minimal Interaction Edge Tuning, or MIET, which reveals that the sum of intermediate features from pretrained models not only has minimal information transfer but also has high adaptation capability. With a lightweight attention-based adaptor network, MIET achieves information transfer efficiency, parameter efficiency, computational and memory efficiency, and at the same time demonstrates competitive results on various visual adaptation benchmarks. △ Less

Submitted 25 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

Comments: 9 pages

arXiv:2406.11131 [pdf, other]

Are Large Language Models a Good Replacement of Taxonomies?

Authors: Yushi Sun, Hao Xin, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen

Abstract: Large language models (LLMs) demonstrate an impressive ability to internalize knowledge and answer natural language questions. Although previous studies validate that LLMs perform well on general knowledge while presenting poor performance on long-tail nuanced knowledge, the community is still doubtful about whether the traditional knowledge graphs should be replaced by LLMs. In this paper, we ask… ▽ More Large language models (LLMs) demonstrate an impressive ability to internalize knowledge and answer natural language questions. Although previous studies validate that LLMs perform well on general knowledge while presenting poor performance on long-tail nuanced knowledge, the community is still doubtful about whether the traditional knowledge graphs should be replaced by LLMs. In this paper, we ask if the schema of knowledge graph (i.e., taxonomy) is made obsolete by LLMs. Intuitively, LLMs should perform well on common taxonomies and at taxonomy levels that are common to people. Unfortunately, there lacks a comprehensive benchmark that evaluates the LLMs over a wide range of taxonomies from common to specialized domains and at levels from root to leaf so that we can draw a confident conclusion. To narrow the research gap, we constructed a novel taxonomy hierarchical structure discovery benchmark named TaxoGlimpse to evaluate the performance of LLMs over taxonomies. TaxoGlimpse covers ten representative taxonomies from common to specialized domains with in-depth experiments of different levels of entities in this taxonomy from root to leaf. Our comprehensive experiments of eighteen state-of-the-art LLMs under three prompting settings validate that LLMs can still not well capture the knowledge of specialized taxonomies and leaf-level entities. △ Less

Submitted 20 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

Comments: Accepted by VLDB 2024

arXiv:2406.11033 [pdf, other]

HAIChart: Human and AI Paired Visualization System

Authors: Yupeng Xie, Yuyu Luo, Guoliang Li, Nan Tang

Abstract: The growing importance of data visualization in business intelligence and data science emphasizes the need for tools that can efficiently generate meaningful visualizations from large datasets. Existing tools fall into two main categories: human-powered tools (e.g., Tableau and PowerBI), which require intensive expert involvement, and AI-powered automated tools (e.g., Draco and Table2Charts), whic… ▽ More The growing importance of data visualization in business intelligence and data science emphasizes the need for tools that can efficiently generate meaningful visualizations from large datasets. Existing tools fall into two main categories: human-powered tools (e.g., Tableau and PowerBI), which require intensive expert involvement, and AI-powered automated tools (e.g., Draco and Table2Charts), which often fall short of guessing specific user needs. In this paper, we aim to achieve the best of both worlds. Our key idea is to initially auto-generate a set of high-quality visualizations to minimize manual effort, then refine this process iteratively with user feedback to more closely align with their needs. To this end, we present HAIChart, a reinforcement learning-based framework designed to iteratively recommend good visualizations for a given dataset by incorporating user feedback. Specifically, we propose a Monte Carlo Graph Search-based visualization generation algorithm paired with a composite reward function to efficiently explore the visualization space and automatically generate good visualizations. We devise a visualization hints mechanism to actively incorporate user feedback, thus progressively refining the visualization generation module. We further prove that the top-k visualization hints selection problem is NP-hard and design an efficient algorithm. We conduct both quantitative evaluations and user studies, showing that HAIChart significantly outperforms state-of-the-art human-powered tools (21% better at Recall and 1.8 times faster) and AI-powered automatic tools (25.1% and 14.9% better in terms of Hit@3 and R10@30, respectively). △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: 16 pages, 14 figures, 7 tables

arXiv:2406.08278 [pdf, other]

doi 10.1088/1674-4527/ad5398

HiFAST : An HI Data Calibration and Imaging Pipeline for FAST II. Flux Density Calibration

Authors: Ziming Liu, Jie Wang, Yingjie Jing, Zhi-Yu Zhang, Chen Xu, Tiantian Liang, Qingze Chen, Ningyu Tang, Qingliang Yang

Abstract: Accurate flux density calibration is essential for precise analysis and interpretation of observations across different observation modes and instruments. In this research, we firstly introduce the flux calibration model incorporated in HIFAST pipeline, designed for processing HI 21-cm spectra. Furthermore, we investigate different calibration techniques and assess the dependence of the gain param… ▽ More Accurate flux density calibration is essential for precise analysis and interpretation of observations across different observation modes and instruments. In this research, we firstly introduce the flux calibration model incorporated in HIFAST pipeline, designed for processing HI 21-cm spectra. Furthermore, we investigate different calibration techniques and assess the dependence of the gain parameter on the time and environmental factors. A comparison is carried out in various observation modes (e.g. tracking and scanning modes) to determine the flux density gain ($G$), revealing insignificant discrepancies in $G$ among different methods. Long-term monitoring data shows a linear correlation between $G$ and atmospheric temperature. After subtracting the $G$--Temperature dependence, the dispersion of $G$ is reduced to $<$3% over a one-year time scale. The stability of the receiver response of FAST is considered sufficient to facilitate HI observations that can accommodate a moderate error in flux calibration (e.g., $>\sim5\%$) when utilizing a constant $G$ for calibration purposes. Our study will serve as a useful addition to the results provided by Jiang et al. (2020). Detailed measurement of $G$ for the 19 beams of FAST, covering the frequency range 1000 MHz -- 1500 MHz can be found on the HIFAST homepage: https://hifast.readthedocs.io/fluxgain. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 14 pages, 15 figures, accepted by RAA

arXiv:2406.07815 [pdf, other]

Are Large Language Models Good Statisticians?

Authors: Yizhang Zhu, Shiyin Du, Boyan Li, Yuyu Luo, Nan Tang

Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities across a range of scientific tasks including mathematics, physics, and chemistry. Despite their successes, the effectiveness of LLMs in handling complex statistical tasks remains systematically under-explored. To bridge this gap, we introduce StatQA, a new benchmark designed for statistical analysis tasks. StatQA comprises 11,6… ▽ More Large Language Models (LLMs) have demonstrated impressive capabilities across a range of scientific tasks including mathematics, physics, and chemistry. Despite their successes, the effectiveness of LLMs in handling complex statistical tasks remains systematically under-explored. To bridge this gap, we introduce StatQA, a new benchmark designed for statistical analysis tasks. StatQA comprises 11,623 examples tailored to evaluate LLMs' proficiency in specialized statistical tasks and their applicability assessment capabilities, particularly for hypothesis testing methods. We systematically experiment with representative LLMs using various prompting strategies and show that even state-of-the-art models such as GPT-4o achieve a best performance of only 64.83%, indicating significant room for improvement. Notably, while open-source LLMs (e.g. LLaMA-3) show limited capability, those fine-tuned ones exhibit marked improvements, outperforming all in-context learning-based methods (e.g. GPT-4o). Moreover, our comparative human experiments highlight a striking contrast in error types between LLMs and humans: LLMs primarily make applicability errors, whereas humans mostly make statistical task confusion errors. This divergence highlights distinct areas of proficiency and deficiency, suggesting that combining LLM and human expertise could lead to complementary strengths, inviting further investigation into their collaborative potential. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 31 pages, 10 figures,19 tables. Work in progress

arXiv:2406.04744 [pdf, other]

CRAG -- Comprehensive RAG Benchmark

Authors: Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar , et al. (2 additional authors not shown)

Abstract: Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering bench… ▽ More Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering benchmark of 4,409 question-answer pairs and mock APIs to simulate web and Knowledge Graph (KG) search. CRAG is designed to encapsulate a diverse array of questions across five domains and eight question categories, reflecting varied entity popularity from popular to long-tail, and temporal dynamisms ranging from years to seconds. Our evaluation on this benchmark highlights the gap to fully trustworthy QA. Whereas most advanced LLMs achieve <=34% accuracy on CRAG, adding RAG in a straightforward manner improves the accuracy only to 44%. State-of-the-art industry RAG solutions only answer 63% questions without any hallucination. CRAG also reveals much lower accuracy in answering questions regarding facts with higher dynamism, lower popularity, or higher complexity, suggesting future research directions. The CRAG benchmark laid the groundwork for a KDD Cup 2024 challenge, attracting thousands of participants and submissions within the first 50 days of the competition. We commit to maintaining CRAG to serve research communities in advancing RAG solutions and general QA solutions. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.01265 [pdf, other]

The Dawn of Natural Language to SQL: Are We Fully Ready?

Authors: Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang

Abstract: Translating users' natural language questions into SQL queries (i.e., NL2SQL) significantly lowers the barriers to accessing relational databases. The emergence of Large Language Models has introduced a novel paradigm in NL2SQL tasks, enhancing capabilities dramatically. However, this raises a critical question: Are we fully prepared to deploy NL2SQL models in production? To address the posed qu… ▽ More Translating users' natural language questions into SQL queries (i.e., NL2SQL) significantly lowers the barriers to accessing relational databases. The emergence of Large Language Models has introduced a novel paradigm in NL2SQL tasks, enhancing capabilities dramatically. However, this raises a critical question: Are we fully prepared to deploy NL2SQL models in production? To address the posed questions, we present a multi-angle NL2SQL evaluation framework, NL2SQL360, to facilitate the design and test of new NL2SQL methods for researchers. Through NL2SQL360, we conduct a detailed comparison of leading NL2SQL methods across a range of application scenarios, such as different data domains and SQL characteristics, offering valuable insights for selecting the most appropriate NL2SQL methods for specific needs. Moreover, we explore the NL2SQL design space, leveraging NL2SQL360 to automate the identification of an optimal NL2SQL solution tailored to user-specific needs. Specifically, NL2SQL360 identifies an effective NL2SQL method, SuperSQL, distinguished under the Spdier dataset using the execution accuracy metric. Remarkably, SuperSQL achieves competitive performance with execution accuracy of 87% and 62.66% on the Spider and BIRD test sets, respectively. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 14 pages, 15 figures, 7 tables

arXiv:2405.18573 [pdf, other]

Programmer Visual Attention During Context-Aware Code Summarization

Authors: Aakash Bansal, Robert Wallace, Zachary Karas, Ningzhi Tang, Yu Huang, Toby Jia-Jun Li, Collin McMillan

Abstract: Abridged: Programmer attention represents the visual focus of programmers on parts of the source code in pursuit of programming tasks. We conducted an in-depth human study with XY Java programmers, where each programmer generated summaries for 40 methods from five large Java projects over five one-hour sessions. We used eye-tracking equipment to map the visual attention of programmers while they w… ▽ More Abridged: Programmer attention represents the visual focus of programmers on parts of the source code in pursuit of programming tasks. We conducted an in-depth human study with XY Java programmers, where each programmer generated summaries for 40 methods from five large Java projects over five one-hour sessions. We used eye-tracking equipment to map the visual attention of programmers while they wrote the summaries. We also rate the quality of each summary. We found eye-gaze patterns and metrics that define common behaviors between programmer attention during context-aware code summarization. Specifically, we found that programmers need to read significantly (p<0.01) fewer words and make significantly fewer revisits to words (p\textless0.03) as they summarize more methods during a session, while maintaining the quality of summaries. We also found that the amount of source code a participant looks at correlates with a higher quality summary, but this trend follows a bell-shaped curve, such that after a threshold reading more source code leads to a significant decrease (p<0.01) in the quality of summaries. We also gathered insight into the type of methods in the project that provide the most contextual information for code summarization based on programmer attention. Specifically, we observed that programmers spent a majority of their time looking at methods inside the same class as the target method to be summarized. Surprisingly, we found that programmers spent significantly less time looking at methods in the call graph of the target method. We discuss how our empirical observations may aid future studies towards modeling programmer attention and improving context-aware automatic source code summarization. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 10 pages, 4 figures, 4 tables. this is a pre-print submitted to IEEE Transactions on Software Engineering for review

arXiv:2405.17039 [pdf, other]

BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation

Authors: Chengxing Jia, Pengyuan Wang, Ziniu Li, Yi-Chen Li, Zhilong Zhang, Nan Tang, Yang Yu

Abstract: Large language models (LLMs) have catalyzed a paradigm shift in natural language processing, yet their limited controllability poses a significant challenge for downstream applications. We aim to address this by drawing inspiration from the neural mechanisms of the human brain, specifically Broca's and Wernicke's areas, which are crucial for language generation and comprehension, respectively. In… ▽ More Large language models (LLMs) have catalyzed a paradigm shift in natural language processing, yet their limited controllability poses a significant challenge for downstream applications. We aim to address this by drawing inspiration from the neural mechanisms of the human brain, specifically Broca's and Wernicke's areas, which are crucial for language generation and comprehension, respectively. In particular, Broca's area receives cognitive decision signals from Wernicke's area, treating the language generation as an intricate decision-making process, which differs from the fully auto-regressive language generation of existing LLMs. In a similar vein, our proposed system, the BWArea model, conceptualizes language generation as a decision-making task. This model has three components: a language world model, an inverse dynamics model, and a cognitive policy. Like Wernicke's area, the inverse dynamics model is designed to deduce the underlying cognitive intentions, or latent actions, behind each token. The BWArea model is amenable to both pre-training and fine-tuning like existing LLMs. With 30B clean pre-training tokens, we have trained a BWArea model, which achieves competitive performance with LLMs of equal size (1B parameters). Unlike fully auto-regressive LLMs, its pre-training performance does not degenerate if dirty data unintentionally appears. This shows the advantage of a decomposed structure of BWArea model in reducing efforts in laborious data selection and labeling. Finally, we reveal that the BWArea model offers enhanced controllability via fine-tuning the cognitive policy with downstream reward metrics, thereby facilitating alignment with greater simplicity. On 9 out of 10 tasks from two suites, TextWorld and BigBench Hard, our method shows superior performance to auto-regressive LLMs. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.16113 [pdf, other]

Enabling On-Device Learning via Experience Replay with Efficient Dataset Condensation

Authors: Gelei Xu, Ningzhi Tang, Jun Xia, Wei Jin, Yiyu Shi

Abstract: Upon deployment to edge devices, it is often desirable for a model to further learn from streaming data to improve accuracy. However, extracting representative features from such data is challenging because it is typically unlabeled, non-independent and identically distributed (non-i.i.d), and is seen only once. To mitigate this issue, a common strategy is to maintain a small data buffer on the ed… ▽ More Upon deployment to edge devices, it is often desirable for a model to further learn from streaming data to improve accuracy. However, extracting representative features from such data is challenging because it is typically unlabeled, non-independent and identically distributed (non-i.i.d), and is seen only once. To mitigate this issue, a common strategy is to maintain a small data buffer on the edge device to hold the most representative data for further learning. As most data is either never stored or quickly discarded, identifying the most representative data to avoid significant information loss becomes critical. In this paper, we propose an on-device framework that addresses this issue by condensing incoming data into more informative samples. Specifically, to effectively handle unlabeled incoming data, we propose a pseudo-labeling technique designed for unlabeled on-device learning environments. Additionally, we develop a dataset condensation technique that only requires little computation resources. To counteract the effects of noisy labels during the condensation process, we further utilize a contrastive learning objective to improve the purity of class data within the buffer. Our empirical results indicate substantial improvements over existing methods, particularly when buffer capacity is severely restricted. For instance, with a buffer capacity of just one sample per class, our method achieves an accuracy that outperforms the best existing baseline by 58.4% on the CIFAR-10 dataset. △ Less

Submitted 25 May, 2024; originally announced May 2024.

Comments: 9 pages, 10 figures

arXiv:2405.16081 [pdf, other]

A Study on Developer Behaviors for Validating and Repairing LLM-Generated Code Using Eye Tracking and IDE Actions

Authors: Ningzhi Tang, Meng Chen, Zheng Ning, Aakash Bansal, Yu Huang, Collin McMillan, Toby Jia-Jun Li

Abstract: The increasing use of large language model (LLM)-powered code generation tools, such as GitHub Copilot, is transforming software engineering practices. This paper investigates how developers validate and repair code generated by Copilot and examines the impact of code provenance awareness during these processes. We conducted a lab study with 28 participants, who were tasked with validating and rep… ▽ More The increasing use of large language model (LLM)-powered code generation tools, such as GitHub Copilot, is transforming software engineering practices. This paper investigates how developers validate and repair code generated by Copilot and examines the impact of code provenance awareness during these processes. We conducted a lab study with 28 participants, who were tasked with validating and repairing Copilot-generated code in three software projects. Participants were randomly divided into two groups: one informed about the provenance of LLM-generated code and the other not. We collected data on IDE interactions, eye-tracking, cognitive workload assessments, and conducted semi-structured interviews. Our results indicate that, without explicit information, developers often fail to identify the LLM origin of the code. Developers generally employ similar validation and repair strategies for LLM-generated code, but exhibit behaviors such as frequent switching between code and comments, different attentional focus, and a tendency to delete and rewrite code. Being aware of the code's provenance led to improved performance, increased search efforts, more frequent Copilot usage, and higher cognitive workload. These findings enhance our understanding of how developers interact with LLM-generated code and carry implications for designing tools that facilitate effective human-LLM collaboration in software development. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.07001

Evaluating Task-based Effectiveness of MLLMs on Charts

Authors: Yifan Wu, Lutao Yan, Yuyu Luo, Yunhai Wang, Nan Tang

Abstract: In this paper, we explore a forward-thinking question: Is GPT-4V effective at low-level data analysis tasks on charts? To this end, we first curate a large-scale dataset, named ChartInsights, consisting of 89,388 quartets (chart, task, question, answer) and covering 10 widely-used low-level data analysis tasks on 7 chart types. Firstly, we conduct systematic evaluations to understand the capabilit… ▽ More In this paper, we explore a forward-thinking question: Is GPT-4V effective at low-level data analysis tasks on charts? To this end, we first curate a large-scale dataset, named ChartInsights, consisting of 89,388 quartets (chart, task, question, answer) and covering 10 widely-used low-level data analysis tasks on 7 chart types. Firstly, we conduct systematic evaluations to understand the capabilities and limitations of 18 advanced MLLMs, which include 12 open-source models and 6 closed-source models. Starting with a standard textual prompt approach, the average accuracy rate across the 18 MLLMs is 36.17%. Among all the models, GPT-4V achieves the highest accuracy, reaching 56.13%. To understand the limitations of multimodal large models in low-level data analysis tasks, we have designed various experiments to conduct an in-depth test of capabilities of GPT-4V. We further investigate how visual modifications to charts, such as altering visual elements (e.g. changing color schemes) and introducing perturbations (e.g. adding image noise), affect performance of GPT-4V. Secondly, we present 12 experimental findings. These findings suggest potential of GPT-4V to revolutionize interaction with charts and uncover the gap between human analytic needs and capabilities of GPT-4V. Thirdly, we propose a novel textual prompt strategy, named Chain-of-Charts, tailored for low-level analysis tasks, which boosts model performance by 24.36%, resulting in an accuracy of 80.49%. Furthermore, by incorporating a visual prompt strategy that directs attention of GPT-4V to question-relevant visual elements, we further improve accuracy to 83.83%. Our study not only sheds light on the capabilities and limitations of GPT-4V in low-level data analysis tasks but also offers valuable insights for future research. △ Less

Submitted 17 June, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

Comments: The experimental part needs to be revised. Withdraw this version

arXiv:2405.02055 [pdf, other]

doi 10.1051/0004-6361/202450067

The CO-dark molecular gas in the cold HI arc

Authors: Gan Luo, Di Li, Zhi-yu Zhang, Thomas G. Bisbas, Ningyu Tang, Lingrui Lin, Yichen Sun, Pei Zuo, Jing Zhou

Abstract: The CO-dark molecular gas (DMG), which refers to the molecular gas not traced by CO emission, is crucial for the evolution of the interstellar medium (ISM). While the gas properties of DMG have been widely explored in the Solar neighborhood, whether or not they are similar in the outer disk regions of the Milky Way is still not well understood. In this Letter, we confirm the existence of DMG towar… ▽ More The CO-dark molecular gas (DMG), which refers to the molecular gas not traced by CO emission, is crucial for the evolution of the interstellar medium (ISM). While the gas properties of DMG have been widely explored in the Solar neighborhood, whether or not they are similar in the outer disk regions of the Milky Way is still not well understood. In this Letter, we confirm the existence of DMG toward a cold HI arc structure at 13 kpc away from the Galactic center with both OH emission and HI narrow self-absorption (HINSA). This is the first detection of HINSA in the outer disk region, in which the HINSA fraction ($N_{\rm HINSA}$/$N_{\rm H_2}$ = 0.022$\pm$0.011) is an order of magnitude higher than the average value observed in nearby evolved dark clouds, but is consistent with that of the early evolutionary stage of dark clouds. The inferred H$_2$ column density from both extinction and OH emission ($N_{\rm H_2} \approx 10^{20}$ cm$^{-2}$) is an order of magnitude higher than previously estimated. Although the ISM environmental parameters are expected to be different between the outer Galactic disk regions and the Solar neighborhood, we find that the visual extinction ($A_{\rm V}$ = 0.19$\pm$0.03 mag), H$_2$-gas density ($n_{\rm H_2} = 91\pm46$ cm$^{-3}$), and molecular fraction (58\%$\pm$28\%) of the DMG are rather similar to those of nearby diffuse molecular clouds. The existence of DMG associated with the expanding HI supershell supports a scenario where the expansion of supershells may trigger the formation of molecular clouds within a crossing timescale of the shock wave ($\sim$10$^6$ yr). △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: 9 pages, 5 figures, accepted by A&A Letter

Journal ref: A&A 685, L12 (2024)

arXiv:2404.18545 [pdf, ps, other]

Implication of odd-even staggering in the charge radii of calcium isotopes

Authors: Rong An, Xiang Jiang, Na Tang, Li-Gang Cao, Feng-Shou Zhang

Abstract: Inspired by the evidently observed odd-even staggering and the inverted parabolic-like shape of charge radii along calcium isotopic chain, the ground state properties of calcium isotopes are investigated by constraining the root-mean-square (rms) charge radii under the covariant energy density functionals with effective forces NL3 and PK1. In this work, the pairing correlations are tackled by solv… ▽ More Inspired by the evidently observed odd-even staggering and the inverted parabolic-like shape of charge radii along calcium isotopic chain, the ground state properties of calcium isotopes are investigated by constraining the root-mean-square (rms) charge radii under the covariant energy density functionals with effective forces NL3 and PK1. In this work, the pairing correlations are tackled by solving the state-dependent Bardeen-Cooper-Schrieffer equations. The calculated results suggest that the binding energies obtained by constraint method have been reduced less than $0.1\%$. But for charge radii, the corresponding results deriving from NL3 and PK1 forces have been increased by about $1.0\%$ and $2.0\%$, respectively. This means that charge radius is a more sensitive quantity in the calibrated protocol. Meanwhile, it is found that the reproduced charge radii of calcium isotopes is attributed to the rather strong isospin dependence of effective potential. The odd-even oscillation behavior can also be presented in the neutron skin thickness and proton Fermi energy along calcium isotopic chain, but keep opposite trends with respect to the corresponding binding energy and charge radius. As encountered in charge radii, the weakening odd-even oscillation behavior is still emerged from the proton Fermi energies at the neutron numbers $N=20$ and $28$ as well, but not in binding energy and neutron skin thickness. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 9 pages, 5 figures

arXiv:2404.09248 [pdf, other]

Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

Authors: Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu

Abstract: Reinforcement learning (RL) trains agents to accomplish complex tasks through environmental interaction data, but its capacity is also limited by the scope of the available data. To obtain a knowledgeable agent, a promising approach is to leverage the knowledge from large language models (LLMs). Despite previous studies combining LLMs with RL, seamless integration of the two components remains cha… ▽ More Reinforcement learning (RL) trains agents to accomplish complex tasks through environmental interaction data, but its capacity is also limited by the scope of the available data. To obtain a knowledgeable agent, a promising approach is to leverage the knowledge from large language models (LLMs). Despite previous studies combining LLMs with RL, seamless integration of the two components remains challenging due to their semantic gap. This paper introduces a novel method, Knowledgeable Agents from Language Model Rollouts (KALM), which extracts knowledge from LLMs in the form of imaginary rollouts that can be easily learned by the agent through offline reinforcement learning methods. The primary challenge of KALM lies in LLM grounding, as LLMs are inherently limited to textual data, whereas environmental data often comprise numerical vectors unseen to LLMs. To address this, KALM fine-tunes the LLM to perform various tasks based on environmental data, including bidirectional translation between natural language descriptions of skills and their corresponding rollout data. This grounding process enhances the LLM's comprehension of environmental dynamics, enabling it to generate diverse and meaningful imaginary rollouts that reflect novel skills. Initial empirical evaluations on the CLEVR-Robot environment demonstrate that KALM enables agents to complete complex rephrasings of task goals and extend their capabilities to novel tasks requiring unprecedented optimal behaviors. KALM achieves a success rate of 46% in executing tasks with unseen goals, substantially surpassing the 26% success rate achieved by baseline methods. Furthermore, KALM effectively enables the LLM to comprehend environmental dynamics, resulting in the generation of meaningful imaginary rollouts that reflect novel skills and demonstrate the seamless integration of large language models and reinforcement learning. △ Less

Submitted 14 April, 2024; originally announced April 2024.

arXiv:2403.17285 [pdf, other]

An Analysis of Switchback Designs in Reinforcement Learning

Authors: Qianglin Wen, Chengchun Shi, Ying Yang, Niansheng Tang, Hongtu Zhu

Abstract: This paper offers a detailed investigation of switchback designs in A/B testing, which alternate between baseline and new policies over time. Our aim is to thoroughly evaluate the effects of these designs on the accuracy of their resulting average treatment effect (ATE) estimators. We propose a novel "weak signal analysis" framework, which substantially simplifies the calculations of the mean squa… ▽ More This paper offers a detailed investigation of switchback designs in A/B testing, which alternate between baseline and new policies over time. Our aim is to thoroughly evaluate the effects of these designs on the accuracy of their resulting average treatment effect (ATE) estimators. We propose a novel "weak signal analysis" framework, which substantially simplifies the calculations of the mean squared errors (MSEs) of these ATEs in Markov decision process environments. Our findings suggest that (i) when the majority of reward errors are positively correlated, the switchback design is more efficient than the alternating-day design which switches policies in a daily basis. Additionally, increasing the frequency of policy switches tends to reduce the MSE of the ATE estimator. (ii) When the errors are uncorrelated, however, all these designs become asymptotically equivalent. (iii) In cases where the majority of errors are negative correlated, the alternating-day design becomes the optimal choice. These insights are crucial, offering guidelines for practitioners on designing experiments in A/B testing. Our analysis accommodates a variety of policy value estimators, including model-based estimators, least squares temporal difference learning estimators, and double reinforcement learning estimators, thereby offering a comprehensive understanding of optimal design strategies for policy evaluation in reinforcement learning. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2402.04009 [pdf, other]

Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning

Authors: Ningyuan Tang, Minghao Fu, Ke Zhu, Jianxin Wu

Abstract: In finetuning a large pretrained model to downstream tasks, parameter-efficient fine-tuning (PEFT) methods can effectively finetune pretrained models with few trainable parameters, but suffer from high GPU memory consumption and slow training speed. Because learnable parameters from these methods are entangled with the pretrained model, gradients related to the frozen pretrained model's parameters… ▽ More In finetuning a large pretrained model to downstream tasks, parameter-efficient fine-tuning (PEFT) methods can effectively finetune pretrained models with few trainable parameters, but suffer from high GPU memory consumption and slow training speed. Because learnable parameters from these methods are entangled with the pretrained model, gradients related to the frozen pretrained model's parameters have to be computed and stored during finetuning. We propose Low-rank Attention Side-Tuning (LAST), which disentangles the trainable module from the pretrained model by freezing not only parameters but also outputs of the pretrained network. LAST trains a side-network composed of only low-rank self-attention modules. By viewing the pretrained model as a frozen feature extractor, the side-network takes intermediate output from the pretrained model and focus on learning task-specific knowledge. We also show that LAST can be highly parallel across multiple optimization objectives, making it very efficient in downstream task adaptation, for example, in finding optimal hyperparameters. LAST outperforms previous state-of-the-art methods on VTAB-1K and other visual adaptation tasks with roughly only 30\% of GPU memory footprint and 60\% of training time compared to existing PEFT methods, but achieves significantly higher accuracy. △ Less

Submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.03719 [pdf, other]

Empowering Language Models with Active Inquiry for Deeper Understanding

Authors: Jing-Cheng Pang, Heng-Bo Fan, Pengyuan Wang, Jia-Hao Xiao, Nan Tang, Si-Hang Yang, Chengxing Jia, Sheng-Jun Huang, Yang Yu

Abstract: The rise of large language models (LLMs) has revolutionized the way that we interact with artificial intelligence systems through natural language. However, LLMs often misinterpret user queries because of their uncertain intention, leading to less helpful responses. In natural human interactions, clarification is sought through targeted questioning to uncover obscure information. Thus, in this pap… ▽ More The rise of large language models (LLMs) has revolutionized the way that we interact with artificial intelligence systems through natural language. However, LLMs often misinterpret user queries because of their uncertain intention, leading to less helpful responses. In natural human interactions, clarification is sought through targeted questioning to uncover obscure information. Thus, in this paper, we introduce LaMAI (Language Model with Active Inquiry), designed to endow LLMs with this same level of interactive engagement. LaMAI leverages active learning techniques to raise the most informative questions, fostering a dynamic bidirectional dialogue. This approach not only narrows the contextual gap but also refines the output of the LLMs, aligning it more closely with user expectations. Our empirical studies, across a variety of complex datasets where LLMs have limited conversational context, demonstrate the effectiveness of LaMAI. The method improves answer accuracy from 31.9% to 50.9%, outperforming other leading question-answering frameworks. Moreover, in scenarios involving human participants, LaMAI consistently generates responses that are superior or comparable to baseline methods in more than 82% of the cases. The applicability of LaMAI is further evidenced by its successful integration with various LLMs, highlighting its potential for the future of interactive language models. △ Less

Submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.00630 [pdf, other]

doi 10.1103/PhysRevB.109.205152

Tensile and compressive strain tuning of a Kondo lattice

Authors: Soumendra Nath Panja, Anton Jesche, Nan Tang, Philipp Gegenwart

Abstract: We present electrical resistivity measurements on the prototypical heavy-fermion metal YbRh$_{2}$Si$_{2}$ (YRS) under $a$-axis tensile and compressive strain and focus on the evolution of the resistivity maximum near 136~K that arises from the interplay of the Kondo effect and the crystal electric field (CEF) splitting. While compressive strain reduces $T_{\rm max}$, similar as previously reported… ▽ More We present electrical resistivity measurements on the prototypical heavy-fermion metal YbRh$_{2}$Si$_{2}$ (YRS) under $a$-axis tensile and compressive strain and focus on the evolution of the resistivity maximum near 136~K that arises from the interplay of the Kondo effect and the crystal electric field (CEF) splitting. While compressive strain reduces $T_{\rm max}$, similar as previously reported for hydrostatic pressure, $T_{\rm max}$ is enhanced up to 145~K for 0.13\% tensile strain. Model calculations for the strain effect on CEF splitting in YRS reveal a negligible shift of the levels. Instead, the enhancement of the resistivity maximum indicates a 20\% increase of the Kondo temperature. This opens the perspective to access the hidden zero-field QCP in pure YRS. △ Less

Submitted 10 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Journal ref: Phys. Rev. B 109, 205152 (2024)

arXiv:2401.17364 [pdf, other]

doi 10.1007/s11433-023-2333-8

HiFAST: an HI data calibration and imaging pipeline for FAST

Authors: Yingjie Jing, Jie Wang, Chen Xu, Ziming Liu, Qingze Chen, Tiantian Liang, Jinlong Xu, Yixian Cao, Jing Wang, Huijie Hu, Chuan-Peng Zhang, Qi Guo, Liang Gao, Mei Ai, Hengqian Gan, Xuyang Gao, Jinlin Han, Ligang Hou, Zhipeng Hou, Peng Jiang, Xu Kong, Fujia Li, Zerui Liu, Li Shao, Hengxing Pan , et al. (8 additional authors not shown)

Abstract: The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has the largest aperture and a 19-beam L-band receiver, making it powerful for investigating the neutral hydrogen atomic gas (HI) in the universe. We present HiFAST (https://hifast.readthedocs.io), a dedicated, modular, and self-contained calibration and imaging pipeline for processing the HI data of FAST. The pipeline consists of fr… ▽ More The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has the largest aperture and a 19-beam L-band receiver, making it powerful for investigating the neutral hydrogen atomic gas (HI) in the universe. We present HiFAST (https://hifast.readthedocs.io), a dedicated, modular, and self-contained calibration and imaging pipeline for processing the HI data of FAST. The pipeline consists of frequency-dependent noise diode calibration, baseline fitting, standing wave removal using an FFT-based method, flux density calibration, stray radiation correction, and gridding to produce data cubes. These modules can be combined as needed to process the data from most FAST observation modes: tracking, drift scanning, On-The-Fly mapping, and most of their variants. With HiFAST, the RMS noises of the calibrated spectra from all 19 beams were only slightly (~ 5%) higher than the theoretical expectation. The results for the extended source M33 and the point sources are consistent with the results from Arecibo. The moment maps (0,1 and 2) of M33 agree well with the results from the Arecibo Galaxy Environment Survey (AGES) with a fractional difference of less than 10%. For a common sample of 221 sources with signal-to-noise ratio S/N >10 from the Arecibo Legacy Fast ALFA (ALFALFA) survey, the mean value of fractional difference in the integrated flux density, $S_{\mathrm{int}}$, between the two datasets is approximately 0.005 %, with a dispersion of 15.4%. Further checks on the integrated flux density of 23 sources with seven observations indicate that the variance in the flux density of the source with luminous objects ($S_\mathrm{int}$ $ > 2.5$ Jy km s$^{-1}$) is less than 5%. Our tests suggest that the FAST telescope, with the efficient, precise, and user-friendly pipeline HiFAST, will yield numerous significant scientific findings in the investigation of the HI in the universe. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: Accepted by SCPMA. 21 pages, 14 figures. The pipeline is accessible at https://hifast.readthedocs.io

arXiv:2312.04912 [pdf, ps, other]

doi 10.1103/PhysRevC.109.064302

Improved description of nuclear charge radii: Global trends beyond $N=28$ shell closure

Authors: Rong An, Xiang Jiang, Na Tang, Li-Gang Cao, Feng-Shou Zhang

Abstract: Charge radii measured with high accuracy provide a stringent benchmark for characterizing nuclear structure phenomena. In this work, the systematic evolution of charge radii for nuclei with $Z=19$-$29$ is investigated through relativistic mean field theory with effective forces NL3, PK1, and NL3$^{*}$. The neutron-proton ($np$) correlation around Fermi surface originated from the unpaired neutron… ▽ More Charge radii measured with high accuracy provide a stringent benchmark for characterizing nuclear structure phenomena. In this work, the systematic evolution of charge radii for nuclei with $Z=19$-$29$ is investigated through relativistic mean field theory with effective forces NL3, PK1, and NL3$^{*}$. The neutron-proton ($np$) correlation around Fermi surface originated from the unpaired neutron and proton has been taken into account tentatively in order to reduce the overestimated odd-even staggering of charge radii. This improved method can give an available description of charge radii across $N=28$ shell closure. A remarkable observation is that the charge radii beyond $N=28$ shell closure follow the similarly steep increasing trend, namely irrespective of the number of protons in the nucleus. Especially, the latest results of charge radii for nickel and copper isotopes can be reproduced remarkably well. Along $N=28$ isotonic chain, the sudden increase of charge radii is weakened across $Z=20$, but presented evidently across $Z=28$ closed shell. The abrupt changes of charge radii across $Z=22$ are also shown along $N=32$ and $34$ isotones, but the latter with a less slope. This seems to provide a sensitive indicator to identify the new magicity of a nucleus with universal trend of charge radii. △ Less

Submitted 4 June, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Comments: 13 pages, 4 figures,

Journal ref: Published in Phys. Rev. C 109, 064302 (2024)

arXiv:2312.03987 [pdf, other]

Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration

Authors: Meihao Fan, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, Xiaoyong Du

Abstract: Entity resolution (ER) is an important data integration task with a wide spectrum of applications. The state-of-the-art solutions on ER rely on pre-trained language models (PLMs), which require fine-tuning on a lot of labeled matching/non-matching entity pairs. Recently, large languages models (LLMs), such as GPT-4, have shown the ability to perform many tasks without tuning model parameters, whic… ▽ More Entity resolution (ER) is an important data integration task with a wide spectrum of applications. The state-of-the-art solutions on ER rely on pre-trained language models (PLMs), which require fine-tuning on a lot of labeled matching/non-matching entity pairs. Recently, large languages models (LLMs), such as GPT-4, have shown the ability to perform many tasks without tuning model parameters, which is known as in-context learning (ICL) that facilitates effective learning from a few labeled input context demonstrations. However, existing ICL approaches to ER typically necessitate providing a task description and a set of demonstrations for each entity pair and thus have limitations on the monetary cost of interfacing LLMs. To address the problem, in this paper, we provide a comprehensive study to investigate how to develop a cost-effective batch prompting approach to ER. We introduce a framework BATCHER consisting of demonstration selection and question batching and explore different design choices that support batch prompting for ER. We also devise a covering-based demonstration selection strategy that achieves an effective balance between matching accuracy and monetary cost. We conduct a thorough evaluation to explore the design space and evaluate our proposed strategies. Through extensive experiments, we find that batch prompting is very cost-effective for ER, compared with not only PLM-based methods fine-tuned with extensive labeled data but also LLM-based methods with manually designed prompting. We also provide guidance for selecting appropriate design choices for batch prompting. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 14 pages, 7 figures

arXiv:2311.05469 [pdf]

doi 10.1002/adma.202300416

Skyrmion-Excited Spin Wave Fractal Network

Authors: Nan Tang, W. L. N. C. Liyanage, Sergio A. Montoya, Sheena Patel, Lizabeth J. Quigley, Alexander J. Grutter, Michael R. Fitzsimmons, Sunil Sinha, Julie A. Borchers, Eric E. Fullerton, Lisa DeBeer-Schmitt, Dustin A. Gilbert

Abstract: Magnetic skyrmions exhibit unique, technologically relevant pseudo-particle behaviors which arise from their topological protection, including well-defined, three-dimensional dynamic modes that occur at microwave frequencies. During dynamic excitation, spin waves are ejected into the interstitial regions between skyrmions, creating the magnetic equivalent of a turbulent sea. However, since the spi… ▽ More Magnetic skyrmions exhibit unique, technologically relevant pseudo-particle behaviors which arise from their topological protection, including well-defined, three-dimensional dynamic modes that occur at microwave frequencies. During dynamic excitation, spin waves are ejected into the interstitial regions between skyrmions, creating the magnetic equivalent of a turbulent sea. However, since the spin waves in these systems have a well-defined length scale, and the skyrmions are on an ordered lattice, ordered structures from spin wave interference can precipitate from the chaos. This work uses small angle neutron scattering (SANS) to capture the dynamics in hybrid skyrmions and investigate the spin wave structure. Performing simultaneous ferromagnetic resonance and SANS, the diffraction pattern shows a large increase in low-angle scattering intensity which is present only in the resonance condition. This scattering pattern is best fit using a mass fractal model, which suggests the spin waves form a long-range fractal network. The fractal structure is constructed of fundamental units with a size that encodes the spin wave emissions and are constrained by the skyrmion lattice. These results offer critical insights into the nanoscale dynamics of skyrmions, identify a new dynamic spin wave fractal structure, and demonstrates SANS as a unique tool to probe high-speed dynamics. △ Less

Submitted 9 November, 2023; originally announced November 2023.

Journal ref: Advanced Materials, 2300416 (2023)

arXiv:2310.00749 [pdf, other]

SEED: Domain-Specific Data Curation With Large Language Models

Authors: Zui Chen, Lei Cao, Sam Madden, Tim Kraska, Zeyuan Shang, Ju Fan, Nan Tang, Zihui Gu, Chunwei Liu, Michael Cafarella

Abstract: Data curation tasks that prepare data for analytics are critical for turning data into actionable insights. However, due to the diverse requirements of applications in different domains, generic off-the-shelf tools are typically insufficient. As a result, data scientists often have to develop domain-specific solutions tailored to both the dataset and the task, e.g. writing domain-specific code or… ▽ More Data curation tasks that prepare data for analytics are critical for turning data into actionable insights. However, due to the diverse requirements of applications in different domains, generic off-the-shelf tools are typically insufficient. As a result, data scientists often have to develop domain-specific solutions tailored to both the dataset and the task, e.g. writing domain-specific code or training machine learning models on a sufficient number of annotated examples. This process is notoriously difficult and time-consuming. We present SEED, an LLM-as-compiler approach that automatically generates domain-specific data curation solutions via Large Language Models (LLMs). Once the user describes a task, input data, and expected output, the SEED compiler produces a hybrid pipeline that combines LLM querying with more cost-effective alternatives, such as vector-based caching, LLM-generated code, and small models trained on LLM-annotated data. SEED features an optimizer that automatically selects from the four LLM-assisted modules and forms a hybrid execution pipeline that best fits the task at hand. To validate this new, revolutionary approach, we conducted experiments on $9$ datasets spanning over $5$ data curation tasks. In comparison to solutions that use the LLM on every data record, SEED achieves state-of-the-art or comparable few-shot performance, while significantly reducing the number of LLM calls. △ Less

Submitted 24 April, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

Comments: preprint, 20 pages, 4 figures

arXiv:2309.09544 [pdf, other]

doi 10.1093/mnras/stad2507

Opacities of dense gas tracers in galactic massive star-forming regions

Authors: Shu Liu, Junzhi Wang, Fei Li, Jingwen Wu, Zhi-Yu Zhang, Di Li, Ningyu Tang, Pei Zuo

Abstract: Optical depths of dense molecular gas are commonly used in Galactic and extragalactic studies to constrain the dense gas mass of the clouds or galaxies. The optical depths are often obtained based on spatially unresolved data, especially in galaxies, which may affect the reliability of such measurements. We examine such effects in spatially resolved Galactic massive star-forming regions. Using the… ▽ More Optical depths of dense molecular gas are commonly used in Galactic and extragalactic studies to constrain the dense gas mass of the clouds or galaxies. The optical depths are often obtained based on spatially unresolved data, especially in galaxies, which may affect the reliability of such measurements. We examine such effects in spatially resolved Galactic massive star-forming regions. Using the 10-m SMT telescope, we mapped HCN and H13CN 3-2, HCO+, and H13CO+ 3-2 towards 51 Galactic massive star-forming regions, 30 of which resulted in robust determination of spatially resolved optical depths. Conspicuous spatial variations of optical depths have been detected within each source. We first obtained opacities for each position and calculated an optical-thick line intensity-weighted average, then averaged all the spectra and derived a single opacity for each region. The two were found to agree extremely well, with a linear least square correlation coefficient of 0.997 for the whole sample. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: 41 pages, 33 figures, 5 tables, publication in MNRAS

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 525, Issue 3, November 2023, Pages 4761-4800

arXiv:2309.01890 [pdf, ps, other]

FAST discovery of a fast neutral hydrogen outflow

Authors: Renzhi Su, Minfeng Gu, S. J. Curran, Elizabeth K. Mahony, Ningyu Tang, James R. Allison, Di Li, Ming Zhu, J. N. H. S. Aditya, Hyein Yoon, Zheng Zheng, Zhongzu Wu

Abstract: In this letter, we report the discovery of a fast neutral hydrogen outflow in SDSS J145239.38+062738.0, a merging radio galaxy containing an optical type I active galactic nuclei (AGN). This discovery was made through observations conducted by the Five-hundred-meter Aperture Spherical radio Telescope (FAST) using redshifted 21-cm absorption. The outflow exhibits a blueshifted velocity likely up to… ▽ More In this letter, we report the discovery of a fast neutral hydrogen outflow in SDSS J145239.38+062738.0, a merging radio galaxy containing an optical type I active galactic nuclei (AGN). This discovery was made through observations conducted by the Five-hundred-meter Aperture Spherical radio Telescope (FAST) using redshifted 21-cm absorption. The outflow exhibits a blueshifted velocity likely up to $\sim-1000\,\rm km\,s^{-1}$ with respect to the systemic velocity of the host galaxy with an absorption strength of $\sim -0.6\,\rm mJy\,beam^{-1}$ corresponding to an optical depth of 0.002 at $v=-500\,\rm km\,s^{-1}$. The mass outflow rate ranges between $2.8\times10^{-2}$ and $3.6\, \rm M_\odot \, yr^{-1}$, implying an energy outflow rate ranging between $4.2\times10^{39}$ and $9.7\times10^{40}\rm\,erg\,s^{-1}$, assuming 100 K $<T_{\rm s}<$ 1000 K. Plausible drivers of the outflow include the star bursts, the AGN radiation, and the radio jet, the last of which is considered the most likely culprit according to the kinematics. By analysing the properties of the outflow, the AGN, and the jet, we find that if the HI outflow is driven by the AGN radiation, the AGN radiation seems not powerful enough to provide negative feedback whereas the radio jet shows the potential to provide negative feedback. Our observations contribute another example of a fast outflow detected in neutral hydrogen, as well as demonstrate the capability of FAST in detecting such outflows. △ Less

Submitted 4 September, 2023; originally announced September 2023.

Comments: Accepted by ApJL

arXiv:2307.02796 [pdf, other]

VerifAI: Verified Generative AI

Authors: Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Halevy

Abstract: Generative AI has made significant strides, yet concerns about the accuracy and reliability of its outputs continue to grow. Such inaccuracies can have serious consequences such as inaccurate decision-making, the spread of false information, privacy violations, legal liabilities, and more. Although efforts to address these risks are underway, including explainable AI and responsible AI practices s… ▽ More Generative AI has made significant strides, yet concerns about the accuracy and reliability of its outputs continue to grow. Such inaccuracies can have serious consequences such as inaccurate decision-making, the spread of false information, privacy violations, legal liabilities, and more. Although efforts to address these risks are underway, including explainable AI and responsible AI practices such as transparency, privacy protection, bias mitigation, and social and environmental responsibility, misinformation caused by generative AI will remain a significant challenge. We propose that verifying the outputs of generative AI from a data management perspective is an emerging issue for generative AI. This involves analyzing the underlying data from multi-modal data lakes, including text files, tables, and knowledge graphs, and assessing its quality and consistency. By doing so, we can establish a stronger foundation for evaluating the outputs of generative AI models. Such an approach can ensure the correctness of generative AI, promote transparency, and enable decision-making with greater confidence. Our vision is to promote the development of verifiable generative AI and contribute to a more trustworthy and responsible use of AI. △ Less

Submitted 10 October, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

Comments: 8 pages, 4 figures

arXiv:2306.11231 [pdf, other]

Deep HI Mapping of Stephan's Quintet and Its Neighborhood

Authors: Cheng Cheng, Cong Kevin Xu, P. N. Appleton, P. -A. Duc, N. -Y. Tang, Y. S. Dai, J. -S. Huang, U. Lisenfeld, F. Renaud, Chuan He, Hai-Cheng Feng

Abstract: We carried out deep mapping observations of the atomic hydrogen (HI) 21 cm line emission in a field centered on the famous galaxy group Stephan's Quintet (SQ), using the Five-hundred-meter Aperture Spherical Telescope (FAST) equipped with the 19-Beam Receiver. The final data cube reaches an HI column density sensitivity of $5 σ= 2.1\times 10^{17}$ cm$^{-2}$ per 20 km s$^{-1}$ channel with an angul… ▽ More We carried out deep mapping observations of the atomic hydrogen (HI) 21 cm line emission in a field centered on the famous galaxy group Stephan's Quintet (SQ), using the Five-hundred-meter Aperture Spherical Telescope (FAST) equipped with the 19-Beam Receiver. The final data cube reaches an HI column density sensitivity of $5 σ= 2.1\times 10^{17}$ cm$^{-2}$ per 20 km s$^{-1}$ channel with an angular resolution of $4'.0$. The discovery of a large diffuse feature of the HI emission in the outskirt of the intragroup medium of SQ was reported in a previous paper (Xu et al. 2022). Here we present a new study of the total HI emission of SQ and the detection of several neighboring galaxies, exploiting the high sensitivity and the large sky coverage of the FAST observations. A total HI mass of $M_{\rm HI} = 3.48 \pm 0.35 \times 10^{10}\; M_\odot$ is found for SQ, which is significantly higher than previous measurements in the literature. This indicates that, contrary to earlier claims, SQ is not HI deficient. The excessive HI gas is mainly found in the velocity ranges of 6200 - 6400 km s$^{-1}$ and 6800 - 7000 km s$^{-1}$, which was undetected in previous observations that are less sensitive than ours. Our results suggest that the ``missing HI" in compact groups may be hidden in the low-density diffuse neutral gas instead of in the ionized gas. △ Less

Submitted 19 June, 2023; originally announced June 2023.

Comments: 20 pages, 5 figures, Accepted by ApJ

arXiv:2306.08891 [pdf, other]

Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation

Authors: Zihui Gu, Ju Fan, Nan Tang, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Sam Madden, Xiaoyong Du

Abstract: Zero-shot NL2SQL is crucial in achieving natural language to SQL that is adaptive to new environments (e.g., new databases, new linguistic phenomena or SQL structures) with zero annotated NL2SQL samples from such environments. Existing approaches either fine-tune pre-trained language models (PLMs) based on annotated data or use prompts to guide fixed large language models (LLMs) such as ChatGPT. P… ▽ More Zero-shot NL2SQL is crucial in achieving natural language to SQL that is adaptive to new environments (e.g., new databases, new linguistic phenomena or SQL structures) with zero annotated NL2SQL samples from such environments. Existing approaches either fine-tune pre-trained language models (PLMs) based on annotated data or use prompts to guide fixed large language models (LLMs) such as ChatGPT. PLMs can perform well in schema alignment but struggle to achieve complex reasoning, while LLMs is superior in complex reasoning tasks but cannot achieve precise schema alignment. In this paper, we propose a ZeroNL2SQL framework that combines the complementary advantages of PLMs and LLMs for supporting zero-shot NL2SQL. ZeroNL2SQL first uses PLMs to generate an SQL sketch via schema alignment, then uses LLMs to fill the missing information via complex reasoning. Moreover, in order to better align the generated SQL queries with values in the given database instances, we design a predicate calibration method to guide the LLM in completing the SQL sketches based on the database instances and select the optimal SQL query via an execution-based strategy. Comprehensive experiments show that ZeroNL2SQL can achieve the best zero-shot NL2SQL performance on real-world benchmarks. Specifically, ZeroNL2SQL outperforms the state-of-the-art PLM-based methods by 3.2% to 13% and exceeds LLM-based methods by 10% to 20% on execution accuracy. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: Working in progress

arXiv:2304.13400 [pdf]

doi 10.1021/acs.nanolett.3c03085

Observation of Fluctuation Spin Hall Effect in Antiferromagnet

Authors: Chi Fang, Caihua Wan, Xiaoyue Zhang, Satoshi Okamoto, Tianyi Ma, Jianying Qin, Xiao Wang, Chenyang Guo, Jing Dong, Guoqiang Yu, Zhenchao Wen, Ning Tang, Stuart S. P. Parkin, Naoto Nagaosa, Yuan Lu, Xiufeng Han

Abstract: The spin Hall effect (SHE) can generate a pure spin current by an electric current, which is promisingly used to electrically control magnetization. To reduce power consumption of this control, a giant spin Hall angle (SHA) in the SHE is desired in low-resistivity systems for practical applications. Here, critical spin fluctuation near the antiferromagnetic (AFM) phase-transition is proved as an e… ▽ More The spin Hall effect (SHE) can generate a pure spin current by an electric current, which is promisingly used to electrically control magnetization. To reduce power consumption of this control, a giant spin Hall angle (SHA) in the SHE is desired in low-resistivity systems for practical applications. Here, critical spin fluctuation near the antiferromagnetic (AFM) phase-transition is proved as an effective mechanism to create an additional part of SHE, named as fluctuation spin Hall effect (FSHE). This FSHE enhances the SHA due to the AFM spin fluctuation between conduction electrons and local spins. We detect the FSHE with the inverse and direct spin Hall effect (ISHE and DSHE) set-up and their temperature (T) dependences in the Cr/MgO/Fe magnetic tunnel junctions (MTJs). The SHA is significantly enhanced when temperature is approached to the Néel temperature (T_N) and has a peak value of -0.34 at 200 K near T_N. This value is higher than the room-temperature value by 240% and comparable to that of heavy metals Ta and W. Furthermore, the spin Hall resistivity of Cr well fits the modeled T-dependence when T approaches T_N from low temperatures, implying the AFM spin fluctuation nature of strong SHA enhancement. Thus, this study demonstrates the critical spin fluctuation as a prospective way of increasing SHA and enriches the AFM material candidates for spin-orbitronic devices. △ Less

Submitted 26 April, 2023; originally announced April 2023.

Comments: 27 pages, 9 figures

arXiv:2304.03540 [pdf, other]

ChatPipe: Orchestrating Data Preparation Program by Optimizing Human-ChatGPT Interactions

Authors: Sibei Chen, Hanbing Liu, Weiting Jin, Xiangyu Sun, Xiaoyao Feng, Ju Fan, Xiaoyong Du, Nan Tang

Abstract: Orchestrating a high-quality data preparation program is essential for successful machine learning (ML), but it is known to be time and effort consuming. Despite the impressive capabilities of large language models like ChatGPT in generating programs by interacting with users through natural language prompts, there are still limitations. Specifically, a user must provide specific prompts to iterat… ▽ More Orchestrating a high-quality data preparation program is essential for successful machine learning (ML), but it is known to be time and effort consuming. Despite the impressive capabilities of large language models like ChatGPT in generating programs by interacting with users through natural language prompts, there are still limitations. Specifically, a user must provide specific prompts to iteratively guide ChatGPT in improving data preparation programs, which requires a certain level of expertise in programming, the dataset used and the ML task. Moreover, once a program has been generated, it is non-trivial to revisit a previous version or make changes to the program without starting the process over again. In this paper, we present ChatPipe, a novel system designed to facilitate seamless interaction between users and ChatGPT. ChatPipe provides users with effective recommendation on next data preparation operations, and guides ChatGPT to generate program for the operations. Also, ChatPipe enables users to easily roll back to previous versions of the program, which facilitates more efficient experimentation and testing. We have developed a web application for ChatPipe and prepared several real-world ML tasks from Kaggle. These tasks can showcase the capabilities of ChatPipe and enable VLDB attendees to easily experiment with our novel features to rapidly orchestrate a high-quality data preparation program. △ Less

Submitted 7 April, 2023; originally announced April 2023.

arXiv:2304.01369 [pdf]

doi 10.1103/PhysRevB.107.184412

Three-Dimensional Structure of Hybrid Magnetic Skyrmions Determined by Neutron Scattering

Authors: WLNC Liyanage, Nan Tang, Lizabeth Quigley, Julie A. Borchers, Alexander J. Grutter, Brian B. Maranville, Sunil K. Sinha, Nicolas Reyren, Sergio A. Montoya, Eric E. Fullerton, Lisa DeBeer-Schmitt, Dustin A. Gilbert

Abstract: Magnetic skyrmions are topologically protected chiral spin textures which present opportunities for next-generation magnetic data storage and logic information technologies. The topology of these structures originates in the geometric configuration of the magnetic spins - more generally described as the structure. While the skyrmion structure is most often depicted using a 2D projection of the thr… ▽ More Magnetic skyrmions are topologically protected chiral spin textures which present opportunities for next-generation magnetic data storage and logic information technologies. The topology of these structures originates in the geometric configuration of the magnetic spins - more generally described as the structure. While the skyrmion structure is most often depicted using a 2D projection of the three-dimensional structure, recent works have emphasized the role of all three dimensions in determining the topology and their response to external stimuli. In this work, grazing-incidence small-angle neutron scattering and polarized neutron reflectometry are used to determine the three-dimensional structure of hybrid skyrmions. The structure of the hybrid skyrmions, which includes a combination of Néel-like and Bloch-like components along their length, is expected to significantly contribute to their notable stability, which includes ambient conditions. To interpret the neutron scattering data, micromagnetic simulations of the hybrid skyrmions were performed, and the corresponding diffraction patterns were determined using a Born approximation transformation. The converged magnetic profile reveals the magnetic structure along with the skyrmion depth profile, including the thickness of the Bloch and Néel segments and the diameter of the core. △ Less

Submitted 19 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

arXiv:2303.16909 [pdf, other]

RetClean: Retrieval-Based Data Cleaning Using Foundation Models and Data Lakes

Authors: Mohammad Shahmeer Ahmad, Zan Ahmad Naeem, Mohamed Eltabakh, Mourad Ouzzani, Nan Tang

Abstract: Can foundation models (such as ChatGPT) clean your data? In this proposal, we demonstrate that indeed ChatGPT can assist in data cleaning by suggesting corrections for specific cells in a data table (scenario 1). However, ChatGPT may struggle with datasets it has never encountered before (e.g., local enterprise data) or when the user requires an explanation of the source of the suggested clean val… ▽ More Can foundation models (such as ChatGPT) clean your data? In this proposal, we demonstrate that indeed ChatGPT can assist in data cleaning by suggesting corrections for specific cells in a data table (scenario 1). However, ChatGPT may struggle with datasets it has never encountered before (e.g., local enterprise data) or when the user requires an explanation of the source of the suggested clean values. To address these issues, we developed a retrieval-based method that complements ChatGPT's power with a user-provided data lake. The data lake is first indexed, we then retrieve the top-k relevant tuples to the user's query tuple and finally leverage ChatGPT to infer the correct value (scenario 2). Nevertheless, sharing enterprise data with ChatGPT, an externally hosted model, might not be feasible for privacy reasons. To assist with this scenario, we developed a custom RoBERTa-based foundation model that can be locally deployed. By fine-tuning it on a small number of examples, it can effectively make value inferences based on the retrieved tuples (scenario 3). Our proposed system, RetClean, seamlessly supports all three scenarios and provides a user-friendly GUI that enables the VLDB audience to explore and experiment with the system. △ Less

Submitted 29 March, 2023; originally announced March 2023.

arXiv:2303.15922 [pdf]

doi 10.1063/5.0142224

Hole doping in compositionally complex correlated oxide enables tunable exchange biasing

Authors: Alessandro R. Mazza, Elizabeth Skoropata, Jason Lapano, Michael A. Chilcote, Cameron Jorgensen, Nan Tang, Zheng Gai, John Singleton, Matthew J. Brahlek, Dustin A. Gilbert, Thomas Z. Ward

Abstract: Magnetic interfaces and the phenomena arising from them drive both the design of modern spintronics and fundamental research. Recently, it was revealed that through designing magnetic frustration in configurationally complex entropy stabilized oxides, exchange bias can occur in structurally single crystal films. This eliminates the need for complex heterostructures and nanocomposites in the design… ▽ More Magnetic interfaces and the phenomena arising from them drive both the design of modern spintronics and fundamental research. Recently, it was revealed that through designing magnetic frustration in configurationally complex entropy stabilized oxides, exchange bias can occur in structurally single crystal films. This eliminates the need for complex heterostructures and nanocomposites in the design and control of magnetic response phenomena. In this work, we demonstrate through hole doping of a high entropy perovskite oxide that tuning of magnetic responses can be achieved. With detailed magnetometry, we show magnetic coupling exhibiting a variety of magnetic responses including exchange bias and antiferromagnetic spin reversal in the entropy stabilized ABO3 perovskite oxide La1-xSrx(Cr0.2Mn0.2Fe0.2Co0.2Ni0.2)O3 family. We find that manipulation of the A-site charge state can be used to balance magnetic phase compositions and coupling responses. This allows for the creation of highly tunable exchange bias responses. In the low Sr doping regime, a spin frustrated region arising at the antiferromagnetic phase boundary is shown to directly couple to the antiferromagnetic moments of the film and emerges as the dominant mechanism, leading to a vertical shift of magnetization loops in response to field biasing. At higher concentrations, direct coupling of antiferromagnetic and ferromagnetic regions is observed. This tunability of magnetic coupling is discussed within the context of these three competing magnetic phases, revealing critical features in designing exchange bias through exploiting spin frustration and disorder in high entropy oxides. △ Less

Submitted 28 March, 2023; originally announced March 2023.

arXiv:2303.09055 [pdf, other]

TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization

Authors: Tuan N. Tang, Kwonyoung Kim, Kwanghoon Sohn

Abstract: Temporal Action Localization (TAL) is a challenging task in video understanding that aims to identify and localize actions within a video sequence. Recent studies have emphasized the importance of applying long-term temporal context modeling (TCM) blocks to the extracted video clip features such as employing complex self-attention mechanisms. In this paper, we present the simplest method ever to a… ▽ More Temporal Action Localization (TAL) is a challenging task in video understanding that aims to identify and localize actions within a video sequence. Recent studies have emphasized the importance of applying long-term temporal context modeling (TCM) blocks to the extracted video clip features such as employing complex self-attention mechanisms. In this paper, we present the simplest method ever to address this task and argue that the extracted video clip features are already informative to achieve outstanding performance without sophisticated architectures. To this end, we introduce TemporalMaxer, which minimizes long-term temporal context modeling while maximizing information from the extracted video clip features with a basic, parameter-free, and local region operating max-pooling block. Picking out only the most critical information for adjacent and local clip embeddings, this block results in a more efficient TAL model. We demonstrate that TemporalMaxer outperforms other state-of-the-art methods that utilize long-term TCM such as self-attention on various TAL datasets while requiring significantly fewer parameters and computational resources. The code for our approach is publicly available at https://github.com/TuanTNG/TemporalMaxer △ Less

Submitted 15 March, 2023; originally announced March 2023.

arXiv:2303.02911 [pdf, other]

Design of the Readout Electronics for the TRIDENT Pathfinder Experiment

Authors: M. X. Wang, G. H. Gong, P. Miao, Z. Y. Sun, J. N. Tang, W. H. Wu, D. L. Xu

Abstract: The tRopIcal DEep-sea Neutrino Telescope (TRIDENT) is a future large-scale next-generation neutrino telescope. In September 2021, the TRIDENT pathfinder experiment (TRIDENT EXplorer, T-REX for short) completed in-situ measurements of deep-sea water properties in the South China Sea. The T-REX apparatus integrates two independent and complementary systems, a photomultiplier tube (PMT) and a camera… ▽ More The tRopIcal DEep-sea Neutrino Telescope (TRIDENT) is a future large-scale next-generation neutrino telescope. In September 2021, the TRIDENT pathfinder experiment (TRIDENT EXplorer, T-REX for short) completed in-situ measurements of deep-sea water properties in the South China Sea. The T-REX apparatus integrates two independent and complementary systems, a photomultiplier tube (PMT) and a camera system, to measure the optical and radioactive properties of the deep-sea water. One light emitter module and two light receiver modules were deployed, which were synchronized by using White Rabbit (WR) technology. The light emitter module generates nanosecond-width LED pulses, while the light receiver module hosts three PMTs and a camera to detect photons. The submerged apparatus and the data acquisition system (DAQ) perform real-time command and data transmission. We report the design and performance of the readout electronics for T-REX, including hardware modules, firmware design for digital signal processing, and host-computer software. △ Less

Submitted 6 March, 2023; originally announced March 2023.

arXiv:2302.12970 [pdf, other]

doi 10.3847/1538-4357/acbf34

Abundance ratios of OH/CO and HCO+/CO as probes of the cosmic ray ionization rate in diffuse clouds

Authors: Gan Luo, Zhiyu Zhang, Thomas G. Bisbas, Di Li, Ping Zhou, Ningyu Tang, Junzhi Wang, Pei Zuo, Nannan Yue

Abstract: The cosmic-ray ionization rate (CRIR, $ζ_2$) is one of the key parameters controlling the formation and destruction of various molecules in molecular clouds. However, the current most commonly used CRIR tracers, such as H$_3^+$, OH$^+$, and H$_2$O$^+$, are hard to detect and require the presence of background massive stars for absorption measurements. In this work, we propose an alternative method… ▽ More The cosmic-ray ionization rate (CRIR, $ζ_2$) is one of the key parameters controlling the formation and destruction of various molecules in molecular clouds. However, the current most commonly used CRIR tracers, such as H$_3^+$, OH$^+$, and H$_2$O$^+$, are hard to detect and require the presence of background massive stars for absorption measurements. In this work, we propose an alternative method to infer the CRIR in diffuse clouds using the abundance ratios of OH/CO and HCO$^+$/CO. We have analyzed the response of chemical abundances of CO, OH, and HCO$^+$ on various environmental parameters of the interstellar medium in diffuse clouds and found that their abundances are proportional to $ζ_2$. Our analytic expressions give an excellent calculation of the abundance of OH for $ζ_2$ $\leq$10$^{-15}$ s$^{-1}$, which are potentially useful for modelling chemistry in hydrodynamical simulations. The abundances of OH and HCO$^+$ were found to monotonically decrease with increasing density, while the CO abundance shows the opposite trend. With high-sensitivity absorption transitions of both CO (1--0) and (2--1) lines from ALMA, we have derived the H$_2$ number densities ($n_{\rm H_2}$) toward 4 line-of-sights (LOSs); assuming a kinetic temperature of $T_{\rm k}=50\,{\rm K}$, we find a range of (0.14$\pm$0.03--1.2$\pm$0.1)$\times$10$^2$ cm$^{-3}$}. By comparing the observed and modelled HCO$^+$/CO ratios, we find that $ζ_2$ in our diffuse gas sample is in the { range of $1.0_{-1.0}^{+14.8}$ $\times$10$^{-16}- 2.5_{-2.4}^{+1.4}$ $\times$10$^{-15}$ s$^{-1}$. This is $\sim$2 times higher than the average value measured at higher extinction, supporting an attenuation of CRs as suggested by theoretical models. △ Less

Submitted 6 April, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: 22 pages, 9 figures, accepted by ApJ

arXiv:2302.10900 [pdf, other]

Semi-decentralized Federated Ego Graph Learning for Recommendation

Authors: Liang Qu, Ningzhi Tang, Ruiqi Zheng, Quoc Viet Hung Nguyen, Zi Huang, Yuhui Shi, Hongzhi Yin

Abstract: Collaborative filtering (CF) based recommender systems are typically trained based on personal interaction data (e.g., clicks and purchases) that could be naturally represented as ego graphs. However, most existing recommendation methods collect these ego graphs from all users to compose a global graph to obtain high-order collaborative information between users and items, and these centralized CF… ▽ More Collaborative filtering (CF) based recommender systems are typically trained based on personal interaction data (e.g., clicks and purchases) that could be naturally represented as ego graphs. However, most existing recommendation methods collect these ego graphs from all users to compose a global graph to obtain high-order collaborative information between users and items, and these centralized CF recommendation methods inevitably lead to a high risk of user privacy leakage. Although recently proposed federated recommendation systems can mitigate the privacy problem, they either restrict the on-device local training to an isolated ego graph or rely on an additional third-party server to access other ego graphs resulting in a cumbersome pipeline, which is hard to work in practice. In addition, existing federated recommendation systems require resource-limited devices to maintain the entire embedding tables resulting in high communication costs. In light of this, we propose a semi-decentralized federated ego graph learning framework for on-device recommendations, named SemiDFEGL, which introduces new device-to-device collaborations to improve scalability and reduce communication costs and innovatively utilizes predicted interacted item nodes to connect isolated ego graphs to augment local subgraphs such that the high-order user-item collaborative information could be used in a privacy-preserving manner. Furthermore, the proposed framework is model-agnostic, meaning that it could be seamlessly integrated with existing graph neural network-based recommendation methods and privacy protection techniques. To validate the effectiveness of the proposed SemiDFEGL, extensive experiments are conducted on three public datasets, and the results demonstrate the superiority of the proposed SemiDFEGL compared to other federated recommendation methods. △ Less

Submitted 9 February, 2023; originally announced February 2023.

arXiv:2301.06888 [pdf, other]

doi 10.1093/mnras/stad1976

Gravitational wave spectral synthesis

Authors: Wouter G. J. van Zeist, J. J. Eldridge, Petra N. Tang

Abstract: We study the LISA sources that arise from isolated binary evolution, and how these depend on age and metallicity, using model stellar populations from BPASS. We model these as single-aged populations which are analogous to star clusters. We calculate the combined GW spectrum of all the binaries within these model clusters, including all types of compact binaries as well as those with living stars.… ▽ More We study the LISA sources that arise from isolated binary evolution, and how these depend on age and metallicity, using model stellar populations from BPASS. We model these as single-aged populations which are analogous to star clusters. We calculate the combined GW spectrum of all the binaries within these model clusters, including all types of compact binaries as well as those with living stars. These results allow us to evaluate the detectability of star clusters with LISA. We find at late times the dominant sources are WD-WD binaries by factors of 50-200, but at times between $10^8$ and $10^9$ years we find a significant population of NS-WD and BH-WD binaries (2-40 per $10^6$ M$_{\odot}$), which is related to the treatment of mass transfer and common envelope events in BPASS, wherein mass transfer is relatively likely to be stable. Metallicity also has an effect on the GW spectrum and on the relative dominance of different types of binaries. Using the information about known star clusters will aid the identification of sky locations where one could expect LISA to find GW sources. △ Less

Submitted 2 August, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

Comments: 21 pages, 12 figures, 3 tables

Journal ref: MNRAS 524 (2023) 2836-2856

arXiv:2211.13380 [pdf, other]

doi 10.3847/1538-4357/aca657

Dependence of Chemical Abundance on the Cosmic Ray Ionization Rate in IC 348

Authors: Gan Luo, Zhi-Yu Zhang, Thomas G. Bisbas, Di Li, Ningyu Tang, Junzhi Wang, Ping Zhou, Pei Zuo, Nannan Yue, Jing Zhou, Lingrui Lin

Abstract: Ions (e.g., H$_3^+$, H$_2$O$^+$) have been used extensively to quantify the cosmic-ray ionization rate (CRIR) in diffuse sightlines. However, measurements of CRIR in low-to-intermediate density gas environments are rare, especially when background stars are absent. In this work, we combine molecular line observations of CO, OH, CH, and HCO$^+$ in the star-forming cloud IC~348, and chemical models… ▽ More Ions (e.g., H$_3^+$, H$_2$O$^+$) have been used extensively to quantify the cosmic-ray ionization rate (CRIR) in diffuse sightlines. However, measurements of CRIR in low-to-intermediate density gas environments are rare, especially when background stars are absent. In this work, we combine molecular line observations of CO, OH, CH, and HCO$^+$ in the star-forming cloud IC~348, and chemical models to constrain the value of CRIR and study the response of the chemical abundances distribution. The cloud boundary is found to have an $A_{\rm V}$ of approximately 4 mag. From the interior to the exterior of the cloud, the observed $^{13}$CO line intensities drop by an order of magnitude. The calculated average abundance of $^{12}$CO (assuming $^{12}$C/$^{13}$C = 65) is (1.2$\pm$0.9) $\times$10$^{-4}$, which increases by a factor of 6 from the interior to the outside regions. The average abundance of CH (3.3$\pm$0.7 $\times$ 10$^{-8}$) is in good agreement with previous findings in diffuse and translucent clouds ($A_{\rm V}$ $<$ 5 mag). However, we did not find a decline in CH abundance in regions of high extinction ($A_{\rm V}\simeq$8 mag) as previously reported in Taurus. By comparing the observed molecular abundances and chemical models, we find a decreasing trend of CRIR as $A_{\rm V}$ increases. The inferred CRIR of $ζ_{cr}$ = (4.7$\pm$1.5) $\times$ 10$^{-16}$ s$^{-1}$ at low $A_{\rm V}$ is consistent with H$^+_3$ measurements toward two nearby massive stars. △ Less

Submitted 23 November, 2022; originally announced November 2022.

Comments: 21 pages, 11 figures. Submitted to ApJ

arXiv:2211.04905 [pdf, other]

SimOn: A Simple Framework for Online Temporal Action Localization

Authors: Tuan N. Tang, Jungin Park, Kwonyoung Kim, Kwanghoon Sohn

Abstract: Online Temporal Action Localization (On-TAL) aims to immediately provide action instances from untrimmed streaming videos. The model is not allowed to utilize future frames and any processing techniques to modify past predictions, making On-TAL much more challenging. In this paper, we propose a simple yet effective framework, termed SimOn, that learns to predict action instances using the popular… ▽ More Online Temporal Action Localization (On-TAL) aims to immediately provide action instances from untrimmed streaming videos. The model is not allowed to utilize future frames and any processing techniques to modify past predictions, making On-TAL much more challenging. In this paper, we propose a simple yet effective framework, termed SimOn, that learns to predict action instances using the popular Transformer architecture in an end-to-end manner. Specifically, the model takes the current frame feature as a query and a set of past context information as keys and values of the Transformer. Different from the prior work that uses a set of outputs of the model as past contexts, we leverage the past visual context and the learnable context embedding for the current query. Experimental results on the THUMOS14 and ActivityNet1.3 datasets show that our model remarkably outperforms the previous methods, achieving a new state-of-the-art On-TAL performance. In addition, the evaluation for Online Detection of Action Start (ODAS) demonstrates the effectiveness and robustness of our method in the online setting. The code is available at https://github.com/TuanTNG/SimOn △ Less

Submitted 7 November, 2022; originally announced November 2022.

arXiv:2211.02816 [pdf, other]

PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training

Authors: Zihui Gu, Ju Fan, Nan Tang, Preslav Nakov, Xiaoman Zhao, Xiaoyong Du

Abstract: Fact verification has attracted a lot of research attention recently, e.g., in journalism, marketing, and policymaking, as misinformation and disinformation online can sway one's opinion and affect one's actions. While fact-checking is a hard task in general, in many cases, false statements can be easily debunked based on analytics over tables with reliable information. Hence, table-based fact ver… ▽ More Fact verification has attracted a lot of research attention recently, e.g., in journalism, marketing, and policymaking, as misinformation and disinformation online can sway one's opinion and affect one's actions. While fact-checking is a hard task in general, in many cases, false statements can be easily debunked based on analytics over tables with reliable information. Hence, table-based fact verification has recently emerged as an important and growing research area. Yet, progress has been limited due to the lack of datasets that can be used to pre-train language models (LMs) to be aware of common table operations, such as aggregating a column or comparing tuples. To bridge this gap, in this paper we introduce PASTA, a novel state-of-the-art framework for table-based fact verification via pre-training with synthesized sentence-table cloze questions. In particular, we design six types of common sentence-table cloze tasks, including Filter, Aggregation, Superlative, Comparative, Ordinal, and Unique, based on which we synthesize a large corpus consisting of 1.2 million sentence-table pairs from WikiTables. PASTA uses a recent pre-trained LM, DeBERTaV3, and further pretrains it on our corpus. Our experimental results show that PASTA achieves new state-of-the-art performance on two table-based fact verification benchmarks: TabFact and SEM-TAB-FACTS. In particular, on the complex set of TabFact, which contains multiple operations, PASTA largely outperforms the previous state of the art by 4.7 points (85.6% vs. 80.9%), and the gap between PASTA and human performance on the small TabFact test set is narrowed to just 1.5 points (90.6% vs. 92.1%). △ Less

Submitted 5 November, 2022; originally announced November 2022.

Comments: EMNLP 2022

MSC Class: 68T50 ACM Class: I.2.7; I.2.6

arXiv:2208.05250 [pdf, other]

doi 10.1088/1674-4527/ac784e

The Distribution of UV Radiation Field in the Molecular Clouds of Gould Belt

Authors: Jifeng Xia, Ningyu Tang, Qijun Zhi, Sihan Jiao, Jinjin Xie, Gary A. Fuller, Paul F. Goldsmith, Di Li

Abstract: The distribution of ultraviolet (UV) radiation field provides critical constraints on the physical environments of molecular clouds. Within 1 kpc of our solar system and fostering protostars of different masses, the giant molecular clouds in the Gould Belt present an excellent opportunity to resolve the UV field structure in star forming regions. We performed spectral energy distribution (SED) fit… ▽ More The distribution of ultraviolet (UV) radiation field provides critical constraints on the physical environments of molecular clouds. Within 1 kpc of our solar system and fostering protostars of different masses, the giant molecular clouds in the Gould Belt present an excellent opportunity to resolve the UV field structure in star forming regions. We performed spectral energy distribution (SED) fitting of the archival data from the Herschel Gould Belt Survey (HGBS). Dust radiative transfer analysis with the DUSTY code were applied to 23 regions in 14 molecular complexes of the Gould Belt, resulting in the spatial distribution of radiation field in these regions. For 10 of 15 regions with independent measurements of star formation rate, their star formation rate and UV radiation intensity largely conform to a linear correlation found in previous studies. △ Less

Submitted 10 August, 2022; originally announced August 2022.

Comments: 31 pages, 25 figures,1 table, published in Research in Astronomy and Astrophysics

arXiv:2208.04870 [pdf]

A 0.6 Mpc HI Structure Associated with Stephan's Quintet

Authors: C. K. Xu, C. Cheng, P. N. Appleton, P. -A. Duc, Y. Gao, N. -Y. Tang, M. Yun, Y. S. Dai, J. -S. Huang, U. Lisenfeld, F. Renaud

Abstract: Stephan's Quintet (SQ, distance=85$\pm$6 Mpc) is unique among compact groups of galaxies. Observations have previously shown that interactions between multiple members, including a high-speed intruder galaxy currently colliding into the intragroup medium, have likely generated tidal debris in the form of multiple gaseous and stellar filaments, the formation of tidal dwarfs and intragroup-medium st… ▽ More Stephan's Quintet (SQ, distance=85$\pm$6 Mpc) is unique among compact groups of galaxies. Observations have previously shown that interactions between multiple members, including a high-speed intruder galaxy currently colliding into the intragroup medium, have likely generated tidal debris in the form of multiple gaseous and stellar filaments, the formation of tidal dwarfs and intragroup-medium starbursts, as well as widespread intergalactic shocked gas. The details and timing of the interactions/collisions remain poorly understood because of the multiple nature. Here we report atomic hydrogen (HI) observations in the vicinity of SQ with a smoothed sensitivity of 1$σ$=4.2 $\times 10^{16}\rm cm^{-2}$ per channel ($Δ$v=20 km s$^{-1}$; angular-resolution=4'), which are about two orders of magnitude deeper than previous observations. The data reveal a large HI structure (linear scale ~0.6 Mpc) encompassing an extended source of size ~0.4 Mpc associated with the debris field and a curved diffuse feature of length ~0.5 Mpc attached to the south edge of the extended source. The diffuse feature was likely produced by tidal interactions in early stages of SQ (>1 Gyr ago), though it is not clear how the low density HI gas (N$_{\rm HI}\leq 10^{18}\rm cm^{-2}$) can survive the ionization by the inter-galactic UV background on such a long time scale. Our observations require a rethinking of gas in outer parts of galaxy groups and demand complex modeling of different phases of the intragroup medium in simulations of group formation. △ Less

Submitted 10 August, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

Comments: 21 pages, 10 figures, accepted by Nature

arXiv:2207.11079 [pdf, ps, other]

doi 10.1109/TCOMM.2022.3215998

New Decoding of Reed-Solomon Codes Based on FFT and Modular Approach

Authors: Nianqi Tang, Yunghsiang S. Han

Abstract: Decoding algorithms for Reed--Solomon (RS) codes are of great interest for both practical and theoretical reasons. In this paper, an efficient algorithm, called the modular approach (MA), is devised for solving the Welch--Berlekamp (WB) key equation. By taking the MA as the key equation solver, we propose a new decoding algorithm for systematic RS codes. For $(n,k)$ RS codes, where $n$ is the code… ▽ More Decoding algorithms for Reed--Solomon (RS) codes are of great interest for both practical and theoretical reasons. In this paper, an efficient algorithm, called the modular approach (MA), is devised for solving the Welch--Berlekamp (WB) key equation. By taking the MA as the key equation solver, we propose a new decoding algorithm for systematic RS codes. For $(n,k)$ RS codes, where $n$ is the code length and $k$ is the code dimension, the proposed decoding algorithm has both the best asymptotic computational complexity $O(n\log(n-k) + (n-k)\log^2(n-k))$ and the smallest constant factor achieved to date. By comparing the number of field operations required, we show that when decoding practical RS codes, the new algorithm is significantly superior to the existing methods in terms of computational complexity. When decoding the $(4096, 3584)$ RS code defined over $\mathbb{F}_{2^{12}}$, the new algorithm is 10 times faster than a conventional syndrome-based method. Furthermore, the new algorithm has a regular architecture and is thus suitable for hardware implementation. △ Less

Submitted 22 July, 2022; originally announced July 2022.

arXiv:2207.06585 [pdf, other]

doi 10.3847/1538-4357/ac80ba

Compact and variable radio emission from an active galaxy with supersoft X-ray emission

Authors: Lei Yang, Xinwen Shu, Fabao Zhang, Yogesh Chandola, Daizhong Liu, Yi Liu, Minfeng Gu, Margherita Giustini, Ning Jiang, Ya-Ping Li, Di Li, David Elbaz, Stephanie Juneau, Maurilio Pannella, Luming Sun, Ningyu Tang, Tinggui Wang, Hongyan Zhou

Abstract: RX J1301.9+2747 is a unique active galaxy with supersoft X-ray spectrum that lacks significant emission at energies above 2 keV. In addition, it is one of few galaxies displaying quasi-periodic X-ray eruptions that recur on a timescale of 13-20 ks. We present multi-epoch radio observations of RX J1301.9+2747 using GMRT, VLA and VLBA. The VLBA imaging at 1.6 GHz reveals a compact radio emission unr… ▽ More RX J1301.9+2747 is a unique active galaxy with supersoft X-ray spectrum that lacks significant emission at energies above 2 keV. In addition, it is one of few galaxies displaying quasi-periodic X-ray eruptions that recur on a timescale of 13-20 ks. We present multi-epoch radio observations of RX J1301.9+2747 using GMRT, VLA and VLBA. The VLBA imaging at 1.6 GHz reveals a compact radio emission unresolved at a scale of <0.7 pc, with a brightness temperature of T_b>5x10^7 K. The radio emission is variable by more than a factor of 2.5 over a few days, based on the data taken from VLA monitoring campaigns. The short-term radio variability suggests that the radio emitting region has a size as small as 8x10^{-4} pc, resulting in an even higher brightness temperature of T_b ~10^{12} K. A similar limit on the source size can be obtained if the observed flux variability is not intrinsic and caused by the interstellar scintillation effect. The overall radio spectrum is steep with a time-averaged spectral index alpha=-0.78+/-0.03 between 0.89 GHz and 14 GHz. These observational properties rule out a thermal or star-formation origin of the radio emission, and appear to be consistent with the scenario of episodic jet ejections driven by magnetohydrodynamic process. Simultaneous radio and X-ray monitoring observations down to a cadence of hours are required to test whether the compact and variable radio emission is correlated with the quasi-periodic X-ray eruptions. △ Less

Submitted 13 July, 2022; originally announced July 2022.

Comments: 11 pages, 3 figures, 2 tables. Accepted for publication in ApJ

arXiv:2207.04519 [pdf, other]

A multi-cubic-kilometre neutrino telescope in the western Pacific Ocean

Authors: Z. P. Ye, F. Hu, W. Tian, Q. C. Chang, Y. L. Chang, Z. S. Cheng, J. Gao, T. Ge, G. H. Gong, J. Guo, X. X. Guo, X. G. He, J. T. Huang, K. Jiang, P. K. Jiang, Y. P. Jing, H. L. Li, J. L. Li, L. Li, W. L. Li, Z. Li, N. Y. Liao, Q. Lin, F. Liu, J. L. Liu , et al. (33 additional authors not shown)

Abstract: Next-generation neutrino telescopes with significantly improved sensitivity are required to pinpoint the sources of the diffuse astrophysical neutrino flux detected by IceCube and uncover the century-old puzzle of cosmic ray origins. A detector near the equator will provide a unique viewpoint of the neutrino sky, complementing IceCube and other neutrino telescopes in the Northern Hemisphere. Here… ▽ More Next-generation neutrino telescopes with significantly improved sensitivity are required to pinpoint the sources of the diffuse astrophysical neutrino flux detected by IceCube and uncover the century-old puzzle of cosmic ray origins. A detector near the equator will provide a unique viewpoint of the neutrino sky, complementing IceCube and other neutrino telescopes in the Northern Hemisphere. Here we present results from an expedition to the north-eastern region of the South China Sea, in the western Pacific Ocean. A favorable neutrino telescope site was found on an abyssal plain at a depth of $\sim$ 3.5km. At depths below 3km, the sea current speed, water absorption and scattering lengths for Cherenkov light, were measured to be $v_{\mathrm{c}}<$10cm/s, $λ_{\mathrm{abs} }\simeq$ 27m and $λ_{\mathrm{sca} }\simeq$ 63m, respectively. Accounting for these measurements, we present the design and expected performance of a next-generation neutrino telescope, TRopIcal DEep-sea Neutrino Telescope (TRIDENT). With its advanced photon-detection technology and large dimensions, TRIDENT expects to observe the IceCube steady source candidate NGC 1068 with 5$σ$ significance within 1 year of operation. This level of sensitivity will open a new arena for diagnosing the origin of cosmic rays and probing fundamental physics over astronomical baselines. △ Less

Submitted 13 May, 2024; v1 submitted 10 July, 2022; originally announced July 2022.

Comments: 34 pages,12 figures. Correspondence should be addressed to D. L. Xu: donglianxu@sjtu.edu.cn

arXiv:2206.06908 [pdf, other]

LPCSE: Neural Speech Enhancement through Linear Predictive Coding

Authors: Yang Liu, Na Tang, Xiaoli Chu, Yang Yang, Jun Wang

Abstract: The increasingly stringent requirement on quality-of-experience in 5G/B5G communication systems has led to the emerging neural speech enhancement techniques, which however have been developed in isolation from the existing expert-rule based models of speech pronunciation and distortion, such as the classic Linear Predictive Coding (LPC) speech model because it is difficult to integrate the models… ▽ More The increasingly stringent requirement on quality-of-experience in 5G/B5G communication systems has led to the emerging neural speech enhancement techniques, which however have been developed in isolation from the existing expert-rule based models of speech pronunciation and distortion, such as the classic Linear Predictive Coding (LPC) speech model because it is difficult to integrate the models with auto-differentiable machine learning frameworks. In this paper, to improve the efficiency of neural speech enhancement, we introduce an LPC-based speech enhancement (LPCSE) architecture, which leverages the strong inductive biases in the LPC speech model in conjunction with the expressive power of neural networks. Differentiable end-to-end learning is achieved in LPCSE via two novel blocks: a block that utilizes the expert rules to reduce the computational overhead when integrating the LPC speech model into neural networks, and a block that ensures the stability of the model and avoids exploding gradients in end-to-end training by mapping the Linear prediction coefficients to the filter poles. The experimental results show that LPCSE successfully restores the formants of the speeches distorted by transmission loss, and outperforms two existing neural speech enhancement methods of comparable neural network sizes in terms of the Perceptual evaluation of speech quality (PESQ) and Short-Time Objective Intelligibility (STOI) on the LJ Speech corpus. △ Less

Submitted 22 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

arXiv:2205.00886 [pdf]

doi 10.1038/s41598-022-25122-4

Anti-microbial properties of a multi-component alloy

Authors: Anne F. Murray, Daniel Bryan, David A. Garfinkel, Cameron S. Jogensen, Nan Tang, WLNC Liyanage, Eric A. Lass, Ying Yang, Philip D. Rack, Thomas G. Denes, Dustin A. Gilbert

Abstract: High traffic touch surfaces such as doorknobs, countertops, and handrails can be transmission points for the spread of pathogens, emphasizing the need to develop materials that actively self-sanitize. Metals are frequently used for these surfaces due to their durability, but many metals also possess antimicrobial properties which function through a variety of mechanisms. This work investigates met… ▽ More High traffic touch surfaces such as doorknobs, countertops, and handrails can be transmission points for the spread of pathogens, emphasizing the need to develop materials that actively self-sanitize. Metals are frequently used for these surfaces due to their durability, but many metals also possess antimicrobial properties which function through a variety of mechanisms. This work investigates metallic alloys comprised of several bioactive metals with the target of achieving broad-spectrum, rapid bioactivity through synergistic activity. An entropy-motivated stabilization paradigm is proposed to prepare scalable alloys of copper, silver, nickel and cobalt. Using combinatorial sputtering, thin-film alloys were prepared on 100 mm wafers with 50% compositional grading of each element across the wafer. The films were then annealed and investigated for alloy stability. Bioactivity testing was performed on both the as-grown alloys and the annealed films using four microorganisms -- Phi6, MS2, Bacillus subtilis and Escherichia coli -- as surrogates for human viral and bacterial pathogens. Testing showed that after 30 s of contact with some of the test alloys, Phi6, an enveloped, single-stranded RNA bacteriophage that serves as a SARS-CoV 2 surrogate, was reduced up to 6.9 orders of magnitude (>99.9999%). Additionally, the non-enveloped, double-stranded DNA bacteriophage MS2, and the Gram-negative E. coli and Gram-positive B. subtilis bacterial strains showed a 5.0, 6.4, and 5.7 log reduction in activity after 30, 20 and 10 minutes, respectively. Bioactivity in the alloy samples showed a strong dependence on the composition, with the log reduction scaling directly with the Cu content. Concentration of Cu by phase separation after annealing improved activity in some of the samples. The results motivate a variety of themes which can be leveraged to design ideal bioactive surfaces. △ Less

Submitted 28 April, 2022; originally announced May 2022.

Comments: 15 pages, 6 figures

Journal ref: Scientific Reports 12, 21427 (2022)

Showing 1–50 of 119 results for author: Tang, N