subscribe to arXiv mailings

Our Halo of Ice and Fire: Strong Kinematic Asymmetries in the Galactic Halo

Authors: Jiwon Jesse Han, Charlie Conroy, Dennis Zaritsky, Ana Bonaca, Nelson Caldwell, Vedant Chandra, Yuan-Sen Ting

Abstract: The kinematics of the stellar halo hold important clues to the assembly history and mass distribution of the Galaxy. In this study, we map the kinematics of stars across the Galactic halo with the H3 Survey. We find a complex distribution that breaks both azimuthal symmetry about the $Z$-axis and mirror symmetry about the Galactic plane. This asymmetry manifests as large variations in the radial v… ▽ More The kinematics of the stellar halo hold important clues to the assembly history and mass distribution of the Galaxy. In this study, we map the kinematics of stars across the Galactic halo with the H3 Survey. We find a complex distribution that breaks both azimuthal symmetry about the $Z$-axis and mirror symmetry about the Galactic plane. This asymmetry manifests as large variations in the radial velocity dispersion $σ_r$ from as ``cold'' as 70 $\text{km}\text{ s}^{-1}$ to as ``hot'' as 160 $\text{km}\text{ s}^{-1}$. We use stellar chemistry to distinguish accreted stars from in-situ stars in the halo, and find that the accreted population has higher $σ_r$ and radially biased orbits, while the in-situ population has lower $σ_r$ and isotropic orbits. As a result, the Galactic halo kinematics are highly heterogeneous and poorly approximated as being spherical or axisymmetric. We measure radial profiles of $σ_r$ and the anisotropy parameter $β$ over Galactocentric radii $10-80\text{ kpc}$, and find that discrepancies in the literature are due to the nonspherical geometry and heterogeneous nature of the halo. Investigating the effect of strongly asymmetric $σ_r$ and $β$ on equilibrium models is a path forward to accurately constraining the Galactic gravitational field, including its total mass. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: submitted to ApJ; comments welcome

arXiv:2406.01676 [pdf, other]

All-Sky Kinematics of the Distant Halo: The Reflex Response to the LMC

Authors: Vedant Chandra, Rohan P. Naidu, Charlie Conroy, Nicolas Garavito-Camargo, Chervin Laporte, Ana Bonaca, Phillip A. Cargile, Emily Cunningham, Jiwon Jesse Han, Benjamin D. Johnson, Hans-Walter Rix, Yuan-Sen Ting, Turner Woody, Dennis Zaritsky

Abstract: The infall of the Large Magellanic Cloud (LMC) is predicted to displace the inner Milky Way (MW), imprinting an apparent 'reflex motion' on the observed velocities of distant halo stars. We construct the largest all-sky spectroscopic dataset of luminous red giant stars from $50-160$ kpc, including a new survey of the southern celestial hemisphere. We fit the full 6D kinematics of our data to measu… ▽ More The infall of the Large Magellanic Cloud (LMC) is predicted to displace the inner Milky Way (MW), imprinting an apparent 'reflex motion' on the observed velocities of distant halo stars. We construct the largest all-sky spectroscopic dataset of luminous red giant stars from $50-160$ kpc, including a new survey of the southern celestial hemisphere. We fit the full 6D kinematics of our data to measure the amplitude and direction of the inner MW's motion towards the outer halo. The observed velocity grows with distance such that, relative to halo stars at $100$ kpc, the inner MW is lurching at $\approx 40$ km s$^{-1}$ towards a recent location along the LMC's past orbit. Our measurements align with N-body simulations of the halo's response to a $1.8 \times 10^{11} M_\odot$ LMC on first infall, suggesting that the LMC is at least 15% as massive as the MW. Our findings highlight the dramatic disequilibrium of the MW outskirts, and will enable more accurate measurements of the total mass of our Galaxy. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 25 pages, 15 figures. Submitted to ApJ

arXiv:2406.01391 [pdf, other]

Knowledge Graph in Astronomical Research with Large Language Models: Quantifying Driving Forces in Interdisciplinary Scientific Discovery

Authors: Zechang Sun, Yuan-Sen Ting, Yaobo Liang, Nan Duan, Song Huang, Zheng Cai

Abstract: Identifying and predicting the factors that contribute to the success of interdisciplinary research is crucial for advancing scientific discovery. However, there is a lack of methods to quantify the integration of new ideas and technological advancements in astronomical research and how these new technologies drive further scientific breakthroughs. Large language models, with their ability to extr… ▽ More Identifying and predicting the factors that contribute to the success of interdisciplinary research is crucial for advancing scientific discovery. However, there is a lack of methods to quantify the integration of new ideas and technological advancements in astronomical research and how these new technologies drive further scientific breakthroughs. Large language models, with their ability to extract key concepts from vast literature beyond keyword searches, provide a new tool to quantify such processes. In this study, we extracted concepts in astronomical research from 297,807 publications between 1993 and 2024 using large language models, resulting in a set of 24,939 concepts. These concepts were then used to form a knowledge graph, where the link strength between any two concepts was determined by their relevance through the citation-reference relationships. By calculating this relevance across different time periods, we quantified the impact of numerical simulations and machine learning on astronomical research. The knowledge graph demonstrates two phases of development: a phase where the technology was integrated and another where the technology was explored in scientific discovery. The knowledge graph reveals that despite machine learning has made much inroad in astronomy, there is currently a lack of new concept development at the intersection of AI and Astronomy, which may be the current bottleneck preventing machine learning from further transforming the field of astronomy. △ Less

Submitted 15 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: An interactive version of the knowledge graph is made publicly available at https://astrokg.github.io/. Accepted to IJCAI 2024 AI4Research Workshop. Comments are welcome

arXiv:2405.18223 [pdf, other]

doi 10.1093/mnras/stae1362

A path towards constraining the evolution of the interstellar medium and outflows in the Milky Way using APOGEE

Authors: Piyush Sharda, Yuan-Sen Ting, Neige Frankel

Abstract: In recent years, the study of the Milky Way has significantly advanced due to extensive spectroscopic surveys of its stars, complemented by astroseismic and astrometric data. However, it remains disjoint from recent advancements in understanding the physics of the Galactic interstellar medium (ISM). This paper introduces a new model for the chemical evolution of the Milky Way that can be constrain… ▽ More In recent years, the study of the Milky Way has significantly advanced due to extensive spectroscopic surveys of its stars, complemented by astroseismic and astrometric data. However, it remains disjoint from recent advancements in understanding the physics of the Galactic interstellar medium (ISM). This paper introduces a new model for the chemical evolution of the Milky Way that can be constrained on stellar data, because it combines a state-of-the-art ISM model with a Milky Way stellar disc model. Utilizing a dataset of red clump stars from APOGEE, known for their precise ages and metallicities, we concentrate on the last 6 billion years -- a period marked by Milky Way's secular evolution. We examine the oxygen abundance in the low-$α$ disc stars relative to their ages and birth radii, validating or constraining critical ISM parameters that remain largely unexplored in extragalactic observations. The models that successfully reproduce the radius -- metallicity distribution and the age -- metallicity distribution of stars without violating existing ISM observations indicate a need for modest differential oxygen enrichment in Galactic outflows, meaning that the oxygen abundance of outflows is higher than the local ISM abundance, irrespective of outflow mass loading. The models also suggest somewhat elevated ISM gas velocity dispersion levels over the past 6 billion years compared to galaxies of similar mass. The extra turbulence necessary could result from energy from gas accretion onto the Galaxy, supernovae clustering in the ISM, or increased star formation efficiency per freefall time. This work provides a novel approach to constraining the Galactic ISM and outflows, leveraging the detailed insights available from contemporary Milky Way surveys. △ Less

Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

Comments: 20 pages, 9 figures. Accepted by MNRAS

arXiv:2405.17156 [pdf, other]

The Scaling Law in Stellar Light Curves

Authors: Jia-Shu Pan, Yuan-Sen Ting, Yang Huang, Jie Yu, Ji-Feng Liu

Abstract: Analyzing time series of fluxes from stars, known as stellar light curves, can reveal valuable information about stellar properties. However, most current methods rely on extracting summary statistics, and studies using deep learning have been limited to supervised approaches. In this research, we investigate the scaling law properties that emerge when learning from astronomical time series data u… ▽ More Analyzing time series of fluxes from stars, known as stellar light curves, can reveal valuable information about stellar properties. However, most current methods rely on extracting summary statistics, and studies using deep learning have been limited to supervised approaches. In this research, we investigate the scaling law properties that emerge when learning from astronomical time series data using self-supervised techniques. By employing the GPT-2 architecture, we show the learned representation improves as the number of parameters increases from $10^4$ to $10^9$, with no signs of performance plateauing. We demonstrate that a self-supervised Transformer model achieves 3-10 times the sample efficiency compared to the state-of-the-art supervised learning model when inferring the surface gravity of stars as a downstream task. Our research lays the groundwork for analyzing stellar light curves by examining them through large-scale auto-regressive generative models. △ Less

Submitted 17 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

Comments: 11 pages, 5 figures, ICML 2024 AI4Science workshop

arXiv:2405.13083 [pdf, other]

Unsupervised Searches for Cosmological Parity Violation: Improving Detection Power with the Neural Field Scattering Transform

Authors: Matthew Craigie, Peter L. Taylor, Yuan-Sen Ting, Carolina Cuesta-Lazaro, Rossana Ruggeri, Tamara M. Davis

Abstract: Recent studies using four-point correlations suggest a parity violation in the galaxy distribution, though the significance of these detections is sensitive to the choice of simulation used to model the noise properties of the galaxy distribution. In a recent paper, we introduce an unsupervised learning approach which offers an alternative method that avoids the dependence on mock catalogs, by lea… ▽ More Recent studies using four-point correlations suggest a parity violation in the galaxy distribution, though the significance of these detections is sensitive to the choice of simulation used to model the noise properties of the galaxy distribution. In a recent paper, we introduce an unsupervised learning approach which offers an alternative method that avoids the dependence on mock catalogs, by learning parity violation directly from observational data. However, the Convolutional Neural Network (CNN) model utilized by our previous unsupervised approach struggles to extend to more realistic scenarios where data is limited. We propose a novel method, the Neural Field Scattering Transform (NFST), which enhances the Wavelet Scattering Transform (WST) technique by adding trainable filters, parameterized as a neural field. We first tune the NFST model to detect parity violation in a simplified dataset, then compare its performance against WST and CNN benchmarks across varied training set sizes. We find the NFST can detect parity violation with $4\times$ less data than the CNN and $32\times$ less than the WST. Furthermore, in cases with limited data the NFST can detect parity violation with up to $6σ$ confidence, where the WST and CNN fail to make any detection. We identify that the added flexibility of the NFST, and particularly the ability to learn asymmetric filters, as well as the specific symmetries built into the NFST architecture, contribute to its improved performance over the benchmark models. We further demonstrate that the NFST is readily interpretable, which is valuable for physical applications such as the detection of parity violation. △ Less

Submitted 21 May, 2024; originally announced May 2024.

arXiv:2404.13562 [pdf, other]

New Evidence of Binarity in Young α-Rich Turn-off and Subgiant Stars: Fast Rotation and Strong Magnetic Activity

Authors: Jie Yu, Luca Casagrande, Ioana Ciucă, Yuan-Sen Ting, Simon J. Murphy, Boquan Chen

Abstract: Young α-rich (YAR) stars within the old Galactic thick disk exhibit a dual characteristic of relative youth determined with asteroseismology and abundance enhancement in α elements measured from high-resolution spectroscopy. The youth origin of YAR stars has been proposed to be binary evolution via mass transfer or stellar mergers. If that is the case, YAR stars should spin rapidly and thus be mag… ▽ More Young α-rich (YAR) stars within the old Galactic thick disk exhibit a dual characteristic of relative youth determined with asteroseismology and abundance enhancement in α elements measured from high-resolution spectroscopy. The youth origin of YAR stars has been proposed to be binary evolution via mass transfer or stellar mergers. If that is the case, YAR stars should spin rapidly and thus be magnetically active, because they are mass and angular momentum gainers. In this study, to seek this binary footprint we select YAR stars on the main-sequence turn-off or the subgiant branch (MSTO-SGB) from APOGEE DR17, whose ages and projected rotation velocities (vsini) can be precisely measured. With APOGEE vsini and LAMOST spectra, we find that YAR stars are indeed fast rotators and magnetically active. In addition, we observe low [C/N] ratios and high Gaia RUWE in some YAR stars, suggesting that these MSTO-SGB stars probably have experienced mass transfer from red-giant companions. Our findings underscore that magnetic activity can serve as a valuable tool for probing the binary evolution for other chemically peculiar stars, such as red giants with lithium anomalies and carbon-enhanced metal-poor stars. △ Less

Submitted 21 April, 2024; originally announced April 2024.

Comments: 7 pages, 6 figures. Accepted for publication in MNRAS

arXiv:2404.08975 [pdf, other]

Uncovering the first-infall history of the LMC through its dynamical impact in the Milky Way halo

Authors: Yanjun Sheng, Yuan-Sen Ting, Xiang-Xiang Xue, Jiang Chang, Hao Tian

Abstract: The gravitational interactions between the LMC and the Milky Way can give rise to dynamical perturbations in the MW halo, leading to a biased distribution of stellar density and other kinematic signals. These disequilibrium phenomena exhibit variations under different parameter combinations of the MW-LMC model. In this work, we run 50 high-resolution N-body simulations spanning different masses an… ▽ More The gravitational interactions between the LMC and the Milky Way can give rise to dynamical perturbations in the MW halo, leading to a biased distribution of stellar density and other kinematic signals. These disequilibrium phenomena exhibit variations under different parameter combinations of the MW-LMC model. In this work, we run 50 high-resolution N-body simulations spanning different masses and halo shapes of the Milky Way and LMC and investigate how the LMC-induced perturbations evolve with these model parameters. We measure the magnitude of kinematic perturbations from the mean velocities of simulated halo stars and identify a discontinuity between the first-infall and second-passage scenarios of the LMC's orbital history. We demonstrate that, due to the short dynamical times of the Galactic inner halo, the reduced perturbation magnitude in the second-passage scenario is mainly a result of the LMC's second infall into the MW, which starts at a much lower velocity relative to the inner halo compared to the first-infall scenario. Using a subset of $\sim 1200$ RR Lyrae stars located in the outer halo ($50 \leq R_{\mathrm{GC}} < 100$ kpc), which are selected from a larger sample of 135,873 RR Lyrae stars with precise distance estimates from Gaia, we find the mean latitudinal velocity ($v_{b}$) in the heliocentric frame to be $\langle v_{b} \rangle = 30.8 \pm 4.0$ km/s. The observation contradicts the second-passage scenario and supports the first-infall scenario with a massive LMC ($\sim 2.1 \times 10^{11} \mathrm{M}_{\odot}$) at infall, an oblate MW halo with a virial mass $M_{200} < 1.4 \times 10^{12} \mathrm{M}_{\odot}$ and a flattening parameter $q > 0.7$. △ Less

Submitted 13 April, 2024; originally announced April 2024.

Comments: 21 pages, 14 figures, submitted to MNRAS

arXiv:2404.01556 [pdf, other]

Blind QSO reconstruction challenge: Exploring methods to reconstruct the Ly$α$ emission line of QSOs

Authors: Bradley Greig, Sarah E. I. Bosman, Frederick B. Davies, Dominika Ďurovčíková, Hassan Fathivavsari, Bin Liu, Romain A. Meyer, Zechang Sun, Valentina D'Odorico, Simona Gallerani, Andrei Mesinger, Yuan-Sen Ting

Abstract: Reconstructing the intrinsic Ly$α$ line flux from high-$z$ QSOs can place constraints on the neutral hydrogen content of the intergalactic medium during reionisation. There are now $\gtrsim10$ different Ly$α$ reconstruction pipelines using different methodologies to predict the Ly$α$ line flux from correlations with the spectral information redward of Ly$α$. However, there have been few attempts t… ▽ More Reconstructing the intrinsic Ly$α$ line flux from high-$z$ QSOs can place constraints on the neutral hydrogen content of the intergalactic medium during reionisation. There are now $\gtrsim10$ different Ly$α$ reconstruction pipelines using different methodologies to predict the Ly$α$ line flux from correlations with the spectral information redward of Ly$α$. However, there have been few attempts to directly compare the performance of these pipelines. Therefore, we devised a blind QSO challenge to compare these reconstruction pipelines on a uniform set of objects. Each author was provided de-identified, observed rest-frame QSO spectra with spectral information only redward of 1260Å rest-frame to ensure unbiased reconstruction. We constructed two samples of 30 QSOs, from X-Shooter and SDSS both spanning $3.5<z<4.5$. Importantly, the purpose of this comparison study was not to champion a single, best performing reconstruction pipeline but rather to explore the relative performance of these pipelines over a range of QSOs with broad observational characteristics to infer general trends. In summary, we find machine learning approaches in general provide the strongest ``best guesses" but underestimate the accompanying statistical uncertainty, although these can be recalibrated, whilst pipelines that decompose the spectral information, for example principal component or factor analysis generally perform better at predicting the Ly$α$ profile. Further, we found that reconstruction pipelines trained on SDSS QSOs performed similarly on average for both the X-Shooter and SDSS samples indicating no discernible biases owing to differences in the observational characteristics of the training set or QSO being reconstructed, although the recovered distributions of reconstructions for X-Shooter were broader likely due to an increased fraction of outliers. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: 30 pages, 16 figures, 4 tables. Submitted to MNRAS, comments welcome

arXiv:2403.14061 [pdf, other]

Exploring the role of the halo mass function for inferring astrophysical parameters during reionisation

Authors: Bradley Greig, David Prelogović, Jordan Mirocha, Yuxiang Qin, Yuan-Sen Ting, Andrei Mesinger

Abstract: The detection of the 21-cm signal at $z\gtrsim6$ will reveal insights into the properties of the first galaxies responsible for driving reionisation. To extract this information, we perform parameter inference which requires embedding 3D simulations of the 21-cm signal within a Bayesian inference pipeline. Presently, when performing inference we must choose which sources of uncertainty to sample a… ▽ More The detection of the 21-cm signal at $z\gtrsim6$ will reveal insights into the properties of the first galaxies responsible for driving reionisation. To extract this information, we perform parameter inference which requires embedding 3D simulations of the 21-cm signal within a Bayesian inference pipeline. Presently, when performing inference we must choose which sources of uncertainty to sample and which to hold fixed. Since the astrophysics of galaxies are much more uncertain than those of the underlying halo-mass function (HMF), we usually parameterise and model the former while fixing the latter. However, in doing so we may bias our inference of the properties of these first galaxies. In this work, we explore the consequences of assuming an incorrect choice of HMF and quantify the relative biases in our inferred astrophysical model parameters when considering the wrong HMF. We then relax this assumption by constructing a generalised five parameter model for the HMF and simultaneously recover these parameters along with our underlying astrophysical model. For this analysis, we use 21cmFAST and perform Simulation-Based Inference by applying marginal neural ratio estimation to learn the likelihood-to-evidence ratio using Swyft. Using a mock 1000 hour observation of the 21-cm power spectrum from the forthcoming Square Kilometre Array, conservatively assuming foreground wedge avoidance, we find assuming the incorrect HMF can bias the recovered astrophysical parameters by up to $\sim3-4σ$ even when including independent information from observed luminosity functions. When considering our generalised HMF model, we recover constraints on our astrophysical parameters with a factor of $\sim2-4$ larger marginalised uncertainties. Importantly, these constraints are unbiased, agnostic to the underlying HMF and therefore more conservative. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 27 pages, 14 figures, 3 tables and 3 appendices. Submitted to MNRAS, comments welcome

arXiv:2403.14060 [pdf, other]

Inferring astrophysical parameters using the 2D cylindrical power spectrum from reionisation

Authors: Bradley Greig, David Prelogović, Yuxiang Qin, Yuan-Sen Ting, Andrei Mesinger

Abstract: Enlightening our understanding of the first galaxies responsible for driving reionisation requires detecting the 21-cm signal from neutral hydrogen. Interpreting the wealth of information embedded in this signal requires Bayesian inference. Parameter inference from the 21-cm signal is primarily restricted to the spherically averaged power spectrum (1D PS) owing to its relatively straightforward de… ▽ More Enlightening our understanding of the first galaxies responsible for driving reionisation requires detecting the 21-cm signal from neutral hydrogen. Interpreting the wealth of information embedded in this signal requires Bayesian inference. Parameter inference from the 21-cm signal is primarily restricted to the spherically averaged power spectrum (1D PS) owing to its relatively straightforward derivation of an analytic likelihood function enabling traditional Monte-Carlo Markov-Chain (MCMC) approaches. However, in recent years, simulation-based inference (SBI) has become feasible which removes the necessity of having an analytic likelihood, enabling more complex summary statistics of the 21-cm signal to be used for Bayesian inference. In this work, we use SBI, specifically marginal neural ratio estimation to learn the likelihood-to-evidence ratio with Swyft, to explore parameter inference using the cylindrically averaged 2D PS. Since the 21-cm signal is anisotropic, the 2D PS should yield more constraining information compared to the 1D PS which isotropically averages the signal. For this, we consider a mock 1000 hr observation of the 21-cm signal using the SKA and compare the performance of the 2D PS relative to the 1D PS. Additionally, we explore two separate foreground mitigation strategies, perfect foreground removal and wedge avoidance. We find the 2D PS outperforms the 1D PS by improving the marginalised uncertainties on individual astrophysical parameters by up to $\sim30-40$ per cent irrespective of the foreground mitigation strategy. Primarily, these improvements stem from how the 2D PS distinguishes between the transverse, $k_{\perp}$, and redshift dependent, $k_{\parallel}$ information which enables greater sensitivity to the complex reionisation morphology. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 16 pages, 6 figures and 1 table. Submitted to MNRAS, comments welcome

arXiv:2403.13209 [pdf, other]

doi 10.1038/s41586-024-07091-y

At least one in a dozen stars exhibits evidence of planetary ingestion

Authors: Fan Liu, Yuan-Sen Ting, David Yong, Bertram Bitsch, Amanda Karakas, Michael T. Murphy, Meridith Joyce, Aaron Dotter, Fei Dai

Abstract: Stellar chemical compositions can be altered by ingestion of planetary material and/or planet formation which removes refractory material from the proto-stellar disc. These "planet signatures" appear as correlations between elemental abundance differences and the dust condensation temperature. Detecting these planet signatures, however, is challenging due to unknown occurrence rates, small amplitu… ▽ More Stellar chemical compositions can be altered by ingestion of planetary material and/or planet formation which removes refractory material from the proto-stellar disc. These "planet signatures" appear as correlations between elemental abundance differences and the dust condensation temperature. Detecting these planet signatures, however, is challenging due to unknown occurrence rates, small amplitudes, and heterogeneous star samples with large differences in stellar ages, and therefore stars born together (i.e., co-natal) with identical compositions can facilitate such detections. While previous spectroscopic studies were limited to small number of binary stars, the Gaia satellite provides new opportunities for detecting stellar chemical signatures of planets among co-moving pairs of stars confirmed to be co-natal. Here we report high-precision chemical abundances for a homogeneous sample of 91 co-natal pairs of stars with a well-defined selection function and identify at least seven new instances of planetary ingestion, corresponding to an occurrence rate of 8%. An independent Bayesian indicator is deployed, which can effectively disentangle the planet signatures from other factors, such as random abundance variation and atomic diffusion. Our study provides new evidence of planet signatures and facilitates a deeper understanding of the star-planet-chemistry connection by providing new observational constraints on the mechanisms of planet engulfment, formation and evolution. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: 29 pages, 11 figures. Author's submitted version before final edits. Published in Nature on March 21, 2024: https://www.nature.com/articles/s41586-024-07091-y

arXiv:2403.05398 [pdf, other]

The Wide-field Spectroscopic Telescope (WST) Science White Paper

Authors: Vincenzo Mainieri, Richard I. Anderson, Jarle Brinchmann, Andrea Cimatti, Richard S. Ellis, Vanessa Hill, Jean-Paul Kneib, Anna F. McLeod, Cyrielle Opitom, Martin M. Roth, Paula Sanchez-Saez, Rodolfo Smiljanic, Eline Tolstoy, Roland Bacon, Sofia Randich, Angela Adamo, Francesca Annibali, Patricia Arevalo, Marc Audard, Stefania Barsanti, Giuseppina Battaglia, Amelia M. Bayo Aran, Francesco Belfiore, Michele Bellazzini, Emilio Bellini , et al. (192 additional authors not shown)

Abstract: The Wide-field Spectroscopic Telescope (WST) is proposed as a new facility dedicated to the efficient delivery of spectroscopic surveys. This white paper summarises the initial concept as well as the corresponding science cases. WST will feature simultaneous operation of a large field-of-view (3 sq. degree), a high multiplex (20,000) multi-object spectrograph (MOS) and a giant 3x3 sq. arcmin integ… ▽ More The Wide-field Spectroscopic Telescope (WST) is proposed as a new facility dedicated to the efficient delivery of spectroscopic surveys. This white paper summarises the initial concept as well as the corresponding science cases. WST will feature simultaneous operation of a large field-of-view (3 sq. degree), a high multiplex (20,000) multi-object spectrograph (MOS) and a giant 3x3 sq. arcmin integral field spectrograph (IFS). In scientific capability these requirements place WST far ahead of existing and planned facilities. Given the current investment in deep imaging surveys and noting the diagnostic power of spectroscopy, WST will fill a crucial gap in astronomical capability and work synergistically with future ground and space-based facilities. This white paper shows that WST can address outstanding scientific questions in the areas of cosmology; galaxy assembly, evolution, and enrichment, including our own Milky Way; origin of stars and planets; time domain and multi-messenger astrophysics. WST's uniquely rich dataset will deliver unforeseen discoveries in many of these areas. The WST Science Team (already including more than 500 scientists worldwide) is open to the all astronomical community. To register in the WST Science Team please visit https://www.wstelescope.com/for-scientists/participate △ Less

Submitted 12 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

Comments: 194 pages, 66 figures. Comments are welcome (wstelescope@gmail.com)

arXiv:2402.08632 [pdf, other]

Cosmological evolution of metallicity correlation functions from the Auriga simulations

Authors: Zefeng Li, Robert J. J. Grand, Emily Wisnioski, J. Trevor Mendel, Mark R. Krumholz, Yuan-Sen Ting, Ruediger Pakmor, Facundo A. Gómez, Federico Marinacci, Ioana Ciucă

Abstract: We study the cosmological evolution of the two-point correlation functions of galactic gas-phase metal distributions using the 28 simulated galaxies from the Auriga Project. Using mock observations of the $z = 0$ snapshots to mimic our past work, we show that the correlation functions of the simulated mock observations are well matched to the correlation functions measured from local galaxy survey… ▽ More We study the cosmological evolution of the two-point correlation functions of galactic gas-phase metal distributions using the 28 simulated galaxies from the Auriga Project. Using mock observations of the $z = 0$ snapshots to mimic our past work, we show that the correlation functions of the simulated mock observations are well matched to the correlation functions measured from local galaxy surveys. This comparison suggests that the simulations capture the processes important for determining metal correlation lengths, the key parameter in metallicity correlation functions. We investigate the evolution of metallicity correlations over cosmic time using the true simulation data, showing that individual galaxies undergo no significant systematic evolution in their metal correlation functions from $z\sim 3$ to today. In addition, the fluctuations in metal correlation length are correlated with but lag ahead fluctuations in star formation rate. This suggests that re-arrangement of metals within galaxies occurs at a higher cadence than star formation activity, and is more sensitive to the changes of environment, such as galaxy mergers, gas inflows / outflows, and fly-bys. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: 12 pages, 10 figures, 1 table, accepted for publication in MNRAS

arXiv:2402.06242 [pdf, ps, other]

Determining Stellar Elemental Abundances from DESI Spectra with the Data-Driven Payne

Authors: Meng Zhang, Maosheng Xiang, Yuan-Sen Ting, Jiahui Wang, Haining Li, Hu Zou, Jundan Nie, Lanya Mou, Tianmin Wu, Yaqian Wu, Jifeng Liu

Abstract: Stellar abundances for a large number of stars are key information for the study of Galactic formation history. Large spectroscopic surveys such as DESI and LAMOST take median-to-low resolution ($R\lesssim5000$) spectra in the full optical wavelength range for millions of stars. However, line blending effect in these spectra causes great challenges for the elemental abundances determination. Here… ▽ More Stellar abundances for a large number of stars are key information for the study of Galactic formation history. Large spectroscopic surveys such as DESI and LAMOST take median-to-low resolution ($R\lesssim5000$) spectra in the full optical wavelength range for millions of stars. However, line blending effect in these spectra causes great challenges for the elemental abundances determination. Here we employ the DD-PAYNE, a data-driven method regularised by differential spectra from stellar physical models, to the DESI EDR spectra for stellar abundance determination. Our implementation delivers 15 labels, including effective temperature $T_{\rm eff}$, surface gravity $\log g$, microturbulence velocity $v_{\rm mic}$, and abundances for 12 individual elements, namely C, N, O, Mg, Al, Si, Ca, Ti, Cr, Mn, Fe, Ni. Given a spectral signal-to-noise ratio of 100 per pixel, internal precision of the label estimates are about 20 K for $T_{\rm eff}$, 0.05 dex for $\log~g$, and 0.05 dex for most elemental abundances. These results are agree with theoretical limits from the Crámer-Rao bound calculation within a factor of two. The Gaia-Enceladus-Sausage that contributes the majority of the accreted halo stars are discernible from the disk and in-situ halo populations in the resultant [Mg/Fe]-[Fe/H] and [Al/Fe]-[Fe/H] abundance spaces. We also provide distance and orbital parameters for the sample stars, which spread a distance out to $\sim$100 kpc. The DESI sample has a significant higher fraction of distant (or metal-poor) stars than other existed spectroscopic surveys, making it a powerful data set to study the Galactic outskirts. The catalog is publicly available. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Comments: 18 pages, 12 figures. Submitted to ApJS

arXiv:2401.11737 [pdf, other]

doi 10.1002/adts.202301227

Sphractal: Estimating the Fractal Dimension of Surfaces Computed from Precise Atomic Coordinates via Box-Counting Algorithm

Authors: Jonathan Yik Chang Ting, Andrew Thomas Agars Wood, Amanda Susan Barnard

Abstract: The fractal dimension of a surface allows its degree of roughness to be characterized quantitatively. However, limited effort is attempted to calculate the fractal dimension of surfaces computed from precisely known atomic coordinates from computational biomolecular and nanomaterial studies. This work proposes methods to estimate the fractal dimension of the surface of any 3D object composed of sp… ▽ More The fractal dimension of a surface allows its degree of roughness to be characterized quantitatively. However, limited effort is attempted to calculate the fractal dimension of surfaces computed from precisely known atomic coordinates from computational biomolecular and nanomaterial studies. This work proposes methods to estimate the fractal dimension of the surface of any 3D object composed of spheres, by representing the surface as either a voxelized point cloud or a mathematically exact surface, and computing its box-counting dimension. Sphractal is published as a Python package that provides these functionalities, and its utility is demonstrated on a set of simulated palladium nanoparticle data. △ Less

Submitted 10 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Comments: 18 pages, 13 figures

ACM Class: J.2

Journal ref: Adv. Theory Simul. 2024, 2301227

arXiv:2401.05620 [pdf, other]

Galactic-Seismology Substructures and Streams Hunter with LAMOST and Gaia. I. Methodology and Local Halo Results

Authors: Guan-Yu Wang, Hai-Feng Wang, Yang-Ping Luo, Yuan-Sen Ting, Thor Tepper-García, Joss Bland-Hawthorn, Jeffrey Carlin

Abstract: We present a novel, deep-learning based method -- dubbed Galactic-Seismology Substructures and Streams Hunter, or GS$^{3}$ Hunter for short, to search for substructures and streams in stellar kinematics data. GS$^{3}$ Hunter relies on a combined application of Siamese Neural Networks to transform the phase space information and the K-means algorithm for the clustering. As a validation test, we app… ▽ More We present a novel, deep-learning based method -- dubbed Galactic-Seismology Substructures and Streams Hunter, or GS$^{3}$ Hunter for short, to search for substructures and streams in stellar kinematics data. GS$^{3}$ Hunter relies on a combined application of Siamese Neural Networks to transform the phase space information and the K-means algorithm for the clustering. As a validation test, we apply GS$^{3}$ Hunter to a subset of the Feedback in Realistic Environments (FIRE) cosmological simulations. The stellar streams and substructures thus identified are in good agreement with corresponding results reported earlier by the FIRE team. In the same vein, we apply our method to a subset of local halo stars from the Gaia Early Data Release 3 and GALAH DR3 datasets, and recover several, previously known dynamical groups, such as Thamnos 1+2, Hot Thick Disk, ED-1, L-RL3, Helmi 1+2, and Gaia-Sausage-Enceladus, Sequoia, VRM, Cronus, Nereus. Finally, we apply our method without fine-tuning to a subset of K-giant stars located in the inner halo region, obtained from the LAMOST Data Release 5 (DR5) dataset. We recover three, previously known structures (Sagittarius, Hercules-Aquila Cloud, and the Virgo Overdensity), but we also discover a number of new substructures. We anticipate that GS$^{3}$ Hunter will become a useful tool for the community dedicated to the search of stellar streams and structures in the Milky Way (MW) and the Local group, thus helping advance our understanding of the stellar inner and outer halos, and of the assembly and tidal stripping history in and around the MW. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: 40 pages, 33 figures, 3 tables

arXiv:2401.02165 [pdf, other]

Optimization of ionic configurations in battery materials by quantum annealing

Authors: Tobias Binninger, Yin-Ying Ting, Piotr M. Kowalski, Michael H. Eikerling

Abstract: Energy materials with disorder in site occupation are challenging for computational studies due to an exponential scaling of the configuration space. We herein present a grand-canonical optimization method that enables the use of quantum annealing (QA) for sampling the ionic ground state. The method relies on a Legendre transformation of the Coulomb energy cost function that strongly reduces the e… ▽ More Energy materials with disorder in site occupation are challenging for computational studies due to an exponential scaling of the configuration space. We herein present a grand-canonical optimization method that enables the use of quantum annealing (QA) for sampling the ionic ground state. The method relies on a Legendre transformation of the Coulomb energy cost function that strongly reduces the effective coupling strengths of the fully connected problem, which is essential for effectiveness of QA. The approach is expected to be applicable to a variety of materials optimization problems. △ Less

Submitted 4 January, 2024; originally announced January 2024.

arXiv:2401.01916 [pdf, other]

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets

Authors: Ernest Perkowski, Rui Pan, Tuan Dung Nguyen, Yuan-Sen Ting, Sandor Kruk, Tong Zhang, Charlie O'Neill, Maja Jablonska, Zechang Sun, Michael J. Smith, Huiling Liu, Kevin Schawinski, Kartheik Iyer, Ioana Ciucă for UniverseTBD

Abstract: We explore the potential of enhancing LLM performance in astronomy-focused question-answering through targeted, continual pre-training. By employing a compact 7B-parameter LLaMA-2 model and focusing exclusively on a curated set of astronomy corpora -- comprising abstracts, introductions, and conclusions -- we achieve notable improvements in specialized topic comprehension. While general LLMs like… ▽ More We explore the potential of enhancing LLM performance in astronomy-focused question-answering through targeted, continual pre-training. By employing a compact 7B-parameter LLaMA-2 model and focusing exclusively on a curated set of astronomy corpora -- comprising abstracts, introductions, and conclusions -- we achieve notable improvements in specialized topic comprehension. While general LLMs like GPT-4 excel in broader question-answering scenarios due to superior reasoning capabilities, our findings suggest that continual pre-training with limited resources can still enhance model performance on specialized topics. Additionally, we present an extension of AstroLLaMA: the fine-tuning of the 7B LLaMA model on a domain-specific conversational dataset, culminating in the release of the chat-enabled AstroLLaMA for community use. Comprehensive quantitative benchmarking is currently in progress and will be detailed in an upcoming full paper. The model, AstroLLaMA-Chat, is now available at https://huggingface.co/universeTBD, providing the first open-source conversational AI tool tailored for the astronomy community. △ Less

Submitted 5 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

Comments: 4 pages, 1 figure, model is available at https://huggingface.co/universeTBD, published in RNAAS

arXiv:2312.09287 [pdf, other]

Unsupervised Searches for Cosmological Parity-Violation: An Investigation with Convolutional Neural Networks

Authors: Peter L. Taylor, Matthew Craigie, Yuan-Sen Ting

Abstract: Recent measurements of the $4$-point correlation functions (4PCF) from spectroscopic surveys provide evidence for parity-violations in the large-scale structure of the Universe. If physical in origin, this could point to exotic physics during the epoch of inflation. However, searching for parity-violations in the 4PCF signal relies on a large suite of simulations to perform a rank test, or an accu… ▽ More Recent measurements of the $4$-point correlation functions (4PCF) from spectroscopic surveys provide evidence for parity-violations in the large-scale structure of the Universe. If physical in origin, this could point to exotic physics during the epoch of inflation. However, searching for parity-violations in the 4PCF signal relies on a large suite of simulations to perform a rank test, or an accurate model of the 4PCF covariance to claim a detection, and this approach is incapable of extracting parity information from the higher-order $N$-point functions. In this work we present an unsupervised method which overcomes these issues, before demonstrating the approach is capable of detecting parity-violations in a few toy models using convolutional neural networks. This technique is complementary to the 4-point method and could be used to discover parity-violations in several upcoming surveys including DESI, Euclid and Roman. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: 8 pages. 5 figures. PRD Submitted

arXiv:2312.08270 [pdf, other]

HRMOS White Paper: Science Motivation

Authors: Laura Magrini, Thomas Bensby, Anna Brucalassi, Sofia Randich, Robin Jeffries, Gayandhi de Silva, Asa Skuladottir, Rodolfo Smiljanic, Oscar Gonzalez, Vanessa Hill, Nadege Lagarde, Eline Tolstoy, Jose' Maria Arroyo-Polonio, Martina Baratella, John R. Barnes, Giuseppina Battaglia, Holger Baumgardt, Michele Bellazzini, Katia Biazzo, Angela Bragaglia, Bradley Carter, Giada Casali, Gabriele Cescutti, Camilla Danielski, Elisa Delgado Mena , et al. (30 additional authors not shown)

Abstract: The High-Resolution Multi-Object Spectrograph (HRMOS) is a facility instrument that we plan to propose for the Very Large Telescope (VLT) of the European Southern Observatory (ESO), following the initial presentation at the VLT 2030 workshop held at ESO in June 2019. HRMOS provides a combination of capabilities that are essential to carry out breakthrough science across a broad range of active res… ▽ More The High-Resolution Multi-Object Spectrograph (HRMOS) is a facility instrument that we plan to propose for the Very Large Telescope (VLT) of the European Southern Observatory (ESO), following the initial presentation at the VLT 2030 workshop held at ESO in June 2019. HRMOS provides a combination of capabilities that are essential to carry out breakthrough science across a broad range of active research areas from stellar astrophysics and exoplanet studies to Galactic and Local Group archaeology. HRMOS fills a gap in capabilities amongst the landscape of future instrumentation planned for the next decade. The key characteristics of HRMOS will be high spectral resolution (R = 60000 - 80000) combined with multi-object (20-100) capabilities and long term stability that will provide excellent radial velocity precision and accuracy (10m/s). Initial designs predict that a SNR~100 will be achievable in about one hour for a star with mag(AB) = 15, while with the same exposure time a SNR~ 30 will be reached for a star with mag(AB) = 17. The combination of high resolution and multiplexing with wavelength coverage extending to relatively blue wavelengths (down to 380\,nm), makes HRMOS a spectrograph that will push the boundaries of our knowledge and that is envisioned as a workhorse instrument in the future. The science cases presented in this White Paper include topics and ideas developed by the Core Science Team with the contributions from the astronomical community, also through the wide participation in the first HRMOS Workshop (https://indico.ict.inaf.it/event/1547/) that took place in Firenze (Italy) in October 2021. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: 88 pages, 39 figures. Comments and expressions of interest are welcome by contacting members of the Core Science Team

arXiv:2311.02057 [pdf, other]

Neural ODEs as a discovery tool to characterize the structure of the hot galactic wind of M82

Authors: Dustin D. Nguyen, Yuan-Sen Ting, Todd A. Thompson, Sebastian Lopez, Laura A. Lopez

Abstract: Dynamic astrophysical phenomena are predominantly described by differential equations, yet our understanding of these systems is constrained by our incomplete grasp of non-linear physics and scarcity of comprehensive datasets. As such, advancing techniques in solving non-linear inverse problems becomes pivotal to addressing numerous outstanding questions in the field. In particular, modeling hot g… ▽ More Dynamic astrophysical phenomena are predominantly described by differential equations, yet our understanding of these systems is constrained by our incomplete grasp of non-linear physics and scarcity of comprehensive datasets. As such, advancing techniques in solving non-linear inverse problems becomes pivotal to addressing numerous outstanding questions in the field. In particular, modeling hot galactic winds is difficult because of unknown structure for various physical terms, and the lack of \textit{any} kinematic observational data. Additionally, the flow equations contain singularities that lead to numerical instability, making parameter sweeps non-trivial. We leverage differentiable programming, which enables neural networks to be embedded as individual terms within the governing coupled ordinary differential equations (ODEs), and show that this method can adeptly learn hidden physics. We robustly discern the structure of a mass-loading function which captures the physical effects of cloud destruction and entrainment into the hot superwind. Within a supervised learning framework, we formulate our loss function anchored on the astrophysical entropy ($K \propto P/ρ^{5/3}$). Our results demonstrate the efficacy of this approach, even in the absence of kinematic data $v$. We then apply these models to real Chandra X-Ray observations of starburst galaxy M82, providing the first systematic description of mass-loading within the superwind. This work further highlights neural ODEs as a useful discovery tool with mechanistic interpretability in non-linear inverse problems. We make our code public at this GitHub repository (https://github.com/dustindnguyen/2023_NeurIPS_NeuralODEs_M82). △ Less

Submitted 28 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

Comments: 9 Pages, 2 Figures, Accepted at the NeurIPS 2023 workshop on Machine Learning and the Physical Sciences

arXiv:2310.20125 [pdf, other]

Zephyr : Stitching Heterogeneous Training Data with Normalizing Flows for Photometric Redshift Inference

Authors: Zechang Sun, Joshua S. Speagle, Song Huang, Yuan-Sen Ting, Zheng Cai

Abstract: We present zephyr, a novel method that integrates cutting-edge normalizing flow techniques into a mixture density estimation framework, enabling the effective use of heterogeneous training data for photometric redshift inference. Compared to previous methods, zephyr demonstrates enhanced robustness for both point estimation and distribution reconstruction by leveraging normalizing flows for densit… ▽ More We present zephyr, a novel method that integrates cutting-edge normalizing flow techniques into a mixture density estimation framework, enabling the effective use of heterogeneous training data for photometric redshift inference. Compared to previous methods, zephyr demonstrates enhanced robustness for both point estimation and distribution reconstruction by leveraging normalizing flows for density estimation and incorporating careful uncertainty quantification. Moreover, zephyr offers unique interpretability by explicitly disentangling contributions from multi-source training data, which can facilitate future weak lensing analysis by providing an additional quality assessment. As probabilistic generative deep learning techniques gain increasing prominence in astronomy, zephyr should become an inspiration for handling heterogeneous training data while remaining interpretable and robustly accounting for observational uncertainties. △ Less

Submitted 30 October, 2023; originally announced October 2023.

Comments: 10 pages, 5 figures, accepted to NeurIPS 2023 workshop on Machine Learning and the Physical Sciences

arXiv:2310.20050 [pdf, other]

Does the $ν_{\max}$ scaling relation depend on metallicity? Insights from 3D convection simulations

Authors: Yixiao Zhou, Jørgen Christensen-Dalsgaard, Martin Asplund, Yaguang Li, Regner Trampedach, Yuan-Sen Ting, Jakob L. Rørsted

Abstract: Solar-like oscillations have been detected in thousands of stars thanks to modern space missions. These oscillations have been used to measure stellar masses and ages, which have been widely applied in Galactic archaeology. One of the pillars of such applications is the $ν_{\max}$ scaling relation: the frequency of maximum power $ν_{\max}$, assumed to be proportional to the acoustic cut-off freque… ▽ More Solar-like oscillations have been detected in thousands of stars thanks to modern space missions. These oscillations have been used to measure stellar masses and ages, which have been widely applied in Galactic archaeology. One of the pillars of such applications is the $ν_{\max}$ scaling relation: the frequency of maximum power $ν_{\max}$, assumed to be proportional to the acoustic cut-off frequency, $ν_{\rm ac}$, scales with effective temperature and surface gravity. However, the theoretical basis of the $ν_{\max}$ scaling relation is uncertain, and there is an ongoing debate about whether it can be applied to metal-poor stars. We investigate the metallicity dependence of the $ν_{\max}$ scaling relation by carrying out 3D near-surface convection simulations for solar-type stars with [Fe/H] between -3 and 0.5 dex. Firstly, we found a negative correlation between $ν_{\rm ac}$ and metallicity from the 3D models. This is in tension with the positive correlation identified by studies using 1D models. Secondly, we estimated theoretical $ν_{\max}$ values using velocity amplitudes determined from first principles, by quantifying the mode excitation and damping rates with methods validated in our previous works. We found that at solar effective temperature and surface gravity, $ν_{\max}$ does not show correlation with metallicity. This study opens an exciting prospect of testing the asteroseismic scaling relations against realistic 3D hydrodynamical stellar models. △ Less

Submitted 20 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

Comments: 14 pages, 7 figures, accepted for publication in ApJ

arXiv:2310.12528 [pdf, other]

Constructing Impactful Machine Learning Research for Astronomy: Best Practices for Researchers and Reviewers

Authors: D. Huppenkothen, M. Ntampaka, M. Ho, M. Fouesneau, B. Nord, J. E. G. Peek, M. Walmsley, J. F. Wu, C. Avestruz, T. Buck, M. Brescia, D. P. Finkbeiner, A. D. Goulding, T. Kacprzak, P. Melchior, M. Pasquato, N. Ramachandra, Y. -S. Ting, G. van de Ven, S. Villar, V. A. Villar, E. Zinger

Abstract: Machine learning has rapidly become a tool of choice for the astronomical community. It is being applied across a wide range of wavelengths and problems, from the classification of transients to neural network emulators of cosmological simulations, and is shifting paradigms about how we generate and report scientific results. At the same time, this class of method comes with its own set of best pr… ▽ More Machine learning has rapidly become a tool of choice for the astronomical community. It is being applied across a wide range of wavelengths and problems, from the classification of transients to neural network emulators of cosmological simulations, and is shifting paradigms about how we generate and report scientific results. At the same time, this class of method comes with its own set of best practices, challenges, and drawbacks, which, at present, are often reported on incompletely in the astrophysical literature. With this paper, we aim to provide a primer to the astronomical community, including authors, reviewers, and editors, on how to implement machine learning models and report their results in a way that ensures the accuracy of the results, reproducibility of the findings, and usefulness of the method. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 14 pages, 3 figures; submitted to the Bulletin of the American Astronomical Society

arXiv:2309.16316 [pdf, other]

doi 10.1093/mnras/stae068

Astroconformer: The Prospects of Analyzing Stellar Light Curves with Transformer-Based Deep Learning Models

Authors: Jia-Shu Pan, Yuan-Sen Ting, Jie Yu

Abstract: Stellar light curves contain valuable information about oscillations and granulation, offering insights into stars' internal structures and evolutionary states. Traditional asteroseismic techniques, primarily focused on power spectral analysis, often overlook the crucial phase information in these light curves. Addressing this gap, recent machine learning applications, particularly those using Con… ▽ More Stellar light curves contain valuable information about oscillations and granulation, offering insights into stars' internal structures and evolutionary states. Traditional asteroseismic techniques, primarily focused on power spectral analysis, often overlook the crucial phase information in these light curves. Addressing this gap, recent machine learning applications, particularly those using Convolutional Neural Networks (CNNs), have made strides in inferring stellar properties from light curves. However, CNNs are limited by their localized feature extraction capabilities. In response, we introduce $\textit{Astroconformer}$, a Transformer-based deep learning framework, specifically designed to capture long-range dependencies in stellar light curves. Our empirical analysis centers on estimating surface gravity ($\log g$), using a dataset derived from single-quarter Kepler light curves with $\log g$ values ranging from 0.2 to 4.4. $\textit{Astroconformer}$ demonstrates superior performance, achieving a root-mean-square-error (RMSE) of 0.017 dex at $\log g\approx3$ in data-rich regimes and up to 0.1 dex in sparser areas. This performance surpasses both K-nearest neighbor models and advanced CNNs. Ablation studies highlight the influence of receptive field size on model effectiveness, with larger fields correlating to improved results. $\textit{Astroconformer}$ also excels in extracting $ν_{\max}$ with high precision. It achieves less than 2% relative median absolute error for 90-day red giant light curves. Notably, the error remains under 3% for 30-day light curves, whose oscillations are undetectable by a conventional pipeline in 30% cases. Furthermore, the attention mechanisms in $\textit{Astroconformer}$ align closely with the characteristics of stellar oscillations and granulation observed in light curves. △ Less

Submitted 18 January, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

Comments: 15 pages, 10 figures, Accepted by MNRAS

arXiv:2309.06126 [pdf, other]

AstroLLaMA: Towards Specialized Foundation Models in Astronomy

Authors: Tuan Dung Nguyen, Yuan-Sen Ting, Ioana Ciucă, Charlie O'Neill, Ze-Chang Sun, Maja Jabłońska, Sandor Kruk, Ernest Perkowski, Jack Miller, Jason Li, Josh Peek, Kartheik Iyer, Tomasz Różański, Pranav Khetarpal, Sharaf Zaman, David Brodrick, Sergio J. Rodríguez Méndez, Thang Bui, Alyssa Goodman, Alberto Accomazzi, Jill Naiman, Jesse Cranney, Kevin Schawinski, UniverseTBD

Abstract: Large language models excel in many human-language tasks but often falter in highly specialized domains like scholarly astronomy. To bridge this gap, we introduce AstroLLaMA, a 7-billion-parameter model fine-tuned from LLaMA-2 using over 300,000 astronomy abstracts from arXiv. Optimized for traditional causal language modeling, AstroLLaMA achieves a 30% lower perplexity than Llama-2, showing marke… ▽ More Large language models excel in many human-language tasks but often falter in highly specialized domains like scholarly astronomy. To bridge this gap, we introduce AstroLLaMA, a 7-billion-parameter model fine-tuned from LLaMA-2 using over 300,000 astronomy abstracts from arXiv. Optimized for traditional causal language modeling, AstroLLaMA achieves a 30% lower perplexity than Llama-2, showing marked domain adaptation. Our model generates more insightful and scientifically relevant text completions and embedding extraction than state-of-the-arts foundation models despite having significantly fewer parameters. AstroLLaMA serves as a robust, domain-specific model with broad fine-tuning potential. Its public release aims to spur astronomy-focused research, including automatic paper summarization and conversational agent development. △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: 6 pages, 3 figures, submitted to IJCNLP-AACL 2023. Comments are welcome. The model can be found on Hugging Face - https://huggingface.co/universeTBD/astrollama

arXiv:2309.01546 [pdf, ps, other]

C3PO: Towards a complete census of co-moving pairs of stars. I. High precision stellar parameters for 250 stars

Authors: David Yong, Fan Liu, Yuan-Sen Ting, Meridith Joyce, Bertram Bitsch, Fei Dai, Aaron Dotter, Amanda I. Karakas, Michael T. Murphy

Abstract: We conduct a line-by-line differential analysis of a sample of 125 co-moving pairs of stars (dwarfs and subgiants near solar metallicity). We obtain high precision stellar parameters with average uncertainties in effective temperature, surface gravity and metallicity of 16.5 K, 0.033 dex and 0.014 dex, respectively. We classify the co-moving pairs of stars into two groups, chemically homogeneous (… ▽ More We conduct a line-by-line differential analysis of a sample of 125 co-moving pairs of stars (dwarfs and subgiants near solar metallicity). We obtain high precision stellar parameters with average uncertainties in effective temperature, surface gravity and metallicity of 16.5 K, 0.033 dex and 0.014 dex, respectively. We classify the co-moving pairs of stars into two groups, chemically homogeneous (conatal; |Delta[Fe/H]| $\le$ 0.04 dex) and inhomogeneous (non-conatal), and examine the fraction of chemically homogeneous pairs as a function of separation and effective temperature. The four main conclusions from this study are: (1) A spatial separation of \ds = 10$^6$ AU is an approximate boundary between homogeneous and inhomogeneous pairs of stars, and we restrict our conclusions to only consider the 91 pairs with \ds $\le$ 10$^6$ AU; (2) There is no trend between velocity separation and the fraction of chemically homogeneous pairs in the range \dv $\le$ 4 \kms; (3) We confirm that the fraction of chemically inhomogeneous pairs increases with increasing \teff\ and the trend matches a toy model of that expected from planet ingestion; (4) Atomic diffusion is not the main cause of the chemical inhomogeneity. A major outcome from this study is a sample of 56 bright co-moving pairs of stars with chemical abundance differences $\leq$ 0.02 dex (5\%) which is a level of chemical homogeneity comparable to that of the Hyades open cluster. These important objects can be used, in conjunction with star clusters and the \gaia\ ``benchmark'' stars, to calibrate stellar abundances from large-scale spectroscopic surveys. △ Less

Submitted 4 September, 2023; originally announced September 2023.

Comments: MNRAS in press (see source file for full versions of long tables)

arXiv:2308.15976 [pdf, other]

The dawn is quiet here: Rise in [$α$/Fe] is a signature of massive gas accretion that fueled proto-Milky Way

Authors: Boquan Chen, Yuan-Sen Ting, Michael Hayden

Abstract: The proto-Milky Way epoch forms the earliest stars in our Galaxy and sets the initial conditions for subsequent disk formation. Recent observations from APOGEE and H3 surveys showed that the [$α$/Fe] ratio slowly declined between [Fe/H] $=-3$ and $-1.3$ until it reached the lowest value ($\sim 0.25$) among the selected in situ metal-poor stars that most likely formed during the proto-Galaxy epoch.… ▽ More The proto-Milky Way epoch forms the earliest stars in our Galaxy and sets the initial conditions for subsequent disk formation. Recent observations from APOGEE and H3 surveys showed that the [$α$/Fe] ratio slowly declined between [Fe/H] $=-3$ and $-1.3$ until it reached the lowest value ($\sim 0.25$) among the selected in situ metal-poor stars that most likely formed during the proto-Galaxy epoch. [$α$/Fe] rose to meet the traditional high value commonly associated with the thick disk population at [Fe/H] $=-1$. It was suggested that the rise in [$α$/Fe] could be caused by an increase in the star formation efficiency (SFE), known as the "simmering" phase scenario. However, gas inflow also plays a vital role in shaping the star formation history and chemical evolution of galaxies. We investigate this unexpected [$α$/Fe]-rise with a statistical experiment involving a galactic chemical evolution (GCE). Our model has five free parameters: the mass of the initial reservoir of the cold interstellar medium (ISM) at birth, the frequency of Type Ia supernovae (SNe Ia), the cooling timescale of the warm ISM, the SFE, and the inflow rate of fresh gas. The last two free parameters were allowed to change after [$α$/Fe] reached its lowest value, dividing the proto-Galaxy epoch into two phases. We find that the rise in [$α$/Fe] is caused by a large inflow of fresh gas and conclude that the [$α$/Fe]-rise is a signature of the cold mode accretion whose materials formed the prototype Milky Way preceding disk formation. Although the SFE is essential in regulating the chemical evolution, it does not necessarily increase to facilitate the [$α$/Fe]-rise. △ Less

Submitted 30 August, 2023; originally announced August 2023.

Comments: 15 pages, 10 figures

arXiv:2308.13768 [pdf, other]

Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content

Authors: Charles O'Neill, Jack Miller, Ioana Ciuca, Yuan-Sen Ting, Thang Bui

Abstract: In this paper, we tackle the emerging challenge of unintended harmful content generation in Large Language Models (LLMs) with a novel dual-stage optimisation technique using adversarial fine-tuning. Our two-pronged approach employs an adversarial model, fine-tuned to generate potentially harmful prompts, and a judge model, iteratively optimised to discern these prompts. In this adversarial cycle,… ▽ More In this paper, we tackle the emerging challenge of unintended harmful content generation in Large Language Models (LLMs) with a novel dual-stage optimisation technique using adversarial fine-tuning. Our two-pronged approach employs an adversarial model, fine-tuned to generate potentially harmful prompts, and a judge model, iteratively optimised to discern these prompts. In this adversarial cycle, the two models seek to outperform each other in the prompting phase, generating a dataset of rich examples which are then used for fine-tuning. This iterative application of prompting and fine-tuning allows continuous refinement and improved performance. The performance of our approach is evaluated through classification accuracy on a dataset consisting of problematic prompts not detected by GPT-4, as well as a selection of contentious but unproblematic prompts. We show considerable increase in classification accuracy of the judge model on this challenging dataset as it undergoes the optimisation process. Furthermore, we show that a rudimentary model \texttt{ada} can achieve 13\% higher accuracy on the hold-out test set than GPT-4 after only a few rounds of this process, and that this fine-tuning improves performance in parallel tasks such as toxic comment identification. △ Less

Submitted 26 August, 2023; originally announced August 2023.

arXiv:2308.13702 [pdf, other]

doi 10.1093/mnras/stae969

Extending the Chemical Reach of the H3 Survey: Detailed Abundances of the Dwarf-galaxy Stellar Stream Wukong/LMS-1

Authors: Guilherme Limberg, Alexander P. Ji, Rohan P. Naidu, Anirudh Chiti, Silvia Rossi, Sam A. Usman, Yuan-Sen Ting, Dennis Zaritsky, Ana Bonaca, Lais Borbolato, Joshua S. Speagle, Vedant Chandra, Charlie Conroy

Abstract: We present the first detailed chemical-abundance analysis of stars from the dwarf-galaxy stellar stream Wukong/LMS-1 covering a wide metallicity range ($-3.5 < \rm[Fe/H] \lesssim -1.3$). We find abundance patterns that are effectively indistinguishable from the bulk of Indus and Jhelum, a pair of smaller stellar streams proposed to be dynamically associated with Wukong/LMS-1. We confirmed a carbon… ▽ More We present the first detailed chemical-abundance analysis of stars from the dwarf-galaxy stellar stream Wukong/LMS-1 covering a wide metallicity range ($-3.5 < \rm[Fe/H] \lesssim -1.3$). We find abundance patterns that are effectively indistinguishable from the bulk of Indus and Jhelum, a pair of smaller stellar streams proposed to be dynamically associated with Wukong/LMS-1. We confirmed a carbon-enhanced metal-poor star ($\rm[C/Fe] > +0.7$ and $\rm[Fe/H] \sim -2.9$) in Wukong/LMS-1 with strong enhancements in Sr, Y, and Zr, which is peculiar given its solar-level [Ba/Fe]. Wukong/LMS-1 stars have high abundances of $α$ elements up to $\rm[Fe/H] \gtrsim -2$, which is expected for relatively massive dwarfs. Towards the high-metallicity end, Wukong/LMS-1 becomes $α$-poor, revealing that it probably experienced fairly standard chemical evolution. We identified a pair of N- and Na-rich stars in Wukong/LMS-1, reminiscent of multiple populations in globular clusters. This indicates that this dwarf galaxy contained at least one globular cluster that was completely disrupted in addition to two intact ones previously known to be associated with Wukong/LMS-1, which is possibly connected to similar evidence found in Indus. From these $\geq$3 globular clusters, we estimate the total mass of Wukong/LMS-1 to be ${\approx}10^{10} M_\odot$, representing ${\sim}1$% of the present-day Milky Way. Finally, the [Eu/Mg] ratio in Wukong/LMS-1 continuously increases with metallicity, making this the first example of a dwarf galaxy where the production of $r$-process elements is clearly dominated by delayed sources, presumably neutron-star mergers. △ Less

Submitted 5 April, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

Comments: Accepted to MNRAS. New version fixes abundance uncertainties, which significantly affects elements like Al and N. Use new version of abundance files for correct error bars

arXiv:2308.08584 [pdf, other]

Dynamical masses across the Hertzsprung-Russell diagram

Authors: Hsiang-Chih Hwang, Yuan-Sen Ting, Sihao Cheng, Joshua S. Speagle

Abstract: We infer the dynamical masses of stars across the Hertzsprung-Russell (H-R) diagram using wide binaries from the Gaia survey. Gaia's high-precision astrometry measures the wide binaries' orbital motion, which contains the mass information. Using wide binaries as the training sample, we measure the mass of stars across the two-dimensional H-R diagram using the combination of statistical inference a… ▽ More We infer the dynamical masses of stars across the Hertzsprung-Russell (H-R) diagram using wide binaries from the Gaia survey. Gaia's high-precision astrometry measures the wide binaries' orbital motion, which contains the mass information. Using wide binaries as the training sample, we measure the mass of stars across the two-dimensional H-R diagram using the combination of statistical inference and neural networks. Our results provide the dynamical mass measurements for main-sequence stars from 0.1 to 2 M$_\odot$, unresolved binaries and unresolved triples on the main sequence, and the mean masses of giants and white dwarfs. Two regions in the H-R diagram show interesting behaviors in mass, where one of them is pre-main-sequence stars, and the other one may be related to close compact object companions like M dwarf-white dwarf binaries. These mass measurements depend solely on Newtonian dynamics, providing independent constraints on stellar evolutionary models and the occurrence rate of compact objects. △ Less

Submitted 16 August, 2023; originally announced August 2023.

Comments: Fig. 5 and Fig. 12 are the key results. Submitted to MNRAS. Comments are welcome!

arXiv:2308.07645 [pdf, other]

Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation

Authors: Charles O'Neill, Yuan-Sen Ting, Ioana Ciuca, Jack Miller, Thang Bui

Abstract: Large Language Models (LLMs) hold immense potential to generate synthetic data of high quality and utility, which has numerous applications from downstream model training to practical data utilisation. However, contemporary models, despite their impressive capacities, consistently struggle to produce both coherent and diverse data. To address the coherency issue, we introduce contrastive expert gu… ▽ More Large Language Models (LLMs) hold immense potential to generate synthetic data of high quality and utility, which has numerous applications from downstream model training to practical data utilisation. However, contemporary models, despite their impressive capacities, consistently struggle to produce both coherent and diverse data. To address the coherency issue, we introduce contrastive expert guidance, where the difference between the logit distributions of fine-tuned and base language models is emphasised to ensure domain adherence. In order to ensure diversity, we utilise existing real and synthetic examples as negative prompts to the model. We deem this dual-pronged approach to logit reshaping as STEER: Semantic Text Enhancement via Embedding Repositioning. STEER operates at inference-time and systematically guides the LLMs to strike a balance between adherence to the data distribution (ensuring semantic fidelity) and deviation from prior synthetic examples or existing real datasets (ensuring diversity and authenticity). This delicate balancing act is achieved by dynamically moving towards or away from chosen representations in the latent space. STEER demonstrates improved performance over previous synthetic data generation techniques, exhibiting better balance between data diversity and coherency across three distinct tasks: hypothesis generation, toxic and non-toxic comment generation, and commonsense reasoning task generation. We demonstrate how STEER allows for fine-tuned control over the diversity-coherency trade-off via its hyperparameters, highlighting its versatility. △ Less

Submitted 17 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

arXiv:2307.09568 [pdf, other]

Can Diffusion Model Conditionally Generate Astrophysical Images?

Authors: Xiaosheng Zhao, Yuan-Sen Ting, Kangning Diao, Yi Mao

Abstract: Generative adversarial networks (GANs) are frequently utilized in astronomy to construct an emulator of numerical simulations. Nevertheless, training GANs can prove to be a precarious task, as they are prone to instability and often lead to mode collapse problems. Conversely, the diffusion model also has the ability to generate high-quality data without adversarial training. It has shown superiori… ▽ More Generative adversarial networks (GANs) are frequently utilized in astronomy to construct an emulator of numerical simulations. Nevertheless, training GANs can prove to be a precarious task, as they are prone to instability and often lead to mode collapse problems. Conversely, the diffusion model also has the ability to generate high-quality data without adversarial training. It has shown superiority over GANs with regard to several natural image datasets. In this study, we undertake a quantitative comparison between the denoising diffusion probabilistic model (DDPM) and StyleGAN2 (one of the most robust types of GANs) via a set of robust summary statistics from scattering transform. In particular, we utilize both models to generate the images of 21 cm brightness temperature mapping, as a case study, conditionally based on astrophysical parameters that govern the process of cosmic reionization. Using our new Fréchet Scattering Distance (FSD) as the evaluation metric to quantitatively compare the sample distribution between generative models and simulations, we demonstrate that DDPM outperforms StyleGAN2 on varied sizes of training sets. Through Fisher forecasts, we demonstrate that on our datasets, StyleGAN2 exhibits mode collapses in varied ways, while DDPM yields a more robust generation. We also explore the role of classifier-free guidance in DDPM and show the preference for a non-zero guidance scale only when the training data is limited. Our findings indicate that the diffusion model presents a promising alternative to GANs in the generation of accurate images. These images can subsequently provide reliable parameter constraints, particularly in the realm of astrophysics. △ Less

Submitted 13 November, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

Comments: 14 pages, 10 figures, 1 table. Accepted for publication in MNRAS. Comments welcome

arXiv:2306.15719 [pdf, other]

doi 10.3847/1538-4357/acf7bf

Discovery of the Magellanic Stellar Stream Out to 100 Kiloparsecs

Authors: Vedant Chandra, Rohan P. Naidu, Charlie Conroy, Ana Bonaca, Dennis Zaritsky, Phillip A. Cargile, Nelson Caldwell, Benjamin D. Johnson, Jiwon Jesse Han, Yuan-Sen Ting

Abstract: The Magellanic Stream (MS) - an enormous ribbon of gas spanning $140^\circ$ of the southern sky trailing the Magellanic Clouds - has been exquisitely mapped in the five decades since its discovery. However, despite concerted efforts, no stellar counterpart to the MS has been conclusively identified. This stellar stream would reveal the distance and 6D kinematics of the MS, constraining its formati… ▽ More The Magellanic Stream (MS) - an enormous ribbon of gas spanning $140^\circ$ of the southern sky trailing the Magellanic Clouds - has been exquisitely mapped in the five decades since its discovery. However, despite concerted efforts, no stellar counterpart to the MS has been conclusively identified. This stellar stream would reveal the distance and 6D kinematics of the MS, constraining its formation and the past orbital history of the Clouds. We have been conducting a spectroscopic survey of the most distant and luminous red giant stars in the Galactic outskirts. From this dataset, we have discovered a prominent population of 13 stars matching the extreme angular momentum of the Clouds, spanning up to $100^\circ$ along the MS at distances of $60-120$ kpc. Furthermore, these kinemetically-selected stars lie along a [$α$/Fe]-deficient track in chemical space from $-2.5 < \mathrm{[Fe/H]} < -0.5$, consistent with their formation in the Clouds themselves. We identify these stars as high-confidence members of the Magellanic Stellar Stream. Half of these stars are metal-rich and closely follow the gaseous MS, whereas the other half are more scattered and metal-poor. We argue that the metal-rich stream is the recently-formed tidal counterpart to the MS, and speculate that the metal-poor population was thrown out of the SMC outskirts during an earlier interaction between the Clouds. The Magellanic Stellar Stream provides a strong set of constraints - distances, 6D kinematics, and birth locations - that will guide future simulations towards unveiling the detailed history of the Clouds. △ Less

Submitted 27 June, 2023; originally announced June 2023.

Comments: 21 pages, 12 figures. Submitted to ApJ

arXiv:2306.15703 [pdf, other]

Toward a Spectral Foundation Model: An Attention-Based Approach with Domain-Inspired Fine-Tuning and Wavelength Parameterization

Authors: Tomasz Różański, Yuan-Sen Ting, Maja Jabłońska

Abstract: Astrophysical explorations are underpinned by large-scale stellar spectroscopy surveys, necessitating a paradigm shift in spectral fitting techniques. Our study proposes three enhancements to transcend the limitations of the current spectral emulation models. We implement an attention-based emulator, adept at unveiling long-range information between wavelength pixels. We leverage a domain-specific… ▽ More Astrophysical explorations are underpinned by large-scale stellar spectroscopy surveys, necessitating a paradigm shift in spectral fitting techniques. Our study proposes three enhancements to transcend the limitations of the current spectral emulation models. We implement an attention-based emulator, adept at unveiling long-range information between wavelength pixels. We leverage a domain-specific fine-tuning strategy where the model is pre-trained on spectra with fixed stellar parameters and variable elemental abundances, followed by fine-tuning on the entire domain. Moreover, by treating wavelength as an autonomous model parameter, akin to neural radiance fields, the model can generate spectra on any wavelength grid. In the case with a training set of O(1000), our approach exceeds current leading methods by a factor of 5-10 across all metrics. △ Less

Submitted 26 June, 2023; originally announced June 2023.

Comments: 7 pages, 3 figures, accepted to ICML 2023 Workshop on Machine Learning for Astrophysics

arXiv:2306.14206 [pdf, other]

Weisfeiler-Lehman Graph Kernel Method: A New Approach to Weak Chemical Tagging

Authors: Yuan-Sen Ting, Bhavesh Sharma

Abstract: Stars' chemical signatures provide invaluable insights into stellar cluster formation. This study utilized the Weisfeiler-Lehman (WL) Graph Kernel to examine a 15-dimensional elemental abundance space. Through simulating chemical distributions using normalizing flows, the effectiveness of our algorithm was affirmed. The results highlight the capability of the WL algorithm, coupled with Gaussian Pr… ▽ More Stars' chemical signatures provide invaluable insights into stellar cluster formation. This study utilized the Weisfeiler-Lehman (WL) Graph Kernel to examine a 15-dimensional elemental abundance space. Through simulating chemical distributions using normalizing flows, the effectiveness of our algorithm was affirmed. The results highlight the capability of the WL algorithm, coupled with Gaussian Process Regression, to identify patterns within elemental abundance point clouds correlated with various cluster mass functions. Notably, the WL algorithm exhibits superior interpretability, efficacy and robustness compared to deep sets and graph convolutional neural networks and enables optimal training with significantly fewer simulations (O(10)), a reduction of at least two orders of magnitude relative to graph neural networks. △ Less

Submitted 25 June, 2023; originally announced June 2023.

Comments: 7 pages, 4 figures, accepted to the ICML 2023 Machine Learning for Astrophysics workshop

arXiv:2306.11648 [pdf, other]

Harnessing the Power of Adversarial Prompting and Large Language Models for Robust Hypothesis Generation in Astronomy

Authors: Ioana Ciucă, Yuan-Sen Ting, Sandor Kruk, Kartheik Iyer

Abstract: This study investigates the application of Large Language Models (LLMs), specifically GPT-4, within Astronomy. We employ in-context prompting, supplying the model with up to 1000 papers from the NASA Astrophysics Data System, to explore the extent to which performance can be improved by immersing the model in domain-specific literature. Our findings point towards a substantial boost in hypothesis… ▽ More This study investigates the application of Large Language Models (LLMs), specifically GPT-4, within Astronomy. We employ in-context prompting, supplying the model with up to 1000 papers from the NASA Astrophysics Data System, to explore the extent to which performance can be improved by immersing the model in domain-specific literature. Our findings point towards a substantial boost in hypothesis generation when using in-context prompting, a benefit that is further accentuated by adversarial prompting. We illustrate how adversarial prompting empowers GPT-4 to extract essential details from a vast knowledge base to produce meaningful hypotheses, signaling an innovative step towards employing LLMs for scientific research in Astronomy. △ Less

Submitted 20 June, 2023; originally announced June 2023.

Comments: 8 pages, 3 figures, accepted to ICML ML4Astro Workshop. Comments and suggestions are welcome

arXiv:2306.04688 [pdf, other]

The Prevalence of the $α$-bimodality: First JWST $α$-abundance Results in M31

Authors: David L. Nidever, Karoline Gilbert, Erik Tollerud, Charles Siders, Ivanna Escala, Carlos Allende Prieto, Verne Smith, Katia Cunha, Victor P. Debattista, Yuan-Sen Ting, Evan N. Kirby

Abstract: We present initial results from our JWST NIRSpec program to study the $α$-abundances in the M31 disk. The Milky Way has two chemically-defined disks, the low-$α$ and high-$α$ disks, which are closely related to the thin and thick disks, respectively. The origin of the two populations and the $α$-bimodality between them is not entirely clear, although there are now several models that can reproduce… ▽ More We present initial results from our JWST NIRSpec program to study the $α$-abundances in the M31 disk. The Milky Way has two chemically-defined disks, the low-$α$ and high-$α$ disks, which are closely related to the thin and thick disks, respectively. The origin of the two populations and the $α$-bimodality between them is not entirely clear, although there are now several models that can reproduce the observed features. To help constrain the models and discern the origin, we have undertaken a study of the chemical abundances of the M31 disk using JWST NIRSpec, in order to determine whether stars in M31's disk also show an $α$-abundance bimodality. Approximately 100 stars were observed in our single NIRSpec field at a projected distance of 18 kpc from the M31 center. The 1-D extracted spectra have an average signal-to-noise ratio of 85 leading to statistical metallicity precision of 0.016 dex, $α$-abundance precision of 0.012 dex, and a radial velocity precision 8 km/s. The initial results indicate that, in contrast to the Milky Way, there is no $α$-bimodality in the M31 disk, and no low-$α$ sequence. The entire stellar population falls along a single chemical sequence very similar to the MW's high-alpha component which had a high star formation rate. While this is somewhat unexpected, the result is not that surprising based on other studies that found the M31 disk has a larger velocity dispersion than the MW and is dominated by a thick component. M31 has had a more active accretion and merger history than the MW which might explain the chemical differences. △ Less

Submitted 7 June, 2023; originally announced June 2023.

Comments: 8 pages, 4 figures, IAU Symposium 377, Early Disk-Galaxy Formation: From JWST to the Milky Way

arXiv:2305.15634 [pdf, other]

Disentangling Stellar Age Estimates from Galactic Chemodynamical Evolution

Authors: Jeff Shen, Joshua S. Speagle, J. Ted Mackereth, Yuan-Sen Ting, Jo Bovy

Abstract: Stellar ages are key for determining the formation history of the Milky Way, but are difficult to measure precisely. Furthermore, methods that use chemical abundances to infer ages may entangle the intrinsic evolution of stars with the chemodynamical evolution of the Galaxy. In this paper, we present a framework for making probabilistic predictions of stellar ages, and then quantify the contributi… ▽ More Stellar ages are key for determining the formation history of the Milky Way, but are difficult to measure precisely. Furthermore, methods that use chemical abundances to infer ages may entangle the intrinsic evolution of stars with the chemodynamical evolution of the Galaxy. In this paper, we present a framework for making probabilistic predictions of stellar ages, and then quantify the contribution of both stellar evolution and Galactic chemical evolution to those predictions using SHAP values. We apply this interpretable prediction framework to both a simulated Milky Way sample containing stars in a variety of evolutionary stages and an APOGEE-mocked sample of red clump stars. We find that in the former case, stellar evolution is the dominant driver for age estimates, while in the latter case, the more restricted evolutionary information causes the model to proxy ages through the chemical evolution model. We show that as a result of the use of non-intrinsic Galactic chemical information, trends estimated with the predicted ages, such as the age-metallicity relation, can deviate from the truth. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 18 pages, 17 figures, submitted to ApJ

arXiv:2305.05854 [pdf, other]

doi 10.3847/1538-4365/acce36

Stellar Parameters and Chemical Abundances Estimated from LAMOST-II DR8 MRS based on Cycle-StarNet

Authors: Rui Wang, A-Li Luo, Shuo Zhang, Yuan-Sen Ting, Teaghan O'Briain, LAMOST MRS Collaboration

Abstract: Deriving stellar atmospheric parameters and chemical abundances from stellar spectra is crucial for understanding the evolution of the Milky Way. By performing a fitting with MARCS model atmospheric theoretical synthetic spectra combined with a domain-adaptation method, we estimate the fundamental stellar parameters (Teff, log g, [Fe/H], vmic, and vmac) and 11 chemical abundances for 1.38 million… ▽ More Deriving stellar atmospheric parameters and chemical abundances from stellar spectra is crucial for understanding the evolution of the Milky Way. By performing a fitting with MARCS model atmospheric theoretical synthetic spectra combined with a domain-adaptation method, we estimate the fundamental stellar parameters (Teff, log g, [Fe/H], vmic, and vmac) and 11 chemical abundances for 1.38 million FGKM-type stars of the Medium-Resolution Spectroscopic Survey (MRS) from LAMOST-II DR8. The domain-adaptation method, Cycle-StarNet, is employed to reduce the gap between observed and synthetic spectra, and the L-BFGS algorithm is used to search for the best-fit synthetic spectra. By combining the 2MASS photometric survey data, Gaia EDR3 parallax, and MIST isochrones, the surface gravities of the stars are constrained after estimating their bolometric luminosities. The accuracy of Teff, log g, and [Fe/H] can reach 150 K, 0.11 dex, and 0.15 dex, evaluated by the PASTEL catalog, asteroseismic samples, and other spectroscopic surveys. The precision of these parameters and elemental abundances ([C/Fe], [Na/Fe], [Mg/Fe], [Si/Fe], [Ca/Fe], [Ti/Fe], [Cr/Fe], [Mn/Fe], [Co/Fe], [Ni/Fe], and [Cu/Fe]) is assessed by repeated observations and validated by cluster members. For spectra with signal-to-noise (S/N) ratios greater than 10, the precision of the three stellar parameters and elemental abundances can achieve 76 K, 0.014 dex, 0.096 dex, and 0.04-0.15 dex. For spectra with S/N ratios higher than 100, the precision stabilizes at 22 K, 0.006 dex, 0.043 dex, and 0.01-0.06 dex. The full LAMOST MRS stellar properties catalog is available online. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: Accepted for publication in ApJS

arXiv:2304.05406 [pdf, other]

Galactic ChitChat: Using Large Language Models to Converse with Astronomy Literature

Authors: Ioana Ciucă, Yuan-Sen Ting

Abstract: We demonstrate the potential of the state-of-the-art OpenAI GPT-4 large language model to engage in meaningful interactions with Astronomy papers using in-context prompting. To optimize for efficiency, we employ a distillation technique that effectively reduces the size of the original input paper by 50\%, while maintaining the paragraph structure and overall semantic integrity. We then explore th… ▽ More We demonstrate the potential of the state-of-the-art OpenAI GPT-4 large language model to engage in meaningful interactions with Astronomy papers using in-context prompting. To optimize for efficiency, we employ a distillation technique that effectively reduces the size of the original input paper by 50\%, while maintaining the paragraph structure and overall semantic integrity. We then explore the model's responses using a multi-document context (ten distilled documents). Our findings indicate that GPT-4 excels in the multi-document domain, providing detailed answers contextualized within the framework of related research findings. Our results showcase the potential of large language models for the astronomical community, offering a promising avenue for further exploration, particularly the possibility of utilizing the models for hypothesis generation. △ Less

Submitted 11 September, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

Comments: 3 pages, published in RNAAS

arXiv:2304.02958 [pdf, other]

doi 10.1051/0004-6361/202245761

Spatial metallicity variations of mono-temperature stellar populations revealed by early-type stars in LAMOST

Authors: Chun Wang, Haibo Yuan, Maosheng Xiang, Yuan-Sen Ting, Yang Huang, Xiaowei Liu

Abstract: We investigate the radial metallicity gradients and azimuthal metallicity distributions on the Galactocentric $X$--$Y$ plane using mono-temperature stellar populations selected from LAMOST MRS young stellar sample. The estimated radial metallicity gradient ranges from $-$0.015\,dex/kpc to $-$0.07\,dex/kpc, which decreases as effective temperature decreases (or stellar age increases) at… ▽ More We investigate the radial metallicity gradients and azimuthal metallicity distributions on the Galactocentric $X$--$Y$ plane using mono-temperature stellar populations selected from LAMOST MRS young stellar sample. The estimated radial metallicity gradient ranges from $-$0.015\,dex/kpc to $-$0.07\,dex/kpc, which decreases as effective temperature decreases (or stellar age increases) at $7500 < T_{\rm eff} < 12500$\,K ($τ< $1.5 Gyr). The azimuthal metallicity excess (metallicity after subtracting radial metallicity gradient, $Δ$\,[M/H]) distributions exhibit inhomogeneities with dispersions of 0.04\,dex to 0.07\,dex, which decrease as effective temperature decreases. We also identify five potential metal-poor substructures with large metallicity excess dispersions. The metallicity excess distributions of these five metal-poor substructures suggest that they contain a larger fraction of metal-poor stars compared to other control samples. These metal-poor substructures may be associated with high-velocity clouds that infall into the Galactic disk from the Galactic halo, which are not quickly well-mixed with the pre-existing ISM of the Galactic disk. As a result, these high-velocity clouds produce some metal-poor stars and the observed metal-poor substructures. The variations of metallicity inhomogeneities with different stellar populations indicate that high-velocity clouds are not well mixed with the pre-existing Galactic disk ISM within 0.3\,Gyr. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: 14 pages, 12 figures, Accepted for publication in A&A

Journal ref: A&A 674, A129 (2023)

arXiv:2303.04098 [pdf, other]

doi 10.3847/1538-4365/acd37b

Validating Stellar Abundance Measurements from Multi-Resolution Spectroscopy

Authors: Nathan R. Sandford, Daniel R. Weisz, Yuan-Sen Ting

Abstract: Large-scale surveys will provide spectroscopy for $\sim$50 million resolved stars in the Milky Way and Local Group. However, these data will have a high degree of heterogeneity and most will be low-resolution ($R<10000$), posing challenges to measuring consistent and reliable stellar labels. Here, we introduce a framework for identifying and remedying these issues. By simultaneously fitting the fu… ▽ More Large-scale surveys will provide spectroscopy for $\sim$50 million resolved stars in the Milky Way and Local Group. However, these data will have a high degree of heterogeneity and most will be low-resolution ($R<10000$), posing challenges to measuring consistent and reliable stellar labels. Here, we introduce a framework for identifying and remedying these issues. By simultaneously fitting the full spectrum and Gaia photometry with the Payne, we measure $\sim$40 abundances for 8 red giants in M15. From degraded quality Keck/HIRES spectra, we evaluate trends with resolution and S/N and find that (i) $\sim$20 abundances are recovered consistently within $\lesssim$0.1 dex agreement and with $\lesssim$0.05-0.15~dex systematic uncertainties from $10000\lesssim R\lesssim80000$; (ii) for 9 elements (C, Mg, Ca, Sc, Ti, Fe, Ni, Y, Nd), this systematic precision and accuracy extends down to $R\sim2500$; and (iii) while most elements do not exhibit strong S/N-dependent systematics, there are non-negligible biases for 4 elements (C, Mg, Ca, and Dy) below $\text{S/N}\sim10$ pixel$^{-1}$. We compare statistical uncertainties from MCMC sampling to the easier-to-compute Cramér-Rao bounds and find that they agree for $\sim$75% of elements, indicating the latter to be a reliable and faster way to estimate uncertainties. Our analysis illustrates the great promise of low-resolution spectroscopy for stellar chemical abundance work, and ongoing improvements to stellar models (e.g., 3D-NLTE physics) will only further extend its viability to more elements and to higher precision and accuracy. △ Less

Submitted 7 March, 2023; originally announced March 2023.

Comments: 46 pages, 26 figures, submitted to ApJS. Comments welcome!

arXiv:2302.13357 [pdf, other]

DESI survey validation data in the COSMOS/HSC field: Cool gas trace main sequence star-forming galaxies at the cosmic noon

Authors: Siwei Zou, Linhua Jiang, Zheng Cai, John Moustakas, Zechang Sun, Zhiwei Pan, Jiani Ding, Jaime E Forero-Romero, Hu Zou, Yuan-sen Ting, Matthew Pieri, Steven Ahlen, David Alexander, David Brooks, Arjun Dey, Andreu Font-Ribera, Satya Gontcho A Gontcho, Klaus Honscheid, Martin Landriau, Axel de la Macorra, Mariana Vargas Magana, Aaron Meisner, Ramon Miquel, Michael Schubnell, Gregory Tarle , et al. (1 additional authors not shown)

Abstract: We present the first result in exploring the gaseous halo and galaxy correlation using the Dark Energy Spectroscopic Instrument (DESI) survey validation data in the Cosmic Evolution Survey (COSMOS) and Hyper Suprime-Cam (HSC) field. We obtain the multiphase gaseous halo properties in the circumgalactic medium (CGM) by using 115 quasar spectra (S/N > 3). We detect MgII absorption at redshift 0.6 <… ▽ More We present the first result in exploring the gaseous halo and galaxy correlation using the Dark Energy Spectroscopic Instrument (DESI) survey validation data in the Cosmic Evolution Survey (COSMOS) and Hyper Suprime-Cam (HSC) field. We obtain the multiphase gaseous halo properties in the circumgalactic medium (CGM) by using 115 quasar spectra (S/N > 3). We detect MgII absorption at redshift 0.6 < z < 2.5, CIV absorption at 1.6 < z < 3.6, and HI absorption associated with the MgII and CIV. By cross-matching the COSMOS2020 catalog, we identify the MgII and CIV host galaxies in ten quasar fields at 0.9 < z < 3.1. We find that within the impact parameter of 250 kpc, a tight correlation is seen between strong MgII equivalent width and the host galaxy star formation rate. The covering fraction fc of strong MgII selected galaxies, which is the ratio of absorbing galaxy in a certain galaxy population, shows significant evolution in the main-sequence galaxies and marginal evolution in all the galaxy populations within 250 kpc at 0.9 < z < 2.2. The fc increase in the main-sequence galaxies likely suggests the co-evolution of strong MgII absorbing gas and the main-sequence galaxies at the cosmic noon. Furthermore, several MgII and CIV absorbing gas is detected out of the galaxy virial radius, tentatively indicating the feedback produced by the star formation and/or the environmental effects. △ Less

Submitted 7 November, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

Comments: 24 pages, 12 figures, 6 tables, accepted for publication in ApJ

arXiv:2302.10504 [pdf, ps, other]

doi 10.3847/1538-4357/acbcc4

Ba-enhanced dwarf and subgiant stars in the LAMOST Galactic surveys

Authors: Meng Zhang, Maosheng Xiang, Hua-Wei Zhang, Yuan-Sen Ting, Ya-Qian Wu, Xiao-Wei Liu

Abstract: Ba-enhanced stars are interesting probes of stellar astrophysics and Galactic formation history. In this work, we investigate the chemistry and kinematics for a large sample of Ba-enhanced ([Ba/Fe]$>$1.0) dwarf and subgiant stars with $5000 < T_{\rm eff }< 6700$\,K from LAMOST. We find that both stellar internal evolution process and external mass exchange due to binary evolution are responsible f… ▽ More Ba-enhanced stars are interesting probes of stellar astrophysics and Galactic formation history. In this work, we investigate the chemistry and kinematics for a large sample of Ba-enhanced ([Ba/Fe]$>$1.0) dwarf and subgiant stars with $5000 < T_{\rm eff }< 6700$\,K from LAMOST. We find that both stellar internal evolution process and external mass exchange due to binary evolution are responsible for the origins of the Ba-enhancement of our sample stars. About one third of them exhibit C and N enhancement and ultraviolet brightness excess, indicating they are products of binary evolution. The remaining Ba-enhanced stars with normal C and N abundances are mostly warm stars with $T_{\rm eff} > 6000$\,K. They are likely consequences of stellar internal elemental transport processes, but they show very different elemental patterns to the hotter Am/Fm stars. Our results reveal a substantially lack of high-[$α$/Fe] Ba-enhanced stars in the [Fe/H]--[$α$/Fe] plane, which we dub as a {\em high-$α$ desert}. We suggest it is due to a lower efficiency for producing Ba-enhanced stars by low-mass AGB progenitors in binary systems. Our results call for detailed modellings for these Ba-enhanced stellar peculiars, in the context of both stellar internal elemental transport and external mass accretion. △ Less

Submitted 21 February, 2023; originally announced February 2023.

Comments: 20 pages, 17 figures. Accepted for publication in ApJ

arXiv:2212.06051 [pdf, other]

doi 10.3847/1538-4357/acab5b

Evidence for Populations-dependent vertical motions and the Long-lived Non-Steady Lopsided Milky Way Warp

Authors: Xiang Li, Hai-Feng Wang, Yang-Ping Luo, Martín López-Corredoira, Yuan-Sen Ting, Žofia Chrobáková

Abstract: We present the Galactic disk vertical velocity analysis using OB type stars (OB), Red Clump stars (RC), and Main-Sequence-Turn-Off stars (MSTO) with different average age populations crossed matched with LAMOST DR5 and Gaia DR3. We reveal the vertical velocities of the three populations varies clearly with the Galactocentric distance ($R$) and the younger stellar population has stronger increasing… ▽ More We present the Galactic disk vertical velocity analysis using OB type stars (OB), Red Clump stars (RC), and Main-Sequence-Turn-Off stars (MSTO) with different average age populations crossed matched with LAMOST DR5 and Gaia DR3. We reveal the vertical velocities of the three populations varies clearly with the Galactocentric distance ($R$) and the younger stellar population has stronger increasing trend in general. The bending and breathing modes indicated by the vertical motions are dependent on the populations and they are varying with spatial locations. These vertical motions may be due to the Galactic warp, or minor mergers, or non-equilibrium of the disk. Assuming the warp is the dominant component, we find that the warp amplitude ($γ$, $Z_ω$) for OB (younger population) is larger than that for RC (medium population) and the later one is also larger than that for MSTO (older population), which is in agreement with other independent analyses of stellar density distribution, and supports the warp is long-lived, non-steady structure and has time evolution. This conclusion is robust whether or not the line-of-nodes $φ_w$ is fixed or as a free parameter (with $φ_w$ is around 3$-$8.5$^{\circ}$ as best fit). Furthermore, we find that warp is lopsided with asymmetries along azimuthal angle ($φ$). △ Less

Submitted 12 December, 2022; originally announced December 2022.

Comments: 15 pages, 13 figures. Accepted for publication in The Astrophysical Journal

arXiv:2212.00806 [pdf, other]

doi 10.3847/1538-4357/accf13

Distant Echoes of the Milky Way's Last Major Merger

Authors: Vedant Chandra, Rohan P. Naidu, Charlie Conroy, Alexander P. Ji, Hans-Walter Rix, Ana Bonaca, Phillip Cargile, Jiwon Jesse Han, Benjamin D. Johnson, Yuan-Sen Ting, Turner Woody, Dennis Zaritsky

Abstract: The majority of the Milky Way's stellar halo consists of debris from our Galaxy's last major merger, the Gaia-Sausage-Enceladus (GSE). In the past few years, stars from GSE have been kinematically and chemically studied in the inner $30$ kpc of our Galaxy. However, simulations predict that accreted debris could lie at greater distances, forming substructures in the outer halo. Here we derive metal… ▽ More The majority of the Milky Way's stellar halo consists of debris from our Galaxy's last major merger, the Gaia-Sausage-Enceladus (GSE). In the past few years, stars from GSE have been kinematically and chemically studied in the inner $30$ kpc of our Galaxy. However, simulations predict that accreted debris could lie at greater distances, forming substructures in the outer halo. Here we derive metallicities and distances using Gaia DR3 XP spectra for an all-sky sample of luminous red giant stars, and map the outer halo with kinematics and metallicities out to $100$ kpc. We obtain follow-up spectra of stars in two strong overdensities - including the previously identified Outer Virgo Overdensity - and find them to be relatively metal-rich and on predominantly retrograde orbits, matching predictions from simulations of the GSE merger. We argue that these are apocentric shells of GSE debris, forming $60-90$ kpc counterparts to the $15-20$ kpc shells that are known to dominate the inner stellar halo. Extending our search across the sky with literature radial velocities, we find evidence for a coherent stream of retrograde stars encircling the Milky Way from $50-100$ kpc, in the same plane as the Sagittarius stream but moving in the opposite direction. These are the first discoveries of distant and structured imprints from the GSE merger, cementing the picture of an inclined and retrograde collision that built up our Galaxy's stellar halo. △ Less

Submitted 30 June, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

Comments: 22 pages, 10 figures. Accepted to ApJ

Journal ref: The Astrophysical Journal, 951, 26 (2023)

arXiv:2211.16782 [pdf, ps, other]

Panel Discussion: Practical Problem Solving for Machine Learning

Authors: Guillermo Cabrera, Sungwook E. Hong, Lilianne Nakazono, David Parkinson, Yuan-Sen Ting

Abstract: Machine Learning is a powerful tool for astrophysicists, which has already had significant uptake in the community. But there remain some barriers to entry, relating to proper understanding, the difficulty of interpretability, and the lack of cohesive training. In this discussion session we addressed some of these questions, and suggest how the field may move forward. Machine Learning is a powerful tool for astrophysicists, which has already had significant uptake in the community. But there remain some barriers to entry, relating to proper understanding, the difficulty of interpretability, and the lack of cohesive training. In this discussion session we addressed some of these questions, and suggest how the field may move forward. △ Less

Submitted 30 November, 2022; originally announced November 2022.

Comments: 6 pages. Prepared for the proceedings of the International Astronomical Union Symposium 368 "Machine Learning in Astronomy: Possibilities and Pitfalls"

arXiv:2211.14349 [pdf]

doi 10.3847/1538-4357/aca295

A Panchromatic Study of Massive Stars in the Extremely Metal-Poor Local Group Dwarf Galaxy Leo A

Authors: Maude Gull, Daniel R. Weisz, Peter Senchyna, Nathan R. Sandford, Yumi Choi, Anna F. McLeod, Kareem El-Badry, Ylva Götberg, Karoline M. Gilbert, Martha Boyer, Julianne J. Dalcanton, Puragra GuhaThakurta, Steven Goldman, Paola Marigo, Kristen B. W. McQuinn, Giada Pastorelli, Daniel P. Stark, Evan Skillman, Yuan-sen Ting, Benjamin F. Williams

Abstract: We characterize massive stars (M>8 M_sun) in the nearby (D~0.8 Mpc) extremely metal-poor (Z~5% Z_sun) galaxy Leo A using Hubble Space Telescope ultra-violet (UV), optical, and near-infrared (NIR) imaging along with Keck/LRIS and MMT/Binospec optical spectroscopy for 18 main sequence OB stars. We find that: (a) 12 of our 18 stars show emission lines, despite not being associated with an H II region… ▽ More We characterize massive stars (M>8 M_sun) in the nearby (D~0.8 Mpc) extremely metal-poor (Z~5% Z_sun) galaxy Leo A using Hubble Space Telescope ultra-violet (UV), optical, and near-infrared (NIR) imaging along with Keck/LRIS and MMT/Binospec optical spectroscopy for 18 main sequence OB stars. We find that: (a) 12 of our 18 stars show emission lines, despite not being associated with an H II region, suggestive of stellar activity (e.g., mass loss, accretion, binary star interaction), which is consistent with previous predictions of enhanced activity at low metallicity; (b) 6 are Be stars, which are the first to be spectroscopically studied at such low metallicity -- these Be stars have unusual panchromatic SEDs; (c) for stars well-fit by the TLUSTY non-local thermodynamic equilibrium (non-LTE) models, the photometric and spectroscopic values of T_eff and log(g) agree to within ~0.01 dex and ~0.18 dex, respectively, indicating that NUV/optical/NIR imaging can be used to reliably characterize massive (M ~ 8-30 M_sun) main sequence star properties relative to optical spectroscopy; (d) the properties of the most massive stars in H II regions are consistent with constraints from previous nebular emission line studies; and (e) 13 stars with M>8 M_sun are >40 pc from a known star cluster or H II region. Our sample comprises ~50% of all known massive stars at Z < 10% Z_sun with derived stellar parameters, high-quality optical spectra, and panchromatic photometry. △ Less

Submitted 28 December, 2022; v1 submitted 25 November, 2022; originally announced November 2022.

Comments: 35 pages, 18 figures

Showing 1–50 of 204 results for author: Ting, Y