subscribe to arXiv mailings

Parameter Estimation and Identifiability in Kinetic Flux Profiling Models of Metabolism

Authors: Breanna Guppy, Colleen Mitchell, Eric Taylor

Abstract: Metabolic fluxes are the rates of life-sustaining chemical reactions within a cell and metabolites are the components. Determining the changes in these fluxes is crucial to understanding diseases with metabolic causes and consequences. Kinetic flux profiling (KFP) is a method for estimating flux that utilizes data from isotope tracing experiments. In these experiments, the isotope-labeled nutrient… ▽ More Metabolic fluxes are the rates of life-sustaining chemical reactions within a cell and metabolites are the components. Determining the changes in these fluxes is crucial to understanding diseases with metabolic causes and consequences. Kinetic flux profiling (KFP) is a method for estimating flux that utilizes data from isotope tracing experiments. In these experiments, the isotope-labeled nutrient is metabolized through a pathway and integrated into the downstream metabolite pools. Measurements of proportion labeled for each metabolite in the pathway are taken at multiple time points and used to fit an ordinary differential equations model with fluxes as parameters. We begin by generalizing the process of converting diagrams of metabolic pathways into mathematical models composed of differential equations and algebraic constraints. The scaled differential equations for proportions of unlabeled metabolite contain parameters related to the metabolic fluxes in the pathway. We investigate flux parameter identifiability given data collected only at the steady state of the differential equation. Next, we give criteria for valid parameter estimations in the case of a large separation of timescales with fast-slow analysis. Bayesian parameter estimation on simulated data from KFP experiments containing both irreversible and reversible reactions illustrates the accuracy and reliability of flux estimations. These analyses provide constraints that serve as guidelines for the design of KFP experiments to estimate metabolic fluxes. △ Less

Submitted 11 July, 2024; originally announced July 2024.

MSC Class: 92

arXiv:2407.08633 [pdf, other]

A Novel Framework for Automated Warehouse Layout Generation

Authors: Atefeh Shahroudnejad, Payam Mousavi, Oleksii Perepelytsia, Sahir, David Staszak, Matthew E. Taylor, Brent Bawel

Abstract: Optimizing warehouse layouts is crucial due to its significant impact on efficiency and productivity. We present an AI-driven framework for automated warehouse layout generation. This framework employs constrained beam search to derive optimal layouts within given spatial parameters, adhering to all functional requirements. The feasibility of the generated layouts is verified based on criteria suc… ▽ More Optimizing warehouse layouts is crucial due to its significant impact on efficiency and productivity. We present an AI-driven framework for automated warehouse layout generation. This framework employs constrained beam search to derive optimal layouts within given spatial parameters, adhering to all functional requirements. The feasibility of the generated layouts is verified based on criteria such as item accessibility, required minimum clearances, and aisle connectivity. A scoring function is then used to evaluate the feasible layouts considering the number of storage locations, access points, and accessibility costs. We demonstrate our method's ability to produce feasible, optimal layouts for a variety of warehouse dimensions and shapes, diverse door placements, and interconnections. This approach, currently being prepared for deployment, will enable human designers to rapidly explore and confirm options, facilitating the selection of the most appropriate layout for their use-case. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.01661 [pdf, other]

doi 10.1093/mnras/stae1637

The infall region as a complementary probe to cluster abundance

Authors: Charlie T. Mpetha, James E. Taylor, Yuba Amoura, Roan Haggar

Abstract: Galaxy cluster abundance measurements provide a classic test of cosmology. They are most sensitive to the evolved amplitude of fluctuations, usually expressed as $S_8 = σ_8\sqrt{Ω_m/0.3}$. Thus, abundance constraints exhibit a strong degeneracy between $σ_8$ and $Ω_{\rm m}$, as do other similar low-redshift tests such as cosmic shear. The mass distribution in the infall region around galaxy cluste… ▽ More Galaxy cluster abundance measurements provide a classic test of cosmology. They are most sensitive to the evolved amplitude of fluctuations, usually expressed as $S_8 = σ_8\sqrt{Ω_m/0.3}$. Thus, abundance constraints exhibit a strong degeneracy between $σ_8$ and $Ω_{\rm m}$, as do other similar low-redshift tests such as cosmic shear. The mass distribution in the infall region around galaxy clusters, where material is being accreted from the surrounding field, also exhibits a cosmological dependence, but in this case it is nearly orthogonal to the $S_8$ direction in the $Ω_m$--$σ_8$ plane, making it highly complementary to halo abundance or cosmic shear studies. We explore how weak lensing measurements of the infall region might be used to complement abundance studies, considering three different tests. The splashback radius is a prominent feature of the infall region; we show that detection of this feature in lensing data from the Euclid survey could independently constrain $Ω_{\rm m}$ and $σ_8$ to $\pm 0.05$. Another feature, the depletion radius where the bias reaches a minimum, also shows cosmological dependence, though it is challenging to observe in practice. The strongest constraints come from direct measurements of the shear profile in the infall region at $2$--$4\,r_{200{\rm c}}$. Combining the latter with abundance constraints such as those reported from SRG$/$eROSITA should reduce the area of the error contours by an estimated factor of $1.2$ using a sample of clusters observed by the UNIONS survey, or a factor of $3$ using clusters observed by the Euclid Wide survey over a broader range of redshift. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 14 pages, 10 figures. Accepted for publication in MNRAS

arXiv:2406.19970 [pdf]

A determination of FL at xmin with HERA data

Authors: Frank E. Taylor

Abstract: It is well known that there are persistent statistical tensions with the standard model in the low Q2 HERA deep inelastic scattering neutral current data characterized by a turn-over of F2(x, Q2) at low x and low Q2. One important experimental signature that sheds light on this low Q2 region is the determination of the longitudinal structure function FL(x, Q2). This paper describes a novel method… ▽ More It is well known that there are persistent statistical tensions with the standard model in the low Q2 HERA deep inelastic scattering neutral current data characterized by a turn-over of F2(x, Q2) at low x and low Q2. One important experimental signature that sheds light on this low Q2 region is the determination of the longitudinal structure function FL(x, Q2). This paper describes a novel method to determine FL based on an extrapolation of the reduced NC cross section at fixed s and Q to the minimum value of x given by Q2/s. At this kinematic point, the reduced cross section equals 2xF1 = F2 - FL so that a determination of both this value and the value of F2, determines FL. Since the polarization of the exchanged photon is transverse at this kinematic point, we expect FL to be small because its dominate gluon component is strongly suppressed. Surprisingly, we find FL at low Q2 to be much larger than expectation and observe that both FL and F2 at x = Q2/s show several properties consistent with the dipole picture. We discuss the statistical as well as chief systematic errors of our method and we tabulate our determinations of F2, 2xF1 and FL in the Appendix. △ Less

Submitted 1 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

Comments: 42 pages, 13 figures

arXiv:2406.17849 [pdf, other]

Constraining cosmological parameters using the splashback radius of galaxy clusters

Authors: Roan Haggar, Yuba Amoura, Charlie T. Mpetha, James E. Taylor, Kris Walker, Chris Power

Abstract: Cosmological parameters such as $Ω_{\rm{M}}$ and $σ_{8}$ can be measured indirectly using various methods, including galaxy cluster abundance and cosmic shear. These measurements constrain the composite parameter $S_{8}$, leading to degeneracy between $Ω_{\rm{M}}$ and $σ_{8}$. However, some structural properties of galaxy clusters also correlate with cosmological parameters, due to their dependenc… ▽ More Cosmological parameters such as $Ω_{\rm{M}}$ and $σ_{8}$ can be measured indirectly using various methods, including galaxy cluster abundance and cosmic shear. These measurements constrain the composite parameter $S_{8}$, leading to degeneracy between $Ω_{\rm{M}}$ and $σ_{8}$. However, some structural properties of galaxy clusters also correlate with cosmological parameters, due to their dependence on a cluster's accretion history. In this work, we focus on the splashback radius, an observable cluster feature that represents a boundary between a cluster and the surrounding Universe. Using a suite of cosmological simulations with a range of values for $Ω_{\rm{M}}$ and $σ_{8}$, we show that the position of the splashback radius around cluster-mass halos is greater in cosmologies with smaller values of $Ω_{\rm{M}}$ or larger values of $σ_{8}$. This variation breaks the degeneracy between $Ω_{\rm{M}}$ and $σ_{8}$ that comes from measurements of the $S_{8}$ parameter. We also show that this variation is, in principle, measurable in observations. As the splashback radius can be determined from the same weak lensing analysis already used to estimate $S_{8}$, this new approach can tighten low-redshift constraints on cosmological parameters, either using existing data, or using upcoming data such as that from Euclid and LSST. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 12 pages, 7 figures, 1 table, accepted for publication in ApJ

arXiv:2406.17842 [pdf, other]

doi 10.1093/mnras/stae1582

The hyperplane of early-type galaxies: using stellar population properties to increase the precision and accuracy of the fundamental plane as a distance indicator

Authors: Francesco D'Eugenio, Matthew Colless, Arjen van der Wel, Sam P. Vaughan, Khaled Said, Jesse van de Sande, Joss Bland-Hawthorn, Julia J. Bryant, Scott M. Croom, Angel R. Lopez-Sanchez, Nuria P. F. Lorente, Roberto Maiolino, Edward N. Taylor

Abstract: We use deep spectroscopy from the SAMI Galaxy Survey to explore the precision of the fundamental plane of early-type galaxies (FP) as a distance indicator for future single-fibre spectroscopy surveys. We study the optimal trade-off between sample size and signal-to-noise ratio (SNR), and investigate which additional observables can be used to construct hyperplanes with smaller intrinsic scatter th… ▽ More We use deep spectroscopy from the SAMI Galaxy Survey to explore the precision of the fundamental plane of early-type galaxies (FP) as a distance indicator for future single-fibre spectroscopy surveys. We study the optimal trade-off between sample size and signal-to-noise ratio (SNR), and investigate which additional observables can be used to construct hyperplanes with smaller intrinsic scatter than the FP. We add increasing levels of random noise (parametrised as effective exposure time) to the SAMI spectra to study the effect of increasing measurement uncertainties on the FP-and hyperplane-inferred distances. We find that, using direct-fit methods, the values of the FP and hyperplane best-fit coefficients depend on the spectral SNR, and reach asymptotic values for a mean SNR=40 Å$^{-1}$. As additional variables for the FP we consider three stellar-population observables: light-weighted age, stellar mass-to-light ratio and a novel combination of Lick indices (I$_{\rm age}$). For a SNR=45 Å$^{-1}$ (equivalent to 1-hour exposure on a 4-m telescope), all three hyperplanes outperform the FP as distance indicators. Being an empirical spectral index, I$_{\rm age}$ avoids the model-dependent uncertainties and bias underlying age and mass-to-light ratio measurements, yet yields a 10 per cent reduction of the median distance uncertainty compared to the FP. We also find that, as a by-product, the Iage hyperplane removes most of the reported environment bias of the FP. After accounting for the different signal-to-noise ratio, these conclusions also apply to a 50 times larger sample from SDSS-III. However, in this case, only age removes the environment bias. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 24 pages, 18 figures, accepted for publication in MNRAS

arXiv:2406.15555 [pdf, other]

doi 10.1093/mnras/stae1566

Reconsidering the dynamical states of galaxy clusters using PCA and UMAP

Authors: Roan Haggar, Federico De Luca, Marco De Petris, Elizaveta Sazonova, James E. Taylor, Alexander Knebe, Meghan E. Gray, Frazer R. Pearce, Ana Contreras-Santos, Weiguang Cui, Ulrike Kuchner, Robert A. Mostoghiu Paun, Chris Power

Abstract: Numerous metrics exist to quantify the dynamical state of galaxy clusters, both observationally and within simulations. Many of these correlate strongly with one another, but it is not clear whether all of these measures probe the same intrinsic properties. In this work, we use two different statistical approaches -- principal component analysis (PCA) and uniform manifold approximation and project… ▽ More Numerous metrics exist to quantify the dynamical state of galaxy clusters, both observationally and within simulations. Many of these correlate strongly with one another, but it is not clear whether all of these measures probe the same intrinsic properties. In this work, we use two different statistical approaches -- principal component analysis (PCA) and uniform manifold approximation and projection (UMAP) -- to investigate which dynamical properties of a cluster are in fact the best descriptors of its dynamical state. We use measurements taken directly from The Three Hundred suite of galaxy cluster simulations, as well as morphological properties calculated using mock X-ray and SZ maps of the same simulated clusters. We find that four descriptions of dynamical state naturally arise, and although correlations exist between these, a given cluster can be "dynamically relaxed" according to all, none, or some of these four descriptions. These results demonstrate that it is highly important for future observational and theoretical studies to consider in which sense clusters are dynamically relaxed. Cluster dynamical states are complex and multi-dimensional, and so it is not meaningful to classify them simply as "relaxed" and "unrelaxed" based on a single linear scale. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 18 pages, 7 figures, 3 tables, accepted for publication in MNRAS

arXiv:2406.10877 [pdf, other]

WALLABY Pilot Survey: the Tully-Fisher relation in the NGC 4808, Vela and NGC 5044 fields

Authors: Jeremy Mould, T. H. Jarrett, Hélène Courtois, Albert Bosma, Nathan Deg, Alexandra Dupuy, Lister Staveley-Smith, E. N. Taylor, Jayanne English, S. H. A. Rajohnson, Renée Kraan-Korteweg, Duncan Forbes, Helga Dénes, Karen Lee-Waddell, Austin Shen, O. I. Wong, Benne Holwerda, Bärbel Koribalski, Denis Leahy, Pavel Mancera Piña, Niankun Yu

Abstract: The Tully-Fisher Relation (TFR) is a well-known empirical relationship between the luminosity of a spiral galaxy and its circular velocity, allowing us to estimate redshift independent distances. Here we use high signal-to-noise HI 21-cm integrated spectra from the second pilot data release (PDR2, 180 deg2) of the Widefield ASKAP L-band Legacy All-sky Blind surveY (WALLABY). In order to prepare fo… ▽ More The Tully-Fisher Relation (TFR) is a well-known empirical relationship between the luminosity of a spiral galaxy and its circular velocity, allowing us to estimate redshift independent distances. Here we use high signal-to-noise HI 21-cm integrated spectra from the second pilot data release (PDR2, 180 deg2) of the Widefield ASKAP L-band Legacy All-sky Blind surveY (WALLABY). In order to prepare for the full WALLABY survey, we have investigated the TFR in phase 2 of the pilot survey with a further three fields. The data were obtained with wide-field Phased Array Feeds on the Australian Square Kilometre Array Pathfinder (ASKAP) and have an angular resolution of 30 arcsec and a velocity resolution of ~4 km/s. Galaxy luminosities have been measured from the Wide-field Infrared Survey Explorer (WISE), and optical galaxy inclinations from the Dark Energy Camera Legacy Survey. We present TFRs for wavelengths from 0.8-3.4μm. We examine sources of galaxy inclination data and investigate magnitudes from the DECam Local Volume Exploration Survey (DELVE) and DENIS catalogues and the 4HS target catalogue based on the VISTA Hemisphere Survey (VHS). We consider the baryonic TFR. These are all of interest for TFR using the full WALLABY survey of 200,000 galaxies. We demonstrate that WALLABY TFR distances can take their place among state of the art studies of the local velocity field. △ Less

Submitted 19 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

Comments: to appear in MNRAS. One figure removed

arXiv:2406.06495 [pdf, other]

Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity

Authors: Calarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu, Matthew E. Taylor

Abstract: For autonomous agents to successfully integrate into human-centered environments, agents should be able to learn from and adapt to humans in their native settings. Preference-based reinforcement learning (PbRL) is a promising approach that learns reward functions from human preferences. This enables RL agents to adapt their behavior based on human desires. However, humans live in a world full of d… ▽ More For autonomous agents to successfully integrate into human-centered environments, agents should be able to learn from and adapt to humans in their native settings. Preference-based reinforcement learning (PbRL) is a promising approach that learns reward functions from human preferences. This enables RL agents to adapt their behavior based on human desires. However, humans live in a world full of diverse information, most of which is not relevant to completing a particular task. It becomes essential that agents learn to focus on the subset of task-relevant environment features. Unfortunately, prior work has largely ignored this aspect; primarily focusing on improving PbRL algorithms in standard RL environments that are carefully constructed to contain only task-relevant features. This can result in algorithms that may not effectively transfer to a more noisy real-world setting. To that end, this work proposes R2N (Robust-to-Noise), the first PbRL algorithm that leverages principles of dynamic sparse training to learn robust reward models that can focus on task-relevant features. We study the effectiveness of R2N in the Extremely Noisy Environment setting, an RL problem setting where up to 95% of the state features are irrelevant distractions. In experiments with a simulated teacher, we demonstrate that R2N can adapt the sparse connectivity of its neural networks to focus on task-relevant features, enabling R2N to significantly outperform several state-of-the-art PbRL algorithms in multiple locomotion and control environments. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2405.19286 [pdf, other]

EDGE: A new model for Nuclear Star Cluster formation in dwarf galaxies

Authors: Emily I. Gray, Justin I. Read, Ethan Taylor, Matthew D. A. Orkney, Martin P. Rey, Robert M. Yates, Stacy Y. Kim, Noelia E. D. Noël, Oscar Agertz, Eric Andersson, Andrew Pontzen

Abstract: Nuclear Star Clusters (NSCs) are amongst the densest stellar systems in the Universe and are found at the centres of many bright spiral and elliptical galaxies, and up to ${\sim}$40% of dwarf galaxies. However, their formation mechanisms, and possible links to globular clusters (GCs), remain debated. This paper uses the EDGE simulations - a collection of zoom-in, cosmological simulations of isolat… ▽ More Nuclear Star Clusters (NSCs) are amongst the densest stellar systems in the Universe and are found at the centres of many bright spiral and elliptical galaxies, and up to ${\sim}$40% of dwarf galaxies. However, their formation mechanisms, and possible links to globular clusters (GCs), remain debated. This paper uses the EDGE simulations - a collection of zoom-in, cosmological simulations of isolated dwarf galaxies -- to present a new formation mechanism for NSCs. We find that, at a gas spatial and mass resolution of ${\sim}3\,$pc and ${\sim}161$ M$_\odot$, respectively, NSCs naturally emerge in a subset of our EDGE dwarfs with redshift-zero halo masses of $\rm{M}_{\rm{r}200\rm{c}} \sim 5 \times 10^9$ M$_\odot$. These dwarfs are quenched by reionisation, but retain a significant reservoir of gas that is unable to cool and form stars. Sometime after reionisation, the dwarfs then undergo a major (${\sim}$1:1) merger that excites rapid gas cooling, leading to a significant starburst. An NSC forms in this starburst that then quenches star formation thereafter. The result is a nucleated dwarf that has two stellar populations with distinct age: one pre-reionisation and one post-reionisation. Our mechanism is unique for two key reasons. Firstly, the low mass of the host dwarf means that NSCs, formed in this way, can accrete onto galaxies of almost all masses, potentially seeding the formation of NSCs everywhere. Secondly, our model predicts that NSCs should have at least two stellar populations with a large ($\gtrsim$1 billion year) age separation. This yields a predicted colour magnitude diagram for our nucleated dwarfs that has two distinct main sequence turnoffs. Several GCs orbiting the Milky Way, including Omega Centauri and M54, show exactly this behaviour, suggesting that they may, in fact, be accreted NSCs. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: Main text 12 pages, 8 figures. Submitted to MNRAS

arXiv:2405.13491 [pdf, other]

Euclid. I. Overview of the Euclid mission

Authors: Euclid Collaboration, Y. Mellier, Abdurro'uf, J. A. Acevedo Barroso, A. Achúcarro, J. Adamek, R. Adam, G. E. Addison, N. Aghanim, M. Aguena, V. Ajani, Y. Akrami, A. Al-Bahlawan, A. Alavi, I. S. Albuquerque, G. Alestas, G. Alguero, A. Allaoui, S. W. Allen, V. Allevato, A. V. Alonso-Tetilla, B. Altieri, A. Alvarez-Candal, A. Amara, L. Amendola , et al. (1086 additional authors not shown)

Abstract: The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14… ▽ More The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14,000 deg^2 of extragalactic sky. In addition to accurate weak lensing and clustering measurements that probe structure formation over half of the age of the Universe, its primary probes for cosmology, these exquisite data will enable a wide range of science. This paper provides a high-level overview of the mission, summarising the survey characteristics, the various data-processing steps, and data products. We also highlight the main science objectives and expected performance. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Paper submitted as part of the A&A special issue`Euclid on Sky'

arXiv:2405.10866 [pdf, other]

Galaxy And Mass Assembly (GAMA): Stellar-to-Dynamical Mass Relation II. Peculiar Velocities

Authors: M. Burak Dogruel, Edward Taylor, Michelle Cluver, Matthew Colless, Anna de Graaff, Alessandro Sonnenfeld, John R. Lucey, Francesco D'Eugenio, Cullan Howlett, Khaled Said

Abstract: Empirical correlations connecting starlight to galaxy dynamics (e.g., the fundamental plane (FP) of elliptical/quiescent galaxies and the Tully--Fisher relation of spiral/star-forming galaxies) provide cosmology-independent distance estimation and are central to local Universe cosmology. In this work, we introduce the mass hyperplane (MH), which is the stellar-to-dynamical mass relation… ▽ More Empirical correlations connecting starlight to galaxy dynamics (e.g., the fundamental plane (FP) of elliptical/quiescent galaxies and the Tully--Fisher relation of spiral/star-forming galaxies) provide cosmology-independent distance estimation and are central to local Universe cosmology. In this work, we introduce the mass hyperplane (MH), which is the stellar-to-dynamical mass relation $(M_\star/M_\mathrm{dyn})$ recast as a linear distance indicator. Building on recent FP studies, we show that both star-forming and quiescent galaxies follow the same empirical MH, then use this to measure the peculiar velocities (PVs) for a sample of 2496 galaxies at $z<0.12$ from GAMA. The limiting precision of MH-derived distance/PV estimates is set by the intrinsic scatter in size, which we find to be $\approx$0.1~dex for both quiescent and star-forming galaxies (when modeled independently) and $\approx$0.11~dex when all galaxies are modeled together; showing that the MH is as good as the FP. To empirically validate our framework and distance/PV estimates, we compare the inferred distances to groups as derived using either quiescent or star-forming galaxies. A good agreement is obtained with no discernible bias or offset, having a scatter of $\approx$0.05~dex $\approx$12\% in distance. Further, we compare our PV measurements for the quiescent galaxies to the previous PV measurements of the galaxies in common between GAMA and the Sloan Digital Sky Survey (SDSS), which shows similarly good agreement. Finally, we provide comparisons of PV measurements made with the FP and the MH, then discuss possible improvements in the context of upcoming surveys such as the 4MOST Hemisphere Survey (4HS). △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: Accepted: 15th May 2024

arXiv:2405.00746 [pdf, other]

Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning

Authors: Calarina Muslimani, Matthew E. Taylor

Abstract: To create useful reinforcement learning (RL) agents, step zero is to design a suitable reward function that captures the nuances of the task. However, reward engineering can be a difficult and time-consuming process. Instead, human-in-the-loop (HitL) RL allows agents to learn reward functions from human feedback. Despite recent successes, many of the HitL RL methods still require numerous human in… ▽ More To create useful reinforcement learning (RL) agents, step zero is to design a suitable reward function that captures the nuances of the task. However, reward engineering can be a difficult and time-consuming process. Instead, human-in-the-loop (HitL) RL allows agents to learn reward functions from human feedback. Despite recent successes, many of the HitL RL methods still require numerous human interactions to learn successful reward functions. To improve the feedback efficiency of HitL RL methods (i.e., require less feedback), this paper introduces Sub-optimal Data Pre-training, SDP, an approach that leverages reward-free, sub-optimal data to improve scalar- and preference-based HitL RL algorithms. In SDP, we start by pseudo-labeling all low-quality data with rewards of zero. Through this process, we obtain free reward labels to pre-train our reward model. This pre-training phase provides the reward model a head start in learning, whereby it can identify that low-quality transitions should have a low reward, all without any actual feedback. Through extensive experiments with a simulated teacher, we demonstrate that SDP can significantly improve or achieve competitive performance with state-of-the-art (SOTA) HitL RL algorithms across nine robotic manipulation and locomotion tasks. △ Less

Submitted 30 April, 2024; originally announced May 2024.

arXiv:2404.16319 [pdf, other]

The MAGPI Survey: Evolution of radial trends in star formation activity across cosmic time

Authors: Marcie Mun, Emily Wisnioski, Andrew J. Battisti, J. Trevor Mendel, Sara L. Ellison, Edward N. Taylor, Claudia D. P. Lagos, Katherine E. Harborne, Caroline Foster, Scott M. Croom, Sabine Bellstedt, Stefania Barsanti, Anshu Gupta, Lucas M. Valenzuela, Qian-Hui Chen, Kathryn Grasha, Tamal Mukherjee, Hye-Jin Park, Piyush Sharda, Sarah M. Sweet, Rhea-Silvia Remus, Tayyaba Zafar

Abstract: Using adaptive optics with the Multi-Unit Spectroscopic Explorer (MUSE) on the Very Large Telescope (VLT), the Middle Ages Galaxy Properties with Integral Field Spectroscopy (MAGPI) survey allows us to study the spatially resolved Universe at a crucial time of ~4 Gyr ago ($z$ ~ 0.3) when simulations predict the greatest diversity in evolutionary pathways for galaxies. We investigate the radial tre… ▽ More Using adaptive optics with the Multi-Unit Spectroscopic Explorer (MUSE) on the Very Large Telescope (VLT), the Middle Ages Galaxy Properties with Integral Field Spectroscopy (MAGPI) survey allows us to study the spatially resolved Universe at a crucial time of ~4 Gyr ago ($z$ ~ 0.3) when simulations predict the greatest diversity in evolutionary pathways for galaxies. We investigate the radial trends in the star formation (SF) activity and luminosity-weighted stellar ages as a function of offset from the star-forming main sequence (SFMS) for a total of 294 galaxies. Using both H$α$ emission and the 4000 Angstrom break (i.e., D4000) as star formation rate (SFR) tracers, we find overall flat radial profiles for galaxies lying on and above the SFMS, suggestive of physical processes that enhance/regulate SF throughout the entire galaxy disc. However, for galaxies lying below the SFMS, we find positive gradients in SF suggestive of inside-out quenching. Placing our results in context with results from other redshift regimes suggests an evolution in radial trends at $z$ ~ 0.3 for SF galaxies above the SFMS, from uniformly enhanced SF at $z$ ~ 1 and $z$ ~ 0.3 to centrally enhanced SF at $z$ ~ 0 (when averaged over a wide range of mass). We also capture higher local SFRs for galaxies below the SFMS compared to that of $z$ ~ 0, which can be explained by a larger population of quenched satellites in the local Universe and/or different treatments of limitations set by the D4000-sSFR relation. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: 19 pages, 15 figures, 4 tables, accepted for publication in MNRAS

arXiv:2404.13777 [pdf, other]

Explainable Interfaces for Rapid Gaze-Based Interactions in Mixed Reality

Authors: Mengjie Yu, Dustin Harris, Ian Jones, Ting Zhang, Yue Liu, Naveen Sendhilnathan, Narine Kokhlikyan, Fulton Wang, Co Tran, Jordan L. Livingston, Krista E. Taylor, Zhenhong Hu, Mary A. Hood, Hrvoje Benko, Tanya R. Jonker

Abstract: Gaze-based interactions offer a potential way for users to naturally engage with mixed reality (XR) interfaces. Black-box machine learning models enabled higher accuracy for gaze-based interactions. However, due to the black-box nature of the model, users might not be able to understand and effectively adapt their gaze behaviour to achieve high quality interaction. We posit that explainable AI (XA… ▽ More Gaze-based interactions offer a potential way for users to naturally engage with mixed reality (XR) interfaces. Black-box machine learning models enabled higher accuracy for gaze-based interactions. However, due to the black-box nature of the model, users might not be able to understand and effectively adapt their gaze behaviour to achieve high quality interaction. We posit that explainable AI (XAI) techniques can facilitate understanding of and interaction with gaze-based model-driven system in XR. To study this, we built a real-time, multi-level XAI interface for gaze-based interaction using a deep learning model, and evaluated it during a visual search task in XR. A between-subjects study revealed that participants who interacted with XAI made more accurate selections compared to those who did not use the XAI system (i.e., F1 score increase of 10.8%). Additionally, participants who used the XAI system adapted their gaze behavior over time to make more effective selections. These findings suggest that XAI can potentially be used to assist users in more effective collaboration with model-driven interactions in XR. △ Less

Submitted 21 April, 2024; originally announced April 2024.

arXiv:2404.13061 [pdf, other]

FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning

Authors: Shang Wang, Deepak Ranganatha Sastry Mamillapalli, Tianpei Yang, Matthew E. Taylor

Abstract: This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a learning-based method. In contrast to previous search-based placement algorithms, we instead employ Reinforcement Learning (RL) with the goal of minimizing wirelength. In addition to our preliminary learning results, we also evaluated a novel decomposition to address the nature of la… ▽ More This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a learning-based method. In contrast to previous search-based placement algorithms, we instead employ Reinforcement Learning (RL) with the goal of minimizing wirelength. In addition to our preliminary learning results, we also evaluated a novel decomposition to address the nature of large search space when placing many blocks on a chipboard. Empirical experiments evaluate the effectiveness of the learning and decomposition paradigms on FPGA placement tasks. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: accepted by ISEDA2024

arXiv:2402.18520 [pdf, other]

doi 10.1051/0004-6361/202347705

Do galaxy mergers prefer under-dense environments?

Authors: U. Sureshkumar, A. Durkalec, A. Pollo, W. J. Pearson, D. J. Farrow, A. Narayanan, J. Loveday, E. N. Taylor, L. E. Suelves

Abstract: Galaxy mergers play a crucial role in galaxy evolution. However, the correlation between mergers and the local environment of galaxies is not fully understood. We aim to address the question of whether galaxy mergers prefer denser or less dense environments by quantifying the spatial clustering of mergers and non-mergers. We use two different indicators to classify mergers and non-mergers - classi… ▽ More Galaxy mergers play a crucial role in galaxy evolution. However, the correlation between mergers and the local environment of galaxies is not fully understood. We aim to address the question of whether galaxy mergers prefer denser or less dense environments by quantifying the spatial clustering of mergers and non-mergers. We use two different indicators to classify mergers and non-mergers - classification based on a deep learning technique ($f$) and non-parametric measures of galaxy morphology, Gini-$M_{20}$ ($g$). We used a set of galaxy samples in the redshift range $0.1 < z < 0.15$ from the Galaxy and Mass Assembly (GAMA) survey with a stellar mass cut of $\log (M_{\star}/M_{\odot} ) > 9.5$. We measured and compared the two-point correlation function (2pCF) of mergers and non-mergers classified using the two merger indicators $f$ and $g$. We measured the marked correlation function (MCF), in which the galaxies are weighted by $f$ to probe the environmental dependence of galaxy mergers. We do not observe a statistically significant difference between the clustering strengths of mergers and non-mergers obtained using 2pCF. However, using the MCF measurements with $f$ as a mark, we observe an anti-correlation between the likelihood of a galaxy being a merger and its environment. Our results emphasise the advantage of MCF over 2pCF in probing the environmental correlations. Based on the MCF measurements, we conclude that the galaxy mergers prefer to occur in the under-dense environments on scales $> 50 \, h^{-1} \mathrm{kpc}$ of the large-scale structure (LSS). We attribute this observation to the high relative velocities of galaxies in the densest environments that prevent them from merging. △ Less

Submitted 30 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: 13 pages, 9 figures, matches the version published in A&A

Journal ref: A&A 686, A40 (2024)

arXiv:2402.06819 [pdf, other]

Monitored Markov Decision Processes

Authors: Simone Parisi, Montaser Mohammedalamen, Alireza Kazemipour, Matthew E. Taylor, Michael Bowling

Abstract: In reinforcement learning (RL), an agent learns to perform a task by interacting with an environment and receiving feedback (a numerical reward) for its actions. However, the assumption that rewards are always observable is often not applicable in real-world problems. For example, the agent may need to ask a human to supervise its actions or activate a monitoring system to receive feedback. There… ▽ More In reinforcement learning (RL), an agent learns to perform a task by interacting with an environment and receiving feedback (a numerical reward) for its actions. However, the assumption that rewards are always observable is often not applicable in real-world problems. For example, the agent may need to ask a human to supervise its actions or activate a monitoring system to receive feedback. There may even be a period of time before rewards become observable, or a period of time after which rewards are no longer given. In other words, there are cases where the environment generates rewards in response to the agent's actions but the agent cannot observe them. In this paper, we formalize a novel but general RL framework - Monitored MDPs - where the agent cannot always observe rewards. We discuss the theoretical and practical consequences of this setting, show challenges raised even in toy environments, and propose algorithms to begin to tackle this novel setting. This paper introduces a powerful new formalism that encompasses both new and existing problems and lays the foundation for future research. △ Less

Submitted 13 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

Comments: AAMAS 2024, Main Track

arXiv:2401.02991 [pdf, other]

GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

Authors: Chaitanya Kharyal, Sai Krishna Gottipati, Tanmay Kumar Sinha, Srijita Das, Matthew E. Taylor

Abstract: One of the final frontiers in the development of complex human - AI collaborative systems is the ability of AI agents to comprehend the natural language and perform tasks accordingly. However, training efficient Reinforcement Learning (RL) agents grounded in natural language has been a long-standing challenge due to the complexity and ambiguity of the language and sparsity of the rewards, among ot… ▽ More One of the final frontiers in the development of complex human - AI collaborative systems is the ability of AI agents to comprehend the natural language and perform tasks accordingly. However, training efficient Reinforcement Learning (RL) agents grounded in natural language has been a long-standing challenge due to the complexity and ambiguity of the language and sparsity of the rewards, among other factors. Several advances in reinforcement learning, curriculum learning, continual learning, language models have independently contributed to effective training of grounded agents in various environments. Leveraging these developments, we present a novel algorithm, Grounded Language Instruction through DEmonstration in RL (GLIDE-RL) that introduces a teacher-instructor-student curriculum learning framework for training an RL agent capable of following natural language instructions that can generalize to previously unseen language instructions. In this multi-agent framework, the teacher and the student agents learn simultaneously based on the student's current skill level. We further demonstrate the necessity for training the student agent with not just one, but multiple teacher agents. Experiments on a complex sparse reward environment validates the effectiveness of our proposed approach. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 12 pages, 6 figures, to be presented at AAMAS 2024

arXiv:2401.00907 [pdf, other]

LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models

Authors: Qianxi Li, Yingyue Cao, Jikun Kang, Tianpei Yang, Xi Chen, Jun Jin, Matthew E. Taylor

Abstract: Fine-tuning Large Language Models (LLMs) adapts a trained model to specific downstream tasks, significantly improving task-specific performance. Supervised Fine-Tuning (SFT) is a common approach, where an LLM is trained to produce desired answers. However, LLMs trained with SFT sometimes make simple mistakes and result in hallucinations on reasoning tasks such as question-answering. Without extern… ▽ More Fine-tuning Large Language Models (LLMs) adapts a trained model to specific downstream tasks, significantly improving task-specific performance. Supervised Fine-Tuning (SFT) is a common approach, where an LLM is trained to produce desired answers. However, LLMs trained with SFT sometimes make simple mistakes and result in hallucinations on reasoning tasks such as question-answering. Without external feedback, it is difficult for SFT to learn a good mapping between the question and the desired answer, especially with a small dataset. This paper introduces an alternative to SFT called Natural Language Feedback for Finetuning LLMs (LaFFi). LaFFi has LLMs directly predict the feedback they will receive from an annotator. We find that requiring such reflection can significantly improve the accuracy in in-domain question-answering tasks, providing a promising direction for the application of natural language feedback in the realm of SFT LLMs. Additional ablation studies show that the portion of human-annotated data in the annotated datasets affects the fine-tuning performance. △ Less

Submitted 31 December, 2023; originally announced January 2024.

Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

arXiv:2312.17155 [pdf, other]

Numerical Simulation of Quantum Field Fluctuations

Authors: Emily R. Taylor, Samuel Yencho, L. H. Ford

Abstract: The quantum fluctuations of fields can exhibit subtle correlations in space and time. As the interval between a pair of measurements varies, the correlation function can change sign, signaling a shift between correlation and anti-correlation. A numerical simulation of the fluctuations requires a knowledge of both the probability distribution and the correlation function. Although there are widely… ▽ More The quantum fluctuations of fields can exhibit subtle correlations in space and time. As the interval between a pair of measurements varies, the correlation function can change sign, signaling a shift between correlation and anti-correlation. A numerical simulation of the fluctuations requires a knowledge of both the probability distribution and the correlation function. Although there are widely used methods to generate a sequence of random numbers which obey a given probability distribution, the imposition of a given correlation function can be more difficult. Here we propose a simple method in which the outcome of a given measurement determines a shift in the peak of the probability distribution, to be used for the next measurement. We illustrate this method for three examples of quantum field correlation functions, and show that the resulting simulated function agree well with the original, analytically derived function. We then discuss the application of this method to numerical studies of the effects of correlations on the random walks of test particles coupled to the fluctuating field. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: 8 pages, 3 figures

arXiv:2312.15339 [pdf, other]

MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning

Authors: Bram Grooten, Tristan Tomilin, Gautham Vasan, Matthew E. Taylor, A. Rupam Mahmood, Meng Fang, Mykola Pechenizkiy, Decebal Constantin Mocanu

Abstract: The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabling them to generalize to unseen environments with new distractions. Existing works approach this problem using data augmentation or large auxiliary networks wit… ▽ More The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabling them to generalize to unseen environments with new distractions. Existing works approach this problem using data augmentation or large auxiliary networks with additional loss functions. We introduce MaDi, a novel algorithm that learns to mask distractions by the reward signal only. In MaDi, the conventional actor-critic structure of deep reinforcement learning agents is complemented by a small third sibling, the Masker. This lightweight neural network generates a mask to determine what the actor and critic will receive, such that they can focus on learning the task. The masks are created dynamically, depending on the current input. We run experiments on the DeepMind Control Generalization Benchmark, the Distracting Control Suite, and a real UR5 Robotic Arm. Our algorithm improves the agent's focus with useful masks, while its efficient Masker network only adds 0.2% more parameters to the original structure, in contrast to previous work. MaDi consistently achieves generalization results better than or competitive to state-of-the-art methods. △ Less

Submitted 23 December, 2023; originally announced December 2023.

Comments: Accepted as full-paper (oral) at AAMAS 2024. Code is available at https://github.com/bramgrooten/mask-distractions and see our 40-second video at https://youtu.be/2oImF0h1k48

arXiv:2312.11883 [pdf, other]

EMU/GAMA: Radio detected galaxies are more obscured than optically selected galaxies

Authors: U. T. Ahmed, A. M. Hopkins, J. Ware, Y. A. Gordon, M. Bilicki, M. J. I. Brown, M. Cluver, G. Gürkan, Á. R. López-Sánchez, D. A. Leahy, L. Marchetti, S. Phillipps, I. Prandoni, N. Seymour, E. N. Taylor, E. Vardoulaki

Abstract: We demonstrate the importance of radio selection in probing heavily obscured galaxy populations. We combine Evolutionary Map of the Universe (EMU) Early Science data in the Galaxy and Mass Assembly (GAMA) G23 field with the GAMA data, providing optical photometry and spectral line measurements, together with Wide-field Infrared Survey Explorer (WISE) infrared (IR) photometry, providing IR luminosi… ▽ More We demonstrate the importance of radio selection in probing heavily obscured galaxy populations. We combine Evolutionary Map of the Universe (EMU) Early Science data in the Galaxy and Mass Assembly (GAMA) G23 field with the GAMA data, providing optical photometry and spectral line measurements, together with Wide-field Infrared Survey Explorer (WISE) infrared (IR) photometry, providing IR luminosities and colours. We investigate the degree of obscuration in star forming galaxies, based on the Balmer decrement (BD), and explore how this trend varies, over a redshift range of 0<z<0.345. We demonstrate that the radio detected population has on average higher levels of obscuration than the parent optical sample, arising through missing the lowest BD and lowest mass galaxies, which are also the lower star formation rate (SFR) and metallicity systems. We discuss possible explanations for this result, including speculation around whether it might arise from steeper stellar initial mass functions in low mass, low SFR galaxies. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: Accepted for publication in PASA, 17 pages, 14 figures, 3 tables

arXiv:2312.11768 [pdf, other]

Curriculum Learning for Cooperation in Multi-Agent Reinforcement Learning

Authors: Rupali Bhati, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor

Abstract: While there has been significant progress in curriculum learning and continuous learning for training agents to generalize across a wide variety of environments in the context of single-agent reinforcement learning, it is unclear if these algorithms would still be valid in a multi-agent setting. In a competitive setting, a learning agent can be trained by making it compete with a curriculum of inc… ▽ More While there has been significant progress in curriculum learning and continuous learning for training agents to generalize across a wide variety of environments in the context of single-agent reinforcement learning, it is unclear if these algorithms would still be valid in a multi-agent setting. In a competitive setting, a learning agent can be trained by making it compete with a curriculum of increasingly skilled opponents. However, a general intelligent agent should also be able to learn to act around other agents and cooperate with them to achieve common goals. When cooperating with other agents, the learning agent must (a) learn how to perform the task (or subtask), and (b) increase the overall team reward. In this paper, we aim to answer the question of what kind of cooperative teammate, and a curriculum of teammates should a learning agent be trained with to achieve these two objectives. Our results on the game Overcooked show that a pre-trained teammate who is less skilled is the best teammate for overall team reward but the worst for the learning of the agent. Moreover, somewhat surprisingly, a curriculum of teammates with decreasing skill levels performs better than other types of curricula. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: 9 pages, 5 figures. Presented at Agent Learning in Open-Endedness Workshop at Neural Information Processing Systems (NeurIPS 2023)

arXiv:2312.11718 [pdf, other]

Human-Machine Teaming for UAVs: An Experimentation Platform

Authors: Laila El Moujtahid, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor

Abstract: Full automation is often not achievable or desirable in critical systems with high-stakes decisions. Instead, human-AI teams can achieve better results. To research, develop, evaluate, and validate algorithms suited for such teaming, lightweight experimentation platforms that enable interactions between humans and multiple AI agents are necessary. However, there are limited examples of such platfo… ▽ More Full automation is often not achievable or desirable in critical systems with high-stakes decisions. Instead, human-AI teams can achieve better results. To research, develop, evaluate, and validate algorithms suited for such teaming, lightweight experimentation platforms that enable interactions between humans and multiple AI agents are necessary. However, there are limited examples of such platforms for defense environments. To address this gap, we present the Cogment human-machine teaming experimentation platform, which implements human-machine teaming (HMT) use cases that features heterogeneous multi-agent systems and can involve learning AI agents, static AI agents, and humans. It is built on the Cogment platform and has been used for academic research, including work presented at the ALA workshop at AAMAS this year [1]. With this platform, we hope to facilitate further research on human-machine teaming in critical systems and defense environments. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: 9 pages, 6 figures Presented at Conference on Artificial Intelligence for Defense (CAID) 2023

arXiv:2311.03580 [pdf, other]

doi 10.1093/mnras/stad3416

Halo Growth and Merger Rates as a Cosmological Test

Authors: Yuba Amoura, Nicole E. Drakos, Anael Berrouet, James E. Taylor

Abstract: Dark matter haloes grow at a rate that depends on the value of the cosmological parameters $σ_8$ and $Ω_{\rm m}$ through the initial power spectrum and the linear growth factor. While halo abundance is routinely used to constrain these parameters, through cluster abundance studies, the halo growth rate is not. In recent work, we proposed constraining the cosmological parameters using observational… ▽ More Dark matter haloes grow at a rate that depends on the value of the cosmological parameters $σ_8$ and $Ω_{\rm m}$ through the initial power spectrum and the linear growth factor. While halo abundance is routinely used to constrain these parameters, through cluster abundance studies, the halo growth rate is not. In recent work, we proposed constraining the cosmological parameters using observational estimates of the overall dynamical "age" of clusters, expressed, for instance, by their half-mass assembly redshift $z_{50}$. Here we explore the prospects for using the instantaneous growth rate, as estimated from the halo merger rate, from the average growth rate over the last dynamical time, or from the fraction of systems with recent episodes of major growth. We show that the merger rate is mainly sensitive to the amplitude of fluctuations $σ_8$, while the rates of recent growth provide constraints in the $Ω_{\rm m}$-$σ_8$ plane that are almost orthogonal to those provided by abundance studies. Data collected for forthcoming cluster abundance studies, or studies of the galaxy merger rate in current and future galaxy surveys, may thus provide additional constraints on the cosmological parameters complementary to those already derived from halo abundance. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 16 pages, 12 figures, 1 table and two figures in appendix

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 527, Issue 2, January 2024, Pages 3459-3473

arXiv:2311.00810 [pdf, other]

A Call to Arms: AI Should be Critical for Social Media Analysis of Conflict Zones

Authors: Afia Abedin, Abdul Bais, Cody Buntain, Laura Courchesne, Brian McQuinn, Matthew E. Taylor, Muhib Ullah

Abstract: The massive proliferation of social media data represents a transformative moment in conflict studies. This data can provide unique insights into the spread and use of weaponry, but the scale and types of data are problematic for traditional open-source intelligence. This paper presents preliminary, transdisciplinary work using computer vision to identify specific weapon systems and the insignias… ▽ More The massive proliferation of social media data represents a transformative moment in conflict studies. This data can provide unique insights into the spread and use of weaponry, but the scale and types of data are problematic for traditional open-source intelligence. This paper presents preliminary, transdisciplinary work using computer vision to identify specific weapon systems and the insignias of the armed groups using them. There is potential to not only track how weapons are distributed through networks of armed units but also to track which types of weapons are being used by the different types of state and non-state military actors in Ukraine. Such a system could ultimately be used to understand conflicts in real-time, including where humanitarian and medical aid is most needed. We believe that using AI to help automate such processes should be a high-priority goal for our community, with near-term real-world payoffs. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.17085 [pdf, other]

doi 10.1093/mnras/stad3235

Testing the Surface Brightness Fluctuation Method on Dwarf Galaxies in the COSMOS Field

Authors: Lauren M. Foster, James E. Taylor, John P. Blakeslee

Abstract: Dwarf galaxies are important tracers of small-scale cosmological structure, yet much of our knowledge about these systems comes from the limited sample of dwarf galaxies within the Local Group. To make a comprehensive inventory of dwarf populations in the local Universe, we require effective methods for deriving distance estimates for large numbers of faint, low surface brightness objects. Here we… ▽ More Dwarf galaxies are important tracers of small-scale cosmological structure, yet much of our knowledge about these systems comes from the limited sample of dwarf galaxies within the Local Group. To make a comprehensive inventory of dwarf populations in the local Universe, we require effective methods for deriving distance estimates for large numbers of faint, low surface brightness objects. Here we test the surface brightness fluctuation (SBF) method, traditionally applied to brighter early-type galaxies, on a sample of 20 nearby dwarf galaxies detected in the COSMOS field. These objects are partially resolved in HST ACS images, and have confirmed redshift distances in the range 17-130 Mpc. We discuss the many model choices required in applying the SBF method, and explore how these affect the final distance estimates. Amongst other variations on the method, when applying the SBF method, we alter the standard equation to include a term accounting for the power spectrum of the background, greatly improving our results. For the most robust modelling choices, we find a roughly Gaussian SBF signal that correlates linearly with distance out to distances of 50-100 Mpc, but with only a fraction of the power expected. At larger distances, there is excess power relative to that predicted, probably from undetected point sources. Overall, obtaining accurate SBF distances to faint, irregular galaxies remains challenging, but may yet prove possible with the inclusion of more information about galaxy properties and point source populations, and the use of more advanced techniques. △ Less

Submitted 25 October, 2023; originally announced October 2023.

Comments: 18 pages, 26 figures, accepted by MNRAS

arXiv:2310.10740 [pdf, other]

Unbiased Estimation of Structured Prediction Error

Authors: Kevin Fry, Jonathan E. Taylor

Abstract: Many modern datasets, such as those in ecology and geology, are composed of samples with spatial structure and dependence. With such data violating the usual independent and identically distributed (IID) assumption in machine learning and classical statistics, it is unclear a priori how one should measure the performance and generalization of models. Several authors have empirically investigated c… ▽ More Many modern datasets, such as those in ecology and geology, are composed of samples with spatial structure and dependence. With such data violating the usual independent and identically distributed (IID) assumption in machine learning and classical statistics, it is unclear a priori how one should measure the performance and generalization of models. Several authors have empirically investigated cross-validation (CV) methods in this setting, reaching mixed conclusions. We provide a class of unbiased estimation methods for general quadratic errors, correlated Gaussian response, and arbitrary prediction function $g$, for a noise-elevated version of the error. Our approach generalizes the coupled bootstrap (CB) from the normal means problem to general normal data, allowing correlation both within and between the training and test sets. CB relies on creating bootstrap samples that are intelligently decoupled, in the sense of being statistically independent. Specifically, the key to CB lies in generating two independent "views" of our data and using them as stand-ins for the usual independent training and test samples. Beginning with Mallows' $C_p$, we generalize the estimator to develop our generalized $C_p$ estimators (GC). We show at under only a moment condition on $g$, this noise-elevated error estimate converges smoothly to the noiseless error estimate. We show that when Stein's unbiased risk estimator (SURE) applies, GC converges to SURE as in the normal means problem. Further, we use these same tools to analyze CV and provide some theoretical analysis to help understand when CV will provide good estimates of error. Simulations align with our theoretical results, demonstrating the effectiveness of GC and illustrating the behavior of CV methods. Lastly, we apply our estimator to a model selection task on geothermal data in Nevada. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 28 pages, 13 figures

arXiv:2310.01276 [pdf, other]

doi 10.3847/1538-4365/ad0846

The UNCOVER Survey: A First-look HST+JWST Catalog of Galaxy Redshifts and Stellar Population Properties Spanning $0.2 \lesssim z \lesssim 15$

Authors: Bingjie Wang, Joel Leja, Ivo Labbé, Rachel Bezanson, Katherine E. Whitaker, Gabriel Brammer, Lukas J. Furtak, John R. Weaver, Sedona H. Price, Adi Zitrin, Hakim Atek, Dan Coe, Sam E. Cutler, Pratika Dayal, Pieter van Dokkum, Robert Feldmann, Danilo Marchesini, Marijn Franx, Natascha Förster Schreiber, Seiji Fujimoto, Marla Geha, Karl Glazebrook, Anna de Graaff, Jenny E. Greene, Stéphanie Juneau , et al. (19 additional authors not shown)

Abstract: The recent UNCOVER survey with the James Webb Space Telescope (JWST) exploits the nearby cluster Abell 2744 to create the deepest view of our universe to date by leveraging strong gravitational lensing. In this work, we perform photometric fitting of more than 50,000 robustly detected sources out to $z \sim 15$. We show the redshift evolution of stellar ages, star formation rates, and rest-frame c… ▽ More The recent UNCOVER survey with the James Webb Space Telescope (JWST) exploits the nearby cluster Abell 2744 to create the deepest view of our universe to date by leveraging strong gravitational lensing. In this work, we perform photometric fitting of more than 50,000 robustly detected sources out to $z \sim 15$. We show the redshift evolution of stellar ages, star formation rates, and rest-frame colors across the full range of $0.2 \lesssim z \lesssim 15$. The galaxy properties are inferred using the Prospector Bayesian inference framework using informative Prospector-$β$ priors on masses and star formation histories to produce joint redshift and stellar population posteriors, and additionally lensing magnification is performed on-the-fly to ensure consistency with the scale-dependent priors. We show that this approach produces excellent photometric redshifts with $σ_{\rm NMAD} \sim 0.03$, of a similar quality to the established photometric redshift code EAzY. In line with the open-source scientific objective of the Treasury survey, we publicly release the stellar population catalog with this paper, derived from the photometric catalog adapting aperture sizes based on source profiles. This release includes posterior moments, maximum-likelihood spectra, star-formation histories, and full posterior distributions, offering a rich data set to explore the processes governing galaxy formation and evolution over a parameter space now accessible by JWST. △ Less

Submitted 16 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: Corrected typos: Eq.1 should've been (1-kappa)^2, and the lens maps are normalized to D_ds/D_s=1. These errors were only in the writing; no data products or results were affected. The SPS catalogs are accessible via the UNCOVER survey webpage: https://jwst-uncover.github.io/DR2.html#SPSCatalogs, with a copy deposited to Zenodo: https://doi.org/10.5281/zenodo.8401181

Journal ref: The Astrophysical Journal Supplement Series, 270, 12 (2024)

arXiv:2309.11113 [pdf, ps, other]

Groups with at most 13 nonpower subgroups

Authors: Jiwei Zheng, Wei Zhou, D. E. Taylor

Abstract: For a group G and positive interger m, Gm denotes the subgroup generated by the elements gm where g runs through G. The subgroups not of the form Gm are called nonpower subgroups. We extend the classification of groups with few nonpower subgroups from groups with at most 9 nonpower subgroups to groups with at most 13 nonpower subgroups. For a group G and positive interger m, Gm denotes the subgroup generated by the elements gm where g runs through G. The subgroups not of the form Gm are called nonpower subgroups. We extend the classification of groups with few nonpower subgroups from groups with at most 9 nonpower subgroups to groups with at most 13 nonpower subgroups. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 16 pages, 0 figures

MSC Class: 20D25; 20D60

arXiv:2308.00830 [pdf, other]

Do assumptions about the central density of subhaloes affect dark matter annihilation and lensing calculations?

Authors: Nicole E. Drakos, James E. Taylor, Andrew J. Benson

Abstract: A growing body of evidence suggests that the central density of cuspy dark matter subhaloes is conserved in minor mergers. However, empirical models of subhalo evolution, calibrated from simulations, often assume a drop in the central density. Since empirical models of subhaloes are used in galaxy-galaxy lensing studies and dark matter annihilation calculations, we explore the consequences of assu… ▽ More A growing body of evidence suggests that the central density of cuspy dark matter subhaloes is conserved in minor mergers. However, empirical models of subhalo evolution, calibrated from simulations, often assume a drop in the central density. Since empirical models of subhaloes are used in galaxy-galaxy lensing studies and dark matter annihilation calculations, we explore the consequences of assuming different subhalo models. We find that dark matter annihilation calculations are very sensitive to the assumed subhalo mass profile, and different models can give more than a magnitude difference in the J-factor and boost factor in individual haloes. On the other hand, the shear and convergence profiles used in galaxy-galaxy lensing are sensitive to the initial profile assumed (e.g., NFW versus Einato) but are otherwise well-approximated by a simple model in which the original profile is sharply truncated. We conclude that since the innermost parts of haloes are difficult to resolve in simulations, it is important to have a theoretical understanding of how subhaloes evolve to make accurate predictions of the dark matter annihilation signal. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: 15 pages, 13 figures. Submitted to MNRAS

arXiv:2307.05603 [pdf, other]

Can You Improve My Code? Optimizing Programs with Local Search

Authors: Fatemeh Abdollahi, Saqib Ameen, Matthew E. Taylor, Levi H. S. Lelis

Abstract: This paper introduces a local search method for improving an existing program with respect to a measurable objective. Program Optimization with Locally Improving Search (POLIS) exploits the structure of a program, defined by its lines. POLIS improves a single line of the program while keeping the remaining lines fixed, using existing brute-force synthesis algorithms, and continues iterating until… ▽ More This paper introduces a local search method for improving an existing program with respect to a measurable objective. Program Optimization with Locally Improving Search (POLIS) exploits the structure of a program, defined by its lines. POLIS improves a single line of the program while keeping the remaining lines fixed, using existing brute-force synthesis algorithms, and continues iterating until it is unable to improve the program's performance. POLIS was evaluated with a 27-person user study, where participants wrote programs attempting to maximize the score of two single-agent games: Lunar Lander and Highway. POLIS was able to substantially improve the participants' programs with respect to the game scores. A proof-of-concept demonstration on existing Stack Overflow code measures applicability in real-world problems. These results suggest that POLIS could be used as a helpful programming assistant for programming problems with measurable objectives. △ Less

Submitted 10 July, 2023; originally announced July 2023.

Comments: International Joint Conference on Artificial Intelligence (IJCAI) 2023

arXiv:2306.10693 [pdf, other]

doi 10.3847/1538-4357/acde56

Galaxy And Mass Assembly (GAMA): Stellar-to-Dynamical Mass Relation I. Constraining the Precision of Stellar Mass Estimates

Authors: M. Burak Dogruel, Edward N. Taylor, Michelle Cluver, Francesco D'Eugenio, Anna de Graaff, Matthew Colless, Alessandro Sonnenfeld

Abstract: In this empirical work, we aim to quantify the systematic uncertainties in stellar mass $(M_\star)$ estimates made from spectral energy distribution (SED) fitting through stellar population synthesis (SPS), for galaxies in the local Universe, by using the dynamical mass $(M_\text{dyn})$ estimator as an SED-independent check on stellar mass. We first construct a statistical model of the high dimens… ▽ More In this empirical work, we aim to quantify the systematic uncertainties in stellar mass $(M_\star)$ estimates made from spectral energy distribution (SED) fitting through stellar population synthesis (SPS), for galaxies in the local Universe, by using the dynamical mass $(M_\text{dyn})$ estimator as an SED-independent check on stellar mass. We first construct a statistical model of the high dimensional space of galaxy properties; size $(R_e)$, velocity dispersion $(σ_e)$, surface brightness $(I_e)$, mass-to-light ratio $(M_\star/L)$, rest-frame colour, Sérsic index $(n)$ and dynamical mass $(M_\text{dyn})$; accounting for selection effects and covariant errors. We disentangle the correlations among galaxy properties and find that the variation in $M_\star/M_\text{dyn}$ is driven by $σ_e$, Sérsic index and colour. We use these parameters to calibrate an SED-independent $M_\star$ estimator, $\hat{M}_\star$. We find the random scatter of the relation $M_\star-\hat{M}_\star$ to be $0.108\text{dex}$ and $0.147\text{dex}$ for quiescent and star-forming galaxies respectively. Finally, we inspect the residuals as a function of SPS parameters (dust, age, metallicity, star formation rate) and spectral indices (H$α$, H$δ$, $D_n4000)$. For quiescent galaxies, $\sim65\%$ of the scatter can be explained by the uncertainty in SPS parameters, with dust and age being the largest sources of uncertainty. For star-forming galaxies, while age and metallicity are the leading factors, SPS parameters account for only $\sim13\%$ of the scatter. These results leave us with remaining unmodelled scatters of $0.055\text{dex}$ and $0.122\text{dex}$ for quiescent and star-forming galaxies respectively. This can be interpreted as a conservative limit on the precision in $M_\star$ that can be achieved via simple SPS-modelling. △ Less

Submitted 19 June, 2023; originally announced June 2023.

Comments: Accepted for publication in the Astrophysical Journal on 14 June 2023

arXiv:2306.04675 [pdf, other]

Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Authors: George Stein, Jesse C. Cresswell, Rasa Hosseinzadeh, Yi Sui, Brendan Leigh Ross, Valentin Villecroze, Zhaoyan Liu, Anthony L. Caterini, J. Eric T. Taylor, Gabriel Loaiza-Ganem

Abstract: We systematically study a wide variety of generative models spanning semantically-diverse image datasets to understand and improve the feature extractors and metrics used to evaluate them. Using best practices in psychophysics, we measure human perception of image realism for generated samples by conducting the largest experiment evaluating generative models to date, and find that no existing metr… ▽ More We systematically study a wide variety of generative models spanning semantically-diverse image datasets to understand and improve the feature extractors and metrics used to evaluate them. Using best practices in psychophysics, we measure human perception of image realism for generated samples by conducting the largest experiment evaluating generative models to date, and find that no existing metric strongly correlates with human evaluations. Comparing to 17 modern metrics for evaluating the overall performance, fidelity, diversity, rarity, and memorization of generative models, we find that the state-of-the-art perceptual realism of diffusion models as judged by humans is not reflected in commonly reported metrics such as FID. This discrepancy is not explained by diversity in generated samples, though one cause is over-reliance on Inception-V3. We address these flaws through a study of alternative self-supervised feature extractors, find that the semantic information encoded by individual networks strongly depends on their training procedure, and show that DINOv2-ViT-L/14 allows for much richer evaluation of generative models. Next, we investigate data memorization, and find that generative models do memorize training examples on simple, smaller datasets like CIFAR10, but not necessarily on more complex datasets like ImageNet. However, our experiments show that current metrics do not properly detect memorization: none in the literature is able to separate memorization from other phenomena such as underfitting or mode shrinkage. To facilitate further development of generative models and their evaluation we release all generated image datasets, human evaluation data, and a modular library to compute 17 common metrics for 9 different encoders at https://github.com/layer6ai-labs/dgm-eval. △ Less

Submitted 30 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: NeurIPS 2023. 53 pages, 29 figures, 12 tables. Code at https://github.com/layer6ai-labs/dgm-eval, reviews at https://openreview.net/forum?id=08zf7kTOoh

Journal ref: Thirty-seventh Conference on Neural Information Processing Systems (2023)

arXiv:2305.13826 [pdf, other]

"Is the Pope Catholic?" Applying Chain-of-Thought Reasoning to Understanding Conversational Implicatures

Authors: Zae Myung Kim, David E. Taylor, Dongyeop Kang

Abstract: Conversational implicatures are pragmatic inferences that require listeners to deduce the intended meaning conveyed by a speaker from their explicit utterances. Although such inferential reasoning is fundamental to human communication, recent research indicates that large language models struggle to comprehend these implicatures as effectively as the average human. This paper demonstrates that by… ▽ More Conversational implicatures are pragmatic inferences that require listeners to deduce the intended meaning conveyed by a speaker from their explicit utterances. Although such inferential reasoning is fundamental to human communication, recent research indicates that large language models struggle to comprehend these implicatures as effectively as the average human. This paper demonstrates that by incorporating Grice's Four Maxims into the model through chain-of-thought prompting, we can significantly enhance its performance, surpassing even the average human performance on this task. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.09215 [pdf, other]

doi 10.1093/mnras/stad1500

A Geometric Calibration of the Tip of the Red Giant Branch in the Milky Way using Gaia DR3

Authors: M. Dixon, J. Mould, C. Flynn, E. N. Taylor, C. Lidman, A. R. Duffy

Abstract: We use the latest parallaxes measurements from Gaia DR3 to obtain a geometric calibration of the tip of the red giant branch (TRGB) in Cousins $I$ magnitudes as a standard candle for cosmology. We utilise the following surveys: SkyMapper DR3, APASS DR9, ATLAS Refcat2, and Gaia DR3 synthetic photometry to obtain multiple zero-point calibrations of the TRGB magnitude, $M_{I}^{TRGB}$. Our sample cont… ▽ More We use the latest parallaxes measurements from Gaia DR3 to obtain a geometric calibration of the tip of the red giant branch (TRGB) in Cousins $I$ magnitudes as a standard candle for cosmology. We utilise the following surveys: SkyMapper DR3, APASS DR9, ATLAS Refcat2, and Gaia DR3 synthetic photometry to obtain multiple zero-point calibrations of the TRGB magnitude, $M_{I}^{TRGB}$. Our sample contains Milky Way halo stars at high galactic latitudes ($|b| > 36$) where the impact of metallicity, dust, and crowding are minimised. The magnitude of the TRGB is identified using Sobel edge detection, but this approach introduced a systematic offset. To address this issue, we utilised simulations with PARSEC isochrones and showed how to calibrate and remove this bias. Applying our method within the colour range where the slope of the TRGB is relatively flat for metal-poor halo stars (1.55 $<$ $(BP-RP)$ $<$ 2.25), we find a weighted average $M_{I}^{TRGB} = -4.042 \pm 0.041$ (stat) $\pm0.031$ (sys) mag. A geometric calibration of the Milky Way TRGB has the benefit of being independent of other distance indicators and will help probe systematics in the local distance ladder, leading to improved measurements of the Hubble constant. △ Less

Submitted 16 May, 2023; originally announced May 2023.

Comments: 14 pages, 13 figures. Accepted for publication in MNRAS

arXiv:2305.05687 [pdf, other]

doi 10.3847/1538-4357/accc89

Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies

Authors: James Paul Mason, Alexandra Werth, Colin G. West, Allison A. Youngblood, Donald L. Woodraska, Courtney Peck, Kevin Lacjak, Florian G. Frick, Moutamen Gabir, Reema A. Alsinan, Thomas Jacobsen, Mohammad Alrubaie, Kayla M. Chizmar, Benjamin P. Lau, Lizbeth Montoya Dominguez, David Price, Dylan R. Butler, Connor J. Biron, Nikita Feoktistov, Kai Dewey, N. E. Loomis, Michal Bodzianowski, Connor Kuybus, Henry Dietrick, Aubrey M. Wolfe , et al. (977 additional authors not shown)

Abstract: Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th… ▽ More Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms that could explain it: nanoflares or Alfvén waves. To date, neither can be directly observed. Nanoflares are, by definition, extremely small, but their aggregate energy release could represent a substantial heating mechanism, presuming they are sufficiently abundant. One way to test this presumption is via the flare frequency distribution, which describes how often flares of various energies occur. If the slope of the power law fitting the flare frequency distribution is above a critical threshold, $α=2$ as established in prior literature, then there should be a sufficient abundance of nanoflares to explain coronal heating. We performed $>$600 case studies of solar flares, made possible by an unprecedented number of data analysts via three semesters of an undergraduate physics laboratory course. This allowed us to include two crucial, but nontrivial, analysis methods: pre-flare baseline subtraction and computation of the flare energy, which requires determining flare start and stop times. We aggregated the results of these analyses into a statistical study to determine that $α= 1.63 \pm 0.03$. This is below the critical threshold, suggesting that Alfvén waves are an important driver of coronal heating. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 1,002 authors, 14 pages, 4 figures, 3 tables, published by The Astrophysical Journal on 2023-05-09, volume 948, page 71

arXiv:2304.09169 [pdf, other]

doi 10.1093/mnras/stad1098

The role of mass and environment in the build up of the quenched galaxy population since cosmic noon

Authors: E. Taylor, O. Almaini, M. Merrifield, D. Maltby, V. Wild, W. G. Hartley, K. Rowlands

Abstract: We conduct the first study of how the relative quenching probability of galaxies depends on environment over the redshift range $0.5 < z < 3$, using data from the UKIDSS Ultra-Deep Survey. By constructing the stellar mass functions for quiescent and post-starburst (PSB) galaxies in high, medium and low density environments to $z = 3$, we find an excess of quenched galaxies in dense environments ou… ▽ More We conduct the first study of how the relative quenching probability of galaxies depends on environment over the redshift range $0.5 < z < 3$, using data from the UKIDSS Ultra-Deep Survey. By constructing the stellar mass functions for quiescent and post-starburst (PSB) galaxies in high, medium and low density environments to $z = 3$, we find an excess of quenched galaxies in dense environments out to at least $z \sim 2$. Using the growth rate in the number of quenched galaxies, combined with the star-forming galaxy mass function, we calculate the probability that a given star-forming galaxy is quenched per unit time. We find a significantly higher quenching rate in dense environments (at a given stellar mass) at all redshifts. Massive galaxies (M$_* > 10^{10.7}$ M$_{\odot}$) are on average 1.7 $\pm$ 0.2 times more likely to quench per Gyr in the densest third of environments compared to the sparsest third. Finally, we compare the quiescent galaxy growth rate to the rate at which galaxies pass through a PSB phase. Assuming a visibility timescale of 500 Myr, we find that the PSB route can explain $\sim$ 50\% of the growth in the quiescent population at high stellar mass (M$_* > 10^{10.7}$ M$_{\odot}$) in the redshift range $0.5 < z < 3$, and potentially all of the growth at lower stellar masses. △ Less

Submitted 18 April, 2023; originally announced April 2023.

Comments: 12 pages, 8 figures. Accepted for publication in MNRAS

arXiv:2303.06121 [pdf, other]

Ignorance is Bliss: Robust Control via Information Gating

Authors: Manan Tomar, Riashat Islam, Matthew E. Taylor, Sergey Levine, Philip Bachman

Abstract: Informational parsimony provides a useful inductive bias for learning representations that achieve better generalization by being robust to noise and spurious correlations. We propose \textit{information gating} as a way to learn parsimonious representations that identify the minimal information required for a task. When gating information, we can learn to reveal as little information as possible… ▽ More Informational parsimony provides a useful inductive bias for learning representations that achieve better generalization by being robust to noise and spurious correlations. We propose \textit{information gating} as a way to learn parsimonious representations that identify the minimal information required for a task. When gating information, we can learn to reveal as little information as possible so that a task remains solvable, or hide as little information as possible so that a task becomes unsolvable. We gate information using a differentiable parameterization of the signal-to-noise ratio, which can be applied to arbitrary values in a network, e.g., erasing pixels at the input layer or activations in some intermediate layer. When gating at the input layer, our models learn which visual cues matter for a given task. When gating intermediate layers, our models learn which activations are needed for subsequent stages of computation. We call our approach \textit{InfoGating}. We apply InfoGating to various objectives such as multi-step forward and inverse dynamics models, Q-learning, and behavior cloning, highlighting how InfoGating can naturally help in discarding information not relevant for control. Results show that learning to identify and use minimal information can improve generalization in downstream tasks. Policies based on InfoGating are considerably more robust to irrelevant visual features, leading to improved pretraining and finetuning of RL models. △ Less

Submitted 8 December, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

Comments: NeurIPS 2023

arXiv:2303.05520 [pdf, other]

Evolution in the orbital structure of quiescent galaxies from MAGPI, LEGA-C and SAMI surveys: direct evidence for merger-driven growth over the last 7 Gy

Authors: Francesco D'Eugenio, Arjen van der Wel, Joanna M. Piotrowska, Rachel Bezanson, Edward N. Taylor, Jesse van de Sande, William M. Baker, Eric F. Bell, Sabine Bellstedt, Joss Bland-Hawthorn, Asa F. L. Bluck, Sarah Brough, Julia J. Bryant, Matthew Colless, Luca Cortese, Scott M. Croom, Caro Derkenne, Pieter van Dokkum, Deanne Fisher, Caroline Foster, Anna Gallazzi, Anna de Graaff, Brent Groves, Josha van Houdt, Claudia del P. Lagos , et al. (15 additional authors not shown)

Abstract: We present the first study of spatially integrated higher-order stellar kinematics over cosmic time. We use deep rest-frame optical spectroscopy of quiescent galaxies at redshifts z=0.05, 0.3 and 0.8 from the SAMI, MAGPI and LEGA-C surveys to measure the excess kurtosis $h_4$ of the stellar velocity distribution, the latter parametrised as a Gauss-Hermite series. Conservatively using a redshift-in… ▽ More We present the first study of spatially integrated higher-order stellar kinematics over cosmic time. We use deep rest-frame optical spectroscopy of quiescent galaxies at redshifts z=0.05, 0.3 and 0.8 from the SAMI, MAGPI and LEGA-C surveys to measure the excess kurtosis $h_4$ of the stellar velocity distribution, the latter parametrised as a Gauss-Hermite series. Conservatively using a redshift-independent cut in stellar mass ($M_\star = 10^{11}\,{\rm M}_\odot$), and matching the stellar-mass distributions of our samples, we find 7 $σ$ evidence of $h_4$ increasing with cosmic time, from a median value of 0.019$\pm$0.002 at z=0.8 to 0.059$\pm$0.004 at z=0.06. Alternatively, we use a physically motivated sample selection, based on the mass distribution of the progenitors of local quiescent galaxies as inferred from numerical simulations; in this case, we find 10 $σ$ evidence. This evolution suggests that, over the last 7 Gyr, there has been a gradual decrease in the rotation-to-dispersion ratio and an increase in the radial anisotropy of the stellar velocity distribution, qualitatively consistent with accretion of gas-poor satellites. These findings demonstrate that massive galaxies continue to accrete mass and increase their dispersion support after becoming quiescent. △ Less

Submitted 9 March, 2023; originally announced March 2023.

Comments: 19 pages, 9 figures Accepted for publication in MNRAS

arXiv:2303.04157 [pdf, other]

Different higher-order kinematics between star-forming and quiescent galaxies based on the SAMI, MAGPI and LEGA-C surveys

Authors: Francesco D'Eugenio, Arjen van der Wel, Caro Derkenne, Josha van Houdt, Rachel Bezanson, Edward N. Taylor, Jesse van de Sande, William M. Baker, Eric F. Bell, Joss Bland-Hawthorn, Asa F. L. Bluck, Sarah Brough, Julia J. Bryant, Matthew Colless, Luca Cortese, Scott M. Croom, Pieter van Dokkum, Deanne Fisher, Caroline Foster, Amelia Fraser-McKelvie, Anna Gallazzi, Anna de Graaff, Brent Groves, Claudia del P. Lagos, Tobias J. Looser , et al. (16 additional authors not shown)

Abstract: We present the first statistical study of spatially integrated non-Gaussian stellar kinematics spanning 7 Gyr in cosmic time. We use deep, rest-frame optical spectroscopy of massive galaxies (stellar mass $M_\star > 10^{10.5} {\rm M}_\odot$) at redshifts z = 0.05, 0.3 and 0.8 from the SAMI, MAGPI and LEGA-C surveys, to measure the excess kurtosis $h_4$ of the stellar velocity distribution, the lat… ▽ More We present the first statistical study of spatially integrated non-Gaussian stellar kinematics spanning 7 Gyr in cosmic time. We use deep, rest-frame optical spectroscopy of massive galaxies (stellar mass $M_\star > 10^{10.5} {\rm M}_\odot$) at redshifts z = 0.05, 0.3 and 0.8 from the SAMI, MAGPI and LEGA-C surveys, to measure the excess kurtosis $h_4$ of the stellar velocity distribution, the latter parametrised as a Gauss-Hermite series. We find that at all redshifts where we have large enough samples, $h_4$ anti-correlates with the ratio between rotation and dispersion, highlighting the physical connection between these two kinematic observables. In addition, and independently from the anti-correlation with rotation-to-dispersion ratio, we also find a correlation between $h_4$ and $M_\star$, potentially connected to the assembly history of galaxies. In contrast, after controlling for mass, we find no evidence of independent correlation between $h_4$ and aperture velocity dispersion or galaxy size. These results hold for both star-forming and quiescent galaxies. For quiescent galaxies, $h_4$ also correlates with projected shape, even after controlling for the rotation-to-dispersion ratio. At any given redshift, star-forming galaxies have lower $h_4$ compared to quiescent galaxies, highlighting the link between kinematic structure and star-forming activity. △ Less

Submitted 7 March, 2023; originally announced March 2023.

Comments: 26 pages, 15 figures Accepted for publication in MNRAS

arXiv:2302.12818 [pdf, other]

doi 10.1093/mnras/stad2516

EDGE: The shape of dark matter haloes in the faintest galaxies

Authors: Matthew D. A. Orkney, Ethan Taylor, Justin I. Read, Martin P. Rey, Andrew Pontzen, Oscar Agertz, Stacy Y. Kim, Maxime Delorme

Abstract: Collisionless Dark Matter Only (DMO) structure formation simulations predict that Dark Matter (DM) haloes are prolate in their centres and triaxial towards their outskirts. The addition of gas condensation transforms the central DM shape to be rounder and more oblate. It is not clear, however, whether such shape transformations occur in `ultra-faint' dwarfs, which have extremely low baryon fractio… ▽ More Collisionless Dark Matter Only (DMO) structure formation simulations predict that Dark Matter (DM) haloes are prolate in their centres and triaxial towards their outskirts. The addition of gas condensation transforms the central DM shape to be rounder and more oblate. It is not clear, however, whether such shape transformations occur in `ultra-faint' dwarfs, which have extremely low baryon fractions. We present the first study of the shape and velocity anisotropy of ultra-faint dwarf galaxies that have gas mass fractions of $f_{\rm gas}(r<R_{\rm half}) < 0.06$. These dwarfs are drawn from the Engineering Dwarfs at Galaxy formation's Edge (EDGE) project, using high resolution simulations that allow us to resolve DM halo shapes within the half light radius ($\sim 100\,$pc). We show that gas-poor ultra-faints ($M_{\rm 200c} \leqslant 1.5\times10^9\,$M$_\odot$; $f_{\rm gas} < 10^{-5}$) retain their pristine prolate DM halo shape even when gas, star formation and feedback are included. This could provide a new and robust test of DM models. By contrast, gas-rich ultra-faints ($M_{\rm 200c} > 3\times10^9\,$M$_\odot$; $f_{\rm gas} > 10^{-4}$) become rounder and more oblate within $\sim 10$ half light radii. Finally, we find that most of our simulated dwarfs have significant radial velocity anisotropy that rises to $\tildeβ > 0.5$ at $R \gtrsim 3 R_{\rm half}$. The one exception is a dwarf that forms a rotating gas/stellar disc because of a planar, major merger. Such strong anisotropy should be taken into account when building mass models of gas-poor ultra-faints. △ Less

Submitted 5 September, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: 16 pages and 11 figures (excluding appendices), accepted by MNRAS

arXiv:2302.06548 [pdf, other]

Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning

Authors: Bram Grooten, Ghada Sokar, Shibhansh Dohare, Elena Mocanu, Matthew E. Taylor, Mykola Pechenizkiy, Decebal Constantin Mocanu

Abstract: Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement le… ▽ More Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement learning literature. To start resolving this, we formulate a problem setting in reinforcement learning called the $\textit{extremely noisy environment}$ (ENE), where up to $99\%$ of the input features are pure noise. Agents need to detect which features provide task-relevant information about the state of the environment. Consequently, we propose a new method termed $\textit{Automatic Noise Filtering}$ (ANF), which uses the principles of dynamic sparse training in synergy with various deep reinforcement learning algorithms. The sparse input layer learns to focus its connectivity on task-relevant features, such that ANF-SAC and ANF-TD3 outperform standard SAC and TD3 by a large margin, while using up to $95\%$ fewer weights. Furthermore, we devise a transfer learning setting for ENEs, by permuting all features of the environment after 1M timesteps to simulate the fact that other information sources can become relevant as the world evolves. Again, ANF surpasses the baselines in final performance and sample complexity. Our code is available at https://github.com/bramgrooten/automatic-noise-filtering △ Less

Submitted 13 February, 2023; originally announced February 2023.

Comments: Accepted as full-paper at AAMAS 2023

arXiv:2302.03930 [pdf]

A Model for Forecasting Air Quality Index in Port Harcourt Nigeria Using Bi-LSTM Algorithm

Authors: O. E. Taylor, P. S. Ezekiel

Abstract: The release of toxic gases by industries, emissions from vehicles, and an increase in the concentration of harmful gases and particulate matter in the atmosphere are all contributing factors to the deterioration of the quality of the air. Factors such as industries, urbanization, population growth, and the increased use of vehicles contribute to the rapid increase in pollution levels, which can ad… ▽ More The release of toxic gases by industries, emissions from vehicles, and an increase in the concentration of harmful gases and particulate matter in the atmosphere are all contributing factors to the deterioration of the quality of the air. Factors such as industries, urbanization, population growth, and the increased use of vehicles contribute to the rapid increase in pollution levels, which can adversely impact human health. This paper presents a model for forecasting the air quality index in Nigeria using the Bi-directional LSTM model. The air pollution data was downloaded from an online database (UCL). The dataset was pre-processed using both pandas tools in python. The pre-processed result was used as input features in training a Bi-LSTM model in making future forecasts of the values of the particulate matter Pm2.5, and Pm10. The Bi-LSTM model was evaluated using some evaluation parameters such as mean square error, mean absolute error, absolute mean square, and R^2 square. The result of the Bi-LSTM shows a mean square error of 52.99%, relative mean square error of 7.28%, mean absolute error of 3.4%, and R^2 square of 97%. The model. This shows that the model follows a seamless trend in forecasting the air quality in Port Harcourt, Nigeria. △ Less

Submitted 8 February, 2023; originally announced February 2023.

arXiv:2301.13230 [pdf, other]

doi 10.1051/0004-6361/202346026

Strong lensing selection effects

Authors: Alessandro Sonnenfeld, Shun-Sheng Li, Giulia Despali, Raphael Gavazzi, Anowar J. Shajib, Edward N. Taylor

Abstract: Context. Strong lenses are a biased subset of the general population of galaxies. Aims. The goal of this work is to quantify how lens galaxies and lensed sources differ from their parent distribution, namely the strong lensing bias. Methods. We first studied how the strong lensing cross-section varies as a function of lens and source properties. Then, we simulated strong lensing surveys with d… ▽ More Context. Strong lenses are a biased subset of the general population of galaxies. Aims. The goal of this work is to quantify how lens galaxies and lensed sources differ from their parent distribution, namely the strong lensing bias. Methods. We first studied how the strong lensing cross-section varies as a function of lens and source properties. Then, we simulated strong lensing surveys with data similar to that expected for Euclid and measured the strong lensing bias in different scenarios. We focused particularly on two quantities: the stellar population synthesis mismatch parameter, $α_{sps}$, defined as the ratio between the true stellar mass of a galaxy and the stellar mass obtained from photometry, and the central dark matter mass at fixed stellar mass and size. Results. Strong lens galaxies are biased towards larger stellar masses, smaller half-mass radii and larger dark matter masses. The amplitude of the bias depends on the intrinsic scatter in the mass-related parameters of the galaxy population and on the completeness in Einstein radius of the lens sample. For values of the scatter that are consistent with observed scaling relations and a minimum detectable Einstein radius of $0.5''$, the strong lensing bias in $α_{sps}$ is $10\%$, while that in the central dark matter mass is $5\%$. The bias has little dependence on the properties of the source population: samples of galaxy-galaxy lenses and galaxy-quasar lenses that probe the same Einstein radius distribution are biased in a very similar way. Conclusions. Given current uncertainties, strong lensing observations can be used directly to improve our current knowledge of the inner structure of galaxies, without the need to correct for selection effects. Time-delay measurements of $H_0$ from lensed quasars can take advantage of prior information obtained from galaxy-galaxy lenses with similar Einstein radii. △ Less

Submitted 28 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

Comments: Published on Astronomy & Astrophysics. A two-minute summary video of this paper is available at https://youtu.be/UmS9jRHTmZU

Journal ref: A&A 678, A4 (2023)

arXiv:2301.11153 [pdf, other]

Learning from Multiple Independent Advisors in Multi-agent Reinforcement Learning

Authors: Sriram Ganapathi Subramanian, Matthew E. Taylor, Kate Larson, Mark Crowley

Abstract: Multi-agent reinforcement learning typically suffers from the problem of sample inefficiency, where learning suitable policies involves the use of many data samples. Learning from external demonstrators is a possible solution that mitigates this problem. However, most prior approaches in this area assume the presence of a single demonstrator. Leveraging multiple knowledge sources (i.e., advisors)… ▽ More Multi-agent reinforcement learning typically suffers from the problem of sample inefficiency, where learning suitable policies involves the use of many data samples. Learning from external demonstrators is a possible solution that mitigates this problem. However, most prior approaches in this area assume the presence of a single demonstrator. Leveraging multiple knowledge sources (i.e., advisors) with expertise in distinct aspects of the environment could substantially speed up learning in complex environments. This paper considers the problem of simultaneously learning from multiple independent advisors in multi-agent reinforcement learning. The approach leverages a two-level Q-learning architecture, and extends this framework from single-agent to multi-agent settings. We provide principled algorithms that incorporate a set of advisors by both evaluating the advisors at each state and subsequently using the advisors to guide action selection. We also provide theoretical convergence and sample complexity guarantees. Experimentally, we validate our approach in three different test-beds and show that our algorithms give better performances than baselines, can effectively integrate the combined expertise of different advisors, and learn to ignore bad advice. △ Less

Submitted 2 March, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

Comments: Paper to appear in AAMAS 2023, London, UK

arXiv:2301.05952 [pdf, other]

A New WISE Calibration of Stellar Mass

Authors: T. H. Jarrett, M. E. Cluver, Edward N. Taylor, Sabine Bellstedt, A. S. G Robotham, H. F. M. Yao

Abstract: We derive new empirical scaling relations between WISE mid-infrared galaxy photometry and well-determined stellar masses from SED modeling of a suite of optical-infrared photometry provided by the DR4 Catalogue of the GAMA-KiDS-VIKING survey of the southern G23 field. The mid-infrared source extraction and characterization are drawn from the WISE Extended Source Catalogue (WXSC) and the archival A… ▽ More We derive new empirical scaling relations between WISE mid-infrared galaxy photometry and well-determined stellar masses from SED modeling of a suite of optical-infrared photometry provided by the DR4 Catalogue of the GAMA-KiDS-VIKING survey of the southern G23 field. The mid-infrared source extraction and characterization are drawn from the WISE Extended Source Catalogue (WXSC) and the archival ALLWISE catalog, combining both resolved and compact galaxies in the G23 sample to a redshift of 0.15. Three scaling relations are derived: W1 3.4 micron luminosity versus stellar mass, and WISE W1-W2, W1-W3 colors versus mass-to-light ratio (sensitive to a variety of galaxy types from passive to star-forming). For each galaxy in the sample, we then derive the combined stellar mass from these scaling relations, producing Mstellar estimates with better than $\sim$25-30% accuracy for galaxies with $>$10$^{9}$ Msolar and $<$40 - 50% for lower luminosity dwarf galaxies. We also provide simple prescriptions for rest-frame corrections and estimating stellar masses using only the W1 flux and the W1-W2 color, making stellar masses more accessible to users of the WISE data. Given a redshift or distance, these new scaling relations will enable stellar mass estimates for any galaxy in the sky detected by WISE with high fidelity across a range of mass-to-light. △ Less

Submitted 14 January, 2023; originally announced January 2023.

Comments: Accepted for publication in the Astrophysical Journal (ApJ)

arXiv:2301.02671 [pdf, other]

The UNCOVER Survey: A first-look HST+JWST catalog of 60,000 galaxies near Abell 2744 and beyond

Authors: John R. Weaver, Sam E. Cutler, Richard Pan, Katherine E. Whitaker, Ivo Labbe, Sedona H. Price, Rachel Bezanson, Gabriel Brammer, Danilo Marchesini, Joel Leja, Bingjie Wang, Lukas J. Furtak, Adi Zitrin, Hakim Atek, Dan Coe, Pratika Dayal, Pieter van Dokkum, Robert Feldmann, Natascha Forster Schreiber, Marijn Franx, Seiji Fujimoto, Yoshinobu Fudamoto, Karl Glazebrook, Anna de Graaff, Jenny E. Greene , et al. (19 additional authors not shown)

Abstract: In November 2022, the James Webb Space Telescope (JWST) returned deep near-infrared images of Abell 2744 -- a powerful lensing cluster capable of magnifying distant, incipient galaxies beyond it. Together with the existing Hubble Space Telescope (HST) imaging, this publicly available dataset opens a fundamentally new discovery space to understand the remaining mysteries of the formation and evolut… ▽ More In November 2022, the James Webb Space Telescope (JWST) returned deep near-infrared images of Abell 2744 -- a powerful lensing cluster capable of magnifying distant, incipient galaxies beyond it. Together with the existing Hubble Space Telescope (HST) imaging, this publicly available dataset opens a fundamentally new discovery space to understand the remaining mysteries of the formation and evolution of galaxies across cosmic time. In this work, we detect and measure some 60,000 objects across the 49 arcmin$^2$ JWST footprint down to a $5\,σ$ limiting magnitude of $\sim$30 mag in 0.32" apertures. Photometry is performed using circular apertures on images matched to the point spread function of the reddest NIRCam broad band, F444W, and cleaned of bright cluster galaxies and the related intra-cluster light. To give an impression of the photometric performance, we measure photometric redshifts and achieve a $σ_{\rm NMAD}\approx0.03$ based on known, but relatively small, spectroscopic samples. With this paper, we publicly release our HST and JWST PSF-matched photometric catalog with optimally assigned aperture sizes for easy use, along with single aperture catalogs, photometric redshifts, rest-frame colors, and individual magnification estimates. These catalogs will set the stage for efficient and deep spectroscopic follow-up of some of the first JWST-selected samples in Summer 2023. △ Less

Submitted 2 October, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

Comments: 28 pages, 19 figures, resubmitted to ApJS following significant data product improvements. Comments welcome. Catalogs can be accessed at https://jwst-uncover.github.io/DR2.html#PhotometricCatalogs

arXiv:2212.08302 [pdf, other]

Safe Evaluation For Offline Learning: Are We Ready To Deploy?

Authors: Hager Radi, Josiah P. Hanna, Peter Stone, Matthew E. Taylor

Abstract: The world currently offers an abundance of data in multiple domains, from which we can learn reinforcement learning (RL) policies without further interaction with the environment. RL agents learning offline from such data is possible but deploying them while learning might be dangerous in domains where safety is critical. Therefore, it is essential to find a way to estimate how a newly-learned age… ▽ More The world currently offers an abundance of data in multiple domains, from which we can learn reinforcement learning (RL) policies without further interaction with the environment. RL agents learning offline from such data is possible but deploying them while learning might be dangerous in domains where safety is critical. Therefore, it is essential to find a way to estimate how a newly-learned agent will perform if deployed in the target environment before actually deploying it and without the risk of overestimating its true performance. To achieve this, we introduce a framework for safe evaluation of offline learning using approximate high-confidence off-policy evaluation (HCOPE) to estimate the performance of offline policies during learning. In our setting, we assume a source of data, which we split into a train-set, to learn an offline policy, and a test-set, to estimate a lower-bound on the offline policy using off-policy evaluation with bootstrapping. A lower-bound estimate tells us how good a newly-learned target policy would perform before it is deployed in the real environment, and therefore allows us to decide when to deploy our learned policy. △ Less

Submitted 16 December, 2022; originally announced December 2022.

Comments: NeurIPS 2021 Workshop on Deployable Decision Making in Embodied Systems [Spotlight]

Showing 1–50 of 473 results for author: Taylor, E