subscribe to arXiv mailings

The Lowell Observatory Solar Telescope: A fiber feed into the EXtreme PREcision Spectrometer

Authors: Joe Llama, Lily L. Zhao, John M. Brewer, Andrew Szymkowiak, Debra A. Fischer, Michael Collins, Jake Tiegs, Frank Cornelius

Abstract: The signal induced by a temperate, terrestrial planet orbiting a Sun-like star is an order of magnitude smaller than the host stars' intrinsic variability. Understanding stellar activity is, therefore, a fundamental obstacle in confirming the smallest exoplanets. We present the Lowell Observatory Solar Telescope (LOST), a solar feed for the EXtreme PREcision Spectrometer (EXPRES) at the 4.3-m Lowe… ▽ More The signal induced by a temperate, terrestrial planet orbiting a Sun-like star is an order of magnitude smaller than the host stars' intrinsic variability. Understanding stellar activity is, therefore, a fundamental obstacle in confirming the smallest exoplanets. We present the Lowell Observatory Solar Telescope (LOST), a solar feed for the EXtreme PREcision Spectrometer (EXPRES) at the 4.3-m Lowell Discovery Telescope (LDT). EXPRES is one of the newest high-resolution spectrographs that accurately measure extreme radial velocity. With LOST/EXPRES, we observe disk-integrated sunlight autonomously throughout the day. In clear conditions, we achieve a ~137,500 optical spectrum of the Sun with a signal-to-noise of 500 in ~150s. Data is reduced using the standard EXPRES pipeline with minimal modification to ensure the data are comparable to the observations of other stars with the LDT. During the first three years of operation, we find a daily RMS of 71 cm/s. Additionally, having two EPRV spectrometers located in Arizona gives us an unprecedented opportunity to benchmark the performance of these planet-finders. We find a RMS of just 55 cm/s when comparing data taken simultaneously with EXPRES and NEID. △ Less

Submitted 10 July, 2024; originally announced July 2024.

Comments: SPIE Astronomical Telescopes & Instrumentation proceedings paper

arXiv:2407.04698 [pdf, other]

Stellar Metallicities and Gradients in the Faint M31 Satellites Andromeda XVI and Andromeda XXVIII

Authors: Sal Wanying Fu, Daniel R. Weisz, Else Starkenburg, Nicolas Martin, Michelle L. M. Collins, Alessandro Savino, Michael Boylan-Kolchin, Patrick Côté, Andrew E. Dolphin, Nicolas Longeard, Mario L. Mateo, Francisco J. Mercado, Nathan R. Sandford, Evan D. Skillman

Abstract: We present $\sim300$ stellar metallicity measurements in two faint M31 dwarf galaxies, Andromeda XVI ($M_V = -7.5$) and Andromeda XXVIII ($M_V = -8.8$) derived using metallicity-sensitive Calcium H & K narrow-band Hubble Space Telescope imaging. These are the first individual stellar metallicities in And~XVI (95 stars). Our And~XXVIII sample (191 stars) is a factor of $\sim15$ increase over litera… ▽ More We present $\sim300$ stellar metallicity measurements in two faint M31 dwarf galaxies, Andromeda XVI ($M_V = -7.5$) and Andromeda XXVIII ($M_V = -8.8$) derived using metallicity-sensitive Calcium H & K narrow-band Hubble Space Telescope imaging. These are the first individual stellar metallicities in And~XVI (95 stars). Our And~XXVIII sample (191 stars) is a factor of $\sim15$ increase over literature metallicities. For And~XVI, we measure $\langle \mbox{[Fe/H]}\rangle = -2.17^{+0.05}_{-0.05}$, $σ_{\mbox{[Fe/H]}}=0.33^{+0.07}_{-0.07}$, and $\nabla_{\mbox{[Fe/H]}} = -0.23\pm0.15$ dex $R_e^{-1}$. We find that And XVI is more metal-rich than MW UFDs of similar luminosity, which may be a result of its unusually extended star formation history. For And XXVIII, we measure $\langle \mbox{[Fe/H]}\rangle = -1.95^{+0.04}_{-0.04}$, $σ_{\mbox{[Fe/H]}}=0.34^{+0.07}_{-0.07}$, and $\nabla_{\mbox{[Fe/H]}} = -0.46 \pm 0.10$~dex~$R_e^{-1}$, placing it on the dwarf galaxy mass-metallicity relation. Neither galaxy has a metallicity distribution function with an abrupt metal-rich truncation, suggesting that star formation fell off gradually. The stellar metallicity gradient measurements are among the first for faint ($L \lesssim 10^6~L_{\odot}$) galaxies outside the Milky Way halo. Both galaxies' gradients are consistent with predictions from the FIRE simulations, where an age-gradient strength relationship is the observational consequence of stellar feedback that produces dark matter cores. We include a catalog for community spectroscopic follow-up, including 19 extremely metal poor ($\mbox{[Fe/H]} < -3.0$) star candidates, which make up 7% of And~XVI's MDF and 6% of And~XXVIII's. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 17 pages, 5 figures, 4 tables, ApJ submitted; comments welcome!

arXiv:2407.04349 [pdf, other]

Elemental Abundances in And XIX From Coadded Spectra

Authors: L. R. Cullinane, Karoline M. Gilbert, Ivanna Escala, J. Leigh Wojno, Evan N. Kirby, Kateryna A. Kvasova, Erik Tollerud, Michelle L. M. Collins, R. Michael Rich

Abstract: With a luminosity similar to that of Milky Way dwarf spheroidal (dSph) systems like Sextans, but a spatial extent similar to that of ultradiffuse galaxies (UDGs), Andromeda (And) XIX is an unusual satellite of M31. To investigate the origin of this galaxy, we measure chemical abundances for AndXIX derived from medium-resolution (R$\sim$6000) spectra from Keck II/DEIMOS. We coadd 79 red giant branc… ▽ More With a luminosity similar to that of Milky Way dwarf spheroidal (dSph) systems like Sextans, but a spatial extent similar to that of ultradiffuse galaxies (UDGs), Andromeda (And) XIX is an unusual satellite of M31. To investigate the origin of this galaxy, we measure chemical abundances for AndXIX derived from medium-resolution (R$\sim$6000) spectra from Keck II/DEIMOS. We coadd 79 red giant branch stars, grouped by photometric metallicity, in order to obtain a sufficiently high signal-to-noise ratio (S/N) to measure 20 [Fe/H] and [$α$/Fe] abundances via spectral synthesis. The latter are the first such measurements for AndXIX. The mean metallicity we derive for AndXIX places it $\sim2σ$ higher than the present-day stellar mass-metallicity relation for Local Group dwarf galaxies, potentially indicating it has experienced tidal stripping. A loss of gas and associated quenching during such a process, which prevents the extended star formation necessary to produce shallow [$α$/Fe]--[Fe/H] gradients in massive systems, is also consistent with the steeply decreasing [$α$/Fe]--[Fe/H] trend we observe. In combination with the diffuse structure and disturbed kinematic properties of AndXIX, this suggests tidal interactions, rather than galaxy mergers, are strong contenders for its formation. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: 26 pages, 10 figures. Accepted by ApJ

arXiv:2406.16807 [pdf, other]

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

Authors: Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei, Yushi Hu, Sherol Chen, Senjuti Dutta, Minsuk Chang, Kimin Lee, Youwei Liang, Georgina Evans, Sahil Singla, Gang Li, Adrian Weller, Junfeng He, Deepak Ramachandran, Krishnamurthy Dj Dvijotham

Abstract: Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This paper investigates the effectiveness of fine-grained feedback which captures nuanced distinctions in image quality and prompt-alignment, compared to traditional co… ▽ More Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This paper investigates the effectiveness of fine-grained feedback which captures nuanced distinctions in image quality and prompt-alignment, compared to traditional coarse-grained feedback (for example, thumbs up/down or ranking between a set of options). While fine-grained feedback holds promise, particularly for systems catering to diverse societal preferences, we show that demonstrating its superiority to coarse-grained feedback is not automatic. Through experiments on real and synthetic preference data, we surface the complexities of building effective models due to the interplay of model choice, feedback type, and the alignment between human judgment and computational interpretation. We identify key challenges in eliciting and utilizing fine-grained feedback, prompting a reassessment of its assumed benefits and practicality. Our findings -- e.g., that fine-grained feedback can lead to worse models for a fixed budget, in some settings; however, in controlled settings with known attributes, fine grained rewards can indeed be more helpful -- call for careful consideration of feedback attributes and potentially beckon novel modeling approaches to appropriately unlock the potential value of fine-grained feedback in-the-wild. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.12431 [pdf, other]

The influence of dislocations on R-phase transformations in a NiTi shape memory alloy

Authors: Himanshu Vashishtha, David M. Collins

Abstract: The ability to control the stress-induced phase transformation of the shape memory alloy, NiTi, is an important technological challenge that must be understood for their wide application in devices that can exploit their reversible strain properties. This study elucidates the direct relationship between dislocation density and the \textit{R}-phase transformation, including its formation temperatur… ▽ More The ability to control the stress-induced phase transformation of the shape memory alloy, NiTi, is an important technological challenge that must be understood for their wide application in devices that can exploit their reversible strain properties. This study elucidates the direct relationship between dislocation density and the \textit{R}-phase transformation, including its formation temperature from interrupted annealing of rolled NiTi samples. Deformation is shown to determine the enthalpy change required for the B2$\rightarrow$\textit{R}-phase transformation, with associated transformation temperatures being modifiable via dislocation density and recovery processes. Recovery is shown to be rapid, highly heterogeneous and sensitive to crystal orientation. Grains with a $\langle100\rangle$ direction close to the macroscopic rolling direction recover more rapidly than $\langle110\rangle$ and $\langle111\rangle$ orientated grains. Considered to be governed by processing induced residual stresses and resultant crystallographic dependent annihilation/slip pathways, there are opportunities to tune B2$\rightarrow$\textit{R}-phase transformation on either a grain-averaged or an orientation dependant per-grain basis. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.10140 [pdf, other]

A micromechanical study of heat treatment induced hardening in α-brass

Authors: Jonathan Birch, Emily Jenkins, Anastasia Vrettou, Mohammed Said, Himanshu Vashishtha, Thomas Connolley, Jeff Brooks, David M. Collins

Abstract: The mechanisms that govern a previously unexplained hardening effect of a single phase Cu-30wt%Zn α-brass after heating have been investigated. After cold-work, the alloy possesses an increased yield strength and hardening rate only when heat treated to temperatures close to 220{^\circ}C, and is otherwise softer. Crystallographic texture and microstructure, explored using electron backscatter diff… ▽ More The mechanisms that govern a previously unexplained hardening effect of a single phase Cu-30wt%Zn α-brass after heating have been investigated. After cold-work, the alloy possesses an increased yield strength and hardening rate only when heat treated to temperatures close to 220{^\circ}C, and is otherwise softer. Crystallographic texture and microstructure, explored using electron backscatter diffraction (EBSD), describe the deformation heterogeneity including twin development, as a function of heat treatment. When heated, an increased area fraction of deformation twins is observed, with dimensions reaching a critical size that maximises the resistance to dislocation slip in the parent grains. The effect is shown to dominate over other alloy characteristics including short range order, giving serrated yielding during tensile testing which is mostly eliminated after heating. In-situ X-ray diffraction during tensile testing corroborates these findings; dislocation-related line broadening and lattice strain development between as worked and heated α-brass is directly related to the interaction between the dislocations and the population of deformation twins. The experiments unambiguously disprove that other thermally-induced microstructure features contribute to thermal hardening. Specifically, the presence of recrystallised grains or second phases do not play a role. As these heat treatments match annealing conditions subjected to α-brass during deformation-related manufacturing processes, the results here are considered critical to understand, predict and exploit, where appropriate, any beneficial process-induced structural behaviour. △ Less

Submitted 14 June, 2024; originally announced June 2024.

arXiv:2406.04302 [pdf, other]

Representational Alignment Supports Effective Machine Teaching

Authors: Ilia Sucholutsky, Katherine M. Collins, Maya Malaviya, Nori Jacoby, Weiyang Liu, Theodore R. Sumers, Michalis Korakakis, Umang Bhatt, Mark Ho, Joshua B. Tenenbaum, Brad Love, Zachary A. Pardos, Adrian Weller, Thomas L. Griffiths

Abstract: A good teacher should not only be knowledgeable; but should be able to communicate in a way that the student understands -- to share the student's representation of the world. In this work, we integrate insights from machine teaching and pragmatic communication with the burgeoning literature on representational alignment to characterize a utility curve defining a relationship between representatio… ▽ More A good teacher should not only be knowledgeable; but should be able to communicate in a way that the student understands -- to share the student's representation of the world. In this work, we integrate insights from machine teaching and pragmatic communication with the burgeoning literature on representational alignment to characterize a utility curve defining a relationship between representational alignment and teacher capability for promoting student learning. To explore the characteristics of this utility curve, we design a supervised learning environment that disentangles representational alignment from teacher accuracy. We conduct extensive computational experiments with machines teaching machines, complemented by a series of experiments in which machines teach humans. Drawing on our findings that improved representational alignment with a student improves student learning outcomes (i.e., task accuracy), we design a classroom matching procedure that assigns students to teachers based on the utility curve. If we are to design effective machine teachers, it is not enough to build teachers that are accurate -- we want teachers that can align, representationally, to their students too. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: Preprint

arXiv:2406.00179 [pdf, other]

Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation

Authors: Bernd Bohnet, Kevin Swersky, Rosanne Liu, Pranjal Awasthi, Azade Nova, Javier Snaider, Hanie Sedghi, Aaron T Parisi, Michael Collins, Angeliki Lazaridou, Orhan Firat, Noah Fiedel

Abstract: We explore the use of long-context capabilities in large language models to create synthetic reading comprehension data from entire books. Previous efforts to construct such datasets relied on crowd-sourcing, but the emergence of transformers with a context size of 1 million or more tokens now enables entirely automatic approaches. Our objective is to test the capabilities of LLMs to analyze, unde… ▽ More We explore the use of long-context capabilities in large language models to create synthetic reading comprehension data from entire books. Previous efforts to construct such datasets relied on crowd-sourcing, but the emergence of transformers with a context size of 1 million or more tokens now enables entirely automatic approaches. Our objective is to test the capabilities of LLMs to analyze, understand, and reason over problems that require a detailed comprehension of long spans of text, such as questions involving character arcs, broader themes, or the consequences of early actions later in the story. We propose a holistic pipeline for automatic data generation including question generation, answering, and model scoring using an ``Evaluator''. We find that a relative approach, comparing answers between models in a pairwise fashion and ranking with a Bradley-Terry model, provides a more consistent and differentiating scoring mechanism than an absolute scorer that rates answers individually. We also show that LLMs from different model families produce moderate agreement in their ratings. We ground our approach using the manually curated NarrativeQA dataset, where our evaluator shows excellent agreement with human judgement and even finds errors in the dataset. Using our automatic evaluation approach, we show that using an entire book as context produces superior reading comprehension performance compared to baseline no-context (parametric knowledge only) and retrieval-based approaches. △ Less

Submitted 31 May, 2024; originally announced June 2024.

arXiv:2405.13499 [pdf, other]

Euclid: Early Release Observations -- Deep anatomy of nearby galaxies

Authors: L. K. Hunt, F. Annibali, J. -C. Cuillandre, A. M. N. Ferguson, P. Jablonka, S. S. Larsen, F. R. Marleau, E. Schinnerer, M. Schirmer, C. Stone, C. Tortora, T. Saifollahi, A. Lançon, M. Bolzonella, S. Gwyn, M. Kluge, R. Laureijs, D. Carollo, M. L. M. Collins, P. Dimauro, P. -A. Duc, D. Erkal, J. M. Howell, C. Nally, E. Saremi , et al. (174 additional authors not shown)

Abstract: Euclid is poised to make significant advances in the study of nearby galaxies in the local Universe. Here we present a first look at 6 galaxies observed for the Nearby Galaxy Showcase as part of the Euclid Early Release Observations acquired between August and November, 2023. These targets, 3 dwarf galaxies (HolmbergII, IC10, NGC6822) and 3 spirals (IC342, NGC2403, NGC6744), range in distance from… ▽ More Euclid is poised to make significant advances in the study of nearby galaxies in the local Universe. Here we present a first look at 6 galaxies observed for the Nearby Galaxy Showcase as part of the Euclid Early Release Observations acquired between August and November, 2023. These targets, 3 dwarf galaxies (HolmbergII, IC10, NGC6822) and 3 spirals (IC342, NGC2403, NGC6744), range in distance from about 0.5 Mpc to 8.8 Mpc. Our assessment of the surface brightness depths in the stacked Euclid images confirms previous estimates in 100 arcsec^2 regions of 1sigma=30.5 mag/arcsec^2 for VIS, but slightly deeper than previous estimates for NISP with 1sigma=29.2-29.4 mag/arcsec^2. By combining Euclid HE, YE, and IE into RGB images, we illustrate the large field-of-view covered by a single Reference Observing Sequence, together with exquisite detail on parsec scales in these nearby galaxies. Radial surface brightness and color profiles demonstrate galaxy colors in agreement with stellar population synthesis models. Standard stellar photometry selection techniques find approximately 1.3 million stars across the 6 galaxy fields. Euclid's resolved stellar photometry allows us to constrain the star-formation histories of these galaxies, by disentangling the distributions of young stars, as well as asymptotic giant branch and red giant branch stellar populations. We finally examine 2 galaxies individually for surrounding satellite systems. Our analysis of the ensemble of dwarf satellites around NGC6744 reveals a new galaxy, EDwC1, a nucleated dwarf spheroidal at the end of a spiral arm. Our new census of the globular clusters around NGC2403 yields 9 new star-cluster candidates, 8 of which with colors indicative of evolved stellar populations. In summary, our investigation of the 6 Showcase galaxies demonstrates that Euclid is a powerful probe of the anatomy of nearby galaxies [abridged]. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 36 pages; 20 figures in main text; 4 Appendices. Submitted to A&A, as part of the A&A special issue `Euclid on Sky', which contains Euclid key reference papers and first results from the Euclid Early Release Observations

arXiv:2405.13496 [pdf, other]

Euclid: Early Release Observations -- Programme overview and pipeline for compact- and diffuse-emission photometry

Authors: J. -C. Cuillandre, E. Bertin, M. Bolzonella, H. Bouy, S. Gwyn, S. Isani, M. Kluge, O. Lai, A. Lançon, D. A. Lang, R. Laureijs, T. Saifollahi, M. Schirmer, C. Stone, Abdurro'uf, N. Aghanim, B. Altieri, F. Annibali, H. Atek, P. Awad, M. Baes, E. Bañados, D. Barrado, S. Belladitta, V. Belokurov , et al. (240 additional authors not shown)

Abstract: The Euclid ERO showcase Euclid's capabilities in advance of its main mission, targeting 17 astronomical objects, from galaxy clusters, nearby galaxies, globular clusters, to star-forming regions. A total of 24 hours observing time was allocated in the early months of operation, engaging the scientific community through an early public data release. We describe the development of the ERO pipeline t… ▽ More The Euclid ERO showcase Euclid's capabilities in advance of its main mission, targeting 17 astronomical objects, from galaxy clusters, nearby galaxies, globular clusters, to star-forming regions. A total of 24 hours observing time was allocated in the early months of operation, engaging the scientific community through an early public data release. We describe the development of the ERO pipeline to create visually compelling images while simultaneously meeting the scientific demands within months of launch, leveraging a pragmatic, data-driven development strategy. The pipeline's key requirements are to preserve the image quality and to provide flux calibration and photometry for compact and extended sources. The pipeline's five pillars are: removal of instrumental signatures; astrometric calibration; photometric calibration; image stacking; and the production of science-ready catalogues for both the VIS and NISP instruments. We report a PSF with a full width at half maximum of 0.16" in the optical and 0.49" in the three NIR bands. Our VIS mean absolute flux calibration is accurate to about 1%, and 10% for NISP due to a limited calibration set; both instruments have considerable colour terms. The median depth is 25.3 and 23.2 AB mag with a SNR of 10 for galaxies, and 27.1 and 24.5 AB mag at an SNR of 5 for point sources for VIS and NISP, respectively. Euclid's ability to observe diffuse emission is exceptional due to its extended PSF nearly matching a pure diffraction halo, the best ever achieved by a wide-field, high-resolution imaging telescope. Euclid offers unparalleled capabilities for exploring the LSB Universe across all scales, also opening a new observational window in the NIR. Median surface-brightness levels of 29.9 and 28.3 AB mag per square arcsec are achieved for VIS and NISP, respectively, for detecting a 10 arcsec x 10 arcsec extended feature at the 1 sigma level. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Submitted to A&A, 44 pages, 36 figures - Part of the A&A special issue `Euclid on Sky', which contains Euclid key reference papers and first results from the Euclid Early Release Observations

arXiv:2404.17559 [pdf, other]

Decoherence in Neutrino Oscillation at the ESSnuSB Experiment

Authors: ESSnuSB, :, J. Aguilar, M. Anastasopoulos, E. Baussan, A. K. Bhattacharyya, A. Bignami, M. Blennow, M. Bogomilov, B. Bolling, E. Bouquerel, F. Bramati, A. Branca, G. Brunetti, I. Bustinduy, C. J. Carlile, J. Cederkall, T. W. Choi, S. Choubey, P. Christiansen, M. Collins, E. Cristaldo Morales, P. Cupiał, H. Danared, D. Dancila , et al. (72 additional authors not shown)

Abstract: Neutrino oscillation experiments provide a unique window in exploring several new physics scenarios beyond the standard three flavour. One such scenario is quantum decoherence in neutrino oscillation which tends to destroy the interference pattern of neutrinos reaching the far detector from the source. In this work, we study the decoherence in neutrino oscillation in the context of the ESSnuSB exp… ▽ More Neutrino oscillation experiments provide a unique window in exploring several new physics scenarios beyond the standard three flavour. One such scenario is quantum decoherence in neutrino oscillation which tends to destroy the interference pattern of neutrinos reaching the far detector from the source. In this work, we study the decoherence in neutrino oscillation in the context of the ESSnuSB experiment. We consider the energy-independent decoherence parameter and derive the analytical expressions for P$_{μe}$ and P$_{μμ}$ probabilities in vacuum. We have computed the capability of ESSnuSB to put bounds on the decoherence parameters namely, $Γ_{21}$ and $Γ_{32}$ and found that the constraints on $Γ_{21}$ are competitive compared to the DUNE bounds and better than the current T2K and MINOS ones. We have also investigated the impact of decoherence on the ESSnuSB measurement of the Dirac CP phase $δ_{\rm CP}$ and concluded that it remains robust in the presence of new physics. △ Less

Submitted 26 April, 2024; originally announced April 2024.

Comments: 30 pages, 9 figures, 2 tables

arXiv:2404.04186 [pdf, other]

Probabilistically Informed Robot Object Search with Multiple Regions

Authors: Matthew Collins, Jared J. Beard, Nicholas Ohi, Yu Gu

Abstract: The increasing use of autonomous robot systems in hazardous environments underscores the need for efficient search and rescue operations. Despite significant advancements, existing literature on object search often falls short in overcoming the difficulty of long planning horizons and dealing with sensor limitations, such as noise. This study introduces a novel approach that formulates the search… ▽ More The increasing use of autonomous robot systems in hazardous environments underscores the need for efficient search and rescue operations. Despite significant advancements, existing literature on object search often falls short in overcoming the difficulty of long planning horizons and dealing with sensor limitations, such as noise. This study introduces a novel approach that formulates the search problem as a belief Markov decision processes with options (BMDP-O) to make Monte Carlo tree search (MCTS) a viable tool for overcoming these challenges in large scale environments. The proposed formulation incorporates sequences of actions (options) to move between regions of interest, enabling the algorithm to efficiently scale to large environments. This approach also enables the use of customizable fields of view, for use with multiple types of sensors. Experimental results demonstrate the superiority of this approach in large environments when compared to the problem without options and alternative tools such as receding horizon planners. Given compute time for the proposed formulation is relatively high, a further approximated "lite" formulation is proposed. The lite formulation finds objects in a comparable number of steps with faster computation. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: 6 pages, 7 figures. Submitted to the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems in Abu Dhabi, UAE (Oct 14-18, 2024)

arXiv:2403.01314 [pdf, other]

Superflows: A New Tool for Forensic Network Flow Analysis

Authors: Michael Collins, Jyotirmoy V. Deshmukh, Dristi Dinesh, Mukund Raghothaman, Srivatsan Ravi, Yuan Xia

Abstract: Network security analysts gather data from diverse sources, from high-level summaries of network flow and traffic volumes to low-level details such as service logs from servers and the contents of individual packets. They validate and check this data against traffic patterns and historical indicators of compromise. Based on the results of this analysis, a decision is made to either automatically m… ▽ More Network security analysts gather data from diverse sources, from high-level summaries of network flow and traffic volumes to low-level details such as service logs from servers and the contents of individual packets. They validate and check this data against traffic patterns and historical indicators of compromise. Based on the results of this analysis, a decision is made to either automatically manage the traffic or report it to an analyst for further investigation. Unfortunately, due rapidly increasing traffic volumes, there are far more events to check than operational teams can handle for effective forensic analysis. However, just as packets are grouped into flows that share a commonality, we argue that a high-level construct for grouping network flows into a set a flows that share a hypothesis is needed to significantly improve the quality of operational network response by increasing Events Per Analysts Hour (EPAH). In this paper, we propose a formalism for describing a superflow construct, which we characterize as an aggregation of one or more flows based on an analyst-specific hypothesis about traffic behavior. We demonstrate simple superflow constructions and representations, and perform a case study to explain how the formalism can be used to reduce the volume of data for forensic analysis. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2402.16955 [pdf, other]

Discovery of Globular Cluster Candidates in the Dwarf Irregular Galaxy IC 2574 Using HST/ACS Imaging

Authors: Noushin Karim, Michelle L. M. Collins, Duncan A. Forbes, Justin I. Read

Abstract: We report the discovery of 23 globular cluster (GC) candidates around the relatively isolated dwarf galaxy IC 2574 within the Messier 81 (M81) group, at a distance of 3.86 Mpc. We use observations from the HST Advanced Camera for Surveys (ACS) to analyse the imaging in the F814W and F555W broadband filters. Our GC candidates have luminosities ranging from $-5.9 \geq M_V \geq -10.4$ and half-light… ▽ More We report the discovery of 23 globular cluster (GC) candidates around the relatively isolated dwarf galaxy IC 2574 within the Messier 81 (M81) group, at a distance of 3.86 Mpc. We use observations from the HST Advanced Camera for Surveys (ACS) to analyse the imaging in the F814W and F555W broadband filters. Our GC candidates have luminosities ranging from $-5.9 \geq M_V \geq -10.4$ and half-light radii of $1.4 \leq r_h \leq 11.5$ pc. We find the total number of GCs ($N_{\mathrm{GC}})=27\pm5$ after applying completeness corrections, which implies a specific frequency of $S_N = 4.0\pm0.8$, consistent with expectations based on its luminosity. The GC system appears to have a bimodal colour distribution, with 30% of the GC candidates having redder colours. We also find 5 objects with extremely blue colours that could be young star clusters linked to an intense star formation episode that occurred in IC 2574 $\sim$1 Gyr ago. We make an independent measurement of the halo mass of IC 2574 from its kinematic data, which is rare for low mass galaxies, and find log $M_{200} = 10.93 \pm 0.08$. We place the galaxy on the well-known GC system mass-halo mass relation and find that it agrees well with the observed near-linear relation. IC 2574 has a rich GC population for a dwarf galaxy, which includes an unusually bright $ω$ Cen-like GC, making it an exciting nearby laboratory for probing the peculiar efficiency of forming massive GCs in dwarf galaxies. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: 14 pages, 11 figures, accepted for publication in MNRAS

arXiv:2402.13314 [pdf, other]

Probing the dark matter haloes of external galaxies with stellar streams

Authors: Madison Walder, Denis Erkal, Michelle Collins, David Martinez-Delgado

Abstract: Stellar streams have proven to be powerful tools for measuring the Milky Way's gravitational potential and hence its dark matter halo. In the coming years, Vera Rubin, Euclid, ARRAKIHS, and NGRST will uncover a plethora of streams around external galaxies. Although great in number, observations of these distant streams will often be limited to only the on-sky position of the stream. In this work,… ▽ More Stellar streams have proven to be powerful tools for measuring the Milky Way's gravitational potential and hence its dark matter halo. In the coming years, Vera Rubin, Euclid, ARRAKIHS, and NGRST will uncover a plethora of streams around external galaxies. Although great in number, observations of these distant streams will often be limited to only the on-sky position of the stream. In this work, we explore how well we will be able to measure the dark matter haloes of these galaxies by fitting simplified mock streams with a variety of intrinsic and orbital properties in a range of data availability scenarios. We find that streams with multiple wraps around their host galaxy can constrain the overall radial profile and scale radius of the potential without radial velocities. In many other cases, a single radial velocity measurement often provides a significant boost to constraining power for the radial profile, scale radius, and enclosed mass of the dark matter halo. Given the wealth of data expected soon, this suggests that we will be able to measure the dark matter haloes of a statistically significant sample of galaxies with stellar streams in the coming years. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: 16 pages, 15 figures (+4 in appendix), submitted to MNRAS

arXiv:2402.00559 [pdf, other]

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains

Authors: Alon Jacovi, Yonatan Bitton, Bernd Bohnet, Jonathan Herzig, Or Honovich, Michael Tseng, Michael Collins, Roee Aharoni, Mor Geva

Abstract: Prompting language models to provide step-by-step answers (e.g., "Chain-of-Thought") is the prominent approach for complex reasoning tasks, where more accurate reasoning chains typically improve downstream task performance. Recent literature discusses automatic methods to verify reasoning to evaluate and improve their correctness. However, no fine-grained step-level datasets are available to enabl… ▽ More Prompting language models to provide step-by-step answers (e.g., "Chain-of-Thought") is the prominent approach for complex reasoning tasks, where more accurate reasoning chains typically improve downstream task performance. Recent literature discusses automatic methods to verify reasoning to evaluate and improve their correctness. However, no fine-grained step-level datasets are available to enable thorough evaluation of such verification methods, hindering progress in this direction. We introduce REVEAL: Reasoning Verification Evaluation, a dataset to benchmark automatic verifiers of complex Chain-of-Thought reasoning in open-domain question-answering settings. REVEAL includes comprehensive labels for the relevance, attribution to evidence passages, and logical correctness of each reasoning step in a language model's answer, across a variety of datasets and state-of-the-art language models. Evaluation on REVEAL shows that verifiers struggle at verifying reasoning chains - in particular, verifying logical correctness and detecting contradictions. Available at https://reveal-dataset.github.io/ . △ Less

Submitted 21 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Accepted to ACL 2024

arXiv:2312.08063 [pdf, other]

Estimation of Concept Explanations Should be Uncertainty Aware

Authors: Vihari Piratla, Juyeon Heo, Katherine M. Collins, Sukriti Singh, Adrian Weller

Abstract: Model explanations can be valuable for interpreting and debugging predictive models. We study a specific kind called Concept Explanations, where the goal is to interpret a model using human-understandable concepts. Although popular for their easy interpretation, concept explanations are known to be noisy. We begin our work by identifying various sources of uncertainty in the estimation pipeline th… ▽ More Model explanations can be valuable for interpreting and debugging predictive models. We study a specific kind called Concept Explanations, where the goal is to interpret a model using human-understandable concepts. Although popular for their easy interpretation, concept explanations are known to be noisy. We begin our work by identifying various sources of uncertainty in the estimation pipeline that lead to such noise. We then propose an uncertainty-aware Bayesian estimation method to address these issues, which readily improved the quality of explanations. We demonstrate with theoretical analysis and empirical evaluation that explanations computed by our method are robust to train-time choices while also being label-efficient. Further, our method proved capable of recovering relevant concepts amongst a bank of thousands, in an evaluation with real-datasets and off-the-shelf models, demonstrating its scalability. We believe the improved quality of uncertainty-aware concept explanations make them a strong candidate for more reliable model interpretation. We release our code at https://github.com/vps-anonconfs/uace. △ Less

Submitted 5 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

arXiv:2312.07480 [pdf, other]

Microscale stress-geometry interactions in an additively manufactured NiTi cardiovascular stent: A synchrotron dual imaging tomography and diffraction study

Authors: Himanshu Vashishtha, Parastoo Jamshidi, Anastasia Vrettou, Anna Kareer, Michael Goode, Hans Deyhle, Andrew James, Sharif Ahmad, Christina Reinhard, Moataz M. Attallah, David M. Collins

Abstract: This study explores cardiovascular stents fabricated using laser powder bed fusion (LPBF); an emerging method to offer patient-specific customisable parts. Here, the shape memory alloy NiTi, in a near equiatomic composition, was investigated to deconvolve the material response from macroscopic component effects. Specifically, stress-geometry interactions were revealed, in-situ, for a minaturised c… ▽ More This study explores cardiovascular stents fabricated using laser powder bed fusion (LPBF); an emerging method to offer patient-specific customisable parts. Here, the shape memory alloy NiTi, in a near equiatomic composition, was investigated to deconvolve the material response from macroscopic component effects. Specifically, stress-geometry interactions were revealed, in-situ, for a minaturised cardiovascular stent subjected to an externally applied cylindrical stress whilst acquiring synchrotron X-ray imaging and diffraction data. The approach enabled the collection of spatially resolved micromechanical deformation data; the formation of stress-induced martensite and R-phase was evident, occurring in locations near junctions between stent ligaments where stress concentrations exist. In the as-fabricated condition, hardness maps were obtained through nanoindentation, demonstrating that the localised deformation and deformation patterning is further controlled by porosity and microstructural heterogeneity. Electron backscatter diffraction (EBSD) supported these observations, showing a finer grain structure near stent junctions with higher associated lattice curvature. These features, combined with stress concentrations when loaded will initiate localised phase transformations. If the stent was subjected to repeated loading, representing in-vivo conditions, these regions would be susceptible to cyclic damage through transformation memory loss, leading to premature component failure. This study highlights the challenges that must be addressed for the post-processing treatment of LABF-processed stents for healthcare-related applications. △ Less

Submitted 12 December, 2023; originally announced December 2023.

arXiv:2312.06793 [pdf]

Serious errors impair an assessment of forest carbon projects: A rebuttal of West et al. (2023)

Authors: Edward T. A. Mitchard, Harry Carstairs, Riccardo Cosenza, Sassan S. Saatchi, Jason Funk, Paula Nieto Quintano, Thom Brade, Iain M. McNicol, Patrick Meir, Murray B. Collins, Eric Nowak

Abstract: Independent retrospective analyses of the effectiveness of reducing deforestation and forest degradation (REDD) projects are vital to ensure climate change benefits are being delivered. A recent study in Science by West et al. (1) appeared therefore to be a timely alert that the majority of projects operating in the 2010s failed to reduce deforestation rates. Unfortunately, their analysis suffered… ▽ More Independent retrospective analyses of the effectiveness of reducing deforestation and forest degradation (REDD) projects are vital to ensure climate change benefits are being delivered. A recent study in Science by West et al. (1) appeared therefore to be a timely alert that the majority of projects operating in the 2010s failed to reduce deforestation rates. Unfortunately, their analysis suffered from major flaws in the choice of underlying data, resulting in poorly matched and unstable counterfactual scenarios. These were compounded by calculation errors, biasing the study against finding that projects significantly reduced deforestation. This flawed analysis of 24 projects unfairly condemned all 100+ REDD projects, and risks cutting off finance for protecting vulnerable tropical forests from destruction at a time when funding needs to grow rapidly. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2311.16397 [pdf, other]

The Hubble Space Telescope Survey of M31 Satellite Galaxies. III. Calibrating the Horizontal Branch as an Age Indicator for Nearby Galaxies

Authors: Connor Jennings, Alessandro Savino, Daniel Weisz, Nitya Kallivayalil, Andrew Cole, Michelle Collins, Andrew Dolphin, Annette Ferguson, Karoline Gilbert, Puragra Guhathakurta, Evan Kirby, Geraint Lewis, Nicolas Martin, Michael Rich, Evan Skillman, Roeland van der Marel, Jack Warfield

Abstract: We present a new method for measuring the mean age of old/intermediate stellar populations in resolved, metal-poor ($\rm \langle[Fe/H]\rangle \lesssim -1.5$) galaxies using only the morphology of the horizontal branch (HB) and an estimate of the average metallicity. We calculate the ratio of blue-to-red HB stars and the mass-weighted mean ages of 27 M31 satellite galaxies that have star formation… ▽ More We present a new method for measuring the mean age of old/intermediate stellar populations in resolved, metal-poor ($\rm \langle[Fe/H]\rangle \lesssim -1.5$) galaxies using only the morphology of the horizontal branch (HB) and an estimate of the average metallicity. We calculate the ratio of blue-to-red HB stars and the mass-weighted mean ages of 27 M31 satellite galaxies that have star formation histories (SFHs) measured from Hubble Space Telescope-based color-magnitude diagrams (CMDs) that include the oldest Main Sequence Turn-off (MSTO) ages. We find a strong correlation between mean age, metallicity, and HB morphology, for stellar populations older than $\sim6$~Gyr. The correlation allows us to predict a galaxy's mean age from its HB morphology to a precision of $\lesssim 1$~Gyr. We validate our method by recovering the correct ages of Local Group galaxies that have robust MSTO-based ages and are not in our calibration sample. We also use our technique to measure the mean ages of isolated field galaxies KKR25 ($11.21^{+0.70}_{-0.65}$~Gyr) and VV124 ($11.03^{+0.73}_{-0.68}$~Gyr), which indicate that their main star formation episodes may have lasted several Gyr and support the picture that they achieved their early-type characteristics (e.g., low gas content, low star formation activity) in isolation and not through environment. Because the HB is $\sim80\times$ brighter than the oldest MSTO, our method can provide precise characteristic ages of predominantly old galaxies at distances $\sim 9$ times farther. We provide our calibrations in commonly used HST/ACS filters. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 21 pages, 13 figures, 5 tables, submitted to ApJ

arXiv:2311.03088 [pdf, other]

doi 10.1016/j.actamat.2023.119608

Grain-level effects on in-situ deformation-induced phase transformations in a complex-phase steel using 3DXRD and EBSD

Authors: James A. D. Ball, Claire Davis, Carl Slater, Himanshu Vashishtha, Mohammed Said, Louis Hébrard, Florian Steinhilber, Jonathan P. Wright, Thomas Connolley, Stefan Michalik, David M. Collins

Abstract: A novel complex-phase steel alloy is conceived with a deliberately unstable austenite, $γ$, phase that enables the deformation-induced martensitic transformations (DIMT) to be explored at low levels of plastic strain. The DIMT was thus explored, in-situ and non-destructively, using both far-field Three-Dimensional X-Ray Diffraction (3DXRD) and Electron Back-Scatter Diffraction (EBSD). Substantial… ▽ More A novel complex-phase steel alloy is conceived with a deliberately unstable austenite, $γ$, phase that enables the deformation-induced martensitic transformations (DIMT) to be explored at low levels of plastic strain. The DIMT was thus explored, in-situ and non-destructively, using both far-field Three-Dimensional X-Ray Diffraction (3DXRD) and Electron Back-Scatter Diffraction (EBSD). Substantial $α'$ martensite formation was observed under 10% applied strain with EBSD, and many $\varepsilon$ grain formation events were captured with 3DXRD, indicative of the indirect transformation of martensite via the reaction $γ\rightarrow \varepsilon \rightarrow α'$. Using $\varepsilon$ grain formation as a direct measurement of $γ$ grain stability, the influence of several microstructural properties, such as grain size, orientation and neighbourhood configuration, on $γ$ stability have been identified. Larger $γ$ grains were found to be less stable than smaller grains. Any $γ$ grains oriented with {100} parallel to the loading direction preferentially transformed with lower stresses. Parent $\varepsilon$-forming $γ$ grains possessed a neighbourhood with increased ferritic/martensitic volume fraction. This finding shows, unambiguously, that $α$/$α'$ promotes $\varepsilon$ formation in neighbouring grains. The minimum strain work criterion model for $\varepsilon$ variant prediction was also evaluated, which worked well for most grains. However, $\varepsilon$-forming grains with a lower stress were less well predicted by the model, indicating crystal-level behaviour must be considered for accurate $\varepsilon$ formation. The findings from this work are considered key for the future design of alloys where the deformation response can be controlled by tailoring microstructure and local or macroscopic crystal orientations. △ Less

Submitted 7 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

Comments: 20 pages, 3 supplementary pages, 14 figures, 3 supplementary figures. Preprint submitted to Acta Materialia. Updated to correct Figure 14

Journal ref: Acta Materialia 265 (2024) 119608

arXiv:2310.17022 [pdf, other]

Controlled Decoding from Language Models

Authors: Sidharth Mudgal, Jong Lee, Harish Ganapathy, YaGuang Li, Tao Wang, Yanping Huang, Zhifeng Chen, Heng-Tze Cheng, Michael Collins, Trevor Strohman, Jilin Chen, Alex Beutel, Ahmad Beirami

Abstract: KL-regularized reinforcement learning (RL) is a popular alignment framework to control the language model responses towards high reward outcomes. We pose a tokenwise RL objective and propose a modular solver for it, called controlled decoding (CD). CD exerts control through a separate prefix scorer module, which is trained to learn a value function for the reward. The prefix scorer is used at infe… ▽ More KL-regularized reinforcement learning (RL) is a popular alignment framework to control the language model responses towards high reward outcomes. We pose a tokenwise RL objective and propose a modular solver for it, called controlled decoding (CD). CD exerts control through a separate prefix scorer module, which is trained to learn a value function for the reward. The prefix scorer is used at inference time to control the generation from a frozen base model, provably sampling from a solution to the RL objective. We empirically demonstrate that CD is effective as a control mechanism on popular benchmarks. We also show that prefix scorers for multiple rewards may be combined at inference time, effectively solving a multi-objective RL problem with no additional training. We show that the benefits of applying CD transfer to an unseen base model with no further tuning as well. Finally, we show that CD can be applied in a blockwise decoding fashion at inference-time, essentially bridging the gap between the popular best-of-K strategy and tokenwise control through reinforcement learning. This makes CD a promising approach for alignment of language models. △ Less

Submitted 3 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

Comments: ICML 2024

arXiv:2310.13021 [pdf, other]

AI for Mathematics: A Cognitive Science Perspective

Authors: Cedegao E. Zhang, Katherine M. Collins, Adrian Weller, Joshua B. Tenenbaum

Abstract: Mathematics is one of the most powerful conceptual systems developed and used by the human species. Dreams of automated mathematicians have a storied history in artificial intelligence (AI). Rapid progress in AI, particularly propelled by advances in large language models (LLMs), has sparked renewed, widespread interest in building such systems. In this work, we reflect on these goals from a \text… ▽ More Mathematics is one of the most powerful conceptual systems developed and used by the human species. Dreams of automated mathematicians have a storied history in artificial intelligence (AI). Rapid progress in AI, particularly propelled by advances in large language models (LLMs), has sparked renewed, widespread interest in building such systems. In this work, we reflect on these goals from a \textit{cognitive science} perspective. We call attention to several classical and ongoing research directions from cognitive science, which we believe are valuable for AI practitioners to consider when seeking to build truly human (or superhuman)-level mathematical systems. We close with open discussions and questions that we believe necessitate a multi-disciplinary perspective -- cognitive scientists working in tandem with AI researchers and mathematicians -- as we move toward better mathematical AI systems which not only help us push the frontier of the mathematics, but also offer glimpses into how we as humans are even capable of such great cognitive feats. △ Less

Submitted 18 October, 2023; originally announced October 2023.

arXiv:2310.13018 [pdf, other]

Getting aligned on representational alignment

Authors: Ilia Sucholutsky, Lukas Muttenthaler, Adrian Weller, Andi Peng, Andreea Bobu, Been Kim, Bradley C. Love, Erin Grant, Iris Groen, Jascha Achterberg, Joshua B. Tenenbaum, Katherine M. Collins, Katherine L. Hermann, Kerem Oktar, Klaus Greff, Martin N. Hebart, Nori Jacoby, Qiuyi Zhang, Raja Marjieh, Robert Geirhos, Sherol Chen, Simon Kornblith, Sunayana Rane, Talia Konkle, Thomas P. O'Connell , et al. (5 additional authors not shown)

Abstract: Biological and artificial information processing systems form representations that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the extent to which the representations formed by these diverse systems agree? Do similarities in representations then translate into similar behavior? How can a system's representations be modified to better match those of an… ▽ More Biological and artificial information processing systems form representations that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the extent to which the representations formed by these diverse systems agree? Do similarities in representations then translate into similar behavior? How can a system's representations be modified to better match those of another system? These questions pertaining to the study of representational alignment are at the heart of some of the most active research areas in cognitive science, neuroscience, and machine learning. For example, cognitive scientists measure the representational alignment of multiple individuals to identify shared cognitive priors, neuroscientists align fMRI responses from multiple individuals into a shared representational space for group-level analyses, and ML researchers distill knowledge from teacher models into student models by increasing their alignment. Unfortunately, there is limited knowledge transfer between research communities interested in representational alignment, so progress in one field often ends up being rediscovered independently in another. Thus, greater cross-field communication would be advantageous. To improve communication between these fields, we propose a unifying framework that can serve as a common language between researchers studying representational alignment. We survey the literature from all three fields and demonstrate how prior work fits into this framework. Finally, we lay out open problems in representational alignment where progress can benefit all three of these fields. We hope that our work can catalyze cross-disciplinary collaboration and accelerate progress for all communities studying and developing information processing systems. We note that this is a working paper and encourage readers to reach out with their suggestions for future revisions. △ Less

Submitted 2 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

Comments: Working paper, changes to be made in upcoming revisions

arXiv:2310.10749 [pdf, other]

Study of non-standard interaction mediated by a scalar field at ESSnuSB experiment

Authors: ESSnuSB, :, J. Aguilar, M. Anastasopoulos, E. Baussan, A. K. Bhattacharyya, A. Bignami, M. Blennow, M. Bogomilov, B. Bolling, E. Bouquerel, F. Bramati, A. Branca, W. Brorsson, I. Bustinduy, C. J. Carlile, J. Cederkall, T. W. Choi, S. Choubey, P. Christiansen, M. Collins, E. Cristaldo Morales, H. Danared, D. Dancila, J. P. A. M. de André , et al. (67 additional authors not shown)

Abstract: In this paper we study non-standard interactions mediated by a scalar field (SNSI) in the context of ESSnuSB experiment. In particular we study the capability of ESSnuSB to put bounds on the SNSI parameters and also study the impact of SNSI in the measurement of the leptonic CP phase $δ_{\rm CP}$. Existence of SNSI modifies the neutrino mass matrix and this modification can be expressed in terms o… ▽ More In this paper we study non-standard interactions mediated by a scalar field (SNSI) in the context of ESSnuSB experiment. In particular we study the capability of ESSnuSB to put bounds on the SNSI parameters and also study the impact of SNSI in the measurement of the leptonic CP phase $δ_{\rm CP}$. Existence of SNSI modifies the neutrino mass matrix and this modification can be expressed in terms of three diagonal real parameters ($η_{ee}$, $η_{μμ}$ and $η_{ττ}$) and three off-diagonal complex parameters ($η_{e μ}$, $η_{eτ}$ and $η_{μτ}$). Our study shows that the upper bounds on the parameters $η_{μμ}$, $η_{ττ}$ and $η_{μτ}$ depend upon how $Δm^2_{31}$ is minimized in the theory. However, this is not the case when one tries to measure the impact of SNSI on $δ_{\rm CP}$. Further, we show that the CP sensitivity of ESSnuSB can be completely lost for certain values of $η_{ee}$ and $η_{μτ}$ for which the appearance channel probability becomes independent of $δ_{\rm CP}$. △ Less

Submitted 26 April, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: 14 pages, 6 figures, 2 tables, Version accepted for publication in Phys. Rev. D

arXiv:2309.17159 [pdf, other]

Numerically computed Double, Triple, and Quadruple Planar Bubbles for Density $r^p$

Authors: Marcus Collins

Abstract: Using Brakke's Evolver, we numerically verify conjectured optimal planar double bubbles for density $r^p$ and provide conjectures for triple and quadruple bubbles. Using Brakke's Evolver, we numerically verify conjectured optimal planar double bubbles for density $r^p$ and provide conjectures for triple and quadruple bubbles. △ Less

Submitted 12 May, 2023; originally announced September 2023.

Comments: 14 pages, 11 figures. arXiv admin note: substantial text overlap with arXiv:1908.10766 by other authors

arXiv:2309.16928 [pdf, other]

Learning to Receive Help: Intervention-Aware Concept Embedding Models

Authors: Mateo Espinosa Zarlenga, Katherine M. Collins, Krishnamurthy Dvijotham, Adrian Weller, Zohreh Shams, Mateja Jamnik

Abstract: Concept Bottleneck Models (CBMs) tackle the opacity of neural architectures by constructing and explaining their predictions using a set of high-level concepts. A special property of these models is that they permit concept interventions, wherein users can correct mispredicted concepts and thus improve the model's performance. Recent work, however, has shown that intervention efficacy can be highl… ▽ More Concept Bottleneck Models (CBMs) tackle the opacity of neural architectures by constructing and explaining their predictions using a set of high-level concepts. A special property of these models is that they permit concept interventions, wherein users can correct mispredicted concepts and thus improve the model's performance. Recent work, however, has shown that intervention efficacy can be highly dependent on the order in which concepts are intervened on and on the model's architecture and training hyperparameters. We argue that this is rooted in a CBM's lack of train-time incentives for the model to be appropriately receptive to concept interventions. To address this, we propose Intervention-aware Concept Embedding models (IntCEMs), a novel CBM-based architecture and training paradigm that improves a model's receptiveness to test-time interventions. Our model learns a concept intervention policy in an end-to-end fashion from where it can sample meaningful intervention trajectories at train-time. This conditions IntCEMs to effectively select and receive concept interventions when deployed at test-time. Our experiments show that IntCEMs significantly outperform state-of-the-art concept-interpretable models when provided with test-time concept interventions, demonstrating the effectiveness of our approach. △ Less

Submitted 25 October, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

Comments: Accepted as a spotlight at the Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

arXiv:2309.04467 [pdf, other]

A search for faint resolved galaxies beyond the Milky Way in DES Year 6: A new faint, diffuse dwarf satellite of NGC 55

Authors: M. McNanna, K. Bechtol, S. Mau, E. O. Nadler, J. Medoff, A. Drlica-Wagner, W. Cerny, D. Crnojevic, B. Mutlu-Pakdil, A. K. Vivas, A. B. Pace, J. L. Carlin, M. L. M. Collins, P. S. Ferguson, D. Martinez-Delgado, C. E. Martinez-Vazquez, N. E. D. Noel, A. H. Riley, D. J. Sand, A. Smercina, E. Tollerud, R. H. Wechsler, T. M. C. Abbott, M. Aguena, O. Alves , et al. (41 additional authors not shown)

Abstract: We report results from a systematic wide-area search for faint dwarf galaxies at heliocentric distances from 0.3 to 2 Mpc using the full six years of data from the Dark Energy Survey (DES). Unlike previous searches over the DES data, this search specifically targeted a field population of faint galaxies located beyond the Milky Way virial radius. We derive our detection efficiency for faint, resol… ▽ More We report results from a systematic wide-area search for faint dwarf galaxies at heliocentric distances from 0.3 to 2 Mpc using the full six years of data from the Dark Energy Survey (DES). Unlike previous searches over the DES data, this search specifically targeted a field population of faint galaxies located beyond the Milky Way virial radius. We derive our detection efficiency for faint, resolved dwarf galaxies in the Local Volume with a set of synthetic galaxies and expect our search to be complete to $M_V$ ~ $(-7, -10)$ mag for galaxies at $D = (0.3, 2.0)$ Mpc respectively. We find no new field dwarfs in the DES footprint, but we report the discovery of one high-significance candidate dwarf galaxy at a distance of $2.2\substack{+0.05\\-0.12}$ Mpc, a potential satellite of the Local Volume galaxy NGC 55, separated by $47$ arcmin (physical separation as small as 30 kpc). We estimate this dwarf galaxy to have an absolute V-band magnitude of $-8.0\substack{+0.5\\-0.3}$ mag and an azimuthally averaged physical half-light radius of $2.2\substack{+0.5\\-0.4}$ kpc, making this one of the lowest surface brightness galaxies ever found with $μ= 32.3$ mag ${\rm arcsec}^{-2}$. This is the largest, most diffuse galaxy known at this luminosity, suggesting possible tidal interactions with its host. △ Less

Submitted 4 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

Comments: 20 pages, 7 figures

Report number: FERMILAB-PUB-23-478-PPD

arXiv:2308.16589 [pdf]

Microscopic crystallographic analysis of dislocations in molecular crystals

Authors: Sang T. Pham, Natalia Koniuch, Emily Wynne, Andy Brown, Sean M. Collins

Abstract: Organic molecular crystals encompass a vast range of materials from pharmaceuticals to organic optoelectronics and proteins to waxes in biological and industrial settings. Crystal defects from grain boundaries to dislocations are known to play key roles in mechanisms of growth and also in the functional properties of molecular crystals. In contrast to the precise analysis of individual defects in… ▽ More Organic molecular crystals encompass a vast range of materials from pharmaceuticals to organic optoelectronics and proteins to waxes in biological and industrial settings. Crystal defects from grain boundaries to dislocations are known to play key roles in mechanisms of growth and also in the functional properties of molecular crystals. In contrast to the precise analysis of individual defects in metals, ceramics, and inorganic semiconductors enabled by electron microscopy, significantly greater ambiguity remains in the experimental determination of individual dislocation character and slip systems in molecular materials. In large part, nanoscale dislocation analysis in molecular crystals has been hindered by the severely constrained electron exposures required to avoid irreversibly degrading these crystals. Here, we present a low-dose, single-exposure approach enabling nanometre-resolved analysis of individual extended dislocations in molecular crystals. We demonstrate the approach for a range of crystal types to reveal dislocation character and operative slip systems unambiguously. △ Less

Submitted 31 August, 2023; originally announced August 2023.

Comments: Manuscript (14 pages, 4 figures) and Supplementary Material (32 pages, 19 figures) in a single PDF file

arXiv:2308.08062 [pdf, other]

doi 10.1051/0004-6361/202346892

A large topographic feature on the surface of the trans-Neptunian object (307261) 2002 MS$_4$ measured from stellar occultations

Authors: F. L. Rommel, F. Braga-Ribas, J. L. Ortiz, B. Sicardy, P. Santos-Sanz, J. Desmars, J. I. B. Camargo, R. Vieira-Martins, M. Assafin, B. E. Morgado, R. C. Boufleur, G. Benedetti-Rossi, A. R. Gomes-Júnior, E. Fernández-Valenzuela, B. J. Holler, D. Souami, R. Duffard, G. Margoti, M. Vara-Lubiano, J. Lecacheux, J. L. Plouvier, N. Morales, A. Maury, J. Fabrega, P. Ceravolo , et al. (179 additional authors not shown)

Abstract: This work aims at constraining the size, shape, and geometric albedo of the dwarf planet candidate 2002 MS4 through the analysis of nine stellar occultation events. Using multichord detection, we also studied the object's topography by analyzing the obtained limb and the residuals between observed chords and the best-fitted ellipse. We predicted and organized the observational campaigns of nine st… ▽ More This work aims at constraining the size, shape, and geometric albedo of the dwarf planet candidate 2002 MS4 through the analysis of nine stellar occultation events. Using multichord detection, we also studied the object's topography by analyzing the obtained limb and the residuals between observed chords and the best-fitted ellipse. We predicted and organized the observational campaigns of nine stellar occultations by 2002 MS4 between 2019 and 2022, resulting in two single-chord events, four double-chord detections, and three events with three to up to sixty-one positive chords. Using 13 selected chords from the 8 August 2020 event, we determined the global elliptical limb of 2002 MS4. The best-fitted ellipse, combined with the object's rotational information from the literature, constrains the object's size, shape, and albedo. Additionally, we developed a new method to characterize topography features on the object's limb. The global limb has a semi-major axis of 412 $\pm$ 10 km, a semi-minor axis of 385 $\pm$ 17 km, and the position angle of the minor axis is 121 $^\circ$ $\pm$ 16$^\circ$. From this instantaneous limb, we obtained 2002 MS4's geometric albedo and the projected area-equivalent diameter. Significant deviations from the fitted ellipse in the northernmost limb are detected from multiple sites highlighting three distinct topographic features: one 11 km depth depression followed by a 25$^{+4}_{-5}$ km height elevation next to a crater-like depression with an extension of 322 $\pm$ 39 km and 45.1 $\pm$ 1.5 km deep. Our results present an object that is $\approx$138 km smaller in diameter than derived from thermal data, possibly indicating the presence of a so-far unknown satellite. However, within the error bars, the geometric albedo in the V-band agrees with the results published in the literature, even with the radiometric-derived albedo. △ Less

Submitted 23 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

Journal ref: A&A 678, A167 (2023)

arXiv:2307.15475 [pdf, other]

FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines

Authors: Matthew Barker, Emma Kallina, Dhananjay Ashok, Katherine M. Collins, Ashley Casovan, Adrian Weller, Ameet Talwalkar, Valerie Chen, Umang Bhatt

Abstract: Even though machine learning (ML) pipelines affect an increasing array of stakeholders, there is little work on how input from stakeholders is recorded and incorporated. We propose FeedbackLogs, addenda to existing documentation of ML pipelines, to track the input of multiple stakeholders. Each log records important details about the feedback collection process, the feedback itself, and how the fe… ▽ More Even though machine learning (ML) pipelines affect an increasing array of stakeholders, there is little work on how input from stakeholders is recorded and incorporated. We propose FeedbackLogs, addenda to existing documentation of ML pipelines, to track the input of multiple stakeholders. Each log records important details about the feedback collection process, the feedback itself, and how the feedback is used to update the ML pipeline. In this paper, we introduce and formalise a process for collecting a FeedbackLog. We also provide concrete use cases where FeedbackLogs can be employed as evidence for algorithmic auditing and as a tool to record updates based on stakeholder feedback. △ Less

Submitted 28 July, 2023; originally announced July 2023.

arXiv:2307.13502 [pdf, ps, other]

Growth and displacement of free product automorphisms

Authors: Matthew Collins

Abstract: It is well known for an irreducible free group automorphism that its growth rate is equal to the minimal Lipschitz displacement of its action on Culler-Vogtmann space. This follows as a consequence of the existence of train track representatives for the automorphism. We extend this result to the general - possibly reducible - case as well as to the free product situation where growth is replaced… ▽ More It is well known for an irreducible free group automorphism that its growth rate is equal to the minimal Lipschitz displacement of its action on Culler-Vogtmann space. This follows as a consequence of the existence of train track representatives for the automorphism. We extend this result to the general - possibly reducible - case as well as to the free product situation where growth is replaced by `relative growth'. △ Less

Submitted 26 July, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

MSC Class: 20F65 (Primary) 20E36; 20E08 (Secondary)

arXiv:2307.10760 [pdf, ps, other]

Length functions on groups and actions on graphs

Authors: Matthew Collins, Armando Martino

Abstract: We study generalisations of Chiswell's Theorem that $0$-hyperbolic Lyndon length functions on groups always arise as based length functions of the the group acting isometrically on a tree. We produce counter-examples to show that this Theorem fails if one replaces $0$-hyperbolicity with $δ$-hyperbolicity. We then propose a set of axioms for the length function on a finitely generated group that… ▽ More We study generalisations of Chiswell's Theorem that $0$-hyperbolic Lyndon length functions on groups always arise as based length functions of the the group acting isometrically on a tree. We produce counter-examples to show that this Theorem fails if one replaces $0$-hyperbolicity with $δ$-hyperbolicity. We then propose a set of axioms for the length function on a finitely generated group that ensures the function is bi-Lipschitz equivalent to a (or any) length function of the group acting on its Cayley graph. △ Less

Submitted 20 July, 2023; originally announced July 2023.

MSC Class: 20E08; 20F65; 20F67

arXiv:2306.17011 [pdf, other]

doi 10.1016/j.matchar.2023.113228

Registration between DCT and EBSD datasets for multiphase microstructures

Authors: James A. D. Ball, Jette Oddershede, Claire Davis, Carl Slater, Mohammed Said, Himanshu Vashishtha, Stefan Michalik, David M. Collins

Abstract: The ability to characterise the three-dimensional microstructure of multiphase materials is essential for understanding the interaction between phases and associated materials properties. Here, laboratory-based diffraction-contrast tomography (DCT), a recently-established materials characterization technique that can determine grain phases, morphologies, positions and orientations in a voxel-based… ▽ More The ability to characterise the three-dimensional microstructure of multiphase materials is essential for understanding the interaction between phases and associated materials properties. Here, laboratory-based diffraction-contrast tomography (DCT), a recently-established materials characterization technique that can determine grain phases, morphologies, positions and orientations in a voxel-based reconstruction method, was used to map part of a dual-phase steel alloy sample. To assess the resulting microstructures that were produced by the DCT technique, an EBSD map was collected within the same sample volume. To identify the 2D slice of the 3D DCT reconstruction that best corresponded to the EBSD map, a novel registration technique based solely on grain-averaged orientations was developed -- this registration technique requires very little a priori knowledge of dataset alignment and can be extended to other techniques that only recover grain-averaged orientation data such as far-field 3D X-ray diffraction microscopy. Once the corresponding 2D slice was identified in the DCT dataset, comparisons of phase balance, grain size, shape and texture were performed between DCT and EBSD techniques. More complicated aspects of the microstructural morphology such as grain boundary shape and grains less than a critical size were poorly reproduced by the DCT reconstruction, primarily due to the difference in resolutions of the technique compared with EBSD. However, lab-based DCT is shown to accurately determine the centre-of-mass position, orientation, and size of the large grains for each phase present, austenite and martensitic ferrite. The results reveals a complex ferrite grain network of similar crystal orientations that are absent from the EBSD dataset. Such detail demonstrates that lab-based DCT, as a technique, shows great promise in the field of multi-phase material characterization. △ Less

Submitted 29 June, 2023; originally announced June 2023.

Comments: 15 pages, 11 figures. Preprint submitted to Materials Characterization

Journal ref: Materials Characterization, October 2023, Volume 204, Page 113228

arXiv:2306.14325 [pdf, other]

The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs

Authors: Lance Ying, Katherine M. Collins, Megan Wei, Cedegao E. Zhang, Tan Zhi-Xuan, Adrian Weller, Joshua B. Tenenbaum, Lionel Wong

Abstract: Human beings are social creatures. We routinely reason about other agents, and a crucial component of this social reasoning is inferring people's goals as we learn about their actions. In many settings, we can perform intuitive but reliable goal inference from language descriptions of agents, actions, and the background environments. In this paper, we study this process of language driving and inf… ▽ More Human beings are social creatures. We routinely reason about other agents, and a crucial component of this social reasoning is inferring people's goals as we learn about their actions. In many settings, we can perform intuitive but reliable goal inference from language descriptions of agents, actions, and the background environments. In this paper, we study this process of language driving and influencing social reasoning in a probabilistic goal inference domain. We propose a neuro-symbolic model that carries out goal inference from linguistic inputs of agent scenarios. The "neuro" part is a large language model (LLM) that translates language descriptions to code representations, and the "symbolic" part is a Bayesian inverse planning engine. To test our model, we design and run a human experiment on a linguistic goal inference task. Our model closely matches human response patterns and better predicts human judgements than using an LLM alone. △ Less

Submitted 27 June, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

Comments: To appear at ICML Workshop on Theory of Mind in Communicating Agents

arXiv:2306.12302 [pdf, other]

RomAndromeda: The Roman Survey of the Andromeda Halo

Authors: Arjun Dey, Joan Najita, Carrie Filion, Jiwon Jesse Han, Sarah Pearson, Rosemary Wyse, Adrien C. R. Thob, Borja Anguiano, Miranda Apfel, Magda Arnaboldi, Eric F. Bell, Leandro Beraldo e Silva, Gurtina Besla, Aparajito Bhattacharya, Souradeep Bhattacharya, Vedant Chandra, Yumi Choi, Michelle L. M. Collins, Emily C. Cunningham, Julianne J. Dalcanton, Ivanna Escala, Hayden R. Foote, Annette M. N. Ferguson, Benjamin J. Gibson, Oleg Y. Gnedin , et al. (28 additional authors not shown)

Abstract: As our nearest large neighbor, the Andromeda Galaxy provides a unique laboratory for investigating galaxy formation and the distribution and substructure properties of dark matter in a Milky Way-like galaxy. Here, we propose an initial 2-epoch ($Δt\approx 5$yr), 2-band Roman survey of the entire halo of Andromeda, covering 500 square degrees, which will detect nearly every red giant star in the ha… ▽ More As our nearest large neighbor, the Andromeda Galaxy provides a unique laboratory for investigating galaxy formation and the distribution and substructure properties of dark matter in a Milky Way-like galaxy. Here, we propose an initial 2-epoch ($Δt\approx 5$yr), 2-band Roman survey of the entire halo of Andromeda, covering 500 square degrees, which will detect nearly every red giant star in the halo (10$σ$ detection in F146, F062 of 26.5, 26.1AB mag respectively) and yield proper motions to $\sim$25 microarcsec/year (i.e., $\sim$90 km/s) for all stars brighter than F146 $\approx 23.6$ AB mag (i.e., reaching the red clump stars in the Andromeda halo). This survey will yield (through averaging) high-fidelity proper motions for all satellites and compact substructures in the Andromeda halo and will enable statistical searches for clusters in chemo-dynamical space. Adding a third epoch during the extended mission will improve these proper motions by $\sim t^{-1.5}$, to $\approx 11$ km/s, but this requires obtaining the first epoch in Year 1 of Roman operations. In combination with ongoing and imminent spectroscopic campaigns with ground-based telescopes, this Roman survey has the potential to yield full 3-d space motions of $>$100,000 stars in the Andromeda halo, including (by combining individual measurements) robust space motions of its entire globular cluster and most of its dwarf galaxy satellite populations. It will also identify high-velocity stars in Andromeda, providing unique information on the processes that create this population. These data offer a unique opportunity to study the immigration history, halo formation, and underlying dark matter scaffolding of a galaxy other than our own. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: Submitted in response to the call for Roman Space Telescope Core Community Survey white papers

arXiv:2306.08424 [pdf, other]

Selective Concept Models: Permitting Stakeholder Customisation at Test-Time

Authors: Matthew Barker, Katherine M. Collins, Krishnamurthy Dvijotham, Adrian Weller, Umang Bhatt

Abstract: Concept-based models perform prediction using a set of concepts that are interpretable to stakeholders. However, such models often involve a fixed, large number of concepts, which may place a substantial cognitive load on stakeholders. We propose Selective COncept Models (SCOMs) which make predictions using only a subset of concepts and can be customised by stakeholders at test-time according to t… ▽ More Concept-based models perform prediction using a set of concepts that are interpretable to stakeholders. However, such models often involve a fixed, large number of concepts, which may place a substantial cognitive load on stakeholders. We propose Selective COncept Models (SCOMs) which make predictions using only a subset of concepts and can be customised by stakeholders at test-time according to their preferences. We show that SCOMs only require a fraction of the total concepts to achieve optimal accuracy on multiple real-world datasets. Further, we collect and release a new dataset, CUB-Sel, consisting of human concept set selections for 900 bird images from the popular CUB dataset. Using CUB-Sel, we show that humans have unique individual preferences for the choice of concepts they prefer to reason about, and struggle to identify the most theoretically informative concepts. The customisation and concept selection provided by SCOM improves the efficiency of interpretation and intervention for stakeholders. △ Less

Submitted 14 June, 2023; originally announced June 2023.

arXiv:2306.01694 [pdf, other]

Evaluating Language Models for Mathematics through Interactions

Authors: Katherine M. Collins, Albert Q. Jiang, Simon Frieder, Lionel Wong, Miri Zilka, Umang Bhatt, Thomas Lukasiewicz, Yuhuai Wu, Joshua B. Tenenbaum, William Hart, Timothy Gowers, Wenda Li, Adrian Weller, Mateja Jamnik

Abstract: There is much excitement about the opportunity to harness the power of large language models (LLMs) when building problem-solving assistants. However, the standard methodology of evaluating LLMs relies on static pairs of inputs and outputs, and is insufficient for making an informed decision about which LLMs and under which assistive settings can they be sensibly used. Static assessment fails to a… ▽ More There is much excitement about the opportunity to harness the power of large language models (LLMs) when building problem-solving assistants. However, the standard methodology of evaluating LLMs relies on static pairs of inputs and outputs, and is insufficient for making an informed decision about which LLMs and under which assistive settings can they be sensibly used. Static assessment fails to account for the essential interactive element in LLM deployment, and therefore limits how we understand language model capabilities. We introduce CheckMate, an adaptable prototype platform for humans to interact with and evaluate LLMs. We conduct a study with CheckMate to evaluate three language models (InstructGPT, ChatGPT, and GPT-4) as assistants in proving undergraduate-level mathematics, with a mixed cohort of participants from undergraduate students to professors of mathematics. We release the resulting interaction and rating dataset, MathConverse. By analysing MathConverse, we derive a taxonomy of human behaviours and uncover that despite a generally positive correlation, there are notable instances of divergence between correctness and perceived helpfulness in LLM generations, amongst other findings. Further, we garner a more granular understanding of GPT-4 mathematical problem-solving through a series of case studies, contributed by expert mathematicians. We conclude with actionable takeaways for ML practitioners and mathematicians: models that communicate uncertainty respond well to user corrections, and are more interpretable and concise may constitute better assistants. Interactive evaluation is a promising way to navigate the capability of these models; humans should be aware of language models' algebraic fallibility and discern where they are appropriate to use. △ Less

Submitted 5 November, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

arXiv:2305.14793 [pdf, other]

Faithful Low-Resource Data-to-Text Generation through Cycle Training

Authors: Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

Abstract: Methods to generate text from structured data have advanced significantly in recent years, primarily due to fine-tuning of pre-trained language models on large datasets. However, such models can fail to produce output faithful to the input data, particularly on out-of-domain data. Sufficient annotated data is often not available for specific domains, leading us to seek an unsupervised approach to… ▽ More Methods to generate text from structured data have advanced significantly in recent years, primarily due to fine-tuning of pre-trained language models on large datasets. However, such models can fail to produce output faithful to the input data, particularly on out-of-domain data. Sufficient annotated data is often not available for specific domains, leading us to seek an unsupervised approach to improve the faithfulness of output text. Since the problem is fundamentally one of consistency between the representations of the structured data and text, we evaluate the effectiveness of cycle training in this work. Cycle training uses two models which are inverses of each other: one that generates text from structured data, and one which generates the structured data from natural language text. We show that cycle training, when initialized with a small amount of supervised data (100 samples in our case), achieves nearly the same performance as fully supervised approaches for the data-to-text generation task on the WebNLG, E2E, WTQ, and WSQL datasets. We perform extensive empirical analysis with automated evaluation metrics and a newly designed human evaluation schema to reveal different cycle training strategies' effectiveness of reducing various types of generation errors. Our code is publicly available at https://github.com/Edillower/CycleNLG. △ Less

Submitted 11 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: 19 pages, 4 figures, ACL 2023

arXiv:2305.13966 [pdf, other]

Pisces VII/Triangulum III -- M33's second dwarf satellite galaxy

Authors: Michelle L. M. Collins, Noushin Karim, David Martinez-Delgado, Matteo Monelli, Erik J. Tollerud, Giuseppe Donatiello, Mahdieh Navabi, Emily Charles, Walter Boschin

Abstract: Pisces VII/Triangulum III (Pisc~VII) was discovered in the DESI Legacy Imaging Survey and was shown to be a Local Group dwarf galaxy with follow-up imaging from the 4-m Telescopio Nazionale Galileo. However, this imaging was unable to reach the horizontal branch of Pisc VII, preventing a precision distance measurement. The distance bound from the red giant branch population placed Pisc VII as eith… ▽ More Pisces VII/Triangulum III (Pisc~VII) was discovered in the DESI Legacy Imaging Survey and was shown to be a Local Group dwarf galaxy with follow-up imaging from the 4-m Telescopio Nazionale Galileo. However, this imaging was unable to reach the horizontal branch of Pisc VII, preventing a precision distance measurement. The distance bound from the red giant branch population placed Pisc VII as either an isolated ultra-faint dwarf galaxy or the second known satellite galaxy of Triangulum (M33). Using deep imaging from Gemini GMOS-N, we have resolved the horizontal branch of Pisc VII, and measure a distance of $D=916^{+65}_{-53}$~kpc, making Pisc VII a likely satellite of M33. We also remeasure its size and luminosity from this deeper data, finding $r_{\rm half}=186^{+58}_{-32}$ pc, $M_V=-6.0\pm0.3$ and $L=2.2^{+0.7}_{-0.5}\times10^4\,{\rm L}_\odot$. Given its position in the M33 halo, we argue that Pisc VII could support the theory that M33 is on its first infall to the Andromeda system. We also discuss the presence of blue plume and helium burning stars in the colour-magnitude diagram of Pisc VII that are consistent with ages of $\sim1.5$~Gyr. If these are truly members of the galaxy, it would transform our understanding of how reionisation affects the faintest galaxies. Future deep imaging and dynamics could allow significant insight into both the stellar populations of Pisc VII and the evolution of M33 △ Less

Submitted 30 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: 7 pages, 5 figures, accepted by MNRAS after minor revisions

arXiv:2305.13360 [pdf, other]

The Hubble Space Telescope Survey of M31 Satellite Galaxies II. The Star Formation Histories of Ultra-Faint Dwarf Galaxies

Authors: A. Savino, D. R. Weisz, E. D. Skillman, A. Dolphin, A. A. Cole, N. Kallivayalil, A. Wetzel, J. Anderson, G. Besla, M. Boylan-Kolchin, T. M. Brown, J. S. Bullock, M. L. M. Collins, M. C. Cooper, A. J. Deason, A. L. Dotter, M. Fardal, A. M. N. Ferguson, T. K. Fritz, M. C. Geha, K. M. Gilbert, P. Guhathakurta, R. Ibata, M. J. Irwin, M. Jeon , et al. (12 additional authors not shown)

Abstract: We present the lifetime star formation histories (SFHs) for six ultra-faint dwarf (UFD; $M_V>-7.0$, $ 4.9<\log_{10}({M_*(z=0)}/{M_{\odot}})<5.5$) satellite galaxies of M31 based on deep color-magnitude diagrams constructed from \textit{Hubble Space Telescope} imaging. These are the first SFHs obtained from the oldest main sequence turn-off of UFDs outside the halo of the Milky Way (MW). We find th… ▽ More We present the lifetime star formation histories (SFHs) for six ultra-faint dwarf (UFD; $M_V>-7.0$, $ 4.9<\log_{10}({M_*(z=0)}/{M_{\odot}})<5.5$) satellite galaxies of M31 based on deep color-magnitude diagrams constructed from \textit{Hubble Space Telescope} imaging. These are the first SFHs obtained from the oldest main sequence turn-off of UFDs outside the halo of the Milky Way (MW). We find that five UFDs formed at least 50\% of their stellar mass by $z=5$ (12.6~Gyr ago), similar to known UFDs around the MW, but that 10-40\% of their stellar mass formed at later times. We uncover one remarkable UFD, \A{XIII}, which formed only 10\% of its stellar mass by $z=5$, and 75\% in a rapid burst at $z\sim2-3$, a result that is robust to choices of underlying stellar model and is consistent with its predominantly red horizontal branch. This "young" UFD is the first of its kind and indicates that not all UFDs are necessarily quenched by reionization, which is consistent with predictions from several cosmological simulations of faint dwarf galaxies. SFHs of the combined MW and M31 samples suggest reionization did not homogeneously quench UFDs. We find that the least massive MW UFDs ($M_*(z=5) \lesssim 5\times10^4 M_{\odot}$) are likely quenched by reionization, whereas more massive M31 UFDs ($M_*(z=5) \gtrsim 10^5 M_{\odot}$) may only have their star formation suppressed by reionization and quench at a later time. We discuss these findings in the context of the evolution and quenching of UFDs. △ Less

Submitted 13 September, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

Comments: 18 pages, 14 figures, 5 appendices, accepted for publication in ApJ

arXiv:2305.01451 [pdf, ps, other]

Fixed points of irreducible, displacement one automorphisms of free products

Authors: Matthew Collins

Abstract: We consider the action of outer automorphisms on the deformation space $\mathcal{O}$ of $G$-trees given by a free product decomposition of a group $G$. We show that an irreducible, displacement 1 automorphism fixes exactly one point in $\mathcal{O}_1$ (the covolume 1 slice of $\mathcal{O}$). We consider the action of outer automorphisms on the deformation space $\mathcal{O}$ of $G$-trees given by a free product decomposition of a group $G$. We show that an irreducible, displacement 1 automorphism fixes exactly one point in $\mathcal{O}_1$ (the covolume 1 slice of $\mathcal{O}$). △ Less

Submitted 2 May, 2023; originally announced May 2023.

MSC Class: 20F65 (Primary) 20E36; 20E08 (Secondary)

arXiv:2304.13000 [pdf, other]

Segment anything, from space?

Authors: Simiao Ren, Francesco Luzi, Saad Lahrichi, Kaleb Kassaw, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof

Abstract: Recently, the first foundation model developed specifically for image segmentation tasks was developed, termed the "Segment Anything Model" (SAM). SAM can segment objects in input imagery based on cheap input prompts, such as one (or more) points, a bounding box, or a mask. The authors examined the \textit{zero-shot} image segmentation accuracy of SAM on a large number of vision benchmark tasks an… ▽ More Recently, the first foundation model developed specifically for image segmentation tasks was developed, termed the "Segment Anything Model" (SAM). SAM can segment objects in input imagery based on cheap input prompts, such as one (or more) points, a bounding box, or a mask. The authors examined the \textit{zero-shot} image segmentation accuracy of SAM on a large number of vision benchmark tasks and found that SAM usually achieved recognition accuracy similar to, or sometimes exceeding, vision models that had been trained on the target tasks. The impressive generalization of SAM for segmentation has major implications for vision researchers working on natural imagery. In this work, we examine whether SAM's performance extends to overhead imagery problems and help guide the community's response to its development. We examine SAM's performance on a set of diverse and widely studied benchmark tasks. We find that SAM does often generalize well to overhead imagery, although it fails in some cases due to the unique characteristics of overhead imagery and its common target objects. We report on these unique systematic failure cases for remote sensing imagery that may comprise useful future research for the community. △ Less

Submitted 9 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Work accepted at WACV 2024, this is only a pre-print, please go to WACV website for the official version

arXiv:2304.09565 [pdf, other]

Signatures of quantum chaos of Rydberg dressed bosons in a triple-well potential

Authors: Tianyi Yan, Matthew Collins, Rejish Nath, Weibin Li

Abstract: We study signatures of quantum chaos in dynamics of Rydberg dressed bosonic atoms held in a one dimensional triple-well potential. Long-range nearest-neighbor and next-nearest-neighbor interactions, induced by laser dressing atoms to strongly interacting Rydberg states, affect drastically mean field and quantum many-body dynamics. By analyzing the mean field dynamics, classical chaos regions with… ▽ More We study signatures of quantum chaos in dynamics of Rydberg dressed bosonic atoms held in a one dimensional triple-well potential. Long-range nearest-neighbor and next-nearest-neighbor interactions, induced by laser dressing atoms to strongly interacting Rydberg states, affect drastically mean field and quantum many-body dynamics. By analyzing the mean field dynamics, classical chaos regions with positive and large Lyapunov exponents are identified as a function of the potential well tilting and dressed interactions. In the quantum regime, it is found that level statistics of the eigen-energies gains a Wigner-Dyson distribution when the Lyapunov exponents are large, giving rise to signatures of strong quantum chaos. We find that both the time averaged entanglement entropy and survival probability of the initial state have distinctively large values in the quantum chaos regime. We further show that population variances could be used as an indicator of the emergence of quantum chaos. This might provide a way to directly probe quantum chaotic dynamics through analyzing population dynamics in individual potential wells. △ Less

Submitted 6 June, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

Comments: 13 pages, 7 figures

arXiv:2304.07787 [pdf, other]

Harnessing Digital Pathology And Causal Learning To Improve Eosinophilic Esophagitis Dietary Treatment Assignment

Authors: Eliel Aknin, Ariel Larey, Julie M. Caldwell, Margaret H. Collins, Juan P. Abonia, Seema S. Aceves, Nicoleta C. Arva, Mirna Chehade, Evan S. Dellon, Nirmala Gonsalves, Sandeep K. Gupta, John Leung, Kathryn A. Peterson, Tetsuo Shoda, Jonathan M. Spergel, Marc E. Rothenberg, Yonatan Savir

Abstract: Eosinophilic esophagitis (EoE) is a chronic, food antigen-driven, allergic inflammatory condition of the esophagus associated with elevated esophageal eosinophils. EoE is a top cause of chronic dysphagia after GERD. Diagnosis of EoE relies on counting eosinophils in histological slides, a manual and time-consuming task that limits the ability to extract complex patient-dependent features. The trea… ▽ More Eosinophilic esophagitis (EoE) is a chronic, food antigen-driven, allergic inflammatory condition of the esophagus associated with elevated esophageal eosinophils. EoE is a top cause of chronic dysphagia after GERD. Diagnosis of EoE relies on counting eosinophils in histological slides, a manual and time-consuming task that limits the ability to extract complex patient-dependent features. The treatment of EoE includes medication and food elimination. A personalized food elimination plan is crucial for engagement and efficiency, but previous attempts failed to produce significant results. In this work, on the one hand, we utilize AI for inferring histological features from the entire biopsy slide, features that cannot be extracted manually. On the other hand, we develop causal learning models that can process this wealth of data. We applied our approach to the 'Six-Food vs. One-Food Eosinophilic Esophagitis Diet Study', where 112 symptomatic adults aged 18-60 years with active EoE were assigned to either a six-food elimination diet (6FED) or a one-food elimination diet (1FED) for six weeks. Our results show that the average treatment effect (ATE) of the 6FED treatment compared with the 1FED treatment is not significant, that is, neither diet was superior to the other. We examined several causal models and show that the best treatment strategy was obtained using T-learner with two XGBoost modules. While 1FED only and 6FED only provide improvement for 35%-38% of the patients, which is not significantly different from a random treatment assignment, our causal model yields a significantly better improvement rate of 58.4%. This study illustrates the significance of AI in enhancing treatment planning by analyzing molecular features' distribution in histological slides through causal learning. Our approach can be harnessed for other conditions that rely on histology for diagnosis and treatment. △ Less

Submitted 16 April, 2023; originally announced April 2023.

Comments: 11 pages, 5 figures

arXiv:2304.06701 [pdf, other]

Learning Personalized Decision Support Policies

Authors: Umang Bhatt, Valerie Chen, Katherine M. Collins, Parameswaran Kamalaruban, Emma Kallina, Adrian Weller, Ameet Talwalkar

Abstract: Individual human decision-makers may benefit from different forms of support to improve decision outcomes, but when each form of support will yield better outcomes? In this work, we posit that personalizing access to decision support tools can be an effective mechanism for instantiating the appropriate use of AI assistance. Specifically, we propose the general problem of learning a decision suppor… ▽ More Individual human decision-makers may benefit from different forms of support to improve decision outcomes, but when each form of support will yield better outcomes? In this work, we posit that personalizing access to decision support tools can be an effective mechanism for instantiating the appropriate use of AI assistance. Specifically, we propose the general problem of learning a decision support policy that, for a given input, chooses which form of support to provide to decision-makers for whom we initially have no prior information. We develop $\texttt{Modiste}$, an interactive tool to learn personalized decision support policies. $\texttt{Modiste}$ leverages stochastic contextual bandit techniques to personalize a decision support policy for each decision-maker and supports extensions to the multi-objective setting to account for auxiliary objectives like the cost of support. We find that personalized policies outperform offline policies, and, in the cost-aware setting, reduce the incurred cost with minimal degradation to performance. Our experiments include various realistic forms of support (e.g., expert consensus and predictions from a large language model) on vision and language tasks. Our human subject experiments validate our computational experiments, demonstrating that personalization can yield benefits in practice for real users, who interact with $\texttt{Modiste}$. △ Less

Submitted 27 May, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

Comments: 29 pages, 12 figures

arXiv:2303.17356 [pdf, other]

doi 10.3390/universe9080347

The ESSnuSB design study: overview and future prospects

Authors: ESSnuSB Collaboration, A. Alekou, E. Baussan, A. K. Bhattacharyya, N. Blaskovic Kraljevic, M. Blennow, M. Bogomilov, B. Bolling, E. Bouquerel, F. Bramati, A. Branca, O. Buchan, A. Burgman, C. J. Carlile, J. Cederkall, S. Choubey, P. Christiansen, M. Collins, E. Cristaldo Morales, L. D'Alessi, H. Danared, D. Dancila, J. P. A. M. de André, J. P. Delahaye, M. Dracos , et al. (61 additional authors not shown)

Abstract: ESSnuSB is a design study for an experiment to measure the CP violation in the leptonic sector at the second neutrino oscillation maximum using a neutrino beam driven by the uniquely powerful ESS linear accelerator. The reduced impact of systematic errors on sensitivity at the second maximum allows for a very precise measurement of the CP violating parameter. This review describes the fundamental… ▽ More ESSnuSB is a design study for an experiment to measure the CP violation in the leptonic sector at the second neutrino oscillation maximum using a neutrino beam driven by the uniquely powerful ESS linear accelerator. The reduced impact of systematic errors on sensitivity at the second maximum allows for a very precise measurement of the CP violating parameter. This review describes the fundamental advantages of measurement at the 2nd maximum, the necessary upgrades to the ESS linac in order to produce a neutrino beam, the near and far detector complexes, the expected physics reach of the proposed ESSnuSB experiment, concluding with the near future developments aimed at the project realization. △ Less

Submitted 8 August, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

Comments: 19 pages, 12 figures; Final version after review by the Universe journal

arXiv:2303.12872 [pdf, other]

Human Uncertainty in Concept-Based AI Systems

Authors: Katherine M. Collins, Matthew Barker, Mateo Espinosa Zarlenga, Naveen Raman, Umang Bhatt, Mateja Jamnik, Ilia Sucholutsky, Adrian Weller, Krishnamurthy Dvijotham

Abstract: Placing a human in the loop may abate the risks of deploying AI systems in safety-critical settings (e.g., a clinician working with a medical AI system). However, mitigating risks arising from human error and uncertainty within such human-AI interactions is an important and understudied issue. In this work, we study human uncertainty in the context of concept-based models, a family of AI systems t… ▽ More Placing a human in the loop may abate the risks of deploying AI systems in safety-critical settings (e.g., a clinician working with a medical AI system). However, mitigating risks arising from human error and uncertainty within such human-AI interactions is an important and understudied issue. In this work, we study human uncertainty in the context of concept-based models, a family of AI systems that enable human feedback via concept interventions where an expert intervenes on human-interpretable concepts relevant to the task. Prior work in this space often assumes that humans are oracles who are always certain and correct. Yet, real-world decision-making by humans is prone to occasional mistakes and uncertainty. We study how existing concept-based models deal with uncertain interventions from humans using two novel datasets: UMNIST, a visual dataset with controlled simulated uncertainty based on the MNIST dataset, and CUB-S, a relabeling of the popular CUB concept dataset with rich, densely-annotated soft labels from humans. We show that training with uncertain concept labels may help mitigate weaknesses of concept-based systems when handling uncertain interventions. These results allow us to identify several open challenges, which we argue can be tackled through future multidisciplinary research on building interactive uncertainty-aware systems. To facilitate further research, we release a new elicitation platform, UElic, to collect uncertain feedback from humans in collaborative prediction tasks. △ Less

Submitted 22 March, 2023; originally announced March 2023.

arXiv:2303.04764 [pdf, other]

High Resolution 3D Strain and Orientation Mapping within a Grain of a Directed Energy Deposition Laser Additively Manufactured Superalloy

Authors: Y. Chen, Y. T. Tang, D. M. Collins, S. J. Clark, W. Ludwig, R. Rodriguez-Lamas, C. Detlefs, R. C. Reed, P. D. Lee, P. J. Withers, C. Yildirim

Abstract: The industrialization of Laser Additive Manufacturing (LAM) is challenged by the undesirable microstructures and high residual stresses originating from the fast and complex solidification process. Non-destructive assessment of the mechanical performance controlling deformation patterning is therefore critical. Here, we use Dark Field X-ray Microscopy (DFXM) to non-destructively map the 3D intragr… ▽ More The industrialization of Laser Additive Manufacturing (LAM) is challenged by the undesirable microstructures and high residual stresses originating from the fast and complex solidification process. Non-destructive assessment of the mechanical performance controlling deformation patterning is therefore critical. Here, we use Dark Field X-ray Microscopy (DFXM) to non-destructively map the 3D intragranular orientation and strain variations throughout a surface breaking grain within a directed energy deposition nickel superalloy. DFXM results reveal a highly heterogenous 3D microstructure in terms of the local orientation and lattice strain. The grain comprises $\approx$ 5$μ$m-sized cells with alternating strain states, as high as 5 $\times 10^{-3}$, and orientation differences <0.5° . The DFXM results are compared to Electron Backscatter Diffraction measurements of the same grain from its cut-off surface. We discuss the microstructure developments during LAM, rationalising the development of the deformation patterning from the extreme thermal gradients during processing and the susceptibility for solute segregation. △ Less

Submitted 8 March, 2023; originally announced March 2023.

Comments: Corresponding author:can.yildirim@esrf.fr - Submitted to Scripta Materialia on 8 March2 023

arXiv:2302.10711 [pdf, other]

doi 10.1038/s43246-024-00466-8

Per-grain and neighbourhood stress interactions during deformation of a ferritic steel obtained using three-dimensional X-ray diffraction

Authors: James A. D. Ball, Anna Kareer, Oxana V. Magdysyuk, Stefan Michalik, Thomas Connolley, David M. Collins

Abstract: Three-dimensional X-ray diffraction (3DXRD) has been used to measure, in-situ, the evolution of $\sim 1800$ grains in a single phase low carbon ferritic steel sample during uniaxial deformation. The distribution of initial residual grain stresses in the material was observed to prevail as plasticity builds, though became less pronounced, and therefore less influential as strain increased. The init… ▽ More Three-dimensional X-ray diffraction (3DXRD) has been used to measure, in-situ, the evolution of $\sim 1800$ grains in a single phase low carbon ferritic steel sample during uniaxial deformation. The distribution of initial residual grain stresses in the material was observed to prevail as plasticity builds, though became less pronounced, and therefore less influential as strain increased. The initial Schmid factor of a grain was found to be strongly correlated to the intergranular stress change and the range of stresses that are permissible; a grain well aligned for easy slip is more likely to exhibit a range of stresses than those orientated poorly for dislocation motion. The orientation path of a grain, however, is not only dependent on its initial orientation, but hypothesised to be influenced by its stress state and the stress state of its grain environment. A grain neighbourhood effect is observed: the Schmid factor of serial adjoining grains influences the stress state of a grain of interest, whereas parallel neighbours are much less influential. This phenomenon is strongest at low plastic strains only, with the effect diminishing as plasticity builds. The influence of initial residual stresses becomes less evident, and grains rotate to eliminate any orientation dependent load shedding. The ability of the BCC ferrite to exhaust such neighbourhood interactions, which would otherwise be detrimental in crystal structures with lower symmetric and fewer slip systems, is considered key to the high ductility possessed by these materials. △ Less

Submitted 21 February, 2023; originally announced February 2023.

Comments: 17 pages (+2 supplementary pages), 13 figures (+3 supplementary figures). Preprint submitted to Acta Materialia

Journal ref: Commun Mater 5, 27 (2024)

Showing 1–50 of 271 results for author: Collins, M