-
curvedSpaceSim: A framework for simulating particles interacting along geodesics
Authors:
Toler H. Webb,
Daniel M. Sussman
Abstract:
A large number of powerful, high-quality, and open-source simulation packages exist to efficiently perform molecular dynamics simulations, and their prevalence has greatly accelerated discoveries across a wide range of scientific domains. These packages typically simulate particles in free (Euclidean) space, with options to specify a variety of boundary conditions. While more exotic, many physical…
▽ More
A large number of powerful, high-quality, and open-source simulation packages exist to efficiently perform molecular dynamics simulations, and their prevalence has greatly accelerated discoveries across a wide range of scientific domains. These packages typically simulate particles in free (Euclidean) space, with options to specify a variety of boundary conditions. While more exotic, many physical systems are constrained to and interact across curved surfaces, such as organisms moving across the landscape, colloids pinned at curved fluid-fluid interfaces, and layers of epithelial cells forming highly curved tissues. The calculation of distances and the updating of equations of motion in idealized geometries (namely, on surfaces of constant curvature) can be done analytically, but it is much more challenging to efficiently perform molecular-dynamics-like simulations on arbitrarily curved surfaces. This article discusses a simulation framework which combines tools from particle-based simulations with recent work in discrete differential geometry to model particles that interact via geodesic distances and move on an arbitrarily curved surface. We present computational cost estimates for a variety of surface complexities with and without various algorithmic specializations (e.g., restrictions to short-range interaction potentials, or multi-threaded parallelization). Our flexible and extensible framework is set up to easily handle both equilibrium and non-equilibrium dynamics, and will enable researchers to access time- and particle-number-scales previously inaccessible.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Evidence from counterfactual tasks supports emergent analogical reasoning in large language models
Authors:
Taylor Webb,
Keith J. Holyoak,
Hongjing Lu
Abstract:
We recently reported evidence that large language models are capable of solving a wide range of text-based analogy problems in a zero-shot manner, indicating the presence of an emergent capacity for analogical reasoning. Two recent commentaries have challenged these results, citing evidence from so-called `counterfactual' tasks in which the standard sequence of the alphabet is arbitrarily permuted…
▽ More
We recently reported evidence that large language models are capable of solving a wide range of text-based analogy problems in a zero-shot manner, indicating the presence of an emergent capacity for analogical reasoning. Two recent commentaries have challenged these results, citing evidence from so-called `counterfactual' tasks in which the standard sequence of the alphabet is arbitrarily permuted so as to decrease similarity with materials that may have been present in the language model's training data. Here, we reply to these critiques, clarifying some misunderstandings about the test materials used in our original work, and presenting evidence that language models are also capable of generalizing to these new counterfactual task variants.
△ Less
Submitted 29 April, 2024; v1 submitted 14 April, 2024;
originally announced April 2024.
-
Resolved UV and optical color gradients reveal environmental influence on galaxy evolution at redshift z$\sim$1.6
Authors:
William J. Cramer,
A. G. Noble,
G. Rudnick,
A. Pigarelli,
G. Wilson,
Y. M. Bahé,
M. C. Cooper,
R. Demarco,
J. Matharu,
T. B. Miller,
A. Muzzin,
J. Nantais,
W. Sportsman,
E. van Kampen,
T. M. A. Webb,
H. K. C. Yee
Abstract:
The changes in colors across a galaxy are intimately connected to the galaxy's formation, growth, quenching history, and dust content. A particularly important epoch in the growth of galaxies is near $z \sim 2$ often referred to as 'cosmic noon', where galaxies on average reach the peak of their star formation. We study a population of 125 cluster galaxies at $z \sim 1.6$ in three Hubble Space Tel…
▽ More
The changes in colors across a galaxy are intimately connected to the galaxy's formation, growth, quenching history, and dust content. A particularly important epoch in the growth of galaxies is near $z \sim 2$ often referred to as 'cosmic noon', where galaxies on average reach the peak of their star formation. We study a population of 125 cluster galaxies at $z \sim 1.6$ in three Hubble Space Telescope (HST) filters, F475W, F625W, and F160W, roughly corresponding to the rest-frame FUV, NUV, and r band, respectively. By comparing to a control sample of 200 field galaxies at similar redshift, we reveal clear, statistically significant differences in the overall spatially resolved colors and color gradients in galaxies across these two different environments. On average, cluster galaxies have redder UV colors in both the inner and outer regions bounded by $r_{\mathrm{50}}$, as well as an overall wider dispersion of outside-in color gradients. The presence of these observed differences, along with evidence from ancillary data from previous studies, strongly suggests that the environment drives these population-level color differences, by affecting the stellar populations and/or dust content.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Slot Abstractors: Toward Scalable Abstract Visual Reasoning
Authors:
Shanka Subhra Mondal,
Jonathan D. Cohen,
Taylor W. Webb
Abstract:
Abstract visual reasoning is a characteristically human ability, allowing the identification of relational patterns that are abstracted away from object features, and the systematic generalization of those patterns to unseen problems. Recent work has demonstrated strong systematic generalization in visual reasoning tasks involving multi-object inputs, through the integration of slot-based methods…
▽ More
Abstract visual reasoning is a characteristically human ability, allowing the identification of relational patterns that are abstracted away from object features, and the systematic generalization of those patterns to unseen problems. Recent work has demonstrated strong systematic generalization in visual reasoning tasks involving multi-object inputs, through the integration of slot-based methods used for extracting object-centric representations coupled with strong inductive biases for relational abstraction. However, this approach was limited to problems containing a single rule, and was not scalable to visual reasoning problems containing a large number of objects. Other recent work proposed Abstractors, an extension of Transformers that incorporates strong relational inductive biases, thereby inheriting the Transformer's scalability and multi-head architecture, but it has yet to be demonstrated how this approach might be applied to multi-object visual inputs. Here we combine the strengths of the above approaches and propose Slot Abstractors, an approach to abstract visual reasoning that can be scaled to problems involving a large number of objects and multiple relations among them. The approach displays state-of-the-art performance across four abstract visual reasoning tasks, as well as an abstract reasoning task involving real-world images.
△ Less
Submitted 2 June, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
Detection of Diffuse Hot Gas Around the Young, Potential Superstar Cluster H72.97-69.39
Authors:
Trinity L. Webb,
Jennifer A. Rodriguez,
Laura A. Lopez,
Anna L. Rosen,
Lachlan Lancaster,
Omnarayani Nayak,
Anna F. McLeod,
Paarmita Pandey,
Grace M. Olivier
Abstract:
We present the first Chandra X-ray observations of H72.97-69.39, a highly-embedded, potential super-star cluster (SSC) in its infancy located in the star-forming complex N79 of the Large Magellanic Cloud. We detect particularly hard, diffuse X-ray emission that is coincident with the young stellar object (YSO) clusters identified with JWST, and the hot gas fills cavities in the dense gas mapped by…
▽ More
We present the first Chandra X-ray observations of H72.97-69.39, a highly-embedded, potential super-star cluster (SSC) in its infancy located in the star-forming complex N79 of the Large Magellanic Cloud. We detect particularly hard, diffuse X-ray emission that is coincident with the young stellar object (YSO) clusters identified with JWST, and the hot gas fills cavities in the dense gas mapped by ALMA. The X-ray spectra are best fit with either a thermal plasma or power-law model, and assuming the former, we show that the X-ray luminosity of L_X = (1.5 +- 0.3)e34 erg/s is a factor of ~20 below the expectation for a fully-confined wind bubble. Our results suggest that stellar wind feedback produces diffuse hot gas in the earliest stages of massive star cluster formation and that wind energy can be lost quickly via either turbulent mixing followed by radiative cooling or by physical leakage.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
High-Spectral Resolution Observations of the Optical Filamentary Nebula in NGC 1275
Authors:
Benjamin Vigneron,
Julie Hlavacek-Larrondo,
Carter Lee Rhea,
Marie-Lou Gendron-Marsolais,
Jeremy Lim,
Jake Reinheimer,
Yuan Li,
Laurent Drissen,
Greg L. Bryan,
Megan Donahue,
Alastair Edge,
Andrew Fabian,
Stephen Hamer,
Thomas Martin,
Michael McDonald,
Brian McNamara,
Annabelle Richard-Lafferriere,
Laurie Rousseau-Nepton,
G. Mark Voit,
Tracy Webb,
Norbert Werner
Abstract:
We present new high-spectral resolution observations (R = $λ/Δλ$ = 7000) of the filamentary nebula surrounding NGC 1275, the central galaxy of the Perseus cluster. These observations have been obtained with SITELLE, an imaging Fourier transform spectrometer installed on the Canada-France-Hawai Telescope (CFHT) with a field of view of $11\text{ arcmin }\times 11 \text{ arcmin}$ encapsulating the en…
▽ More
We present new high-spectral resolution observations (R = $λ/Δλ$ = 7000) of the filamentary nebula surrounding NGC 1275, the central galaxy of the Perseus cluster. These observations have been obtained with SITELLE, an imaging Fourier transform spectrometer installed on the Canada-France-Hawai Telescope (CFHT) with a field of view of $11\text{ arcmin }\times 11 \text{ arcmin}$ encapsulating the entire filamentary structure of ionised gas despite its large size of $80 \text{ kpc}\times50 \text{ kpc}$. Here, we present renewed flux, velocity and velocity dispersion maps that show in great detail the kinematics of the optical nebula at \sii$\lambda6716$, \sii$\lambda6731$, \nii$\lambda6584$, H$α$(6563Å), and \nii$\lambda6548$. These maps reveal the existence of a bright flattened disk-shaped structure in the core extending to r $\sim 10$ kpc and dominated by a chaotic velocity field. This structure is located in the wake of X-ray cavities and characterised by a high mean velocity dispersion of $134$ km/s. The disk-shaped structure is surrounded by an extended array of filaments spread out to $r\sim 50$ kpc that are 10 times fainter in flux, remarkably quiescent and has a uniform mean velocity dispersion of $44$ km/s. This stability is puzzling given that the cluster core exhibits several energetic phenomena. Based on these results, we argue that there are two mechanisms to form multiphase gas in clusters of galaxies: a first triggered in the wake of X-ray cavities leading to more turbulent multiphase gas and a second, distinct mechanism, that is gentle and leads to large-scale multiphase gas spread throughout the core.
△ Less
Submitted 27 March, 2024; v1 submitted 27 November, 2023;
originally announced November 2023.
-
First-order phase transition vs. spin-state quantum-critical scenarios in strain-tuned epitaxial cobaltite thin films
Authors:
J. E. Dewey,
V. Chaturvedi,
T. A. Webb,
P. Sharma,
W. M. Postiglione,
P. Quarterman,
P. P. Balakrishnan,
B. J. Kirby,
L. Figari,
C. Korostynski,
A. Jacobson,
T. Birol,
R. M. Fernandes,
A. N. Pasupathy,
C. Leighton
Abstract:
Pr-containing perovskite cobaltites exhibit unusual valence transitions, coupled to coincident structural, spin-state, and metal-insulator transitions. Heteroepitaxial strain was recently used to control these phenomena in the model (Pr$_{1-y}$Y$_y$)$_{1-x}$Ca$_x$CoO$_{3-δ}$ system, stabilizing a nonmagnetic insulating phase under compression (with a room-temperature valence/spin-state/metal-insul…
▽ More
Pr-containing perovskite cobaltites exhibit unusual valence transitions, coupled to coincident structural, spin-state, and metal-insulator transitions. Heteroepitaxial strain was recently used to control these phenomena in the model (Pr$_{1-y}$Y$_y$)$_{1-x}$Ca$_x$CoO$_{3-δ}$ system, stabilizing a nonmagnetic insulating phase under compression (with a room-temperature valence/spin-state/metal-insulator transition) and a ferromagnetic metallic phase under tension, thus exposing a potential spin-state quantum critical point. The latter has been proposed in cobaltites and can be probed in this system as a function of a disorder-free variable (strain). We study this here via thickness-dependent strain relaxation in compressive SrLaAlO$_4$(001)/(Pr$_{0.85}$Y$_{0.15}$)$_{0.70}$Ca$_{0.30}$CoO$_{3-δ}$ epitaxial thin films to quasi-continuously probe structural, electronic, and magnetic behaviors across the nonmagnetic-insulator/ferromagnetic-metal boundary. High-resolution X-ray diffraction, electronic transport, magnetometry, polarized neutron reflectometry, and temperature-dependent magnetic force microscopy provide a detailed picture, including abundant evidence of temperature- and strain-dependent phase coexistence. This indicates a first-order phase transition as opposed to spin-state quantum-critical behavior, which we discuss theoretically via a phenomenological Landau model for coupled spin-state and magnetic phase transitions.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models
Authors:
Taylor Webb,
Shanka Subhra Mondal,
Chi Wang,
Brian Krabach,
Ida Momennejad
Abstract:
Large language models (LLMs) demonstrate impressive performance on a wide variety of tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this, we take inspiration from the human brain, in which planning is accomplished via the recurrent interaction of specialized modules in the prefrontal cortex (PFC). These modules perform functions su…
▽ More
Large language models (LLMs) demonstrate impressive performance on a wide variety of tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this, we take inspiration from the human brain, in which planning is accomplished via the recurrent interaction of specialized modules in the prefrontal cortex (PFC). These modules perform functions such as conflict monitoring, state prediction, state evaluation, task decomposition, and task coordination. We find that LLMs are sometimes capable of carrying out these functions in isolation, but struggle to autonomously coordinate them in the service of a goal. Therefore, we propose a black box architecture with multiple LLM-based (GPT-4) modules. The architecture improves planning through the interaction of specialized PFC-inspired modules that break down a larger problem into multiple brief automated calls to the LLM. We evaluate the combined architecture on three challenging planning tasks -- graph traversal, Tower of Hanoi, and logistics -- finding that it yields significant improvements over standard LLM methods (e.g., zero-shot prompting, in-context learning, and chain-of-thought). These results demonstrate the benefit of utilizing knowledge from cognitive neuroscience to improve planning in LLMs.
△ Less
Submitted 5 March, 2024; v1 submitted 29 September, 2023;
originally announced October 2023.
-
Strong evidence for 9N and the limits of existence of atomic nuclei
Authors:
R. J. Charity,
J. Wylie,
S. M. Wang,
T. B. Webb,
K. W. Brown,
G. Cerizza,
Z. Chajecki,
J. M. Elson,
J. Estee,
D. E. M Hoff,
S. A. Kuvin,
W. G. Lynch,
J. Manfredi,
N. Michel,
D. G. McNeel,
P. Morfouace,
W. Nazarewicz,
C. D. Pruitt,
C. Santamaria,
S. Sweany,
J. Smith,
L. G. Sobotka,
M. B. Tsang,
A. H. Wuosmaa
Abstract:
The boundaries of the Chart of Nuclides contain exotic isotopes that possess extreme proton-toneutron asymmetries. Here we report on strong evidence of 9N, one of the most exotic proton-rich isotopes where more than one half of its constitute nucleons are unbound. With seven protons and two neutrons, this extremely proton-rich system would represent the first-known example of a ground-state five-p…
▽ More
The boundaries of the Chart of Nuclides contain exotic isotopes that possess extreme proton-toneutron asymmetries. Here we report on strong evidence of 9N, one of the most exotic proton-rich isotopes where more than one half of its constitute nucleons are unbound. With seven protons and two neutrons, this extremely proton-rich system would represent the first-known example of a ground-state five-proton emitter. The invariant-mass spectrum of its decay products can be fit with two peaks whose energies are consistent with the theoretical predictions of an open-quantum-system approach, however we cannot rule out the possibility that only a single resonance-like peak is present in the spectrum.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
The Relational Bottleneck as an Inductive Bias for Efficient Abstraction
Authors:
Taylor W. Webb,
Steven M. Frankland,
Awni Altabaa,
Simon Segert,
Kamesh Krishnamurthy,
Declan Campbell,
Jacob Russin,
Tyler Giallanza,
Zack Dulberg,
Randall O'Reilly,
John Lafferty,
Jonathan D. Cohen
Abstract:
A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck…
▽ More
A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck. In that approach, neural networks are constrained via their architecture to focus on relations between perceptual inputs, rather than the attributes of individual inputs. We review a family of models that employ this approach to induce abstractions in a data-efficient manner, emphasizing their potential as candidate models for the acquisition of abstract concepts in the human mind and brain.
△ Less
Submitted 1 May, 2024; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Tunable magnetic domains in ferrimagnetic MnSb$_2$Te$_4$
Authors:
Tatiana A. Webb,
Afrin N. Tamanna,
Xiaxin Ding,
Jikai Xu,
Lia Krusin-Elbaum,
Cory R. Dean,
Dmitri N. Basov,
Abhay N. Pasupathy
Abstract:
Highly tunable properties make Mn(Bi,Sb)$_2$Te$_4$ a rich playground for exploring the interplay between band topology and magnetism: On one end, MnBi$_2$Te$_4$ is an antiferromagnetic topological insulator, while the magnetic structure of MnSb$_2$Te$_4$ (MST) can be tuned between antiferromagnetic and ferrimagnetic. Motivated to control electronic properties through real-space magnetic textures,…
▽ More
Highly tunable properties make Mn(Bi,Sb)$_2$Te$_4$ a rich playground for exploring the interplay between band topology and magnetism: On one end, MnBi$_2$Te$_4$ is an antiferromagnetic topological insulator, while the magnetic structure of MnSb$_2$Te$_4$ (MST) can be tuned between antiferromagnetic and ferrimagnetic. Motivated to control electronic properties through real-space magnetic textures, we use magnetic force microscopy (MFM) to image the domains of ferrimagnetic MST. We find that magnetic field tunes between stripe and bubble domain morphologies, raising the possibility of topological spin textures. Moreover, we combine in situ transport with domain manipulation and imaging to both write MST device properties and directly measure the scaling of the Hall response with domain area. This work demonstrates measurement of the local anomalous Hall response using MFM, and opens the door to reconfigurable domain-based devices in the M(B,S)T family.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Systematic Visual Reasoning through Object-Centric Relational Abstraction
Authors:
Taylor W. Webb,
Shanka Subhra Mondal,
Jonathan D. Cohen
Abstract:
Human visual reasoning is characterized by an ability to identify abstract patterns from only a small number of examples, and to systematically generalize those patterns to novel inputs. This capacity depends in large part on our ability to represent complex visual inputs in terms of both objects and relations. Recent work in computer vision has introduced models with the capacity to extract objec…
▽ More
Human visual reasoning is characterized by an ability to identify abstract patterns from only a small number of examples, and to systematically generalize those patterns to novel inputs. This capacity depends in large part on our ability to represent complex visual inputs in terms of both objects and relations. Recent work in computer vision has introduced models with the capacity to extract object-centric representations, leading to the ability to process multi-object visual inputs, but falling short of the systematic generalization displayed by human reasoning. Other recent models have employed inductive biases for relational abstraction to achieve systematic generalization of learned abstract rules, but have generally assumed the presence of object-focused inputs. Here, we combine these two approaches, introducing Object-Centric Relational Abstraction (OCRA), a model that extracts explicit representations of both objects and abstract relations, and achieves strong systematic generalization in tasks (including a novel dataset, CLEVR-ART, with greater visual complexity) involving complex visual displays.
△ Less
Submitted 10 November, 2023; v1 submitted 4 June, 2023;
originally announced June 2023.
-
Determinantal Point Process Attention Over Grid Cell Code Supports Out of Distribution Generalization
Authors:
Shanka Subhra Mondal,
Steven Frankland,
Taylor Webb,
Jonathan D. Cohen
Abstract:
Deep neural networks have made tremendous gains in emulating human-like intelligence, and have been used increasingly as ways of understanding how the brain may solve the complex computational problems on which this relies. However, these still fall short of, and therefore fail to provide insight into how the brain supports strong forms of generalization of which humans are capable. One such case…
▽ More
Deep neural networks have made tremendous gains in emulating human-like intelligence, and have been used increasingly as ways of understanding how the brain may solve the complex computational problems on which this relies. However, these still fall short of, and therefore fail to provide insight into how the brain supports strong forms of generalization of which humans are capable. One such case is out-of-distribution (OOD) generalization-successful performance on test examples that lie outside the distribution of the training set. Here, we identify properties of processing in the brain that may contribute to this ability. We describe a two-part algorithm that draws on specific features of neural computation to achieve OOD generalization, and provide a proof of concept by evaluating performance on two challenging cognitive tasks. First we draw on the fact that the mammalian brain represents metric spaces using grid cell code (e.g., in the entorhinal cortex): abstract representations of relational structure, organized in recurring motifs that cover the representational space. Second, we propose an attentional mechanism that operates over the grid cell code using Determinantal Point Process (DPP), that we call DPP attention (DPP-A) -- a transformation that ensures maximum sparseness in the coverage of that space. We show that a loss function that combines standard task-optimized error with DPP-A can exploit the recurring motifs in the grid cell code, and can be integrated with common architectures to achieve strong OOD generalization performance on analogy and arithmetic tasks. This provides both an interpretation of how the grid cell code in the mammalian brain may contribute to generalization performance, and at the same time a potential means for improving such capabilities in artificial neural networks.
△ Less
Submitted 23 January, 2024; v1 submitted 28 May, 2023;
originally announced May 2023.
-
Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers
Authors:
Awni Altabaa,
Taylor Webb,
Jonathan Cohen,
John Lafferty
Abstract:
An extension of Transformers is proposed that enables explicit relational reasoning through a novel module called the Abstractor. At the core of the Abstractor is a variant of attention called relational cross-attention. The approach is motivated by an architectural inductive bias for relational learning that disentangles relational information from object-level features. This enables explicit rel…
▽ More
An extension of Transformers is proposed that enables explicit relational reasoning through a novel module called the Abstractor. At the core of the Abstractor is a variant of attention called relational cross-attention. The approach is motivated by an architectural inductive bias for relational learning that disentangles relational information from object-level features. This enables explicit relational reasoning, supporting abstraction and generalization from limited data. The Abstractor is first evaluated on simple discriminative relational tasks and compared to existing relational architectures. Next, the Abstractor is evaluated on purely relational sequence-to-sequence tasks, where dramatic improvements are seen in sample efficiency compared to standard Transformers. Finally, Abstractors are evaluated on a collection of tasks based on mathematical problem solving, where consistent improvements in performance and sample efficiency are observed.
△ Less
Submitted 12 April, 2024; v1 submitted 31 March, 2023;
originally announced April 2023.
-
Learning to reason over visual objects
Authors:
Shanka Subhra Mondal,
Taylor Webb,
Jonathan D. Cohen
Abstract:
A core component of human intelligence is the ability to identify abstract patterns inherent in complex, high-dimensional perceptual data, as exemplified by visual reasoning tasks such as Raven's Progressive Matrices (RPM). Motivated by the goal of designing AI systems with this capacity, recent work has focused on evaluating whether neural networks can learn to solve RPM-like problems. Previous w…
▽ More
A core component of human intelligence is the ability to identify abstract patterns inherent in complex, high-dimensional perceptual data, as exemplified by visual reasoning tasks such as Raven's Progressive Matrices (RPM). Motivated by the goal of designing AI systems with this capacity, recent work has focused on evaluating whether neural networks can learn to solve RPM-like problems. Previous work has generally found that strong performance on these problems requires the incorporation of inductive biases that are specific to the RPM problem format, raising the question of whether such models might be more broadly useful. Here, we investigated the extent to which a general-purpose mechanism for processing visual scenes in terms of objects might help promote abstract visual reasoning. We found that a simple model, consisting only of an object-centric encoder and a transformer reasoning module, achieved state-of-the-art results on both of two challenging RPM-like benchmarks (PGM and I-RAVEN), as well as a novel benchmark with greater visual complexity (CLEVR-Matrices). These results suggest that an inductive bias for object-centric processing may be a key component of abstract visual reasoning, obviating the need for problem-specific inductive biases.
△ Less
Submitted 26 October, 2023; v1 submitted 3 March, 2023;
originally announced March 2023.
-
Emergent Analogical Reasoning in Large Language Models
Authors:
Taylor Webb,
Keith J. Holyoak,
Hongjing Lu
Abstract:
The recent advent of large language models has reinvigorated debate over whether human cognitive capacities might emerge in such generic models given sufficient training data. Of particular interest is the ability of these models to reason about novel problems zero-shot, without any direct training. In human cognition, this capacity is closely tied to an ability to reason by analogy. Here, we perf…
▽ More
The recent advent of large language models has reinvigorated debate over whether human cognitive capacities might emerge in such generic models given sufficient training data. Of particular interest is the ability of these models to reason about novel problems zero-shot, without any direct training. In human cognition, this capacity is closely tied to an ability to reason by analogy. Here, we performed a direct comparison between human reasoners and a large language model (the text-davinci-003 variant of GPT-3) on a range of analogical tasks, including a non-visual matrix reasoning task based on the rule structure of Raven's Standard Progressive Matrices. We found that GPT-3 displayed a surprisingly strong capacity for abstract pattern induction, matching or even surpassing human capabilities in most settings; preliminary tests of GPT-4 indicated even better performance. Our results indicate that large language models such as GPT-3 have acquired an emergent ability to find zero-shot solutions to a broad range of analogy problems.
△ Less
Submitted 2 August, 2023; v1 submitted 18 December, 2022;
originally announced December 2022.
-
Zero-shot visual reasoning through probabilistic analogical mapping
Authors:
Taylor W. Webb,
Shuhao Fu,
Trevor Bihl,
Keith J. Holyoak,
Hongjing Lu
Abstract:
Human reasoning is grounded in an ability to identify highly abstract commonalities governing superficially dissimilar visual inputs. Recent efforts to develop algorithms with this capacity have largely focused on approaches that require extensive direct training on visual reasoning tasks, and yield limited generalization to problems with novel content. In contrast, a long tradition of research in…
▽ More
Human reasoning is grounded in an ability to identify highly abstract commonalities governing superficially dissimilar visual inputs. Recent efforts to develop algorithms with this capacity have largely focused on approaches that require extensive direct training on visual reasoning tasks, and yield limited generalization to problems with novel content. In contrast, a long tradition of research in cognitive science has focused on elucidating the computational principles underlying human analogical reasoning; however, this work has generally relied on manually constructed representations. Here we present visiPAM (visual Probabilistic Analogical Mapping), a model of visual reasoning that synthesizes these two approaches. VisiPAM employs learned representations derived directly from naturalistic visual inputs, coupled with a similarity-based mapping operation derived from cognitive theories of human reasoning. We show that without any direct training, visiPAM outperforms a state-of-the-art deep learning model on an analogical mapping task. In addition, visiPAM closely matches the pattern of human performance on a novel task involving mapping of 3D objects across disparate categories.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
A large-scale kinematic study of molecular gas in high-z cluster galaxies: Evidence for high levels of kinematic asymmetry
Authors:
W. J. Cramer,
A. G. Noble,
K. Massingill,
J. Cairns,
D. L. Clements,
M. C. Cooper,
R. Demarco,
J. Matharu,
M. McDonald,
A. Muzzin,
J. Nantais,
G. Rudnick,
H. Übler,
E. van Kampen,
T. M. A. Webb,
G. Wilson,
H. K. C. Yee
Abstract:
We investigate the resolved kinematics of the molecular gas, as traced by ALMA in CO (2-1), of 25 cluster member galaxies across three different clusters at a redshift of $z\sim1.6$. This is the first large-scale analysis of the molecular gas kinematics of cluster galaxies at this redshift. By separately estimating the rotation curve of the approaching and receding side of each galaxy via kinemati…
▽ More
We investigate the resolved kinematics of the molecular gas, as traced by ALMA in CO (2-1), of 25 cluster member galaxies across three different clusters at a redshift of $z\sim1.6$. This is the first large-scale analysis of the molecular gas kinematics of cluster galaxies at this redshift. By separately estimating the rotation curve of the approaching and receding side of each galaxy via kinematic modeling, we quantify the difference in total circular velocity to characterize the overall kinematic asymmetry of each galaxy. 3/14 of the galaxies in our sample that we are able to model have similar degrees of asymmetry as that observed in galaxies in the field at similar redshift. However, this leaved 11/14 galaxies in our sample with significantly higher asymmetry, and some of these galaxies have degrees of asymmetry of up to $\sim$50 times higher than field galaxies observed at similar redshift. Some of these extreme cases also have one-sided tail-like morphology seen in the molecular gas, supporting a scenario of tidal and/or ram pressure interaction. Such stark differences in the kinematic asymmetry in clusters versus the field suggest the evolutionary influence of dense environments, established as being a major driver of galaxy evolution at low-redshift, is also active in the high-redshift universe.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
Dislocation healing during hydrogen absorption and desorption in palladium
Authors:
T. A. Webb,
C. J. Webb,
E. MacA. Gray
Abstract:
An in-situ neutron diffraction investigation of the annealing and healing of dislocations in the bulk Pd-D2 system was carried out. Lattice misfit between the alpha and beta hydride phases produces dislocations during the phase transition in either direction, relieving elastic strain, which is reflected in reduced pressure hysteresis compared to the spinodal hysteresis. The effects on the dislocat…
▽ More
An in-situ neutron diffraction investigation of the annealing and healing of dislocations in the bulk Pd-D2 system was carried out. Lattice misfit between the alpha and beta hydride phases produces dislocations during the phase transition in either direction, relieving elastic strain, which is reflected in reduced pressure hysteresis compared to the spinodal hysteresis. The effects on the dislocation density of annealing the metal under vacuum, of annealing in the beta hydride phase, and of the phase transformation itself were investigated by measuring diffraction peak breadths during annealing and hydrogen cycling. During annealing under vacuum the dislocations were removed at a lower temperature than was previously reported, but annealing in the beta phase gave nearly the same result. However, when cycling hydrogen in and out of the sample, the dislocation density decreased much faster with increasing temperature compared to annealing. In other words the process of phase transformation allows for healing of dislocations at lower temperatures than would be required to anneal them purely by heating. This healing effect was observed during both absorption and desorption. This result illuminates the mechanism by which misfit dislocations can be healed at the same rate that they are created in a sample undergoing absorption-desorption cycling, as proposed in theoretical models of the origin of pressure hysteresis.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
An Assessment of the In-Situ Growth of the Intracluster Light in the High Redshift Galaxy Cluster SpARCS1049+56
Authors:
Capucine Barfety,
Félix-Antoine Valin,
Tracy M. A. Webb,
Min Yun,
Heath Shipley,
Kyle Boone,
Brian Hayden,
Julie Hlavacek-Larrondo,
Adam Muzzin,
Allison G. Noble,
Saul Perlmutter,
Carter Rhea,
Gillian Wilson,
H. K. C Yee
Abstract:
The formation of the stellar mass within galaxy cluster cores is a poorly understood process. It features the complicated physics of cooling flows, AGN feedback, star formation and more. Here, we study the growth of the stellar mass in the vicinity of the Brightest Cluster Galaxy (BCG) in a z = 1.7 cluster, SpARCS1049+56. We synthesize a reanalysis of existing HST imaging, a previously published m…
▽ More
The formation of the stellar mass within galaxy cluster cores is a poorly understood process. It features the complicated physics of cooling flows, AGN feedback, star formation and more. Here, we study the growth of the stellar mass in the vicinity of the Brightest Cluster Galaxy (BCG) in a z = 1.7 cluster, SpARCS1049+56. We synthesize a reanalysis of existing HST imaging, a previously published measurement of the star formation rate, and the results of new radio molecular gas spectroscopy. These analyses represent the past, present and future star formation respectively within this system. We show that a large amount of stellar mass -- between $(2.2 \pm 0.5) \times 10^{10} \: M_\odot$ and $(6.6 \pm 1.2) \times 10^{10}\: M_\odot$ depending on the data processing -- exists in a long and clumpy tail-like structure that lies roughly 12 kpc off the BCG. Spatially coincident with this stellar mass is a similarly massive reservoir ($(1.0 \pm 0.7) \times 10^{11} \: M_\odot$) of molecular gas that we suggest is the fuel for the immense star formation rate of $860 \pm 130 \: M_\odot$/yr, as measured by infrared observations. Hlavacek-Larrondo et al. 2021 surmised that massive, runaway cooling of the hot intracluster X-ray gas was feeding this star formation, a process that had not been observed before at high-redshift. We conclude, based on the amount of fuel and current stars, that this event may be rare in the lifetime of a cluster, producing roughly 15 to 21% of the Intracluster Light (ICL) mass in one go, though perhaps a common event for all galaxy clusters.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
A Hardware-Aware System for Accelerating Deep Neural Network Optimization
Authors:
Anthony Sarah,
Daniel Cummings,
Sharath Nittur Sridhar,
Sairam Sundaresan,
Maciej Szankin,
Tristan Webb,
J. Pablo Munoz
Abstract:
Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely…
▽ More
Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely under-explored. For example, some recent network morphism techniques allow a super-network to be trained once and then have hardware-specific networks extracted from it as needed. These methods decouple the super-network training from the sub-network search and thus decrease the computational burden of specializing to different hardware platforms. We propose a comprehensive system that automatically and efficiently finds sub-networks from a pre-trained super-network that are optimized to different performance metrics and hardware configurations. By combining novel search tactics and algorithms with intelligent use of predictors, we significantly decrease the time needed to find optimal sub-networks from a given super-network. Further, our approach does not require the super-network to be refined for the target task a priori, thus allowing it to interface with any super-network. We demonstrate through extensive experiments that our system works seamlessly with existing state-of-the-art super-network training methods in multiple domains. Moreover, we show how novel search tactics paired with evolutionary algorithms can accelerate the search process for ResNet50, MobileNetV3 and Transformer while maintaining objective space Pareto front diversity and demonstrate an 8x faster search result than the state-of-the-art Bayesian optimization WeakNAS approach.
△ Less
Submitted 25 February, 2022;
originally announced February 2022.
-
Operationalizing Convolutional Neural Network Architectures for Prohibited Object Detection in X-Ray Imagery
Authors:
Thomas W. Webb,
Neelanjan Bhowmik,
Yona Falinie A. Gaus,
Toby P. Breckon
Abstract:
The recent advancement in deep Convolutional Neural Network (CNN) has brought insight into the automation of X-ray security screening for aviation security and beyond. Here, we explore the viability of two recent end-to-end object detection CNN architectures, Cascade R-CNN and FreeAnchor, for prohibited item detection by balancing processing time and the impact of image data compression from an op…
▽ More
The recent advancement in deep Convolutional Neural Network (CNN) has brought insight into the automation of X-ray security screening for aviation security and beyond. Here, we explore the viability of two recent end-to-end object detection CNN architectures, Cascade R-CNN and FreeAnchor, for prohibited item detection by balancing processing time and the impact of image data compression from an operational viewpoint. Overall, we achieve maximal detection performance using a FreeAnchor architecture with a ResNet50 backbone, obtaining mean Average Precision (mAP) of 87.7 and 85.8 for using the OPIXray and SIXray benchmark datasets, showing superior performance over prior work on both. With fewer parameters and less training time, FreeAnchor achieves the highest detection inference speed of ~13 fps (3.9 ms per image). Furthermore, we evaluate the impact of lossy image compression upon detector performance. The CNN models display substantial resilience to the lossy compression, resulting in only a 1.1% decrease in mAP at the JPEG compression level of 50. Additionally, a thorough evaluation of data augmentation techniques is provided, including adaptions of MixUp and CutMix strategy as well as other standard transformations, further improving the detection accuracy.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
Using spin alignment of inelastically-excited fast beams to make spin assignments: the spectroscopy of 13O as a test case
Authors:
R. J. Charity,
T. B. Webb,
J. M. Elson,
D. E. M. Hoff,
C. D. Pruitt,
L. G. Sobotka,
P. Navratil,
G. Hupin,
K. Kravvaris,
S. Quaglioni,
K. W. Brown,
G. Cerizza,
J. Estee,
W. G. Lynch,
J. Manfredi,
P. Morfouace,
C. Santamaria,
S. Sweany,
M. B. Tsang,
T. Tsang,
K. Zhu,
S. A. Kuvin,
D. McNeel,
J. Smith,
A. H. Wousmaa
, et al. (1 additional authors not shown)
Abstract:
Excited states in 13O were investigated using inelastic scattering of an E/A=69.5-MeV 13O beam off of a 9Be target. The excited states were identified in the invariant-mass spectra of the decay products. Both single proton and sequential two-proton decays of the excited states were examined. For a number of the excited states, the protons were emitted with strong anisotropy where emissions transve…
▽ More
Excited states in 13O were investigated using inelastic scattering of an E/A=69.5-MeV 13O beam off of a 9Be target. The excited states were identified in the invariant-mass spectra of the decay products. Both single proton and sequential two-proton decays of the excited states were examined. For a number of the excited states, the protons were emitted with strong anisotropy where emissions transverse to the beam axis are favored. The measured proton-decay angular distributions were compared to predictions from distorted-wave born-approximation (DWBA) calculations of the spin alignment which was shown to be largely independent of the excitation mechanism. The deduced $^{13}$O level scheme is compared to ab initio no-core shell model with continuum (NCSMC) predictions. The lowest-energy excited states decay isotropically consistent with predictions of strong proton 1s1/2 structure. Above these states in the level scheme, we observed a number of higher-spin states not predicted within the model. Possibly these are associated with rotational bands built on deformed cluster configurations predicted by antisymmetrized molecular dynamics (AMD) calculations. The spin alignment mechanism is shown to be useful for making spin assignments and may have widespread use.
△ Less
Submitted 7 July, 2021;
originally announced July 2021.
-
Modelling the development of counting with memory-augmented neural networks
Authors:
Zack Dulberg,
Taylor Webb,
Jonathan Cohen
Abstract:
Learning to count is an important example of the broader human capacity for systematic generalization, and the development of counting is often characterized by an inflection point when children rapidly acquire proficiency with the procedures that support this ability. We aimed to model this process by training a reinforcement learning agent to select N items from a binary vector when instructed (…
▽ More
Learning to count is an important example of the broader human capacity for systematic generalization, and the development of counting is often characterized by an inflection point when children rapidly acquire proficiency with the procedures that support this ability. We aimed to model this process by training a reinforcement learning agent to select N items from a binary vector when instructed (known as the give-$N$ task). We found that a memory-augmented modular network architecture based on the recently proposed Emergent Symbol Binding Network (ESBN) exhibited an inflection during learning that resembled human development. This model was also capable of systematic extrapolation outside the range of its training set - for example, trained only to select between 1 and 10 items, it could succeed at selecting 11 to 15 items as long as it could make use of an arbitrary count sequence of at least that length. The close parallels to child development and the capacity for extrapolation suggest that our model could shed light on the emergence of systematicity in humans.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
The HST See Change Program: I. Survey Design, Pipeline, and Supernova Discoveries
Authors:
Brian Hayden,
David Rubin,
Kyle Boone,
Greg Aldering,
Jakob Nordin,
Mark Brodwin,
Susana Deustua,
Sam Dixon,
Parker Fagrelius,
Andy Fruchter,
Peter Eisenhardt,
Anthony Gonzalez,
Ravi Gupta,
Isobel Hook,
Chris Lidman,
Kyle Luther,
Adam Muzzin,
Zachary Raha,
Pilar Ruiz-Lapuente,
Clare Saunders,
Caroline Sofiatti,
Adam Stanford,
Nao Suzuki,
Tracy Webb,
Steven C. Williams
, et al. (31 additional authors not shown)
Abstract:
The See Change survey was designed to make $z>1$ cosmological measurements by efficiently discovering high-redshift Type Ia supernovae (SNe Ia) and improving cluster mass measurements through weak lensing. This survey observed twelve galaxy clusters with the Hubble Space Telescope spanning the redshift range $z=1.13$ to $1.75$, discovering 57 likely transients and 27 likely SNe Ia at…
▽ More
The See Change survey was designed to make $z>1$ cosmological measurements by efficiently discovering high-redshift Type Ia supernovae (SNe Ia) and improving cluster mass measurements through weak lensing. This survey observed twelve galaxy clusters with the Hubble Space Telescope spanning the redshift range $z=1.13$ to $1.75$, discovering 57 likely transients and 27 likely SNe Ia at $z\sim 0.8-2.3$. As in similar previous surveys (Dawson et al. 2009), this proved to be a highly efficient use of HST for SN observations; the See Change survey additionally tested the feasibility of maintaining, or further increasing, the efficiency at yet higher redshifts, where we have less detailed information on the expected cluster masses and star-formation rates. We find that the resulting number of SNe Ia per orbit is a factor of $\sim 8$ higher than for a field search, and 45% of our orbits contained an active SN Ia within 22 rest-frame days of peak, with one of the clusters by itself yielding 6 of the SNe Ia. We present the survey design, pipeline, and SN discoveries. Novel features include fully blinded SN searches, the first random forest candidate classifier for undersampled IR data (with a 50% detection threshold within 0.05 magnitudes of human searchers), real-time forward-modeling photometry of candidates, and semi-automated photometric classifications and follow-up forecasts. We also describe the spectroscopic follow-up, instrumental in measuring host-galaxy redshifts. The cosmology analysis of our sample will be presented in a companion paper.
△ Less
Submitted 24 March, 2021;
originally announced March 2021.
-
A CO Survey of SpARCS Star-Forming Brightest Cluster Galaxies: Evidence for Uniformity in BCG Molecular Gas Processing Across Cosmic Time
Authors:
Delaney A. Dunne,
Tracy M. A. Webb,
Allison Noble,
Christopher Lidman,
Heath Shipley,
Adam Muzzin,
Gillian Wilson,
H. K. C. Yee
Abstract:
We present ALMA CO (2-1) detections of 24 star-forming Brightest Cluster Galaxies (BCGs) over $0.2<z<1.2$, constituting the largest and most distant sample of molecular gas measurements in BCGs to date. The BCGs are selected from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS) to be IR-bright and therefore star-forming. We find that molecular gas is common in star-forming BCGs,…
▽ More
We present ALMA CO (2-1) detections of 24 star-forming Brightest Cluster Galaxies (BCGs) over $0.2<z<1.2$, constituting the largest and most distant sample of molecular gas measurements in BCGs to date. The BCGs are selected from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS) to be IR-bright and therefore star-forming. We find that molecular gas is common in star-forming BCGs, detecting CO at a detection rate of 80% in our target sample of 30 objects. We additionally provide measurements of the star formation rate (SFR) and stellar mass, calculated from existing MIPS 24 $μ$m and IRAC 3.6 $μ$m fluxes, respectively. We find these galaxies have molecular gas masses of $0.7-11.0\times 10^{10}\ \mathrm{M}_\odot$, comparable to other BCGs in this redshift range, and specific star formation rates which trace the Elbaz et al. (2011) Main Sequence. We compare our BCGs to those of the lower-redshift, cooling-flow BCG sample assembled by Edge (2001) and find that at z $\lesssim 0.6$ the two samples show very similar correlations between their gas masses and specific SFRs. We suggest that, in this redshift regime, the $\sim10\%$ (Webb et al., 2015) of BCGs that are star-forming process any accreted molecular gas into stars through means that are agnostic to both their redshift and their cluster mass.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Emergent Symbols through Binding in External Memory
Authors:
Taylor W. Webb,
Ishan Sinha,
Jonathan D. Cohen
Abstract:
A key aspect of human intelligence is the ability to infer abstract rules directly from high-dimensional sensory data, and to do so given only a limited amount of training experience. Deep neural network algorithms have proven to be a powerful tool for learning directly from high-dimensional data, but currently lack this capacity for data-efficient induction of abstract rules, leading some to argu…
▽ More
A key aspect of human intelligence is the ability to infer abstract rules directly from high-dimensional sensory data, and to do so given only a limited amount of training experience. Deep neural network algorithms have proven to be a powerful tool for learning directly from high-dimensional data, but currently lack this capacity for data-efficient induction of abstract rules, leading some to argue that symbol-processing mechanisms will be necessary to account for this capacity. In this work, we take a step toward bridging this gap by introducing the Emergent Symbol Binding Network (ESBN), a recurrent network augmented with an external memory that enables a form of variable-binding and indirection. This binding mechanism allows symbol-like representations to emerge through the learning process without the need to explicitly incorporate symbol-processing machinery, enabling the ESBN to learn rules in a manner that is abstracted away from the particular entities to which those rules apply. Across a series of tasks, we show that this architecture displays nearly perfect generalization of learned rules to novel entities given only a limited number of training examples, and outperforms a number of other competitive neural network architectures.
△ Less
Submitted 9 March, 2021; v1 submitted 28 December, 2020;
originally announced December 2020.
-
A Memory-Augmented Neural Network Model of Abstract Rule Learning
Authors:
Ishan Sinha,
Taylor W. Webb,
Jonathan D. Cohen
Abstract:
Human intelligence is characterized by a remarkable ability to infer abstract rules from experience and apply these rules to novel domains. As such, designing neural network algorithms with this capacity is an important step toward the development of deep learning systems with more human-like intelligence. However, doing so is a major outstanding challenge, one that some argue will require neural…
▽ More
Human intelligence is characterized by a remarkable ability to infer abstract rules from experience and apply these rules to novel domains. As such, designing neural network algorithms with this capacity is an important step toward the development of deep learning systems with more human-like intelligence. However, doing so is a major outstanding challenge, one that some argue will require neural networks to use explicit symbol-processing mechanisms. In this work, we focus on neural networks' capacity for arbitrary role-filler binding, the ability to associate abstract "roles" to context-specific "fillers," which many have argued is an important mechanism underlying the ability to learn and apply rules abstractly. Using a simplified version of Raven's Progressive Matrices, a hallmark test of human intelligence, we introduce a sequential formulation of a visual problem-solving task that requires this form of binding. Further, we introduce the Emergent Symbol Binding Network (ESBN), a recurrent neural network model that learns to use an external memory as a binding mechanism. This mechanism enables symbol-like variable representations to emerge through the ESBN's training process without the need for explicit symbol-processing machinery. We empirically demonstrate that the ESBN successfully learns the underlying abstract rule structure of our task and perfectly generalizes this rule structure to novel fillers.
△ Less
Submitted 14 December, 2020; v1 submitted 13 December, 2020;
originally announced December 2020.
-
The GOGREEN and GCLASS Surveys: First Data Release
Authors:
Michael L. Balogh,
Remco F. J. van der Burg,
Adam Muzzin,
Gregory Rudnick,
Gillian Wilson,
Kristi Webb,
Andrea Biviano,
Kevin Boak,
Pierluigi Cerulo,
Jeffrey Chan,
M. C. Cooper,
David G. Gilbank,
Stephen Gwyn,
Chris Lidman,
Jasleen Matharu,
Sean L. McGee,
Lyndsay Old,
Irene Pintos-Castro,
Andrew M. M. Reeves,
Heath Shipley,
Benedetta Vulcani,
Howard K. C. Yee,
M. Victoria Alonso,
Callum Bellhouse,
Kevin C. Cooke
, et al. (20 additional authors not shown)
Abstract:
We present the first public data release of the GOGREEN and GCLASS surveys of galaxies in dense environments, spanning a redshift range $0.8<z<1.5$. The surveys consist of deep, multiwavelength photometry and extensive Gemini GMOS spectroscopy of galaxies in 26 overdense systems ranging in halo mass from small groups to the most massive clusters. The objective of both projects was primarily to und…
▽ More
We present the first public data release of the GOGREEN and GCLASS surveys of galaxies in dense environments, spanning a redshift range $0.8<z<1.5$. The surveys consist of deep, multiwavelength photometry and extensive Gemini GMOS spectroscopy of galaxies in 26 overdense systems ranging in halo mass from small groups to the most massive clusters. The objective of both projects was primarily to understand how the evolution of galaxies is affected by their environment, and to determine the physical processes that lead to the quenching of star formation. There was an emphasis on obtaining unbiased spectroscopy over a wide stellar mass range ($M\gtrsim 2\times 10^{10}~\mathrm{M}_\odot$), throughout and beyond the cluster virialized regions. The final spectroscopic sample includes 2771 unique objects, of which 2257 have reliable spectroscopic redshifts. Of these, 1704 have redshifts in the range $0.8<z<1.5$, and nearly 800 are confirmed cluster members. Imaging spans the full optical and near-infrared wavelength range, at depths comparable to the UltraVISTA survey, and includes \textit{HST}/WFC3 F160W (GOGREEN) and F140W (GCLASS). This data release includes fully reduced images and spectra, with catalogues of advanced data products including redshifts, line strengths, star formation rates, stellar masses and rest-frame colours. Here we present an overview of the data, including an analysis of the spectroscopic completeness and redshift quality.
△ Less
Submitted 28 September, 2020;
originally announced September 2020.
-
Evidence of runaway gas cooling in the absence of supermassive black hole feedback at the epoch of cluster formation
Authors:
J. Hlavacek-Larrondo,
C. L. Rhea,
T. Webb,
M. McDonald,
A. Muzzin,
G. Wilson,
K. Finner,
F. Valin,
N. Bonaventura,
M. Cooper,
A. C. Fabian,
M. -L. Gendron-Marsolais,
M. J. Jee,
C. Lidman,
M. Mezcua,
A. Noble,
H. R. Russell,
J. Surace,
A. Trudeau,
H. K. C. Yee
Abstract:
Cosmological simulations, as well as mounting evidence from observations, have shown that supermassive black holes play a fundamental role in regulating the formation of stars throughout cosmic time. This has been clearly demonstrated in the case of galaxy clusters in which powerful feedback from the central black hole is preventing the hot intracluster gas from cooling catastrophically, thus redu…
▽ More
Cosmological simulations, as well as mounting evidence from observations, have shown that supermassive black holes play a fundamental role in regulating the formation of stars throughout cosmic time. This has been clearly demonstrated in the case of galaxy clusters in which powerful feedback from the central black hole is preventing the hot intracluster gas from cooling catastrophically, thus reducing the expected star formation rates by orders of magnitude. These conclusions have however been almost entirely based on nearby clusters. Based on new Chandra X-ray observations, we present the first observational evidence for massive, runaway cooling occurring in the absence of supermassive black hole feedback in the high-redshift galaxy cluster SpARCS104922.6+564032.5 ($z=1.709$). The hot intracluster gas appears to be fueling a massive burst of star formation ($\approx900$~M$_\odot$yr$^{-1}$) that is offset by dozens of kpc from the central galaxy. The burst is co-spatial with the coolest intracluster gas but not associated with any galaxy in the cluster. In less than 100 million years, such runaway cooling can form the same amount of stars as in the Milky Way. Intracluster stars are therefore not only produced by tidal stripping and the disruption of cluster galaxies, but can also be produced by runaway cooling of hot intracluster gas at early times. Overall, these observations show the dramatic impact when supermassive black hole feedback fails to operate in clusters. They indicate that in the highest overdensities such as clusters and proto-clusters, runaway cooling may be a new and important mechanism for fueling massive bursts of star formation in the early universe.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
Learning Representations that Support Extrapolation
Authors:
Taylor W. Webb,
Zachary Dulberg,
Steven M. Frankland,
Alexander A. Petrov,
Randall C. O'Reilly,
Jonathan D. Cohen
Abstract:
Extrapolation -- the ability to make inferences that go beyond the scope of one's experiences -- is a hallmark of human intelligence. By contrast, the generalization exhibited by contemporary neural network algorithms is largely limited to interpolation between data points in their training corpora. In this paper, we consider the challenge of learning representations that support extrapolation. We…
▽ More
Extrapolation -- the ability to make inferences that go beyond the scope of one's experiences -- is a hallmark of human intelligence. By contrast, the generalization exhibited by contemporary neural network algorithms is largely limited to interpolation between data points in their training corpora. In this paper, we consider the challenge of learning representations that support extrapolation. We introduce a novel visual analogy benchmark that allows the graded evaluation of extrapolation as a function of distance from the convex domain defined by the training data. We also introduce a simple technique, temporal context normalization, that encourages representations that emphasize the relations between objects. We find that this technique enables a significant improvement in the ability to extrapolate, considerably outperforming a number of competitive techniques.
△ Less
Submitted 6 September, 2023; v1 submitted 9 July, 2020;
originally announced July 2020.
-
Constraining the Mass of the Emerging Galaxy Cluster SpARCS1049+56 at z=1.71 with Infrared Weak Lensing
Authors:
Kyle Finner,
M. James Jee,
Tracy Webb,
Gillian Wilson,
Saul Perlmutter,
Adam Muzzin,
Julie Hlavacek-Larrondo
Abstract:
In the hierarchical structure formation model of the universe, galaxy clusters are assembled through a series of mergers. Accordingly, it is expected that galaxy clusters in the early universe are actively forming and dynamically young. Located at a high redshift of z=1.71, SpARCS1049+56 offers a unique look into the galaxy cluster formation process. This cluster has been shown to be rich in clust…
▽ More
In the hierarchical structure formation model of the universe, galaxy clusters are assembled through a series of mergers. Accordingly, it is expected that galaxy clusters in the early universe are actively forming and dynamically young. Located at a high redshift of z=1.71, SpARCS1049+56 offers a unique look into the galaxy cluster formation process. This cluster has been shown to be rich in cluster galaxies and to have intense star formation. Its high redshift pushes a weak-lensing analysis beyond the regime of the optical spectrum into that of the infrared. Equipped with deep Hubble Space Telescope Wide Field Camera 3 UVIS and IR observations, we present a weak-lensing characterization of SpARCS1049+56. As few IR weak-lensing studies have been performed, we discuss the details of PSF modeling and galaxy shape measurement for an IR weak-lensing procedure and the systematics that come with the territory. It will be critical to understand these systematics in future weak-lensing studies in the IR with the next generation space telescopes such as JWST, Euclid, and WFIRST. Through a careful analysis, the mass distribution of this young galaxy cluster is mapped and the convergence peak is detected at a 3.3 sigma level. The weak-lensing mass of the cluster is estimated to be $3.5\pm1.2\times10^{14}\ \text{M}_\odot$ and is consistent with the mass derived from a mass-richness scaling relation. This mass is extreme for a cluster at such a high redshift and suggests that SpARCS1049+56 is rare in the standard $Λ$CDM universe.
△ Less
Submitted 27 February, 2020; v1 submitted 5 February, 2020;
originally announced February 2020.
-
Particle decays of levels in $^{11,12}$N and $^{12}$O investigated with the invariant-mass method
Authors:
T. B. Webb,
R. J. Charity,
J. M. Elson,
D. E. M Hoff,
C. D. Pruitt,
L. G. Sobotka,
K. W. Brown,
J. Barney,
G. Cerizza,
J. Estee,
G. Jhang,
W. G. Lynch,
J. Manfredi,
P. Morfouace,
C. Santamaria,
S. Sweany,
M. B. Tsang,
T. Tsang,
S. M. Wang,
Y. Zhang,
K. Zhu,
S. A. Kuvin,
D. McNeel,
J. Smith,
A. H. Wuosmaa
, et al. (1 additional authors not shown)
Abstract:
Particle-decaying states of the light nuclei $^{11,12}$N and $^{12}$O were studied using the invariant-mass method. The decay energies and intrinsic widths of a number of states were measured, and the momentum correlations of three-body decaying states were considered. A second 2$p$-decaying 2$^+$ state of $^{12}$O was observed for the first time, and a higher energy $^{12}$O state was observed in…
▽ More
Particle-decaying states of the light nuclei $^{11,12}$N and $^{12}$O were studied using the invariant-mass method. The decay energies and intrinsic widths of a number of states were measured, and the momentum correlations of three-body decaying states were considered. A second 2$p$-decaying 2$^+$ state of $^{12}$O was observed for the first time, and a higher energy $^{12}$O state was observed in the 4$p$+2$α$ decay channel. This 4$p$+2$α$ channel also contains contributions from fission-like decay paths, including $^6$Be$_{g.s.}$+$^{6}$Be$_{g.s.}$. Analogs to these states in $^{12}$O were found in $^{12}$N in the 2$p$+$^{10}$B and 2$p$+$α$+$^6$Li channels. The momentum correlations for the prompt 2$p$ decay of $^{12}$O$_{g.s.}$ were found to be nearly identical to those of $^{16}$Ne$_{g.s.}$, and the correlations for the new 2$^+$ state were found to be consistent with sequential decay through excited states in $^{11}$N. The momentum correlations for the 2$^+_1$ state in $^{12}$O provide a new value for the $^{11}$N ground-state energy. The states in $^{12}$N/$^{12}$O that belong to the $A$=12 isobaric sextet do not deviate from the quadratic isobaric multiplet mass equation (IMME) form.
△ Less
Submitted 10 April, 2020; v1 submitted 26 June, 2019;
originally announced June 2019.
-
Multiwavelength radio observations of a Brightest Cluster Galaxy at z=1.71: Detection of a modest Active Galactic Nucleus and evidence for extended star formation
Authors:
Ariane Trudeau,
Tracy Webb,
Julie Hlavacek-Larrondo,
Allison Noble,
Marie-Lou Gendron-Marsolais,
Christopher Lidman,
Mar Mezcua,
Adam Muzzin,
Gillian Wilson,
H. K. C. Yee
Abstract:
We present deep, multiwavelength radio observations of SpARCS104922.6+564032.5, a z = 1.71 galaxy cluster with a starbusting core. Observations were made with the Karl G. Jansky Very Large Array (JVLA) in 3 bands: 1-2 GHz, 4-8 GHz and 8-12 GHz. We detect a radio source coincident with the Brightest Cluster Galaxy (BCG) that has a spectral index of α=0.44\pm 0.29 and is indicative of emission from…
▽ More
We present deep, multiwavelength radio observations of SpARCS104922.6+564032.5, a z = 1.71 galaxy cluster with a starbusting core. Observations were made with the Karl G. Jansky Very Large Array (JVLA) in 3 bands: 1-2 GHz, 4-8 GHz and 8-12 GHz. We detect a radio source coincident with the Brightest Cluster Galaxy (BCG) that has a spectral index of α=0.44\pm 0.29 and is indicative of emission from an Active Galactic Nucleus. The radio luminosity is consistent with the average luminosity of the lower redshift BCG sample, but the flux densities are 6σ below the predicted values of the star-forming Spectral Energy Distribution based on far infrared data. Our new fit fails to simultaneously describe the far infrared and radio fluxes. This, coupled with the fact that no other bright source is detected in the vicinity of the BCG implies that the star formation region, traced by the infrared emission, is extended or clumpy and not located directly within the BCG. Thus, we suggest that the star-forming core might not be driven by a single major wet merger, but rather by several smaller galaxies stripped of their gas or by a displaced cooling flow, although more data are needed to confirm any of those scenarios.
△ Less
Submitted 14 May, 2019;
originally announced May 2019.
-
First observation of unbound $^{11}$O, the mirror of the halo nucleus $^{11}$Li
Authors:
T. B. Webb,
S. M. Wang,
K. W. Brown,
R. J. Charity,
J. M. Elson,
J. Barney,
G. Cerizza,
Z. Chajecki,
J. Estee,
D. E. M. Hoff,
S. A. Kuvin,
W. G. Lynch,
J. Manfredi,
D. McNeel,
P. Morfouace,
W. Nazarewicz,
C. D. Pruitt,
C. Santamaria,
J. Smith,
L. G. Sobotka,
S. Sweany,
C. Y. Tsang,
M. B. Tsang,
A. H. Wuosmaa,
Y. Zhang
, et al. (1 additional authors not shown)
Abstract:
The structure of the extremely proton-rich nucleus $^{11}_{~8}$O$_3$, the mirror of the two-neutron halo nucleus $^{11}_{~3}$Li$_8$, has been studied experimentally for the first time. Following two-neutron knockout reactions with a $^{13}$O beam, the $^{11}$O decay products were detected after two-proton emission and used to construct an invariant-mass spectrum. A broad peak of width $\sim$3\,MeV…
▽ More
The structure of the extremely proton-rich nucleus $^{11}_{~8}$O$_3$, the mirror of the two-neutron halo nucleus $^{11}_{~3}$Li$_8$, has been studied experimentally for the first time. Following two-neutron knockout reactions with a $^{13}$O beam, the $^{11}$O decay products were detected after two-proton emission and used to construct an invariant-mass spectrum. A broad peak of width $\sim$3\,MeV was observed. Within the Gamow coupled-channel approach, it was concluded that this peak is a multiplet with contributions from the four-lowest $^{11}$O resonant states: $J^π$=3/2$^-_1$, 3/2$^-_2$, 5/2$^+_1$, and 5/2$^+_2$. The widths and configurations of these states show strong, non-monotonic dependencies on the depth of the $p$-$^9$C potential. This unusual behavior is due to the presence of a broad threshold resonant state in $^{10}$N, which is an analog of the virtual state in $^{10}$Li in the presence of the Coulomb potential. After optimizing the model to the data, only a moderate isospin asymmetry between ground states of $^{11}$O and $^{11}$Li was found.
△ Less
Submitted 20 March, 2019; v1 submitted 20 December, 2018;
originally announced December 2018.
-
Density wave probes cuprate quantum phase transition
Authors:
Tatiana A. Webb,
Michael C. Boyer,
Yi Yin,
Debanjan Chowdhury,
Yang He,
Takeshi Kondo,
T. Takeuchi,
H. Ikuta,
Eric W. Hudson,
Jennifer E. Hoffman,
Mohammad H. Hamidian
Abstract:
In cuprates, the strong correlations in proximity to the antiferromagnetic Mott insulating state give rise to an array of unconventional phenomena beyond high temperature superconductivity. Developing a complete description of the ground state evolution is crucial to decoding the complex phase diagram. Here we use the structure of broken translational symmetry, namely $d$-form factor charge modula…
▽ More
In cuprates, the strong correlations in proximity to the antiferromagnetic Mott insulating state give rise to an array of unconventional phenomena beyond high temperature superconductivity. Developing a complete description of the ground state evolution is crucial to decoding the complex phase diagram. Here we use the structure of broken translational symmetry, namely $d$-form factor charge modulations in (Bi,Pb)$_2$(Sr,La)$_2$CuO$_{6+δ}$, as a probe of the ground state reorganization that occurs at the transition from truncated Fermi arcs to a large Fermi surface. We use real space imaging of nanoscale electronic inhomogeneity as a tool to access a range of dopings within each sample, and we definitively validate the spectral gap $Δ$ as a proxy for local hole doping. From the $Δ$-dependence of the charge modulation wavevector, we discover a commensurate to incommensurate transition that is coincident with the Fermi surface transition from arcs to large hole pocket, demonstrating the qualitatively distinct nature of the electronic correlations governing the two sides of this quantum phase transition. Furthermore, the doping dependence of the incommensurate wavevector on the overdoped side is at odds with a simple Fermi surface driven instability.
△ Less
Submitted 15 May, 2019; v1 submitted 14 November, 2018;
originally announced November 2018.
-
Resolving CO (2-1) in z~1.6 Gas-Rich Cluster Galaxies with ALMA: Rotating Molecular Gas Disks with Possible Signatures of Gas Stripping
Authors:
A. G. Noble,
A. Muzzin,
M. McDonald,
G. Rudnick,
J. Matharu,
M. C. Cooper,
R. Demarco,
C. Lidman,
J. Nantais,
E. van Kampen,
T. M. A. Webb,
G. Wilson,
H. K. C. Yee
Abstract:
We present the first spatially-resolved observations of molecular gas in a sample of cluster galaxies beyond z>0.1. Using ALMA, we detect CO (2-1) in 8 z~1.6 cluster galaxies, all within a single 70" primary beam, in under 3 hours of integration time. The cluster, SpARCS-J0225, is replete with gas-rich galaxies in close proximity. It thus affords an efficient multiplexing strategy to build up the…
▽ More
We present the first spatially-resolved observations of molecular gas in a sample of cluster galaxies beyond z>0.1. Using ALMA, we detect CO (2-1) in 8 z~1.6 cluster galaxies, all within a single 70" primary beam, in under 3 hours of integration time. The cluster, SpARCS-J0225, is replete with gas-rich galaxies in close proximity. It thus affords an efficient multiplexing strategy to build up the first sample of resolved CO in distant galaxy clusters. Mapping out the kinematic structure and morphology of the molecular gas on 3.5 kpc scales reveals rotating gas disks in the majority of the galaxies, as evidenced by smooth velocity gradients. Detailed velocity maps also uncover kinematic peculiarities, including a central gas void, a merger, and a few one-sided gas tails. We compare the extent of the molecular gas component to that of the optical stellar component, measured with rest-frame optical HST imaging. We find that the cluster galaxies, while broadly consistent with a ratio of unity for stellar-to-gas effective radii, have a moderately larger ratio compared to the coeval field; this is consistent with the more pronounced trend in the low-redshift Universe. Thus, at first glance, the z~1.6 cluster galaxies generally look like galaxies infalling from the field, with typical main-sequence star formation rates and massive molecular gas reservoirs situated in rotating disks. However, there are potentially important differences from their field counterparts, including elevated gas fractions, slightly smaller CO disks, and possible asymmetric gas tails. Taken in tandem, these signatures are tentative evidence for gas-stripping in the z~1.6 cluster. However, the current sample size of spatially-resolved molecular gas in galaxies at high redshift is small, and verification of these trends will require much larger samples of both cluster and field galaxies.
△ Less
Submitted 10 September, 2018;
originally announced September 2018.
-
Open-source automated chemical vapor deposition system for the production of two - dimensional nanomaterials
Authors:
Lizandra Williams- Godwin,
Dale Brown,
Richard Livingston,
Tyler Webb,
Lynn Karriem,
Elton Graugnard,
David Estrada
Abstract:
The study of two- dimensional (2D) materials is a rapidly growing area within nanomaterial research. However, the high equipment costs, which include the processing systems necessary for creating these materials, can be a barrier to entry for some researchers interested in studying these novel materials. Such process systems include those used for chemical vapor deposition. This article presents t…
▽ More
The study of two- dimensional (2D) materials is a rapidly growing area within nanomaterial research. However, the high equipment costs, which include the processing systems necessary for creating these materials, can be a barrier to entry for some researchers interested in studying these novel materials. Such process systems include those used for chemical vapor deposition. This article presents the first open-source design for an automated chemical vapor deposition system that can be built for less than a third of the cost for a similar commercial system. Our design can be easily customized and expanded on, depending upon the needs of the user. With a process chamber built as described, we demonstrate that a variety of 2D nanomaterials and their heterostructures can be grown via chemical vapor deposition. Specifically, our experimental results demonstrate the capability of this open-source design in producing high quality, 2D nanomaterials such as graphene and tungsten disulfide, which are at the forefront of research in emerging semiconductor devices, sensors, and energy storage applications.
△ Less
Submitted 2 July, 2018;
originally announced July 2018.
-
The Evolution of Environmental Quenching Timescales to $z\sim1.6$
Authors:
R. Foltz,
G. Wilson,
A. Muzzin,
M. C. Cooper,
J. Nantais,
R. F. J. van der Burg,
P. Cerulo,
J. Chan,
S. P. Fillingham,
J. Surace,
T. Webb,
A. Noble,
M. Lacy,
M. McDonald,
G. Rudnick,
C. Lidman,
R. Demarco,
J. Hlavacek-Larrondo,
H. K. C. Yee,
S. Perlmutter,
B. Hayden
Abstract:
Using a sample of 4 galaxy clusters at $1.35 < z < 1.65$ and 10 galaxy clusters at $0.85 < z < 1.35$, we measure the environmental quenching timescale, $t_Q$, corresponding to the time required after a galaxy is accreted by a cluster for it to fully cease star formation. Cluster members are selected by a photometric-redshift criterion, and categorized as star-forming, quiescent, or intermediate ac…
▽ More
Using a sample of 4 galaxy clusters at $1.35 < z < 1.65$ and 10 galaxy clusters at $0.85 < z < 1.35$, we measure the environmental quenching timescale, $t_Q$, corresponding to the time required after a galaxy is accreted by a cluster for it to fully cease star formation. Cluster members are selected by a photometric-redshift criterion, and categorized as star-forming, quiescent, or intermediate according to their dust-corrected rest-frame colors and magnitudes. We employ a "delayed-then-rapid" quenching model that relates a simulated cluster mass accretion rate to the observed numbers of each type of galaxy in the cluster to constrain $t_Q$. For galaxies of mass $M_* \gtrsim 10^{10.5}~ \mathrm{M}_\odot$, we find a quenching timescale of $t_Q=$ 1.24 Gyr in the $z\sim1.5$ cluster sample, and $t_Q=$ 1.50 Gyr at $z\sim1$. Using values drawn from the literature, we compare the redshift evolution of $t_Q$ to timescales predicted for different physical quenching mechanisms. We find $t_Q$ to depend on host halo mass such that quenching occurs over faster timescales in clusters relative to groups, suggesting that properties of the host halo are responsible for quenching high-mass galaxies. Between $z=0$ and $z=1.5$, we find that $t_Q$ evolves faster than the molecular gas depletion timescale and slower than an SFR-outflow timescale, but is consistent with the evolution of the dynamical time. This suggests that environmental quenching in these galaxies is driven by the motion of satellites relative to the cluster environment, although due to uncertainties in the atomic gas budget at high redshift, we cannot rule out quenching due to simple gas depletion.
△ Less
Submitted 14 March, 2018; v1 submitted 8 March, 2018;
originally announced March 2018.
-
Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning
Authors:
Scott Cyphers,
Arjun K. Bansal,
Anahita Bhiwandiwalla,
Jayaram Bobba,
Matthew Brookhart,
Avijit Chakraborty,
Will Constable,
Christian Convey,
Leona Cook,
Omar Kanawi,
Robert Kimball,
Jason Knight,
Nikolay Korovaiko,
Varun Kumar,
Yixing Lao,
Christopher R. Lishka,
Jaikrishnan Menon,
Jennifer Myers,
Sandeep Aswath Narayana,
Adam Procter,
Tristan J. Webb
Abstract:
The Deep Learning (DL) community sees many novel topologies published each year. Achieving high performance on each new topology remains challenging, as each requires some level of manual effort. This issue is compounded by the proliferation of frameworks and hardware platforms. The current approach, which we call "direct optimization", requires deep changes within each framework to improve the tr…
▽ More
The Deep Learning (DL) community sees many novel topologies published each year. Achieving high performance on each new topology remains challenging, as each requires some level of manual effort. This issue is compounded by the proliferation of frameworks and hardware platforms. The current approach, which we call "direct optimization", requires deep changes within each framework to improve the training performance for each hardware backend (CPUs, GPUs, FPGAs, ASICs) and requires $\mathcal{O}(fp)$ effort; where $f$ is the number of frameworks and $p$ is the number of platforms. While optimized kernels for deep-learning primitives are provided via libraries like Intel Math Kernel Library for Deep Neural Networks (MKL-DNN), there are several compiler-inspired ways in which performance can be further optimized. Building on our experience creating neon (a fast deep learning library on GPUs), we developed Intel nGraph, a soon to be open-sourced C++ library to simplify the realization of optimized deep learning performance across frameworks and hardware platforms. Initially-supported frameworks include TensorFlow, MXNet, and Intel neon framework. Initial backends are Intel Architecture CPUs (CPU), the Intel(R) Nervana Neural Network Processor(R) (NNP), and NVIDIA GPUs. Currently supported compiler optimizations include efficient memory management and data layout abstraction. In this paper, we describe our overall architecture and its core components. In the future, we envision extending nGraph API support to a wider range of frameworks, hardware (including FPGAs and ASICs), and compiler optimizations (training versus inference optimizations, multi-node and multi-device scaling via efficient sub-graph partitioning, and HW-specific compounding of operations).
△ Less
Submitted 29 January, 2018; v1 submitted 24 January, 2018;
originally announced January 2018.
-
Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks
Authors:
Urs Köster,
Tristan J. Webb,
Xin Wang,
Marcel Nassar,
Arjun K. Bansal,
William H. Constable,
Oğuz H. Elibol,
Scott Gray,
Stewart Hall,
Luke Hornof,
Amir Khosrowshahi,
Carey Kloss,
Ruby J. Pai,
Naveen Rao
Abstract:
Deep neural networks are commonly developed and trained in 32-bit floating point format. Significant gains in performance and energy efficiency could be realized by training and inference in numerical formats optimized for deep learning. Despite advances in limited precision inference in recent years, training of neural networks in low bit-width remains a challenging problem. Here we present the F…
▽ More
Deep neural networks are commonly developed and trained in 32-bit floating point format. Significant gains in performance and energy efficiency could be realized by training and inference in numerical formats optimized for deep learning. Despite advances in limited precision inference in recent years, training of neural networks in low bit-width remains a challenging problem. Here we present the Flexpoint data format, aiming at a complete replacement of 32-bit floating point format training and inference, designed to support modern deep network topologies without modifications. Flexpoint tensors have a shared exponent that is dynamically adjusted to minimize overflows and maximize available dynamic range. We validate Flexpoint by training AlexNet, a deep residual network and a generative adversarial network, using a simulator implemented with the neon deep learning framework. We demonstrate that 16-bit Flexpoint closely matches 32-bit floating point in training all three models, without any need for tuning of model hyperparameters. Our results suggest Flexpoint as a promising numerical format for future hardware for training and inference.
△ Less
Submitted 2 December, 2017; v1 submitted 6 November, 2017;
originally announced November 2017.
-
Detection of a Substantial Molecular Gas Reservoir in a brightest cluster galaxy at z = 1.7
Authors:
Tracy Webb,
James Lowenthal,
Min Yun,
Allison G. Noble,
Adam Muzzin,
Gillian Wilson,
H. K. C. Yee,
Ryan Cybulski
Abstract:
We report the detection of CO(2-1) emission coincident with the brightest cluster galaxy (BCG) of the high-redshift galaxy cluster SpARCS1049+56, with the Redshift Search Receiver (RSR) on the Large Millimetre Telescope (LMT). We confirm a spectroscopic redshift for the gas of z = 1.7091+/-0.0004, which is consistent with the systemic redshift of the cluster galaxies of z = 1.709. The line is well…
▽ More
We report the detection of CO(2-1) emission coincident with the brightest cluster galaxy (BCG) of the high-redshift galaxy cluster SpARCS1049+56, with the Redshift Search Receiver (RSR) on the Large Millimetre Telescope (LMT). We confirm a spectroscopic redshift for the gas of z = 1.7091+/-0.0004, which is consistent with the systemic redshift of the cluster galaxies of z = 1.709. The line is well-fit by a single component Gaussian with a RSR resolution-corrected FWHM of 569+/-63 km/s. We see no evidence for multiple velocity components in the gas, as might be expected from the multiple image components seen in near-infrared imaging with the Hubble Space Telescope. We measure the integrated flux of the line to be 3.6+/-0.3 Jy km/s and, using alpha_CO = 0.8 Msun (K km s^-1 pc^2)^-1 we estimate a total molecular gas mass of 1.1+/-0.1x10^11 Msun and a M_H2/M_star ~ 0.4. This is the largest gas reservoir detected in a BCG above z > 1 to date. Given the infrared-estimated star formation rate of 860+/-130 Msun/yr, this corresponds to a gas depletion timescale of ~0.1Gyr. We discuss several possible mechanisms for depositing such a large gas reservoir to the cluster center -- e.g., a cooling flow, a major galaxy-galaxy merger or the stripping of gas from several galaxies -- but conclude that these LMT data are not sufficient to differentiate between them.
△ Less
Submitted 5 June, 2017;
originally announced June 2017.
-
Galaxy Merger Candidates in High-Redshift Cluster Environments
Authors:
A. G. Delahaye,
T. M. A. Webb,
J. Nantais,
A. DeGroot,
G. Wilson,
A. Muzzin,
H. K. C. Yee,
R. Foltz,
A. G. Noble,
R. Demarco,
A. Tudorica,
M. C. Cooper,
C. Lidman,
S. Perlmutter,
B. Hayden,
K. Boone,
J. Surace
Abstract:
We compile a sample of spectroscopically- and photometrically-selected cluster galaxies from four high-redshift galaxy clusters ($1.59 < z < 1.71$) from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS), and a comparison field sample selected from the UKIDSS Deep Survey. Using near-infrared imaging from the \textit{Hubble Space Telescope} we classify potential mergers involving ma…
▽ More
We compile a sample of spectroscopically- and photometrically-selected cluster galaxies from four high-redshift galaxy clusters ($1.59 < z < 1.71$) from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS), and a comparison field sample selected from the UKIDSS Deep Survey. Using near-infrared imaging from the \textit{Hubble Space Telescope} we classify potential mergers involving massive ($M_* \geq 3\times 10^{10}\mathrm{M}_\odot$) cluster members by eye, based on morphological properties such as tidal distortions, double nuclei, and projected near neighbors within 20 kpc. With a catalogue of 23 spectroscopic and 32 photometric massive cluster members across the four clusters and 65 spectroscopic and 26 photometric comparable field galaxies, we find that after taking into account contamination from interlopers, $11.0 ^{+7.0}_{-5.6}\%$ of the cluster members are involved in potential mergers, compared to $24.7^{+5.3}_{-4.6}\%$ of the field galaxies. We see no evidence of merger enhancement in the central cluster environment with respect to the field, suggesting that galaxy-galaxy merging is not a stronger source of galaxy evolution in cluster environments compared to the field at these redshifts.
△ Less
Submitted 30 May, 2017;
originally announced May 2017.
-
ALMA Observations of Gas-Rich Galaxies in z~1.6 Galaxy Clusters: Evidence for Higher Gas Fractions in High-Density Environments
Authors:
A. G. Noble,
M. McDonald,
A. Muzzin,
J. Nantais,
G. Rudnick,
E. van Kampen,
T. M. A. Webb,
G. Wilson,
H. K. C. Yee,
K. Boone,
M. C. Cooper,
A. DeGroot,
A. Delahaye,
R. Demarco,
R. Foltz,
B. Hayden,
C. Lidman,
A. Manilla-Robles,
S. Perlmutter
Abstract:
We present ALMA CO (2-1) detections in 11 gas-rich cluster galaxies at z~1.6, constituting the largest sample of molecular gas measurements in z>1.5 clusters to date. The observations span three galaxy clusters, derived from the Spitzer Adaptation of the Red-sequence Cluster Survey. We augment the >5sigma detections of the CO (2-1) fluxes with multi-band photometry, yielding stellar masses and inf…
▽ More
We present ALMA CO (2-1) detections in 11 gas-rich cluster galaxies at z~1.6, constituting the largest sample of molecular gas measurements in z>1.5 clusters to date. The observations span three galaxy clusters, derived from the Spitzer Adaptation of the Red-sequence Cluster Survey. We augment the >5sigma detections of the CO (2-1) fluxes with multi-band photometry, yielding stellar masses and infrared-derived star formation rates, to place some of the first constraints on molecular gas properties in z~1.6 cluster environments. We measure sizable gas reservoirs of 0.5-2x10^11 solar masses in these objects, with high gas fractions and long depletion timescales, averaging 62% and 1.4 Gyr, respectively. We compare our cluster galaxies to the scaling relations of the coeval field, in the context of how gas fractions and depletion timescales vary with respect to the star-forming main sequence. We find that our cluster galaxies lie systematically off the field scaling relations at z=1.6 toward enhanced gas fractions, at a level of ~4sigma, but have consistent depletion timescales. Exploiting CO detections in lower-redshift clusters from the literature, we investigate the evolution of the gas fraction in cluster galaxies, finding it to mimic the strong rise with redshift in the field. We emphasize the utility of detecting abundant gas-rich galaxies in high-redshift clusters, deeming them as crucial laboratories for future statistical studies.
△ Less
Submitted 22 June, 2017; v1 submitted 8 May, 2017;
originally announced May 2017.
-
Red but not dead : Unveiling the Star-forming Far-infrared Spectral Energy Distribution of SpARCS Brightest Cluster Galaxies at 0 < z < 1.8
Authors:
N. R. Bonaventura,
T. M. A. Webb,
A. Muzzin,
A. Noble,
C. Lidman,
G. Wilson,
H. K. C. Yee,
J. Geach,
Y. Hezaveh,
D. Shupe,
J. Surace
Abstract:
We present the results of a Spitzer/Herschel infrared photometric analysis of the largest (716) and highest-redshift (z=1.8) sample of Brightest Cluster Galaxies (BCGs), those from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS). Given the tension that exists between model predictions and recent observations of BCGs at z<2, we aim to uncover the dominant physical mechanism(s) gu…
▽ More
We present the results of a Spitzer/Herschel infrared photometric analysis of the largest (716) and highest-redshift (z=1.8) sample of Brightest Cluster Galaxies (BCGs), those from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS). Given the tension that exists between model predictions and recent observations of BCGs at z<2, we aim to uncover the dominant physical mechanism(s) guiding the stellar-mass buildup of this special class of galaxies, the most massive in the Universe uniquely residing at the centres of galaxy clusters. Through a comparison of their stacked, broadband, infrared spectral energy distributions (SEDs) to a variety of SED model templates in the literature, we identify the major sources of their infrared energy output, in multiple redshift bins between 0 < z < 1.8. We derive estimates of various BCG physical parameters from the stacked νLν SEDs, from which we infer a star-forming, as opposed to a 'red and dead' population of galaxies, producing tens to hundreds of solar masses per year down to z=0.5. This discovery challenges the accepted belief that BCGs should only passively evolve through a series of gas-poor, minor mergers since z~4 (De Lucia & Blaizot 2007), but agrees with the improved semi-analytic model of hierarchical structure formation of Tonini et al. (2012), which predicts star-forming BCGs throughout the epoch considered. We attribute the star formation inferred from the stacked infrared SEDs to both major and minor 'wet' (gas-rich) mergers, based on a lack of key signatures (to date) of the cluster cooling flows to which BCG star formation is typically attributed, as well as a number of observational and simulation-based studies that support this scenario.
△ Less
Submitted 10 April, 2017;
originally announced April 2017.
-
Kinetics of dislocation annealing and the effect of trapped hydrogen, investigated with in-situ diffraction
Authors:
T. A. Webb,
C. J. Webb,
C. V. Tapia-Bastidas,
E. MacA. Gray
Abstract:
In-situ powder diffraction was used to study the annealing of dislocations in the archetypal hydrogen absorbers Pd and LaNi5. The relationship between dislocations and trapped hydrogen was explored using thermally induced desorption. It was found that the dislocations in Pd caused by hydrogen absorption anneal over a wide range of temperatures and that although they start to anneal below 250…
▽ More
In-situ powder diffraction was used to study the annealing of dislocations in the archetypal hydrogen absorbers Pd and LaNi5. The relationship between dislocations and trapped hydrogen was explored using thermally induced desorption. It was found that the dislocations in Pd caused by hydrogen absorption anneal over a wide range of temperatures and that although they start to anneal below 250 $^\circ$C, temperatures well above 750 $^\circ$C are required to fully anneal the metal. It was shown that allowing further time at lower temperatures does not further anneal the metal. It is suggested that this is due to dislocation tangling and pinning, causing different temperatures to be required for different pinning defects. It was found that hydrogen trapped in LaNi5 is released in a wide range of temperatures and it was therefore concluded that hydrogen is trapped in the dislocation strain field and dislocation core as well as vacancies. The direct comparison of deuterium release and dislocation density showed no correlation, in agreement with previous indirect comparisons. Dislocations in LaNi5 were shown to anneal at temperatures as low as 150 $^\circ$C, in contrast to previous reports which suggested more than 500 $^\circ$C was required. This lower annealing temperature for dislocations at least partly explains why low temperature ageing increases the pressure hysteresis in hydrogen cycled LaNi5.
△ Less
Submitted 18 September, 2016;
originally announced September 2016.
-
The SCUBA-2 Cosmology Legacy Survey: 850um maps, catalogues and number counts
Authors:
J. E. Geach,
J. S. Dunlop,
M. Halpern,
Ian Smail,
P. van der Werf,
D. M. Alexander,
O. Almaini,
I. Aretxaga,
V. Arumugam,
V. Asboth,
M. Banerji,
J. Beanlands,
P. N. Best,
A. W. Blain,
M. Birkinshaw,
E. L. Chapin,
S. C. Chapman,
C-C. Chen,
A. Chrysostomou,
C. Clarke,
D. L. Clements,
C. Conselice,
K. E. K. Coppin,
W. I. Cowley,
A. L. R. Danielson
, et al. (44 additional authors not shown)
Abstract:
We present a catalogue of nearly 3,000 submillimetre sources detected at 850um over ~5 square degrees surveyed as part of the James Clerk Maxwell Telescope (JCMT) SCUBA-2 Cosmology Legacy Survey (S2CLS). This is the largest survey of its kind at 850um, probing a meaningful cosmic volume at the peak of star formation activity and increasing the sample size of submillimetre galaxies selected at 850u…
▽ More
We present a catalogue of nearly 3,000 submillimetre sources detected at 850um over ~5 square degrees surveyed as part of the James Clerk Maxwell Telescope (JCMT) SCUBA-2 Cosmology Legacy Survey (S2CLS). This is the largest survey of its kind at 850um, probing a meaningful cosmic volume at the peak of star formation activity and increasing the sample size of submillimetre galaxies selected at 850um by an order of magnitude. We describe the wide 850um survey component of S2CLS, which covers the key extragalactic survey fields: UKIDSS-UDS, COSMOS, Akari-NEP, Extended Groth Strip, Lockman Hole North, SSA22 and GOODS-North. The average 1-sigma depth of S2CLS is 1.2 mJy/beam, approaching the SCUBA-2 850um confusion limit, which we determine to be ~0.8 mJy/beam. We measure the single dish 850um number counts to unprecedented accuracy, reducing the Poisson errors on the differential counts to approximately 4% at S_850~3mJy. With several independent fields, we investigate field-to-field variance, finding that the number counts on 0.5-1 degree scales are generally within 50% of the S2CLS mean for S_850>3mJy, with scatter consistent with the Poisson and estimated cosmic variance uncertainties, although there is a marginal (2-sigma) density enhancement in the GOODS-North field. The observed number counts are in reasonable agreement with recent phenomenological and semi-analytic models. Finally, the large solid angle of S2CLS allows us to measure the bright-end counts: at S_850>10mJy there are approximately ten sources per square degree, and we detect the distinctive up-turn in the number counts indicative of the detection of local sources of 850um emission and strongly lensed high-redshift galaxies. Here we describe the data collection and reduction procedures and present calibrated maps and a catalogue of sources; these are made publicly available.
△ Less
Submitted 13 July, 2016;
originally announced July 2016.
-
Dumbbell Defects in FeSe Films: A Scanning Tunneling Microscopy and First-Principles Investigation
Authors:
Dennis Huang,
Tatiana A. Webb,
Can-Li Song,
Cui-Zu Chang,
Jagadeesh S. Moodera,
Efthimios Kaxiras,
Jennifer E. Hoffman
Abstract:
The properties of iron-based superconductors (Fe-SCs) can be varied dramatically with the introduction of dopants and atomic defects. As a pressing example, FeSe, parent phase of the highest-$T_c$ Fe-SC, exhibits prevalent defects with atomic-scale "dumbbell" signatures as imaged by scanning tunneling microscopy (STM). These defects spoil superconductivity when their concentration exceeds 2.5%. Re…
▽ More
The properties of iron-based superconductors (Fe-SCs) can be varied dramatically with the introduction of dopants and atomic defects. As a pressing example, FeSe, parent phase of the highest-$T_c$ Fe-SC, exhibits prevalent defects with atomic-scale "dumbbell" signatures as imaged by scanning tunneling microscopy (STM). These defects spoil superconductivity when their concentration exceeds 2.5%. Resolving their chemical identity is prerequisite to applications such as nanoscale patterning of superconducting/nonsuperconducting regions in FeSe, as well as fundamental questions such as the mechanism of superconductivity and the path by which the defects destroy it. We use STM and density functional theory to characterize and identify the dumbbell defects. In contrast to previous speculations about Se adsorbates or substitutions, we find that an Fe-site vacancy is the most energetically favorable defect in Se-rich conditions, and reproduces our observed STM signature. Our calculations shed light more generally on the nature of Se capping, the removal of Fe vacancies via annealing, and their ordering into a $\sqrt{5}$$\times$$\sqrt{5}$ superstructure in FeSe and related alkali-doped compounds.
△ Less
Submitted 22 June, 2016;
originally announced June 2016.
-
Evidence for a change in the dominant satellite galaxy quenching mechanism at z=1
Authors:
Michael L. Balogh,
Sean L. McGee,
Angus Mok,
Adam Muzzin,
Remco F. J. van der Burg,
Richard G. Bower,
Alexis Finoguenov,
Henk Hoekstra,
Chris Lidman,
John S. Mulchaey,
Allison Noble,
Laura C. Parker,
Masayuki Tanaka,
David J. Wilman,
Tracy Webb,
Gillian Wilson,
Howard K. C. Yee
Abstract:
We present an analysis of galaxies in groups and clusters at $0.8<z<1.2$, from the GCLASS and GEEC2 spectroscopic surveys. We compute a "conversion fraction" $f_{\rm convert}$ that represents the fraction of galaxies that were prematurely quenched by their environment. For massive galaxies, $M_{\rm star}>10^{10.3}M_\odot$, we find $f_{\rm convert}\sim 0.4$ in the groups and $\sim 0.6$ in the clust…
▽ More
We present an analysis of galaxies in groups and clusters at $0.8<z<1.2$, from the GCLASS and GEEC2 spectroscopic surveys. We compute a "conversion fraction" $f_{\rm convert}$ that represents the fraction of galaxies that were prematurely quenched by their environment. For massive galaxies, $M_{\rm star}>10^{10.3}M_\odot$, we find $f_{\rm convert}\sim 0.4$ in the groups and $\sim 0.6$ in the clusters, similar to comparable measurements at $z=0$. This means the time between first accretion into a more massive halo and final star formation quenching is $t_p\sim 2$ Gyr. This is substantially longer than the estimated time required for a galaxy's star formation rate to become zero once it starts to decline, suggesting there is a long delay time during which little differential evolution occurs. In contrast with local observations we find evidence that this delay timescale may depend on stellar mass, with $t_p$ approaching $t_{\rm Hubble}$ for $M_{\rm star}\sim 10^{9.5}M_\odot$. The result suggests that the delay time must not only be much shorter than it is today, but may also depend on stellar mass in a way that is not consistent with a simple evolution in proportion to the dynamical time. Instead, we find the data are well-matched by a model in which the decline in star formation is due to "overconsumption", the exhaustion of a gas reservoir through star formation and expulsion via modest outflows in the absence of cosmological accretion. Dynamical gas removal processes, which are likely dominant in quenching newly accreted satellites today, may play only a secondary role at $z=1$.
△ Less
Submitted 23 November, 2015;
originally announced November 2015.
-
The Phase Space of z~1.2 SpARCS Clusters: Using Herschel to probe Dust Temperature as a Function of Environment and Accretion History
Authors:
A. G. Noble,
T. M. A. Webb,
H. K. C. Yee,
A. Muzzin,
G. Wilson,
R. F. J. van der Burg,
M. L. Balogh,
D. L. Shupe
Abstract:
We present a five-band Herschel study (100-500um) of three galaxy clusters at z~1.2 from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS). With a sample of 120 spectroscopically-confirmed cluster members, we investigate the role of environment on galaxy properties utilizing the projected cluster phase space (line-of-sight velocity versus clustercentric radius), which probes the t…
▽ More
We present a five-band Herschel study (100-500um) of three galaxy clusters at z~1.2 from the Spitzer Adaptation of the Red-Sequence Cluster Survey (SpARCS). With a sample of 120 spectroscopically-confirmed cluster members, we investigate the role of environment on galaxy properties utilizing the projected cluster phase space (line-of-sight velocity versus clustercentric radius), which probes the time-averaged galaxy density to which a galaxy has been exposed. We divide cluster galaxies into phase-space bins of (r/r200) x (v/sigma_v), tracing a sequence of accretion histories in phase space. Stacking optically star-forming cluster members on the Herschel maps, we measure average infrared star formation rates, and, for the first time in high-redshift galaxy clusters, dust temperatures for dynamically distinct galaxy populations---namely, recent infalls and those that were accreted onto the cluster at an earlier epoch. Proceeding from the infalling to virialized (central) regions of phase space, we find a steady decrease in the specific star formation rate and increase in the stellar age of star-forming cluster galaxies. We perform a probability analysis to investigate all acceptable infrared spectral energy distributions within the full parameter space and measure a ~4 sigma drop in the average dust temperature of cluster galaxies in an intermediate phase-space bin, compared to an otherwise flat trend with phase space. We suggest one plausible quenching mechanism which may be consistent with these trends, invoking ram-pressure stripping of the warmer dust for galaxies within this intermediate accretion phase.
△ Less
Submitted 2 November, 2015;
originally announced November 2015.