-
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images
Authors:
Kazi Sajeed Mehrab,
M. Maruf,
Arka Daw,
Harish Babu Manogaran,
Abhilash Neog,
Mridul Khurana,
Bahadir Altintas,
Yasin Bakis,
Elizabeth G Campolongo,
Matthew J Thompson,
Xiaojun Wang,
Hilmar Lapp,
Wei-Lun Chao,
Paula M. Mabee,
Henry L. Bart Jr.,
Wasila Dahdul,
Anuj Karpatne
Abstract:
Fishes are integral to both ecological systems and economic sectors, and studying fish traits is crucial for understanding biodiversity patterns and macro-evolution trends. To enable the analysis of visual traits from fish images, we introduce the Fish-Visual Trait Analysis (Fish-Vista) dataset - a large, annotated collection of about 60K fish images spanning 1900 different species, supporting sev…
▽ More
Fishes are integral to both ecological systems and economic sectors, and studying fish traits is crucial for understanding biodiversity patterns and macro-evolution trends. To enable the analysis of visual traits from fish images, we introduce the Fish-Visual Trait Analysis (Fish-Vista) dataset - a large, annotated collection of about 60K fish images spanning 1900 different species, supporting several challenging and biologically relevant tasks including species classification, trait identification, and trait segmentation. These images have been curated through a sophisticated data processing pipeline applied to a cumulative set of images obtained from various museum collections. Fish-Vista provides fine-grained labels of various visual traits present in each image. It also offers pixel-level annotations of 9 different traits for 2427 fish images, facilitating additional trait segmentation and localization tasks. The ultimate goal of Fish-Vista is to provide a clean, carefully curated, high-resolution dataset that can serve as a foundation for accelerating biological discoveries using advances in AI. Finally, we provide a comprehensive analysis of state-of-the-art deep learning techniques on Fish-Vista.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
AtLAST Science Overview Report
Authors:
Mark Booth,
Pamela Klaassen,
Claudia Cicone,
Tony Mroczkowski,
Martin A. Cordiner,
Luca Di Mascolo,
Doug Johnstone,
Eelco van Kampen,
Minju M. Lee,
Daizhong Liu,
John Orlowski-Scherer,
Amélie Saintonge,
Matthew W. L. Smith,
Alexander Thelen,
Sven Wedemeyer,
Kazunori Akiyama,
Stefano Andreon,
Doris Arzoumanian,
Tom J. L. C. Bakx,
Caroline Bot,
Geoffrey Bower,
Roman Brajša,
Chian-Chou Chen,
Elisabete da Cunha,
David Eden
, et al. (59 additional authors not shown)
Abstract:
Submillimeter and millimeter wavelengths provide a unique view of the Universe, from the gas and dust that fills and surrounds galaxies to the chromosphere of our own Sun. Current single-dish facilities have presented a tantalising view of the brightest (sub-)mm sources, and interferometers have provided the exquisite resolution necessary to analyse the details in small fields, but there are still…
▽ More
Submillimeter and millimeter wavelengths provide a unique view of the Universe, from the gas and dust that fills and surrounds galaxies to the chromosphere of our own Sun. Current single-dish facilities have presented a tantalising view of the brightest (sub-)mm sources, and interferometers have provided the exquisite resolution necessary to analyse the details in small fields, but there are still many open questions that cannot be answered with current facilities. In this report we summarise the science that is guiding the design of the Atacama Large Aperture Submillimeter Telescope (AtLAST). We demonstrate how tranformational advances in topics including star formation in high redshift galaxies, the diffuse circumgalactic medium, Galactic ecology, cometary compositions and solar flares motivate the need for a 50m, single-dish telescope with a 1-2 degree field of view and a new generation of highly multiplexed continuum and spectral cameras. AtLAST will have the resolution to drastically lower the confusion limit compared to current single-dish facilities, whilst also being able to rapidly map large areas of the sky and detect extended, diffuse structures. Its high sensitivity and large field of view will open up the field of submillimeter transient science by increasing the probability of serendipitous detections. Finally, the science cases listed here motivate the need for a highly flexible operations model capable of short observations of individual targets, large surveys, monitoring programmes, target of opportunity observations and coordinated observations with other observatories. AtLAST aims to be a sustainable, upgradeable, multipurpose facility that will deliver orders of magnitude increases in sensitivity and mapping speeds over current and planned submillimeter observatories.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
A study of Galactic Plane Planck Galactic Cold Clumps observed by SCOPE and the JCMT Plane Survey
Authors:
D. J. Eden,
Tie Liu,
T. J. T. Moore,
J. Di Francesco,
G. Fuller,
Kee-Tae Kim,
Di Li,
S. -Y. Liu,
R. Plume,
Ken'ichi Tatematsu,
M. A. Thompson,
Y. Wu,
L. Bronfman,
H. M. Butner,
M. J. Currie,
G. Garay,
P. F. Goldsmith,
N. Hirano,
D. Johnstone,
M. Juvela,
S. -P. Lai,
C. W. Lee,
E. E. Mannfors,
F. Olguin,
K. Pattle
, et al. (10 additional authors not shown)
Abstract:
We have investigated the physical properties of Planck Galactic Cold Clumps (PGCCs) located in the Galactic Plane, using the JCMT Plane Survey (JPS) and the SCUBA-2 Continuum Observations of Pre-protostellar Evolution (SCOPE) survey. By utilising a suite of molecular-line surveys, velocities and distances were assigned to the compact sources within the PGCCs, placing them in a Galactic context. Th…
▽ More
We have investigated the physical properties of Planck Galactic Cold Clumps (PGCCs) located in the Galactic Plane, using the JCMT Plane Survey (JPS) and the SCUBA-2 Continuum Observations of Pre-protostellar Evolution (SCOPE) survey. By utilising a suite of molecular-line surveys, velocities and distances were assigned to the compact sources within the PGCCs, placing them in a Galactic context. The properties of these compact sources show no large-scale variations with Galactic environment. Investigating the star-forming content of the sample, we find that the luminosity-to-mass ratio (L/M) is an order of magnitude lower than in other Galactic studies, indicating that these objects are hosting lower levels of star formation. Finally, by comparing ATLASGAL sources that are associated or are not associated with PGCCs, we find that those associated with PGCCs are typically colder, denser, and have a lower L/M ratio, hinting that PGCCs are a distinct population of Galactic Plane sources.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
A manufacturable platform for photonic quantum computing
Authors:
Koen Alexander,
Andrea Bahgat,
Avishai Benyamini,
Dylan Black,
Damien Bonneau,
Stanley Burgos,
Ben Burridge,
Geoff Campbell,
Gabriel Catalano,
Alex Ceballos,
Chia-Ming Chang,
CJ Chung,
Fariba Danesh,
Tom Dauer,
Michael Davis,
Eric Dudley,
Ping Er-Xuan,
Josep Fargas,
Alessandro Farsi,
Colleen Fenrich,
Jonathan Frazer,
Masaya Fukami,
Yogeeswaran Ganesan,
Gary Gibson,
Mercedes Gimeno-Segovia
, et al. (70 additional authors not shown)
Abstract:
Whilst holding great promise for low noise, ease of operation and networking, useful photonic quantum computing has been precluded by the need for beyond-state-of-the-art components, manufactured by the millions. Here we introduce a manufacturable platform for quantum computing with photons. We benchmark a set of monolithically-integrated silicon photonics-based modules to generate, manipulate, ne…
▽ More
Whilst holding great promise for low noise, ease of operation and networking, useful photonic quantum computing has been precluded by the need for beyond-state-of-the-art components, manufactured by the millions. Here we introduce a manufacturable platform for quantum computing with photons. We benchmark a set of monolithically-integrated silicon photonics-based modules to generate, manipulate, network, and detect photonic qubits, demonstrating dual-rail photonic qubits with $99.98\% \pm 0.01\%$ state preparation and measurement fidelity, Hong-Ou-Mandel quantum interference between independent photon sources with $99.50\%\pm0.25\%$ visibility, two-qubit fusion with $99.22\%\pm0.12\%$ fidelity, and a chip-to-chip qubit interconnect with $99.72\%\pm0.04\%$ fidelity, not accounting for loss. In addition, we preview a selection of next generation technologies, demonstrating low-loss silicon nitride waveguides and components, fabrication-tolerant photon sources, high-efficiency photon-number-resolving detectors, low-loss chip-to-fiber coupling, and barium titanate electro-optic phase shifters.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Asteroid (101955) Bennu in the Laboratory: Properties of the Sample Collected by OSIRIS-REx
Authors:
Dante S. Lauretta,
Harold C. Connolly, Jr.,
Joseph E. Aebersold,
Conel M. O. D. Alexander,
Ronald-L. Ballouz,
Jessica J. Barnes,
Helena C. Bates,
Carina A. Bennett,
Laurinne Blanche,
Erika H. Blumenfeld,
Simon J. Clemett,
George D. Cody,
Daniella N. DellaGiustina,
Jason P. Dworkin,
Scott A. Eckley,
Dionysis I. Foustoukos,
Ian A. Franchi,
Daniel P. Glavin,
Richard C. Greenwood,
Pierre Haenecour,
Victoria E. Hamilton,
Dolores H. Hill,
Takahiro Hiroi,
Kana Ishimaru,
Fred Jourdan
, et al. (28 additional authors not shown)
Abstract:
On 24 September 2023, the NASA OSIRIS-REx mission dropped a capsule to Earth containing approximately 120 g of pristine carbonaceous regolith from Bennu. We describe the delivery and initial allocation of this asteroid sample and introduce its bulk physical, chemical, and mineralogical properties from early analyses. The regolith is very dark overall, with higher-reflectance inclusions and particl…
▽ More
On 24 September 2023, the NASA OSIRIS-REx mission dropped a capsule to Earth containing approximately 120 g of pristine carbonaceous regolith from Bennu. We describe the delivery and initial allocation of this asteroid sample and introduce its bulk physical, chemical, and mineralogical properties from early analyses. The regolith is very dark overall, with higher-reflectance inclusions and particles interspersed. Particle sizes range from sub-micron dust to a stone about 3.5 cm long. Millimeter-scale and larger stones typically have hummocky or angular morphologies. A subset of the stones appears mottled by brighter material that occurs as veins and crusts. Hummocky stones have the lowest densities and mottled stones have the highest. Remote sensing of the surface of Bennu detected hydrated phyllosilicates, magnetite, organic compounds, carbonates, and scarce anhydrous silicates, all of which the sample confirms. We also find sulfides, presolar grains, and, less expectedly, Na-rich phosphates, as well as other trace phases. The sample composition and mineralogy indicate substantial aqueous alteration and resemble those of Ryugu and the most chemically primitive, low-petrologic-type carbonaceous chondrites. Nevertheless, we find distinct hydrogen, nitrogen, and oxygen isotopic compositions, and some of the material we analyzed is enriched in fluid-mobile elements. Our findings underscore the value of sample return, especially for low-density material that may not readily survive atmospheric entry, and lay the groundwork for more comprehensive analyses.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Atacama Large Aperture Submillimeter Telescope (AtLAST) Science: Our Galaxy
Authors:
Pamela Klaassen,
Alessio Traficante,
Maria T. Beltrán,
Kate Pattle,
Mark Booth,
Joshua B. Lovell,
Jonathan P. Marshall,
Alvaro Hacar,
Brandt A. L. Gaches,
Caroline Bot,
Nicolas Peretto,
Thomas Stanke,
Doris Arzoumanian,
Ana Duarte Cabral,
Gaspard Duchêne,
David J. Eden,
Antonio Hales,
Jens Kauffmann,
Patricia Luppe,
Sebastian Marino,
Elena Redaelli,
Andrew J. Rigby,
Álvaro Sánchez-Monge,
Eugenio Schisano,
Dmitry A. Semenov
, et al. (16 additional authors not shown)
Abstract:
As we learn more about the multi-scale interstellar medium (ISM) of our Galaxy, we develop a greater understanding for the complex relationships between the large-scale diffuse gas and dust in Giant Molecular Clouds (GMCs), how it moves, how it is affected by the nearby massive stars, and which portions of those GMCs eventually collapse into star forming regions. The complex interactions of those…
▽ More
As we learn more about the multi-scale interstellar medium (ISM) of our Galaxy, we develop a greater understanding for the complex relationships between the large-scale diffuse gas and dust in Giant Molecular Clouds (GMCs), how it moves, how it is affected by the nearby massive stars, and which portions of those GMCs eventually collapse into star forming regions. The complex interactions of those gas, dust and stellar populations form what has come to be known as the ecology of our Galaxy. Because we are deeply embedded in the plane of our Galaxy, it takes up a significant fraction of the sky, with complex dust lanes scattered throughout the optically recognisable bands of the Milky Way. These bands become bright at (sub-)millimetre wavelengths, where we can study dust thermal emission and the chemical and kinematic signatures of the gas. To properly study such large-scale environments, requires deep, large area surveys that are not possible with current facilities. Moreover, where stars form, so too do planetary systems, growing from the dust and gas in circumstellar discs, to planets and planetesimal belts. Understanding the evolution of these belts requires deep imaging capable of studying belts around young stellar objects to Kuiper belt analogues around the nearest stars. Here we present a plan for observing the Galactic Plane and circumstellar environments to quantify the physical structure, the magnetic fields, the dynamics, chemistry, star formation, and planetary system evolution of the galaxy in which we live with AtLAST; a concept for a new, 50m single-dish sub-mm telescope with a large field of view which is the only type of facility that will allow us to observe our Galaxy deeply and widely enough to make a leap forward in our understanding of our local ecology.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Synthesizing study-specific controls using generative models on open access datasets for harmonized multi-study analyses
Authors:
Shruti P. Gadewar,
Alyssa H. Zhu,
Iyad Ba Gari,
Sunanda Somu,
Sophia I. Thomopoulos,
Paul M. Thompson,
Talia M. Nir,
Neda Jahanshad
Abstract:
Neuroimaging consortia can enhance reliability and generalizability of findings by pooling data across studies to achieve larger sample sizes. To adjust for site and MRI protocol effects, imaging datasets are often harmonized based on healthy controls. When data from a control group were not collected, statistical harmonization options are limited as patient characteristics and acquisition-related…
▽ More
Neuroimaging consortia can enhance reliability and generalizability of findings by pooling data across studies to achieve larger sample sizes. To adjust for site and MRI protocol effects, imaging datasets are often harmonized based on healthy controls. When data from a control group were not collected, statistical harmonization options are limited as patient characteristics and acquisition-related variables may be confounded. Here, in a multi-study neuroimaging analysis of Alzheimer's patients and controls, we tested whether it is possible to generate synthetic control MRIs. For one case-control study, we used a generative adversarial model for style-based harmonization to generate site-specific controls. Downstream feature extraction, statistical harmonization and group-level multi-study case-control and case-only analyses were performed twice, using either true or synthetic controls. All effect sizes using synthetic controls overlapped with those based on true study controls. This line of work may facilitate wider inclusion of case-only studies in multi-study consortia.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
QUEST-DMC: Background Modelling and Resulting Heat Deposit for a Superfluid Helium-3 Bolometer
Authors:
S. Autti,
A. Casey,
N. Eng,
N. Darvishi,
P. Franchini,
R. P. Haley,
P. J. Heikkinen,
A. Kemp,
E. Leason,
L. V. Levitin,
J. Monroe,
J. March-Russel,
M. T. Noble,
J. R. Prance,
X. Rojas,
T. Salmon,
J. Saunders,
R. Smith,
M. D. Thompson,
V. Tsepelin,
S. M. West,
L. Whitehead,
K. Zhang,
D. E. Zmeev
Abstract:
We report the results of radioactivity assays and heat leak calculations for a range of common cryogenic materials, considered for use in the QUEST-DMC superfluid 3He dark matter detector. The bolometer, instrumented with nanomechanical resonators, will be sensitive to energy deposits from dark matter interactions. Events from radioactive decays and cosmic rays constitute a significant background…
▽ More
We report the results of radioactivity assays and heat leak calculations for a range of common cryogenic materials, considered for use in the QUEST-DMC superfluid 3He dark matter detector. The bolometer, instrumented with nanomechanical resonators, will be sensitive to energy deposits from dark matter interactions. Events from radioactive decays and cosmic rays constitute a significant background and must be precisely modelled, using a combination of material screening and Monte Carlo simulations. However, the results presented here are of wider interest for experiments and quantum devices sensitive to minute heat leaks and spurious events, thus we present heat leak per unit mass or surface area for every material studied. This can inform material choices for other experiments, especially if underground operation is considered where the radiogenic backgrounds will dominate even at shallow depths.
△ Less
Submitted 19 May, 2024; v1 submitted 31 January, 2024;
originally announced February 2024.
-
The dynamic centres of infrared-dark clouds and the formation of cores
Authors:
Andrew J. Rigby,
Nicolas Peretto,
Michael Anderson,
Sarah E. Ragan,
Felix D. Priestley,
Gary A. Fuller,
Mark A. Thompson,
Alessio Traficante,
Elizabeth J. Watkins,
Gwenllian M. Williams
Abstract:
High-mass stars have an enormous influence on the evolution of the interstellar medium in galaxies, so it is important that we understand how they form. We examine the central clumps within a sample of seven infrared-dark clouds (IRDCs) with a range of masses and morphologies. We use 1 pc-scale observations from NOEMA and the IRAM 30-m telescope to trace dense cores with 2.8 mm continuum, and gas…
▽ More
High-mass stars have an enormous influence on the evolution of the interstellar medium in galaxies, so it is important that we understand how they form. We examine the central clumps within a sample of seven infrared-dark clouds (IRDCs) with a range of masses and morphologies. We use 1 pc-scale observations from NOEMA and the IRAM 30-m telescope to trace dense cores with 2.8 mm continuum, and gas kinematics in C$^{18}$O, HCO$^+$, HNC, and N$_2$H$^+$ ($J$=1$-$0). We supplement our continuum sample with six IRDCs observed at 2.9 mm with ALMA, and examine the relationships between core- and clump-scale properties. We have developed a fully-automated multiple-velocity component hyperfine line-fitting code called mwydyn which we employ to trace the dense gas kinematics in N$_2$H$^+$ (1$-$0), revealing highly complex and dynamic clump interiors. We find that parsec-scale clump mass is the most important factor driving the evolution; more massive clumps are able to concentrate more mass into their most massive cores - with a log-normally distributed efficiency of around 9% - in addition to containing the most dynamic gas. Distributions of linewidths within the most massive cores are similar to the ambient gas, suggesting that they are not dynamically decoupled, but are similarly chaotic. A number of studies have previously suggested that clumps are globally collapsing; in such a scenario, the observed kinematics of clump centres would be the direct result of gravity-driven mass inflows that become ever more complex as the clumps evolve, which in turn leads to the chaotic mass growth of their core populations.
△ Less
Submitted 31 January, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Experimenting with Large Language Models and vector embeddings in NASA SciX
Authors:
Sergi Blanco-Cuaresma,
Ioana Ciucă,
Alberto Accomazzi,
Michael J. Kurtz,
Edwin A. Henneken,
Kelly E. Lockhart,
Felix Grezes,
Thomas Allen,
Golnaz Shapurian,
Carolyn S. Grant,
Donna M. Thompson,
Timothy W. Hostetler,
Matthew R. Templeton,
Shinyi Chen,
Jennifer Koch,
Taylor Jacovich,
Daniel Chivvis,
Fernanda de Macedo Alves,
Jean-Claude Paquin,
Jennifer Bartlett,
Mugdha Polimera,
Stephanie Jarmak
Abstract:
Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a…
▽ More
Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed an experiment where we created semantic vectors for our large collection of abstracts and full-text content, and we designed a prompt system to ask questions using contextual chunks from our system. Based on a non-systematic human evaluation, the experiment shows a lower degree of hallucination and better responses when using Retrieval Augmented Generation. Further exploration is required to design new features and data augmentation processes at NASA SciX that leverages this technology while respecting the high level of trust and quality that the project holds.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Virtual Reality-Assisted Physiotherapy for Visuospatial Neglect Rehabilitation: A Proof-of-Concept Study
Authors:
Andrew Danso,
Patti Nijhuis,
Alessandro Ansani,
Martin Hartmann,
Gulnara Minkkinen,
Geoff Luck,
Joshua S. Bamford,
Sarah Faber,
Kat Agres,
Solange Glasser,
Teppo Särkämö,
Rebekah Rousi,
Marc R. Thompson
Abstract:
This study explores a VR-based intervention for Visuospatial neglect (VSN), a post-stroke condition. It aims to develop a VR task utilizing interactive visual-audio cues to improve sensory-motor training and assess its impact on VSN patients' engagement and performance. Collaboratively designed with physiotherapists, the VR task uses directional and auditory stimuli to alert and direct patients, t…
▽ More
This study explores a VR-based intervention for Visuospatial neglect (VSN), a post-stroke condition. It aims to develop a VR task utilizing interactive visual-audio cues to improve sensory-motor training and assess its impact on VSN patients' engagement and performance. Collaboratively designed with physiotherapists, the VR task uses directional and auditory stimuli to alert and direct patients, tested over 12 sessions with two individuals. Results show a consistent decrease in task completion variability and positive patient feedback, highlighting the VR task's potential for enhancing engagement and suggesting its feasibility in rehabilitation. The study underlines the significance of collaborative design in healthcare technology and advocates for further research with a larger sample size to confirm the benefits of VR in VSN treatment, as well as its applicability to other multimodal disorders.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
The SARAO MeerKAT 1.3 GHz Galactic Plane Survey
Authors:
S. Goedhart,
W. D. Cotton,
F. Camilo,
M. A. Thompson,
G. Umana,
M. Bietenholz,
P. A. Woudt,
L. D. Anderson,
C. Bordiu,
D. A. H. Buckley,
C. S. Buemi,
F. Bufano,
F. Cavallaro,
H. Chen,
J. O. Chibueze,
D. Egbo,
B. S. Frank,
M. G. Hoare,
A. Ingallinera,
T. Irabor,
R. C. Kraan-Korteweg,
S. Kurapati,
P. Leto,
S. Loru,
M. Mutale
, et al. (105 additional authors not shown)
Abstract:
We present the SARAO MeerKAT Galactic Plane Survey (SMGPS), a 1.3 GHz continuum survey of almost half of the Galactic Plane (251°$\le l \le$ 358°and 2°$\le l \le$ 61°at $|b| \le 1.5°$). SMGPS is the largest, most sensitive and highest angular resolution 1 GHz survey of the Plane yet carried out, with an angular resolution of 8" and a broadband RMS sensitivity of $\sim$10--20 $μ$ Jy/beam. Here we d…
▽ More
We present the SARAO MeerKAT Galactic Plane Survey (SMGPS), a 1.3 GHz continuum survey of almost half of the Galactic Plane (251°$\le l \le$ 358°and 2°$\le l \le$ 61°at $|b| \le 1.5°$). SMGPS is the largest, most sensitive and highest angular resolution 1 GHz survey of the Plane yet carried out, with an angular resolution of 8" and a broadband RMS sensitivity of $\sim$10--20 $μ$ Jy/beam. Here we describe the first publicly available data release from SMGPS which comprises data cubes of frequency-resolved images over 908--1656 MHz, power law fits to the images, and broadband zeroth moment integrated intensity images. A thorough assessment of the data quality and guidance for future usage of the data products are given. Finally, we discuss the tremendous potential of SMGPS by showcasing highlights of the Galactic and extragalactic science that it permits. These highlights include the discovery of a new population of non-thermal radio filaments; identification of new candidate supernova remnants, pulsar wind nebulae and planetary nebulae; improved radio/mid-IR classification of rare Luminous Blue Variables and discovery of associated extended radio nebulae; new radio stars identified by Bayesian cross-matching techniques; the realisation that many of the largest radio-quiet WISE HII region candidates are not true HII regions; and a large sample of previously undiscovered background HI galaxies in the Zone of Avoidance.
△ Less
Submitted 2 May, 2024; v1 submitted 12 December, 2023;
originally announced December 2023.
-
Accumulation and removal of Si impurities on $β-Ga_2O_3$ arising from ambient air exposure
Authors:
J. P. McCandless,
C. A. Gorsak,
V. Protasenko,
D. G. Schlom,
Michael O. Thompson,
H. G. Xing,
D. Jena,
H. P. Nair
Abstract:
Here we report that the source of Si impurities commonly observed on (010) $β-Ga_2O_3$ is from exposure of the surface to air. Moreover, we find that a 15 minute HF (49%) treatment reduces the Si density by approximately 1 order of magnitude on (010) $β-Ga_2O_3$ surfaces. This reduction in Si is critical for the elimination of the often observed parasitic conducting channel, which negatively affec…
▽ More
Here we report that the source of Si impurities commonly observed on (010) $β-Ga_2O_3$ is from exposure of the surface to air. Moreover, we find that a 15 minute HF (49%) treatment reduces the Si density by approximately 1 order of magnitude on (010) $β-Ga_2O_3$ surfaces. This reduction in Si is critical for the elimination of the often observed parasitic conducting channel, which negatively affects transport properties and lateral transistor performance. After the HF treatment the sample must be immediately put under vacuum, for the Si fully returns within 10 minutes of additional air exposure. Lastly, we demonstrate that performing a 30 minute HF (49%) treatment on the substrate before growth has no deleterious effect on the structure or on the epitaxy surface after subsequent $Ga_2O_3$ growth.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
BioCLIP: A Vision Foundation Model for the Tree of Life
Authors:
Samuel Stevens,
Jiaman Wu,
Matthew J Thompson,
Elizabeth G Campolongo,
Chan Hee Song,
David Edward Carlyn,
Li Dong,
Wasila M Dahdul,
Charles Stewart,
Tanya Berger-Wolf,
Wei-Lun Chao,
Yu Su
Abstract:
Images of the natural world, collected by a variety of cameras, from drones to individual phones, are increasingly abundant sources of biological information. There is an explosion of computational methods and tools, particularly computer vision, for extracting biologically relevant information from images for science and conservation. Yet most of these are bespoke approaches designed for a specif…
▽ More
Images of the natural world, collected by a variety of cameras, from drones to individual phones, are increasingly abundant sources of biological information. There is an explosion of computational methods and tools, particularly computer vision, for extracting biologically relevant information from images for science and conservation. Yet most of these are bespoke approaches designed for a specific task and are not easily adaptable or extendable to new questions, contexts, and datasets. A vision model for general organismal biology questions on images is of timely need. To approach this, we curate and release TreeOfLife-10M, the largest and most diverse ML-ready dataset of biology images. We then develop BioCLIP, a foundation model for the tree of life, leveraging the unique properties of biology captured by TreeOfLife-10M, namely the abundance and variety of images of plants, animals, and fungi, together with the availability of rich structured biological knowledge. We rigorously benchmark our approach on diverse fine-grained biology classification tasks and find that BioCLIP consistently and substantially outperforms existing baselines (by 16% to 17% absolute). Intrinsic evaluation reveals that BioCLIP has learned a hierarchical representation conforming to the tree of life, shedding light on its strong generalizability. https://imageomics.github.io/bioclip has models, data and code.
△ Less
Submitted 14 May, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
DenseNet and Support Vector Machine classifications of major depressive disorder using vertex-wise cortical features
Authors:
Vladimir Belov,
Tracy Erwin-Grabner,
Ling-Li Zeng,
Christopher R. K. Ching,
Andre Aleman,
Alyssa R. Amod,
Zeynep Basgoze,
Francesco Benedetti,
Bianca Besteher,
Katharina Brosch,
Robin Bülow,
Romain Colle,
Colm G. Connolly,
Emmanuelle Corruble,
Baptiste Couvy-Duchesne,
Kathryn Cullen,
Udo Dannlowski,
Christopher G. Davey,
Annemiek Dols,
Jan Ernsting,
Jennifer W. Evans,
Lukas Fisch,
Paola Fuentes-Claramonte,
Ali Saffet Gonul,
Ian H. Gotlib
, et al. (63 additional authors not shown)
Abstract:
Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, h…
▽ More
Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, has the potential to provide diagnostic and predictive biomarkers for MDD. However, previous attempts to demarcate MDD patients and healthy controls (HC) based on segmented cortical features via linear machine learning approaches have reported low accuracies. In this study, we used globally representative data from the ENIGMA-MDD working group containing an extensive sample of people with MDD (N=2,772) and HC (N=4,240), which allows a comprehensive analysis with generalizable results. Based on the hypothesis that integration of vertex-wise cortical features can improve classification performance, we evaluated the classification of a DenseNet and a Support Vector Machine (SVM), with the expectation that the former would outperform the latter. As we analyzed a multi-site sample, we additionally applied the ComBat harmonization tool to remove potential nuisance effects of site. We found that both classifiers exhibited close to chance performance (balanced accuracy DenseNet: 51%; SVM: 53%), when estimated on unseen sites. Slightly higher classification performance (balanced accuracy DenseNet: 58%; SVM: 55%) was found when the cross-validation folds contained subjects from all sites, indicating site effect. In conclusion, the integration of vertex-wise morphometric features and the use of the non-linear classifier did not lead to the differentiability between MDD and HC. Our results support the notion that MDD classification on this combination of features and classifiers is unfeasible.
△ Less
Submitted 18 November, 2023;
originally announced November 2023.
-
Long nanomechanical resonators with circular cross-section
Authors:
Samuli Autti,
Andrew Casey,
Marie Connelly,
Neda Darvishi,
Paolo Franchini,
James Gorman,
Richard P. Haley,
Petri J. Heikkinen,
Ashlea Kemp,
Elizabeth Leason,
John March-Russell,
Jocelyn Monroe,
Theo Noble,
George R. Pickett,
Jonathan R. Prance,
Xavier Rojas,
Tineke Salmon,
John Saunders,
Jack Slater,
Robert Smith,
Michael D. Thompson,
Stephen M. West,
Luke Whitehead,
Vladislav V. Zavjalov,
Kuang Zhang
, et al. (1 additional authors not shown)
Abstract:
Fabrication of superconducting nanomechanical resonators for quantum research, detectors and devices traditionally relies on a lithographic process, resulting in oscillators with sharp edges and a suspended length limited to a few 100 micrometres. We report a low-investment top-down approach to fabricating NbTi nanowire resonators with suspended lengths up to several millimetres and diameters down…
▽ More
Fabrication of superconducting nanomechanical resonators for quantum research, detectors and devices traditionally relies on a lithographic process, resulting in oscillators with sharp edges and a suspended length limited to a few 100 micrometres. We report a low-investment top-down approach to fabricating NbTi nanowire resonators with suspended lengths up to several millimetres and diameters down to 100 nanometres. The nanowires possess high critical currents and fields, making them a natural choice for magnetomotive actuation and sensing. This fabrication technique is independent of the substrate material, dimensions and layout and can readily be adapted to fabricate nanowire resonators from any metal or alloy with suitable ductility and yield strength. Our work thus opens access to a new class of nanomechanical devices with applications including microscopic and mesoscopic investigations of quantum fluids, detecting dark matter and fundamental materials research in one-dimensional superconductors in vacuum.
△ Less
Submitted 4 November, 2023;
originally announced November 2023.
-
Silicon Implantation and Annealing in $β$-Ga$_2$O$_3$: Role of Ambient, Temperature, and Time
Authors:
K. R. Gann,
N. Pieczulewski1,
C. A. Gorsak,
K. Heinselman,
T. J. Asel,
B. A. Noesges,
K. T. Smith,
D. M. Dryden,
H. G. Xing,
H. P. Nair,
D. A. Muller,
M. O. Thompson
Abstract:
Optimizing thermal anneals of Si-implanted $β$-Ga$_2$O$_3$ is critical for low resistance contacts and selective area doping. We report the impact of annealing ambient, temperature, and time on activation of room temperature ion-implanted Si in $β$-Ga$_2$O$_3$ at concentrations from 5x10$^{18}$ to 1x10$^{20}$ cm$^{-3}$, demonstrating full activation (>80% activation, mobilities >70 cm$^{2}$/Vs) wi…
▽ More
Optimizing thermal anneals of Si-implanted $β$-Ga$_2$O$_3$ is critical for low resistance contacts and selective area doping. We report the impact of annealing ambient, temperature, and time on activation of room temperature ion-implanted Si in $β$-Ga$_2$O$_3$ at concentrations from 5x10$^{18}$ to 1x10$^{20}$ cm$^{-3}$, demonstrating full activation (>80% activation, mobilities >70 cm$^{2}$/Vs) with contact resistances below 0.29 $Ω$-mm. Homoepitaxial $β$-Ga$_2$O$_3$ films, grown by plasma assisted MBE on Fe-doped (010) substrates, were implanted at multiple energies to yield 100 nm box profiles of 5x10$^{18}$, 5x10$^{19}$, and 1x10$^{20}$ cm$^{-3}$. Anneals were performed in a UHV-compatible quartz furnace at 1 bar with well-controlled gas composition. To maintain $β$-Ga$_2$O$_3$ stability, $p_{O2}$ must be greater than 10$^{-9}$ bar. Anneals up to $p_{O2}$ = 1 bar achieve full activation at 5x10$^{18}$ cm$^{-3}$, while 5x10$^{19}$ cm$^{-3}$ must be annealed with $p_{O2}$ <10$^{-4}$ bar and 1x10$^{20}$ cm$^{-3}$ requires $p_{O2}$ <10$^{-6}$ bar. Water vapor prevents activation and must be maintained below 10$^{-8}$ bar. Activation is achieved for anneal temperatures as low as 850 °C with mobility increasing with anneal temperature up to 1050 °C, though Si diffusion has been reported above 950 °C. At 950 °C, activation is maximized between 5 and 20 minutes with longer times resulting in decreased carrier activation (over-annealing). This over-annealing is significant for concentrations above 5x10$^{19}$ cm$^{-3}$ and occurs rapidly at 1x10$^{20}$ cm$^{-3}$. RBS (channeling) suggests damage recovery is seeded from remnant aligned $β$-Ga$_2$O$_3$ that remains after implantation; this conclusion is also supported by STEM showing retention of the $β$-phase with inclusions that resemble the $γ$-phase.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
QUEST-DMC superfluid $^3$He detector for sub-GeV dark matter
Authors:
S. Autti,
A. Casey,
N. Eng,
N. Darvishi,
P. Franchini,
R. P. Haley,
P. J. Heikkinen,
A. Jennings,
A. Kemp,
E. Leason,
L. V. Levitin,
J. Monroe,
J. March-Russel,
M. T. Noble,
J. R. Prance,
X. Rojas,
T. Salmon,
J. Saunders,
R. Smith,
M. D. Thompson,
V. Tsepelin,
S. M. West,
L. Whitehead,
V. V. Zavjalov,
D. E. Zmeev
Abstract:
The focus of dark matter searches to date has been on Weakly Interacting Massive Particles (WIMPs) in the GeV/$c^2$-TeV/$c^2$ mass range. The direct, indirect and collider searches in this mass range have been extensive but ultimately unsuccessful, providing a strong motivation for widening the search outside this range. Here we describe a new concept for a dark matter experiment, employing superf…
▽ More
The focus of dark matter searches to date has been on Weakly Interacting Massive Particles (WIMPs) in the GeV/$c^2$-TeV/$c^2$ mass range. The direct, indirect and collider searches in this mass range have been extensive but ultimately unsuccessful, providing a strong motivation for widening the search outside this range. Here we describe a new concept for a dark matter experiment, employing superfluid $^3$He as a detector for dark matter that is close to the mass of the proton, of order 1 GeV/$c^2$. The QUEST-DMC detector concept is based on quasiparticle detection in a bolometer cell by a nanomechanical resonator. In this paper we develop the energy measurement methodology and detector response model, simulate candidate dark matter signals and expected background interactions, and calculate the sensitivity of such a detector. We project that such a detector can reach sub-eV nuclear recoil energy threshold, opening up new windows on the parameter space of both spin-dependent and spin-independent interactions of light dark matter candidates.
△ Less
Submitted 14 March, 2024; v1 submitted 17 October, 2023;
originally announced October 2023.
-
Did Private Election Administration Funding Advantage Democrats in 2020?
Authors:
Apoorva Lal,
Daniel M Thompson
Abstract:
Private donors contributed more than $350 million to local election officials to support the administration of the 2020 election. Supporters argue these grants were neutral and necessary to maintain normal election operations during the pandemic, while critics worry these grants mostly went to Democratic strongholds and tilted election outcomes. These concerns have led twenty-four states to restri…
▽ More
Private donors contributed more than $350 million to local election officials to support the administration of the 2020 election. Supporters argue these grants were neutral and necessary to maintain normal election operations during the pandemic, while critics worry these grants mostly went to Democratic strongholds and tilted election outcomes. These concerns have led twenty-four states to restrict private election grants. How much did these grants shape the 2020 presidential election? To answer this question, we collect administrative data on private election administration grants and election outcomes. We then use new advances in synthetic control methods to compare presidential election results and turnout in counties that received grants to counties with identical average presidential election results and turnout before 2020. While counties that favor Democrats were much more likely to apply for a grant, we find that the grants did not have a noticeable effect on the presidential election. Our estimates of the average effect of receiving a grant on Democratic vote share range from 0.02 percentage points to 0.36 percentage points. Our estimates of the average effect of receiving a grant on turnout range from -0.03 percentage points to 0.13 percentage points. Across specifications, our 95% confidence intervals typically include negative effects, and our confidence intervals from all specifications fail to include effects on Democratic vote share larger than 0.58 percentage points and effects on turnout larger than 0.40 percentage points. We characterize the magnitude of our effects by asking how large they are compared to the margin by which Biden won the 2020 election. In simple bench-marking exercises, we find that the effects of the grants were likely too small to have changed the outcome of the 2020 presidential election.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Outgassing Composition of the Murchison Meteorite: Implications for Volatile Depletion of Planetesimals and Interior-atmosphere Connections for Terrestrial Exoplanets
Authors:
Maggie A. Thompson,
Myriam Telus,
Graham Harper Edwards,
Laura Schaefer,
Jasmeet Dhaliwal,
Brian Dreyer,
Jonathan J. Fortney,
Kyle Kim
Abstract:
Outgassing is a central process during the formation and evolution of terrestrial planets and their atmospheres both within and beyond the solar system. Although terrestrial planets' early atmospheres likely form via outgassing during planetary accretion, the connection between a planet's bulk composition and its initial atmospheric properties is not well understood. One way to inform this connect…
▽ More
Outgassing is a central process during the formation and evolution of terrestrial planets and their atmospheres both within and beyond the solar system. Although terrestrial planets' early atmospheres likely form via outgassing during planetary accretion, the connection between a planet's bulk composition and its initial atmospheric properties is not well understood. One way to inform this connection is to analyze the outgassing compositions of meteorites, and in particular carbonaceous chondrites, because they are some of the most volatile-rich, primitive materials (in terms of their bulk compositions) that are available for direct study. In addition, they may serve as compositional analogs for the building block materials of terrestrial planets in our solar system and around other Sun-like stars. This study builds upon previous outgassing experiments that monitored the abundances of volatile species (e.g., H2O, CO, and CO2) released from the Murchison meteorite. To gain a more complete understanding of Murchison's outgassing composition, we perform a series of heating experiments under atmospheric pressure (1 bar) and vacuum (1E-9 bar) conditions on samples of the Murchison meteorite and subsequent bulk element analysis to inform the outgassing trends of a suite of major elements in Murchison (e.g., Fe, Mg, Zn, and S). Under both pressure conditions, sulfur outgases significantly at the highest temperatures (800C - 1000C). For the samples heated under vacuum conditions, we also detect outgassing of zinc. Combined with prior outgassing experiments, this study provides important insights into the volatile depletion patterns of undifferentiated planetesimals and the early outgassing compositions of terrestrial exoplanets.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Tackling the dimensions in imaging genetics with CLUB-PLS
Authors:
Andre Altmann,
Ana C Lawry Aguila,
Neda Jahanshad,
Paul M Thompson,
Marco Lorenzi
Abstract:
A major challenge in imaging genetics and similar fields is to link high-dimensional data in one domain, e.g., genetic data, to high dimensional data in a second domain, e.g., brain imaging data. The standard approach in the area are mass univariate analyses across genetic factors and imaging phenotypes. That entails executing one genome-wide association study (GWAS) for each pre-defined imaging m…
▽ More
A major challenge in imaging genetics and similar fields is to link high-dimensional data in one domain, e.g., genetic data, to high dimensional data in a second domain, e.g., brain imaging data. The standard approach in the area are mass univariate analyses across genetic factors and imaging phenotypes. That entails executing one genome-wide association study (GWAS) for each pre-defined imaging measure. Although this approach has been tremendously successful, one shortcoming is that phenotypes must be pre-defined. Consequently, effects that are not confined to pre-selected regions of interest or that reflect larger brain-wide patterns can easily be missed. In this work we introduce a Partial Least Squares (PLS)-based framework, which we term Cluster-Bootstrap PLS (CLUB-PLS), that can work with large input dimensions in both domains as well as with large sample sizes. One key factor of the framework is to use cluster bootstrap to provide robust statistics for single input features in both domains. We applied CLUB-PLS to investigating the genetic basis of surface area and cortical thickness in a sample of 33,000 subjects from the UK Biobank. We found 107 genome-wide significant locus-phenotype pairs that are linked to 386 different genes. We found that a vast majority of these loci could be technically validated at a high rate: using classic GWAS or Genome-Wide Inferred Statistics (GWIS) we found that 85 locus-phenotype pairs exceeded the genome-wide suggestive (P<1e-05) threshold.
△ Less
Submitted 19 September, 2023; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Tracing Evolution in Massive Protostellar Objects (TEMPO) -- I: Fragmentation and emission properties of massive star-forming clumps in a luminosity limited ALMA sample
Authors:
A. Avison,
G. A. Fuller,
N. Asabre Frimpong,
S. Etoka,
M. Hoare,
B. M. Jones,
N. Peretto,
A. Traficante,
F. van der Tak,
J. E. Pineda,
M. Beltrán,
F. Wyrowski,
M. Thompson,
S. Lumsden,
Z. Nagy,
T. Hill,
S. Viti,
F. Fontani,
P. Schilke
Abstract:
The role of massive ($\geq$ 8M$_{\odot}$) stars in defining the energy budget and chemical enrichment of the interstellar medium in their host galaxy is significant. In this first paper from the Tracing Evolution in Massive Protostellar Objects (TEMPO) project we introduce a colour-luminosity selected (L$_*$ $\sim$ 3$\times10^3$ to 1$\times10^5$ L$_{\odot}$) sample of 38 massive star forming regio…
▽ More
The role of massive ($\geq$ 8M$_{\odot}$) stars in defining the energy budget and chemical enrichment of the interstellar medium in their host galaxy is significant. In this first paper from the Tracing Evolution in Massive Protostellar Objects (TEMPO) project we introduce a colour-luminosity selected (L$_*$ $\sim$ 3$\times10^3$ to 1$\times10^5$ L$_{\odot}$) sample of 38 massive star forming regions observed with ALMA at 1.3mm and explore the fragmentation, clustering and flux density properties of the sample. The TEMPO sample fields are each found to contain multiple fragments (between 2-15 per field). The flux density budget is split evenly (53%-47%) between fields where emission is dominated by a single high flux density fragment and those in which the combined flux density of fainter objects dominates. The fragmentation scales observed in most fields are not comparable with the thermal Jeans length, $λ_J$, being larger in the majority of cases, suggestive of some non-thermal mechanism. A tentative evolutionary trend is seen between luminosity of the clump and the `spectral line richness' of the TEMPO fields; with 6.7GHz maser associated fields found to be lower luminosity and more line rich. This work also describes a method of line-free continuum channel selection within ALMA data and a generalised approach used to distinguishing sources which are potentially star-forming from those which are not, utilising interferometric visibility properties.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis
Authors:
Nikhil J. Dhinagar,
Amit Singh,
Saket Ozarkar,
Ketaki Buwa,
Sophia I. Thomopoulos,
Conor Owens-Walton,
Emily Laltoo,
Yao-Liang Chen,
Philip Cook,
Corey McMillan,
Chih-Chien Tsai,
J-J Wang,
Yih-Ru Wu,
Paul M. Thompson
Abstract:
Transfer learning represents a recent paradigm shift in the way we build artificial intelligence (AI) systems. In contrast to training task-specific models, transfer learning involves pre-training deep learning models on a large corpus of data and minimally fine-tuning them for adaptation to specific tasks. Even so, for 3D medical imaging tasks, we do not know if it is best to pre-train models on…
▽ More
Transfer learning represents a recent paradigm shift in the way we build artificial intelligence (AI) systems. In contrast to training task-specific models, transfer learning involves pre-training deep learning models on a large corpus of data and minimally fine-tuning them for adaptation to specific tasks. Even so, for 3D medical imaging tasks, we do not know if it is best to pre-train models on natural images, medical images, or even synthetically generated MRI scans or video data. To evaluate these alternatives, here we benchmarked vision transformers (ViTs) and convolutional neural networks (CNNs), initialized with varied upstream pre-training approaches. These methods were then adapted to three unique downstream neuroimaging tasks with a range of difficulty: Alzheimer's disease (AD) and Parkinson's disease (PD) classification, "brain age" prediction. Experimental tests led to the following key observations: 1. Pre-training improved performance across all tasks including a boost of 7.4% for AD classification and 4.6% for PD classification for the ViT and 19.1% for PD classification and reduction in brain age prediction error by 1.26 years for CNNs, 2. Pre-training on large-scale video or synthetic MRI data boosted performance of ViTs, 3. CNNs were robust in limited-data settings, and in-domain pretraining enhanced their performances, 4. Pre-training improved generalization to out-of-distribution datasets and sites. Overall, we benchmarked different vision architectures, revealing the value of pre-training them with emerging datasets for model initialization. The resulting pre-trained models can be adapted to a range of downstream neuroimaging tasks, even when training data for the target task is limited.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Linking Symptom Inventories using Semantic Textual Similarity
Authors:
Eamonn Kennedy,
Shashank Vadlamani,
Hannah M Lindsey,
Kelly S Peterson,
Kristen Dams OConnor,
Kenton Murray,
Ronak Agarwal,
Houshang H Amiri,
Raeda K Andersen,
Talin Babikian,
David A Baron,
Erin D Bigler,
Karen Caeyenberghs,
Lisa Delano-Wood,
Seth G Disner,
Ekaterina Dobryakova,
Blessen C Eapen,
Rachel M Edelstein,
Carrie Esopenko,
Helen M Genova,
Elbert Geuze,
Naomi J Goodrich-Hunsaker,
Jordan Grafman,
Asta K Haberg,
Cooper B Hodges
, et al. (57 additional authors not shown)
Abstract:
An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores…
▽ More
An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores across previously incongruous symptom inventories. We tested the ability of four pre-trained STS models to screen thousands of symptom description pairs for related content - a challenging task typically requiring expert panels. Models were tasked to predict symptom severity across four different inventories for 6,607 participants drawn from 16 international data sources. The STS approach achieved 74.8% accuracy across five tasks, outperforming other models tested. This work suggests that incorporating contextual, semantic information can assist expert decision-making processes, yielding gains for both general and disease-specific clinical assessment.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Tensor products of multimatroids and a Brylawski-type formula for the transition polynomial
Authors:
Iain Moffatt,
Steven Noble,
Maya Thompson
Abstract:
Brylawski's tensor product formula expresses the Tutte polynomial of the tensor product of two graphs in terms of Tutte polynomials arising from the tensor factors. We are concerned with extensions of Brylawski's tensor product formula to the Bollobas-Riordan and transition polynomials of graphs embedded in surfaces. We give a tensor product formula for the multimatroid transition polynomial and s…
▽ More
Brylawski's tensor product formula expresses the Tutte polynomial of the tensor product of two graphs in terms of Tutte polynomials arising from the tensor factors. We are concerned with extensions of Brylawski's tensor product formula to the Bollobas-Riordan and transition polynomials of graphs embedded in surfaces. We give a tensor product formula for the multimatroid transition polynomial and show that Brylawski's formula and its topological analogues arise as specialisations of this more general result.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Probabilistic Phase Labeling and Lattice Refinement for Autonomous Material Research
Authors:
Ming-Chiang Chang,
Sebastian Ament,
Maximilian Amsler,
Duncan R. Sutherland,
Lan Zhou,
John M. Gregoire,
Carla P. Gomes,
R. Bruce van Dover,
Michael O. Thompson
Abstract:
X-ray diffraction (XRD) is an essential technique to determine a material's crystal structure in high-throughput experimentation, and has recently been incorporated in artificially intelligent agents in autonomous scientific discovery processes. However, rapid, automated and reliable analysis method of XRD data matching the incoming data rate remains a major challenge. To address these issues, we…
▽ More
X-ray diffraction (XRD) is an essential technique to determine a material's crystal structure in high-throughput experimentation, and has recently been incorporated in artificially intelligent agents in autonomous scientific discovery processes. However, rapid, automated and reliable analysis method of XRD data matching the incoming data rate remains a major challenge. To address these issues, we present CrystalShift, an efficient algorithm for probabilistic XRD phase labeling that employs symmetry-constrained pseudo-refinement optimization, best-first tree search, and Bayesian model comparison to estimate probabilities for phase combinations without requiring phase space information or training. We demonstrate that CrystalShift provides robust probability estimates, outperforming existing methods on synthetic and experimental datasets, and can be readily integrated into high-throughput experimental workflows. In addition to efficient phase-mapping, CrystalShift offers quantitative insights into materials' structural parameters, which facilitate both expert evaluation and AI-based modeling of the phase space, ultimately accelerating materials identification and discovery.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
SCOTCH -- Search for Clandestine Optically Thick Compact HIIs
Authors:
A. L. Patel,
J. S. Urquhart,
A. Y. Yang,
T. J. T Moore,
K. M. Menten,
M. A. Thompson,
M. G. Hoare,
T. Irabor,
S. L. Breen,
M. D. Smith
Abstract:
This study uses archival high frequency continuum data to expand the search for Hypercompact HII regions and determine the conditions at which they appear, as this stage high mass star formation is short-lived and rare. We use 23 GHz continuum data taken towards methanol masers, which are an excellent signpost for very young embedded high-mass protostars. We have searched for high-frequency, optic…
▽ More
This study uses archival high frequency continuum data to expand the search for Hypercompact HII regions and determine the conditions at which they appear, as this stage high mass star formation is short-lived and rare. We use 23 GHz continuum data taken towards methanol masers, which are an excellent signpost for very young embedded high-mass protostars. We have searched for high-frequency, optically thick radio sources to identify HC HII region candidates. The data cover 128 fields that include 141 methanol masers identified by the Methanol Multibeam (MMB) survey. We have detected 68 high-frequency radio sources and conducted a multi-wavelength analysis to determine their nature. This has identified 49 HII regions, 47 of which are embedded in dense clumps fourteen of which do not have a 5 GHz radio counterpart. We have identified 13 methanol maser sites that are coincident with radio sources that have a steep positive spectral index. The majority of these are not detected in the mid-infrared and have been classified as protostellar or young stellar objects in the literature and we therefore consider to be good HC HII region candidates, however, further work and higher resolution data are needed to confirm these candidates.
△ Less
Submitted 16 July, 2023;
originally announced July 2023.
-
Generalized Core Spanner Inexpressibility via Ehrenfeucht-Fraïssé Games for FC
Authors:
Sam M. Thompson,
Dominik D. Freydenberger
Abstract:
Despite considerable research on document spanners, little is known about the expressive power of generalized core spanners. In this paper, we use Ehrenfeucht-Fraïssé games to obtain general inexpressibility lemmas for the logic FC (a finite-model variant of the theory of concatenation). Applying these lemmas give inexpressibility results for FC that we lift to generalized core spanners. In partic…
▽ More
Despite considerable research on document spanners, little is known about the expressive power of generalized core spanners. In this paper, we use Ehrenfeucht-Fraïssé games to obtain general inexpressibility lemmas for the logic FC (a finite-model variant of the theory of concatenation). Applying these lemmas give inexpressibility results for FC that we lift to generalized core spanners. In particular, we give several relations that cannot be selected by generalized core spanners, thus demonstrating the effectiveness of the inexpressibility lemmas. As an immediate consequence, we also gain new insights into the expressive power of core spanners.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Estimating dynamic treatment regimes for ordinal outcomes with household interference: Application in household smoking cessation
Authors:
Cong Jiang,
Mary Thompson,
Michael Wallace
Abstract:
The focus of precision medicine is on decision support, often in the form of dynamic treatment regimes (DTRs), which are sequences of decision rules. At each decision point, the decision rules determine the next treatment according to the patient's baseline characteristics, the information on treatments and responses accrued by that point, and the patient's current health status, including symptom…
▽ More
The focus of precision medicine is on decision support, often in the form of dynamic treatment regimes (DTRs), which are sequences of decision rules. At each decision point, the decision rules determine the next treatment according to the patient's baseline characteristics, the information on treatments and responses accrued by that point, and the patient's current health status, including symptom severity and other measures. However, DTR estimation with ordinal outcomes is rarely studied, and rarer still in the context of interference - where one patient's treatment may affect another's outcome. In this paper, we introduce the weighted proportional odds model (WPOM): a regression-based, approximate doubly-robust approach to single-stage DTR estimation for ordinal outcomes. This method also accounts for the possibility of interference between individuals sharing a household through the use of covariate balancing weights derived from joint propensity scores. Examining different types of balancing weights, we verify the approximate double robustness of WPOM with our adjusted weights via simulation studies. We further extend WPOM to multi-stage DTR estimation with household interference, namely dWPOM (dynamic WPOM). Lastly, we demonstrate our proposed methodology in the analysis of longitudinal survey data from the Population Assessment of Tobacco and Health study, which motivates this work. Furthermore, considering interference, we provide optimal treatment strategies for households to achieve smoking cessation of the pair in the household.
△ Less
Submitted 20 December, 2023; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Incomplete Multimodal Learning for Complex Brain Disorders Prediction
Authors:
Reza Shirkavand,
Liang Zhan,
Heng Huang,
Li Shen,
Paul M. Thompson
Abstract:
Recent advancements in the acquisition of various brain data sources have created new opportunities for integrating multimodal brain data to assist in early detection of complex brain disorders. However, current data integration approaches typically need a complete set of biomedical data modalities, which may not always be feasible, as some modalities are only available in large-scale research coh…
▽ More
Recent advancements in the acquisition of various brain data sources have created new opportunities for integrating multimodal brain data to assist in early detection of complex brain disorders. However, current data integration approaches typically need a complete set of biomedical data modalities, which may not always be feasible, as some modalities are only available in large-scale research cohorts and are prohibitive to collect in routine clinical practice. Especially in studies of brain diseases, research cohorts may include both neuroimaging data and genetic data, but for practical clinical diagnosis, we often need to make disease predictions only based on neuroimages. As a result, it is desired to design machine learning models which can use all available data (different data could provide complementary information) during training but conduct inference using only the most common data modality. We propose a new incomplete multimodal data integration approach that employs transformers and generative adversarial networks to effectively exploit auxiliary modalities available during training in order to improve the performance of a unimodal model at inference. We apply our new method to predict cognitive degeneration and disease outcomes using the multimodal imaging genetic data from Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort. Experimental results demonstrate that our approach outperforms the related machine learning and deep learning methods by a significant margin.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Thermodynamic route of Nb3Sn nucleation: Role of oxygen
Authors:
Zeming Sun,
Darrah K. Dare,
Zhaslan Baraissov,
David A. Muller,
Michael O. Thompson,
Matthias U. Liepe
Abstract:
Intermetallic Nb3Sn alloys have long been believed to form through Sn diffusion into Nb. However, our observations of significant oxygen content in Nb3Sn prompted an investigation of alternative formation mechanisms. Through experiments involving different oxide interfaces (clean HF-treated, native oxidized, and anodized), we demonstrate a thermodynamic route that fundamentally challenges the conv…
▽ More
Intermetallic Nb3Sn alloys have long been believed to form through Sn diffusion into Nb. However, our observations of significant oxygen content in Nb3Sn prompted an investigation of alternative formation mechanisms. Through experiments involving different oxide interfaces (clean HF-treated, native oxidized, and anodized), we demonstrate a thermodynamic route that fundamentally challenges the conventional Sn diffusion mechanism for Nb3Sn nucleation. Our results highlight the critical involvement of a SnOx intermediate phase. This new nucleation mechanism identifies the principles for growth optimization and new synthesis of high-quality Nb3Sn superconductors.
△ Less
Submitted 7 July, 2023; v1 submitted 8 May, 2023;
originally announced May 2023.
-
Surface oxides, carbides, and impurities on RF superconducting Nb and Nb3Sn: A comprehensive analysis
Authors:
Zeming Sun,
Zhaslan Baraissov,
Catherine A. Dukes,
Darrah K. Dare,
Thomas Oseroff,
Michael O. Thompson,
David A. Muller,
Matthias U. Liepe
Abstract:
Surface structures on radio-frequency (RF) superconductors are crucially important in determining their interaction with the RF field. Here we investigate the surface compositions, structural profiles, and valence distributions of oxides, carbides, and impurities on niobium (Nb) and niobium-tin (Nb3Sn) in situ under different processing conditions. We establish the underlying mechanisms of vacuum…
▽ More
Surface structures on radio-frequency (RF) superconductors are crucially important in determining their interaction with the RF field. Here we investigate the surface compositions, structural profiles, and valence distributions of oxides, carbides, and impurities on niobium (Nb) and niobium-tin (Nb3Sn) in situ under different processing conditions. We establish the underlying mechanisms of vacuum baking and nitrogen processing in Nb and demonstrate that carbide formation induced during high-temperature baking, regardless of gas environment, determines subsequent oxide formation upon air exposure or low-temperature baking, leading to modifications of the electron population profile. Our findings support the combined contribution of surface oxides and second-phase formation to the outcome of ultra-high vacuum baking (oxygen processing) and nitrogen processing. Also, we observe that vapor-diffused Nb3Sn contains thick metastable oxides, while electrochemically synthesized Nb3Sn only has a thin oxide layer. Our findings reveal fundamental mechanisms of baking and processing Nb and Nb3Sn surface structures for high-performance superconducting RF and quantum applications
△ Less
Submitted 16 October, 2023; v1 submitted 3 May, 2023;
originally announced May 2023.
-
A Comprehensive Corpus Callosum Segmentation Tool for Detecting Callosal Abnormalities and Genetic Associations from Multi Contrast MRIs
Authors:
Shruti P. Gadewar,
Elnaz Nourollahimoghadam,
Ravi R. Bhatt,
Abhinaav Ramesh,
Shayan Javid,
Iyad Ba Gari,
Alyssa H. Zhu,
Sophia Thomopoulos,
Paul M. Thompson,
Neda Jahanshad
Abstract:
Structural alterations of the midsagittal corpus callosum (midCC) have been associated with a wide range of brain disorders. The midCC is visible on most MRI contrasts and in many acquisitions with a limited field-of-view. Here, we present an automated tool for segmenting and assessing the shape of the midCC from T1w, T2w, and FLAIR images. We train a UNet on images from multiple public datasets t…
▽ More
Structural alterations of the midsagittal corpus callosum (midCC) have been associated with a wide range of brain disorders. The midCC is visible on most MRI contrasts and in many acquisitions with a limited field-of-view. Here, we present an automated tool for segmenting and assessing the shape of the midCC from T1w, T2w, and FLAIR images. We train a UNet on images from multiple public datasets to obtain midCC segmentations. A quality control algorithm is also built-in, trained on the midCC shape features. We calculate intraclass correlations (ICC) and average Dice scores in a test-retest dataset to assess segmentation reliability. We test our segmentation on poor quality and partial brain scans. We highlight the biological significance of our extracted features using data from over 40,000 individuals from the UK Biobank; we classify clinically defined shape abnormalities and perform genetic analyses.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
A hierarchical adaptive nonlinear model predictive control approach for maximizing tire force usage in autonomous vehicles
Authors:
James Dallas,
Michael Thompson,
Jonathan Y. M. Goh,
Avinash Balachandran
Abstract:
The ability to reliably maximize tire force usage would improve the safety of autonomous vehicles, especially in challenging edge cases. However, vehicle control near the limits of handling has many challenges, including robustly contending with tire force saturation, balancing model fidelity and computational efficiency, and coordinating inputs with the lower level chassis control system. This wo…
▽ More
The ability to reliably maximize tire force usage would improve the safety of autonomous vehicles, especially in challenging edge cases. However, vehicle control near the limits of handling has many challenges, including robustly contending with tire force saturation, balancing model fidelity and computational efficiency, and coordinating inputs with the lower level chassis control system. This work studies Nonlinear Model Predictive Control for limit handling, specifically adapting to changing tire-road conditions and maximally allocating tire force utilization. We present a novel hierarchical framework that combines a single-track model with longitudinal weight transfer dynamics in the predictive control layer, with lateral brake distribution occurring at the chassis control layer. This vehicle model is simultaneously used in an Unscented Kalman Filter for online friction estimation. Comparative experiments on a full-scale vehicle operating on a race track at up to 95% of maximum tire force usage demonstrate the overall practical effectiveness of this approach.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
The James Webb Space Telescope Mission
Authors:
Jonathan P. Gardner,
John C. Mather,
Randy Abbott,
James S. Abell,
Mark Abernathy,
Faith E. Abney,
John G. Abraham,
Roberto Abraham,
Yasin M. Abul-Huda,
Scott Acton,
Cynthia K. Adams,
Evan Adams,
David S. Adler,
Maarten Adriaensen,
Jonathan Albert Aguilar,
Mansoor Ahmed,
Nasif S. Ahmed,
Tanjira Ahmed,
Rüdeger Albat,
Loïc Albert,
Stacey Alberts,
David Aldridge,
Mary Marsha Allen,
Shaune S. Allen,
Martin Altenburg
, et al. (983 additional authors not shown)
Abstract:
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono…
▽ More
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
A Surface-Based Federated Chow Test Model for Integrating APOE Status, Tau Deposition Measure, and Hippocampal Surface Morphometry
Authors:
Jianfeng Wu,
Yi Su,
Yanxi Chen,
Wenhui Zhu,
Eric M. Reiman,
Richard J. Caselli,
Kewei Chen,
Paul M. Thompson,
Junwen Wang,
Yalin Wang
Abstract:
Background: Alzheimer's Disease (AD) is the most common type of age-related dementia, affecting 6.2 million people aged 65 or older according to CDC data. It is commonly agreed that discovering an effective AD diagnosis biomarker could have enormous public health benefits, potentially preventing or delaying up to 40% of dementia cases. Tau neurofibrillary tangles are the primary driver of downstre…
▽ More
Background: Alzheimer's Disease (AD) is the most common type of age-related dementia, affecting 6.2 million people aged 65 or older according to CDC data. It is commonly agreed that discovering an effective AD diagnosis biomarker could have enormous public health benefits, potentially preventing or delaying up to 40% of dementia cases. Tau neurofibrillary tangles are the primary driver of downstream neurodegeneration and subsequent cognitive impairment in AD, resulting in structural deformations such as hippocampal atrophy that can be observed in magnetic resonance imaging (MRI) scans. Objective: To build a surface-based model to 1) detect differences between APOE subgroups in patterns of tau deposition and hippocampal atrophy, and 2) use the extracted surface-based features to predict cognitive decline. Methods: Using data obtained from different institutions, we develop a surface-based federated Chow test model to study the synergistic effects of APOE, a previously reported significant risk factor of AD, and tau on hippocampal surface morphometry. Results: We illustrate that the APOE-specific morphometry features correlate with AD progression and better predict future AD conversion than other MRI biomarkers. For example, a strong association between atrophy and abnormal tau was identified in hippocampal subregion cornu ammonis 1 (CA1 subfield) and subiculum in e4 homozygote cohort. Conclusion: Our model allows for identifying MRI biomarkers for AD and cognitive decline prediction and may uncover a corner of the neural mechanism of the influence of APOE and tau deposition on hippocampal morphology.
△ Less
Submitted 31 March, 2023;
originally announced April 2023.
-
Improving Deep Dynamics Models for Autonomous Vehicles with Multimodal Latent Mapping of Surfaces
Authors:
Johan Vertens,
Nicolai Dorka,
Tim Welschehold,
Michael Thompson,
Wolfram Burgard
Abstract:
The safe deployment of autonomous vehicles relies on their ability to effectively react to environmental changes. This can require maneuvering on varying surfaces which is still a difficult problem, especially for slippery terrains. To address this issue we propose a new approach that learns a surface-aware dynamics model by conditioning it on a latent variable vector storing surface information a…
▽ More
The safe deployment of autonomous vehicles relies on their ability to effectively react to environmental changes. This can require maneuvering on varying surfaces which is still a difficult problem, especially for slippery terrains. To address this issue we propose a new approach that learns a surface-aware dynamics model by conditioning it on a latent variable vector storing surface information about the current location. A latent mapper is trained to update these latent variables during inference from multiple modalities on every traversal of the corresponding locations and stores them in a map. By training everything end-to-end with the loss of the dynamics model, we enforce the latent mapper to learn an update rule for the latent map that is useful for the subsequent dynamics model. We implement and evaluate our approach on a real miniature electric car. The results show that the latent map is updated to allow more accurate predictions of the dynamics model compared to a model without this information. We further show that by using this model, the driving performance can be improved on varying and challenging surfaces.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
GPT-4 Technical Report
Authors:
OpenAI,
Josh Achiam,
Steven Adler,
Sandhini Agarwal,
Lama Ahmad,
Ilge Akkaya,
Florencia Leoni Aleman,
Diogo Almeida,
Janko Altenschmidt,
Sam Altman,
Shyamal Anadkat,
Red Avila,
Igor Babuschkin,
Suchir Balaji,
Valerie Balcom,
Paul Baltescu,
Haiming Bao,
Mohammad Bavarian,
Jeff Belgum,
Irwan Bello,
Jake Berdine,
Gabriel Bernadett-Shapiro,
Christopher Berner,
Lenny Bogdonoff,
Oleg Boiko
, et al. (256 additional authors not shown)
Abstract:
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo…
▽ More
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.
△ Less
Submitted 4 March, 2024; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Few-Shot Classification of Autism Spectrum Disorder using Site-Agnostic Meta-Learning and Brain MRI
Authors:
Nikhil J. Dhinagar,
Vignesh Santhalingam,
Katherine E. Lawrence,
Emily Laltoo,
Paul M. Thompson
Abstract:
For machine learning applications in medical imaging, the availability of training data is often limited, which hampers the design of radiological classifiers for subtle conditions such as autism spectrum disorder (ASD). Transfer learning is one method to counter this problem of low training data regimes. Here we explore the use of meta-learning for very low data regimes in the context of having p…
▽ More
For machine learning applications in medical imaging, the availability of training data is often limited, which hampers the design of radiological classifiers for subtle conditions such as autism spectrum disorder (ASD). Transfer learning is one method to counter this problem of low training data regimes. Here we explore the use of meta-learning for very low data regimes in the context of having prior data from multiple sites - an approach we term site-agnostic meta-learning. Inspired by the effectiveness of meta-learning for optimizing a model across multiple tasks, here we propose a framework to adapt it to learn across multiple sites. We tested our meta-learning model for classifying ASD versus typically developing controls in 2,201 T1-weighted (T1-w) MRI scans collected from 38 imaging sites as part of Autism Brain Imaging Data Exchange (ABIDE) [age: 5.2-64.0 years]. The method was trained to find a good initialization state for our model that can quickly adapt to data from new unseen sites by fine-tuning on the limited data that is available. The proposed method achieved an ROC-AUC=0.857 on 370 scans from 7 unseen sites in ABIDE using a few-shot setting of 2-way 20-shot i.e., 20 training samples per site. Our results outperformed a transfer learning baseline by generalizing across a wider range of sites as well as other related prior work. We also tested our model in a zero-shot setting on an independent test site without any additional fine-tuning. Our experiments show the promise of the proposed site-agnostic meta-learning framework for challenging neuroimaging tasks involving multi-site heterogeneity with limited availability of training data.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Efficiently Training Vision Transformers on Structural MRI Scans for Alzheimer's Disease Detection
Authors:
Nikhil J. Dhinagar,
Sophia I. Thomopoulos,
Emily Laltoo,
Paul M. Thompson
Abstract:
Neuroimaging of large populations is valuable to identify factors that promote or resist brain disease, and to assist diagnosis, subtyping, and prognosis. Data-driven models such as convolutional neural networks (CNNs) have increasingly been applied to brain images to perform diagnostic and prognostic tasks by learning robust features. Vision transformers (ViT) - a new class of deep learning archi…
▽ More
Neuroimaging of large populations is valuable to identify factors that promote or resist brain disease, and to assist diagnosis, subtyping, and prognosis. Data-driven models such as convolutional neural networks (CNNs) have increasingly been applied to brain images to perform diagnostic and prognostic tasks by learning robust features. Vision transformers (ViT) - a new class of deep learning architectures - have emerged in recent years as an alternative to CNNs for several computer vision applications. Here we tested variants of the ViT architecture for a range of desired neuroimaging downstream tasks based on difficulty, in this case for sex and Alzheimer's disease (AD) classification based on 3D brain MRI. In our experiments, two vision transformer architecture variants achieved an AUC of 0.987 for sex and 0.892 for AD classification, respectively. We independently evaluated our models on data from two benchmark AD datasets. We achieved a performance boost of 5% and 9-10% upon fine-tuning vision transformer models pre-trained on synthetic (generated by a latent diffusion model) and real MRI scans, respectively. Our main contributions include testing the effects of different ViT training strategies including pre-training, data augmentation and learning rate warm-ups followed by annealing, as pertaining to the neuroimaging domain. These techniques are essential for training ViT-like models for neuroimaging applications where training data is usually limited. We also analyzed the effect of the amount of training data utilized on the test-time performance of the ViT via data-model scaling curves.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Transferring Models Trained on Natural Images to 3D MRI via Position Encoded Slice Models
Authors:
Umang Gupta,
Tamoghna Chattopadhyay,
Nikhil Dhinagar,
Paul M. Thompson,
Greg Ver Steeg,
The Alzheimer's Disease Neuroimaging Initiative
Abstract:
Transfer learning has remarkably improved computer vision. These advances also promise improvements in neuroimaging, where training set sizes are often small. However, various difficulties arise in directly applying models pretrained on natural images to radiologic images, such as MRIs. In particular, a mismatch in the input space (2D images vs. 3D MRIs) restricts the direct transfer of models, of…
▽ More
Transfer learning has remarkably improved computer vision. These advances also promise improvements in neuroimaging, where training set sizes are often small. However, various difficulties arise in directly applying models pretrained on natural images to radiologic images, such as MRIs. In particular, a mismatch in the input space (2D images vs. 3D MRIs) restricts the direct transfer of models, often forcing us to consider only a few MRI slices as input. To this end, we leverage the 2D-Slice-CNN architecture of Gupta et al. (2021), which embeds all the MRI slices with 2D encoders (neural networks that take 2D image input) and combines them via permutation-invariant layers. With the insight that the pretrained model can serve as the 2D encoder, we initialize the 2D encoder with ImageNet pretrained weights that outperform those initialized and trained from scratch on two neuroimaging tasks -- brain age prediction on the UK Biobank dataset and Alzheimer's disease detection on the ADNI dataset. Further, we improve the modeling capabilities of 2D-Slice models by incorporating spatial information through position embeddings, which can improve the performance in some cases.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
ZrNb(CO) RF superconducting thin film with high critical temperature in the theoretical limit
Authors:
Zeming Sun,
Thomas Oseroff,
Zhaslan Baraissov,
Darrah K. Dare,
Katrina Howard,
Benjamin Francis,
Ajinkya C. Hire,
Nathan Sitaraman,
Tomas A. Arias,
Mark K. Transtrum,
Richard Hennig,
Michael O. Thompson,
David A. Muller,
Matthias U. Liepe
Abstract:
Superconducting radio-frequency (SRF) resonators are critical components for particle accelerator applications, such as free-electron lasers, and for emerging technologies in quantum computing. Developing advanced materials and their deposition processes to produce RF superconductors that yield nanoohms surface resistances is a key metric for the wider adoption of SRF technology. Here we report Zr…
▽ More
Superconducting radio-frequency (SRF) resonators are critical components for particle accelerator applications, such as free-electron lasers, and for emerging technologies in quantum computing. Developing advanced materials and their deposition processes to produce RF superconductors that yield nanoohms surface resistances is a key metric for the wider adoption of SRF technology. Here we report ZrNb(CO) RF superconducting films with high critical temperatures (Tc) achieved for the first time under ambient pressure. The attainment of a Tc near the theoretical limit for this material without applied pressure is promising for its use in practical applications. A range of Tc, likely arising from Zr doping variation, may allow a tunable superconducting coherence length that lowers the sensitivity to material defects when an ultra-low surface resistance is required. Our ZrNb(CO) films are synthesized using a low-temperature (100 - 200 C) electrochemical recipe combined with thermal annealing. The phase transformation as a function of annealing temperature and time is optimized by the evaporated Zr-Nb diffusion couples. Through phase control, we avoid hexagonal Zr phases that are equilibrium-stable but degrade Tc. X-ray and electron diffraction combined with photoelectron spectroscopy reveal a system containing cubic ZrNb mixed with rocksalt NbC and low-dielectric-loss ZrO2. We demonstrate proof-of-concept RF performance of ZrNb(CO) on an SRF sample test system. BCS resistance trends lower than reference Nb, while quench fields occur at approximately 35 mT. Our results demonstrate the potential of ZrNb(CO) thin films for particle accelerator and other SRF applications.
△ Less
Submitted 12 June, 2023; v1 submitted 28 February, 2023;
originally announced February 2023.
-
Curriculum Based Multi-Task Learning for Parkinson's Disease Detection
Authors:
Nikhil J. Dhinagar,
Conor Owens-Walton,
Emily Laltoo,
Christina P. Boyle,
Yao-Liang Chen,
Philip Cook,
Corey McMillan,
Chih-Chien Tsai,
J-J Wang,
Yih-Ru Wu,
Ysbrand van der Werf,
Paul M. Thompson
Abstract:
There is great interest in developing radiological classifiers for diagnosis, staging, and predictive modeling in progressive diseases such as Parkinson's disease (PD), a neurodegenerative disease that is difficult to detect in its early stages. Here we leverage severity-based meta-data on the stages of disease to define a curriculum for training a deep convolutional neural network (CNN). Typicall…
▽ More
There is great interest in developing radiological classifiers for diagnosis, staging, and predictive modeling in progressive diseases such as Parkinson's disease (PD), a neurodegenerative disease that is difficult to detect in its early stages. Here we leverage severity-based meta-data on the stages of disease to define a curriculum for training a deep convolutional neural network (CNN). Typically, deep learning networks are trained by randomly selecting samples in each mini-batch. By contrast, curriculum learning is a training strategy that aims to boost classifier performance by starting with examples that are easier to classify. Here we define a curriculum to progressively increase the difficulty of the training data corresponding to the Hoehn and Yahr (H&Y) staging system for PD (total N=1,012; 653 PD patients, 359 controls; age range: 20.0-84.9 years). Even with our multi-task setting using pre-trained CNNs and transfer learning, PD classification based on T1-weighted (T1-w) MRI was challenging (ROC AUC: 0.59-0.65), but curriculum training boosted performance (by 3.9%) compared to our baseline model. Future work with multimodal imaging may further boost performance.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Smooth, homogeneous, high-purity Nb3Sn superconducting RF resonant cavity by seed-free electrochemical synthesis
Authors:
Zeming Sun,
Zhaslan Baraissov,
Ryan D. Porter,
Liana Shpani,
Yu-Tsun Shao,
Thomas Oseroff,
Michael O. Thompson,
David A. Muller,
Matthias U. Liepe
Abstract:
Workbench-size particle accelerators, enabled by Nb3Sn-based superconducting radio-frequency (SRF) cavities, hold the potential of driving scientific discovery by offering a widely accessible and affordable source of high-energy electrons and X-rays. Thin-film Nb3Sn RF superconductors with high quality factors, high operation temperatures, and high-field potentials are critical for these devices.…
▽ More
Workbench-size particle accelerators, enabled by Nb3Sn-based superconducting radio-frequency (SRF) cavities, hold the potential of driving scientific discovery by offering a widely accessible and affordable source of high-energy electrons and X-rays. Thin-film Nb3Sn RF superconductors with high quality factors, high operation temperatures, and high-field potentials are critical for these devices. However, surface roughness, non-stoichiometry, and impurities in Nb3Sn deposited by conventional Sn-vapor diffusion prevent them from reaching their theoretical capabilities. Here we demonstrate a seed-free electrochemical synthesis that pushes the limit of chemical and physical properties in Nb3Sn. Utilization of electrochemical Sn pre-deposits reduces the roughness of converted Nb3Sn by five times compared to typical vapor-diffused Nb3Sn. Quantitative mappings using chemical and atomic probes confirm improved stoichiometry and minimized impurity concentrations in electrochemically synthesized Nb3Sn. We have successfully applied this Nb3Sn to the large-scale 1.3 GHz SRF cavity and demonstrated ultra-low BCS surface resistances at multiple operation temperatures, notably lower than vapor-diffused cavities. Our smooth, homogeneous, high-purity Nb3Sn provides the route toward high efficiency and high fields for SRF applications under helium-free cryogenic operations.
△ Less
Submitted 5 September, 2023; v1 submitted 3 February, 2023;
originally announced February 2023.
-
Kinetic Insights into Bridge Cleavage Pathways in Periodic Mesoporous Organosilicas
Authors:
Zeming Sun,
Aine Connolly,
Michael O. Thompson
Abstract:
Bridging functionalities in periodic mesoporous organosilicas (PMOs) enable new functionalities for a wide range of applications. Bridge cleavage is frequently observed during anneals required to form porous structures, yet the mechanism of these bridge cleavages has not been completely resolved. Here, we reveal these chemical transformations and their kinetic pathways on sub-millisecond timescale…
▽ More
Bridging functionalities in periodic mesoporous organosilicas (PMOs) enable new functionalities for a wide range of applications. Bridge cleavage is frequently observed during anneals required to form porous structures, yet the mechanism of these bridge cleavages has not been completely resolved. Here, we reveal these chemical transformations and their kinetic pathways on sub-millisecond timescales induced by laser heating. By varying anneal times and temperatures, the transformation dynamics of bridge cleavage and structural transformations, and their activation energies, are determined. The structural relaxation time for individual reactions and their effective local heating time are determined and compared, and results directly demonstrate the manipulation of different molecules through kinetic control of the sequence of reactions. By isolating and understanding the earliest stage of structural transformations, this study identifies the kinetic principles for new synthesis and post-processing routes to control individual molecules and reactions in PMOs and other material systems with multi-functionalities.
△ Less
Submitted 18 January, 2023; v1 submitted 15 January, 2023;
originally announced January 2023.
-
Red Emission from Copper-Vacancy Color Centers in Zinc Sulfide Colloidal Nanocrystals
Authors:
Sarah M. Thompson,
Cüneyt Şahin,
Shengsong Yang,
Michael E. Flatté,
Christopher B. Murray,
Lee C. Bassett,
Cherie R. Kagan
Abstract:
Copper-doped zinc sulfide (ZnS:Cu) exhibits down-conversion luminescence in the UV, visible, and IR regions of the electromagnetic spectrum; the visible red, green, and blue emission is referred to as R-Cu, G-Cu, and B-Cu, respectively. The sub-bandgap emission arises from optical transitions between localized electronic states created by point defects, making ZnS:Cu a prolific phosphor material a…
▽ More
Copper-doped zinc sulfide (ZnS:Cu) exhibits down-conversion luminescence in the UV, visible, and IR regions of the electromagnetic spectrum; the visible red, green, and blue emission is referred to as R-Cu, G-Cu, and B-Cu, respectively. The sub-bandgap emission arises from optical transitions between localized electronic states created by point defects, making ZnS:Cu a prolific phosphor material and an intriguing candidate material for quantum information science, where point defects excel as single-photon sources and spin qubits. Colloidal nanocrystals (NCs) of ZnS:Cu are particularly interesting as hosts for the creation, isolation, and measurement of quantum defects, since their size, composition, and surface chemistry can be precisely tailored for bio-sensing and opto-electronic applications. Here, we present a method for synthesizing colloidal ZnS:Cu NCs that emit primarily R-Cu, which has been proposed to arise from the Cu$_{Zn}$-V$_S$ complex, an impurity-vacancy point defect structure analogous to well-known quantum defects in other materials that produce favorable optical and spin dynamics. First principles calculations confirm the thermodynamic stability and electronic structure of Cu$_{Zn}$-V$_S$. Temperature- and time-dependent optical properties of ZnS:Cu NCs show blueshifting luminescence and an anomalous plateau in the intensity dependence as temperature is increased from 19 K to 290 K, for which we propose an empirical dynamical model based on thermally-activated coupling between two manifolds of states inside the ZnS bandgap. Understanding of R-Cu emission dynamics, combined with a controlled synthesis method for obtaining R-Cu centers in colloidal NC hosts, will greatly facilitate the development of Cu$_{Zn}$-V$_S$ and related complexes as quantum point defects in ZnS.
△ Less
Submitted 1 March, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Choosing statistical models to assess biological interaction as a departure from additivity of effects
Authors:
David M. Thompson,
Yan Daniel Zhao
Abstract:
Vanderweele and Knol define biological interaction as an instance wherein "two exposures physically interact to bring about the outcome." A hallmark of biological interaction is that the total effect, produced when factors act together, differs from the sum of effects when the factors operate independently. Epidemiologists construct statistical models to assess biological interaction. The form of…
▽ More
Vanderweele and Knol define biological interaction as an instance wherein "two exposures physically interact to bring about the outcome." A hallmark of biological interaction is that the total effect, produced when factors act together, differs from the sum of effects when the factors operate independently. Epidemiologists construct statistical models to assess biological interaction. The form of the statistical model determines whether it is suited to detecting departures from additivity of effects or for detecting departures from multiplicativity of effects. A consensus exists that biological interaction should be assessed as a departure from additivity of effects. This paper compares three statistical models' assessment of a data example that appears in several epidemiology textbooks to illustrate biological interaction in a binomial outcome. A linear binomial model quantifies departure from additivity in the data example in terms of differences in probabilities. It generates directly interpretable estimates and 95% confidence intervals for parameters including the interaction contrast (IC). Log binomial and logistic regression models detect no departure from multiplicativity in the data example. However, their estimates contribute to calculation of a "Relative Excess Risk Due to Interaction" (RERI), a measure of departure from additivity on a relative risk scale. The linear binomial model directly produces interpretable assessments of departures from additivity of effects and deserves wider use in research and in the teaching of epidemiology. Strategies exist to address the model's limitations.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
The Co-Ordinated Radio and Infrared Survey for High-Mass Star Formation. V. The CORNISH-South Survey and Catalogue
Authors:
T. Irabor,
M. G. Hoare,
M. Burton,
W. D. Cotton,
P. Diamond,
S. Dougherty,
S. P. Ellingsen,
R. Fender,
G. A. Fuller,
S. Garrington,
P. F. Goldsmith,
J. Green,
A. G. Gunn,
J. Jackson,
S. Kurtz,
S. L. Lumsden,
J. Marti,
I. McDonald,
S. Molinari,
T. J. Moore,
M. Mutale,
T. Muxlow,
T. OBrien,
R. D. Oudmaijer,
R. Paladini
, et al. (10 additional authors not shown)
Abstract:
We present the first high spatial resolution radio continuum survey of the southern Galactic plane. The CORNISH project has mapped the region defined by $295^{\circ} < l < 350^{\circ}$; $|b| < 1^{\circ}$ at 5.5-GHz, with a resolution of 2.5$^{''}$ (FWHM). As with the CORNISH-North survey, this is designed to primarily provide matching radio data to the Spitzer GLIMPSE survey region. The CORNISH-So…
▽ More
We present the first high spatial resolution radio continuum survey of the southern Galactic plane. The CORNISH project has mapped the region defined by $295^{\circ} < l < 350^{\circ}$; $|b| < 1^{\circ}$ at 5.5-GHz, with a resolution of 2.5$^{''}$ (FWHM). As with the CORNISH-North survey, this is designed to primarily provide matching radio data to the Spitzer GLIMPSE survey region. The CORNISH-South survey achieved a root mean square noise level of $\sim$ 0.11 mJy beam$^{-1}$, using the 6A configuration of the Australia Telescope Compact Array (ATCA). In this paper, we discuss the observations, data processing and measurements of the source properties. Above a 7$σ$ detection limit, 4701 sources were detected, and their ensemble properties show similar distributions with their northern counterparts. The catalogue is highly reliable and is complete to 90 per cent at a flux density level of 1.1 mJy. We developed a new way of measuring the integrated flux densities and angular sizes of non-Gaussian sources. The catalogue primarily provides positions, flux density measurements and angular sizes. All sources with IR counterparts at 8$μm$ have been visually classified, utilizing additional imaging data from optical, near-IR, mid-IR, far-IR and sub-millimetre galactic plane surveys. This has resulted in the detection of 524 H II regions of which 255 are ultra-compact H II regions, 287 planetary nebulae, 79 radio stars and 6 massive young stellar objects. The rest of the sources are likely to be extra-galactic. These data are particularly important in the characterization and population studies of compact ionized sources such as UCHII regions and PNe towards the Galactic mid-plane.
△ Less
Submitted 5 January, 2023;
originally announced January 2023.
-
Deletion-Contraction and the Surface Tutte Polynomial
Authors:
Iain Moffatt,
Maya Thompson
Abstract:
In this paper we unify two families of topological Tutte polynomials. The first family is that coming from the surface Tutte polynomial, a polynomial that arises in the theory of local flows and tensions. The second family arises from the canonical Tutte polynomials of Hopf algebras. Each family includes the Las Vergnas, Bollobás-Riordan, and Krushkal polynomials. As a consequence we determine a d…
▽ More
In this paper we unify two families of topological Tutte polynomials. The first family is that coming from the surface Tutte polynomial, a polynomial that arises in the theory of local flows and tensions. The second family arises from the canonical Tutte polynomials of Hopf algebras. Each family includes the Las Vergnas, Bollobás-Riordan, and Krushkal polynomials. As a consequence we determine a deletion-contraction definition of the surface Tutte polynomial and recursion relations for the number of local flows and tensions in an embedded graph.
△ Less
Submitted 25 January, 2024; v1 submitted 23 December, 2022;
originally announced December 2022.
-
Silicon-doped $β$-Ga$_2$O$_3$ films grown at 1 $μ$m/h by suboxide molecular-beam epitaxy
Authors:
Kathy Azizie,
Felix V. E. Hensling,
Cameron A. Gorsak,
Yunjo Kim,
Daniel M. Dryden,
M. K. Indika Senevirathna,
Selena Coye,
Shun-Li Shang,
Jacob Steele,
Patrick Vogt,
Nicholas A. Parker,
Yorick A. Birkhölzer,
Jonathan P. McCandless,
Debdeep Jena,
Huili G. Xing,
Zi-Kui Liu,
Michael D. Williams,
Andrew J. Green,
Kelson Chabak,
Adam T. Neal,
Shin Mou,
Michael O. Thompson,
Hari P. Nair,
Darrell G. Schlom
Abstract:
We report the use of suboxide molecular-beam epitaxy (S-MBE) to grow $β$-Ga$_2$O$_3$ at a growth rate of ~1 $μ$m/h with control of the silicon doping concentration from 5x10$^{16}$ to 10$^{19}$ cm$^{-3}$. In S-MBE, pre-oxidized gallium in the form of a molecular beam that is 99.98\% Ga$_2$O, i.e., gallium suboxide, is supplied. Directly supplying Ga2O to the growth surface bypasses the rate-limiti…
▽ More
We report the use of suboxide molecular-beam epitaxy (S-MBE) to grow $β$-Ga$_2$O$_3$ at a growth rate of ~1 $μ$m/h with control of the silicon doping concentration from 5x10$^{16}$ to 10$^{19}$ cm$^{-3}$. In S-MBE, pre-oxidized gallium in the form of a molecular beam that is 99.98\% Ga$_2$O, i.e., gallium suboxide, is supplied. Directly supplying Ga2O to the growth surface bypasses the rate-limiting first step of the two-step reaction mechanism involved in the growth of $β$-Ga$_2$O$_3$ by conventional MBE. As a result, a growth rate of ~1 $μ$m/h is readily achieved at a relatively low growth temperature (T$_{sub}$ = 525 $^\circ$C), resulting in films with high structural perfection and smooth surfaces (rms roughness of < 2 nm on ~1 $μ$m thick films). Silicon-containing oxide sources (SiO and SiO$_2$) producing an SiO suboxide molecular beam are used to dope the $β$-Ga$_2$O$_3$ layers. Temperature-dependent Hall effect measurements on a 1 $μ$m thick film with a mobile carrier concentration of 2.7x10$^{17}$ cm$^{-3}$ reveal a room-temperature mobility of 124 cm$^2$ V$^{-1}$ s$^{-1}$ that increases to 627 cm$^2$ V$^{-1}$ s$^{-1}$ at 76 K; the silicon dopants are found to exhibit an activation energy of 27 meV. We also demonstrate working MESFETs made from these silicon-doped $β$-Ga$_2$O$_3$ films grown by S-MBE at growth rates of ~1 $μ$m/h.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.