subscribe to arXiv mailings

arXiv:2406.19998 [pdf, other]

Detectable signals of post-Born lensing curl B-modes

Authors: Mathew Robertson, Giulio Fabbian, Julien Carron, Antony Lewis

Abstract: Curl lensing, also known as lensing field-rotation or shear B-modes, is a distinct post-Born observable caused by two lensing deflections at different redshifts (lens-lens coupling). For the Cosmic Microwave Background (CMB), the field-rotation is approximately four orders of magnitude smaller than the CMB lensing convergence. Direct detection is therefore challenging for near-future CMB experimen… ▽ More Curl lensing, also known as lensing field-rotation or shear B-modes, is a distinct post-Born observable caused by two lensing deflections at different redshifts (lens-lens coupling). For the Cosmic Microwave Background (CMB), the field-rotation is approximately four orders of magnitude smaller than the CMB lensing convergence. Direct detection is therefore challenging for near-future CMB experiments such as the Simons Observatory (SO) or CMB `Stage-4' (CMB-S4). Instead, the curl can be probed in cross-correlation between a direct reconstruction and a template formed using pairs of large-scale structure (LSS) tracers to emulate the lens-lens coupling. In this paper, we derive a new estimator for the optimal curl template specifically adapted for curved-sky applications, and test it against non-Gaussian complications using N-body cosmology simulations. We find non-foreground biases to the curl cross-spectrum are purely Gaussian at the sensitivity of SO. However, higher-order curl contractions induce non-Gaussian bias at the order of $1σ$ for CMB-S4 using quadratic estimators (QE). Maximum a-Posteriori (MAP) lensing estimators significantly reduce biases for both SO and CMB-S4, in agreement with our analytic predictions. We also show that extragalactic foregrounds in the CMB can bias curl measurements at order of the signal, and evaluate a variety of mitigation strategies to control these biases for SO-like experiments. Near-future observations will be able to measure post-Born lensing curl B-modes. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: 18 pages, 16 figures

arXiv:2406.16900 [pdf, other]

Utilizing Weak-to-Strong Consistency for Semi-Supervised Glomeruli Segmentation

Authors: Irina Zhang, Jim Denholm, Azam Hamidinekoo, Oskar Ålund, Christopher Bagnall, Joana Palés Huix, Michal Sulikowski, Ortensia Vito, Arthur Lewis, Robert Unwin, Magnus Soderberg, Nikolay Burlutskiy, Talha Qaiser

Abstract: Accurate segmentation of glomerulus instances attains high clinical significance in the automated analysis of renal biopsies to aid in diagnosing and monitoring kidney disease. Analyzing real-world histopathology images often encompasses inter-observer variability and requires a labor-intensive process of data annotation. Therefore, conventional supervised learning approaches generally achieve sub… ▽ More Accurate segmentation of glomerulus instances attains high clinical significance in the automated analysis of renal biopsies to aid in diagnosing and monitoring kidney disease. Analyzing real-world histopathology images often encompasses inter-observer variability and requires a labor-intensive process of data annotation. Therefore, conventional supervised learning approaches generally achieve sub-optimal performance when applied to external datasets. Considering these challenges, we present a semi-supervised learning approach for glomeruli segmentation based on the weak-to-strong consistency framework validated on multiple real-world datasets. Our experimental results on 3 independent datasets indicate superior performance of our approach as compared with existing supervised baseline models such as U-Net and SegFormer. △ Less

Submitted 30 May, 2024; originally announced June 2024.

Comments: accepted to MIDL'24

arXiv:2406.08583 [pdf, other]

Defining a Reference Architecture for Edge Systems in Highly-Uncertain Environments

Authors: Kevin Pitstick, Marc Novakouski, Grace A. Lewis, Ipek Ozkaya

Abstract: Increasing rate of progress in hardware and artificial intelligence (AI) solutions is enabling a range of software systems to be deployed closer to their users, increasing application of edge software system paradigms. Edge systems support scenarios in which computation is placed closer to where data is generated and needed, and provide benefits such as reduced latency, bandwidth optimization, and… ▽ More Increasing rate of progress in hardware and artificial intelligence (AI) solutions is enabling a range of software systems to be deployed closer to their users, increasing application of edge software system paradigms. Edge systems support scenarios in which computation is placed closer to where data is generated and needed, and provide benefits such as reduced latency, bandwidth optimization, and higher resiliency and availability. Users who operate in highly-uncertain and resource-constrained environments, such as first responders, law enforcement, and soldiers, can greatly benefit from edge systems to support timelier decision making. Unfortunately, understanding how different architecture approaches for edge systems impact priority quality concerns is largely neglected by industry and research, yet crucial for national and local safety, optimal resource utilization, and timely decision making. Much of industry is focused on the hardware and networking aspects of edge systems, with very little attention to the software that enables edge capabilities. This paper presents our work to fill this gap, defining a reference architecture for edge systems in highly-uncertain environments, and showing examples of how it has been implemented in practice. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: Paper accepted and presented at ESA 2024, the 1st Workshop on Edge Software Architectures, co-located with ICSA 2024, the 21st International Conference on Software Architecture

arXiv:2406.08575 [pdf, ps, other]

Using Quality Attribute Scenarios for ML Model Test Case Generation

Authors: Rachel Brower-Sinning, Grace A. Lewis, Sebastían Echeverría, Ipek Ozkaya

Abstract: Testing of machine learning (ML) models is a known challenge identified by researchers and practitioners alike. Unfortunately, current practice for ML model testing prioritizes testing for model performance, while often neglecting the requirements and constraints of the ML-enabled system that integrates the model. This limited view of testing leads to failures during integration, deployment, and o… ▽ More Testing of machine learning (ML) models is a known challenge identified by researchers and practitioners alike. Unfortunately, current practice for ML model testing prioritizes testing for model performance, while often neglecting the requirements and constraints of the ML-enabled system that integrates the model. This limited view of testing leads to failures during integration, deployment, and operations, contributing to the difficulties of moving models from development to production. This paper presents an approach based on quality attribute (QA) scenarios to elicit and define system- and model-relevant test cases for ML models. The QA-based approach described in this paper has been integrated into MLTE, a process and tool to support ML model test and evaluation. Feedback from users of MLTE highlights its effectiveness in testing beyond model performance and identifying failures early in the development process. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: Paper accepted and presented in SAML 2024, the 3rd International Workshop on Software Architecture and Machine Learning, co-located with ICSA 2024, the 21st IEEE International Conference on Software Architecture

arXiv:2406.03913 [pdf, other]

Recognizing weighted means in geodesic spaces

Authors: Ariel Goodwin, Adrian S. Lewis, Genaro Lopez-Acedo, Adriana Nicolae

Abstract: Geodesic metric spaces support a variety of averaging constructions for given finite sets. Computing such averages has generated extensive interest in diverse disciplines. Here we consider the inverse problem of recognizing computationally whether or not a given point is such an average, exactly or approximately. In nonpositively curved spaces, several averaging notions, including the usual weight… ▽ More Geodesic metric spaces support a variety of averaging constructions for given finite sets. Computing such averages has generated extensive interest in diverse disciplines. Here we consider the inverse problem of recognizing computationally whether or not a given point is such an average, exactly or approximately. In nonpositively curved spaces, several averaging notions, including the usual weighted barycenter, produce the same "mean set". In such spaces, at points where the tangent cone is a Euclidean space, the recognition problem reduces to Euclidean projection onto a polytope. Hadamard manifolds comprise one example. Another consists of CAT(0) cubical complexes, at relative-interior points: the recognition problem is harder for general points, but we present an efficient semidefinite-programming-based algorithm. △ Less

Submitted 6 June, 2024; originally announced June 2024.

MSC Class: 90C48; 57Z25; 65K10; 49M29 ACM Class: G.1.6

arXiv:2406.01754 [pdf, other]

Validating Automated Resonance Evaluation with Synthetic Data

Authors: Oleksii Zivenko, Noah A. W. Walton, William Fritsch, Jacob Forbes, Amanda M. Lewis, Aaron Clark, Jesse M. Brown, Vladimir Sobes

Abstract: The integrity and precision of nuclear data are crucial for a broad spectrum of applications, from national security and nuclear reactor design to medical diagnostics, where the associated uncertainties can significantly impact outcomes. A substantial portion of uncertainty in nuclear data originates from the subjective biases in the evaluation process, a crucial phase in the nuclear data producti… ▽ More The integrity and precision of nuclear data are crucial for a broad spectrum of applications, from national security and nuclear reactor design to medical diagnostics, where the associated uncertainties can significantly impact outcomes. A substantial portion of uncertainty in nuclear data originates from the subjective biases in the evaluation process, a crucial phase in the nuclear data production pipeline. Recent advancements indicate that automation of certain routines can mitigate these biases, thereby standardizing the evaluation process, reducing uncertainty and enhancing reproducibility. This article contributes to developing a framework for automated evaluation techniques testing, emphasizing automated fitting methods that do not require the user to provide any prior information. This approach simplifies the process and reduces the manual effort needed in the initial evaluation stage. It highlights the capability of the framework to validate and optimize subroutines, targeting the performance analysis and optimization of the fitting procedure using high-fidelity synthetic data (labeled experimental data) and the concept of a fully controlled computational experiment. An error metric is introduced to provide a clear and intuitive measure of the fitting quality by quantifying the accuracy and performance across the specified energy. This metric sets a scale for comparison and optimization of routines or hyperparameter selection, improving the entire evaluation process methodology and increasing reproducibility and objectivity. △ Less

Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: 39 pages, 12 figures; As a follow-up to arXiv:2303.09698 and arXiv:2402.14122

MSC Class: 65K10; 62H12 ACM Class: J.2; G.1.6

arXiv:2405.12655 [pdf, other]

Lipschitz minimization and the Goldstein modulus

Authors: Siyu Kong, Adrian S. Lewis

Abstract: Goldstein's 1977 idealized iteration for minimizing a Lipschitz objective fixes a distance - the step size - and relies on a certain approximate subgradient. That "Goldstein subgradient" is the shortest convex combination of objective gradients at points within that distance of the current iterate. A recent implementable Goldstein-style algorithm allows a remarkable complexity analysis (Zhang et a… ▽ More Goldstein's 1977 idealized iteration for minimizing a Lipschitz objective fixes a distance - the step size - and relies on a certain approximate subgradient. That "Goldstein subgradient" is the shortest convex combination of objective gradients at points within that distance of the current iterate. A recent implementable Goldstein-style algorithm allows a remarkable complexity analysis (Zhang et al. 2020), and a more sophisticated variant (Davis and Jiang, 2022) leverages typical objective geometry to force near-linear convergence. To explore such methods, we introduce a new modulus, based on Goldstein subgradients, that robustly measures the slope of a Lipschitz function. We relate near-linear convergence of Goldstein-style methods to linear growth of this modulus at minimizers. We illustrate the idea computationally with a simple heuristic for Lipschitz minimization. △ Less

Submitted 21 May, 2024; originally announced May 2024.

MSC Class: 90C56; 49J52; 65Y20 ACM Class: G.1.6

arXiv:2405.01968 [pdf, other]

Convex optimization on CAT(0) cubical complexes

Authors: Ariel Goodwin, Adrian S. Lewis, Genaro Lopez-Acedo, Adriana Nicolae

Abstract: We consider geodesically convex optimization problems involving distances to a finite set of points $A$ in a CAT(0) cubical complex. Examples include the minimum enclosing ball problem, the weighted mean and median problems, and the feasibility and projection problems for intersecting balls with centers in $A$. We propose a decomposition approach relying on standard Euclidean cutting plane algorit… ▽ More We consider geodesically convex optimization problems involving distances to a finite set of points $A$ in a CAT(0) cubical complex. Examples include the minimum enclosing ball problem, the weighted mean and median problems, and the feasibility and projection problems for intersecting balls with centers in $A$. We propose a decomposition approach relying on standard Euclidean cutting plane algorithms. The cutting planes are readily derivable from efficient algorithms for computing geodesics in the complex. △ Less

Submitted 3 May, 2024; originally announced May 2024.

MSC Class: 90C48; 52A41; 57Z25; 65K05 ACM Class: F.2.1

arXiv:2404.16797 [pdf, other]

Spherical bispectrum expansion and quadratic estimators

Authors: Julien Carron, Antony Lewis

Abstract: We describe a general expansion of spherical (full-sky) bispectra into a set of orthogonal modes. For squeezed shapes, the basis separates physically-distinct signals and is dominated by the lowest moments. In terms of reduced bispectra, we identify a set of discrete polynomials that are pairwise orthogonal with respect to the relevant Wigner 3j symbol, and reduce to Chebyshev polynomials in the f… ▽ More We describe a general expansion of spherical (full-sky) bispectra into a set of orthogonal modes. For squeezed shapes, the basis separates physically-distinct signals and is dominated by the lowest moments. In terms of reduced bispectra, we identify a set of discrete polynomials that are pairwise orthogonal with respect to the relevant Wigner 3j symbol, and reduce to Chebyshev polynomials in the flat-sky (high-momentum) limit for both parity-even and parity-odd cases. For squeezed shapes, the flat-sky limit is equivalent to previous moment expansions used for CMB bispectra and quadratic estimators, but in general reduces to a distinct expansion in the angular dependence of triangles at fixed total side length (momentum). We use the full-sky expansion to construct a tower of orthogonal CMB lensing quadratic estimators and construct estimators that are immune to foregrounds like point sources or noise inhomogeneities. In parity-even combinations (such as the lensing gradient mode from $TT$, or the lensing curl mode from $EB$) the leading two modes can be identified with information from the magnification and shear respectively, whereas the parity-odd combinations are shear-only. Although not directly separable, we show that these estimators can nonetheless be evaluated numerically sufficiently easily. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 10 pages and the same of appendices, 8 figures

arXiv:2404.08893 [pdf, other]

Early detection of disease outbreaks and non-outbreaks using incidence data

Authors: Shan Gao, Amit K. Chakraborty, Russell Greiner, Mark A. Lewis, Hao Wang

Abstract: Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a… ▽ More Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a Susceptible-Infected-Recovered model for slowly changing, noisy disease dynamics. Outbreak sequences give a transcritical bifurcation within a specified future time window, whereas non-outbreak (null bifurcation) sequences do not. We identified incipient differences in time series of infectives leading to future outbreaks and non-outbreaks. These differences are reflected in 22 statistical features and 5 early warning signal indicators. Classifier performance, given by the area under the receiver-operating curve, ranged from 0.99 for large expanding windows of training data to 0.7 for small rolling windows. Real-world performances of classifiers were tested on two empirical datasets, COVID-19 data from Singapore and SARS data from Hong Kong, with two classifiers exhibiting high accuracy. In summary, we showed that there are statistical features that distinguish outbreak and non-outbreak sequences long before outbreaks occur. We could detect these differences in synthetic and real-world data sets, well before potential outbreaks occur. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2404.02188 [pdf, other]

Data availability and requirements relevant for the Ariel space mission and other exoplanet atmosphere applications

Authors: Katy L. Chubb, Séverine Robert, Clara Sousa-Silva, Sergei N. Yurchenko, Nicole F. Allard, Vincent Boudon, Jeanna Buldyreva, Benjamin Bultel, Athena Coustenis, Aleksandra Foltynowicz, Iouli E. Gordon, Robert J. Hargreaves, Christiane Helling, Christian Hill, Helgi Rafn Hrodmarsson, Tijs Karman, Helena Lecoq-Molinos, Alessandra Migliorini, Michaël Rey, Cyril Richard, Ibrahim Sadiek, Frédéric Schmidt, Andrei Sokolov, Stefania Stefani, Jonathan Tennyson , et al. (30 additional authors not shown)

Abstract: The goal of this white paper is to provide a snapshot of the data availability and data needs primarily for the Ariel space mission, but also for related atmospheric studies of exoplanets and brown dwarfs. It covers the following data-related topics: molecular and atomic line lists, line profiles, computed cross-sections and opacities, collision-induced absorption and other continuum data, optical… ▽ More The goal of this white paper is to provide a snapshot of the data availability and data needs primarily for the Ariel space mission, but also for related atmospheric studies of exoplanets and brown dwarfs. It covers the following data-related topics: molecular and atomic line lists, line profiles, computed cross-sections and opacities, collision-induced absorption and other continuum data, optical properties of aerosols and surfaces, atmospheric chemistry, UV photodissociation and photoabsorption cross-sections, and standards in the description and format of such data. These data aspects are discussed by addressing the following questions for each topic, based on the experience of the "data-provider" and "data-user" communities: (1) what are the types and sources of currently available data, (2) what work is currently in progress, and (3) what are the current and anticipated data needs. We present a GitHub platform for Ariel-related data, with the goal to provide a go-to place for both data-users and data-providers, for the users to make requests for their data needs and for the data-providers to link to their available data. Our aim throughout the paper is to provide practical information on existing sources of data whether in databases, theoretical, or literature sources. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: 58 pages, submitted to RAS Techniques and Instruments (RASTI). The authors welcome feedback: corresponding author emails can be found as footnotes on page 2

arXiv:2403.16233 [pdf, other]

An early warning indicator trained on stochastic disease-spreading models with different noises

Authors: Amit K. Chakraborty, Shan Gao, Reza Miry, Pouria Ramazi, Russell Greiner, Mark A. Lewis, Hao Wang

Abstract: The timely detection of disease outbreaks through reliable early warning signals (EWSs) is indispensable for effective public health mitigation strategies. Nevertheless, the intricate dynamics of real-world disease spread, often influenced by diverse sources of noise and limited data in the early stages of outbreaks, pose a significant challenge in developing reliable EWSs, as the performance of e… ▽ More The timely detection of disease outbreaks through reliable early warning signals (EWSs) is indispensable for effective public health mitigation strategies. Nevertheless, the intricate dynamics of real-world disease spread, often influenced by diverse sources of noise and limited data in the early stages of outbreaks, pose a significant challenge in developing reliable EWSs, as the performance of existing indicators varies with extrinsic and intrinsic noises. Here, we address the challenge of modeling disease when the measurements are corrupted by additive white noise, multiplicative environmental noise, and demographic noise into a standard epidemic mathematical model. To navigate the complexities introduced by these noise sources, we employ a deep learning algorithm that provides EWS in infectious disease outbreak by training on noise-induced disease-spreading models. The indicator's effectiveness is demonstrated through its application to real-world COVID-19 cases in Edmonton and simulated time series derived from diverse disease spread models affected by noise. Notably, the indicator captures an impending transition in a time series of disease outbreaks and outperforms existing indicators. This study contributes to advancing early warning capabilities by addressing the intricate dynamics inherent in real-world disease spread, presenting a promising avenue for enhancing public health preparedness and response efforts. △ Less

Submitted 24 March, 2024; originally announced March 2024.

arXiv:2403.15749 [pdf, other]

Horoballs and the subgradient method

Authors: Adrian S. Lewis, Genaro Lopez-Acedo, Adriana Nicolae

Abstract: To explore convex optimization on Hadamard spaces, we consider an iteration in the style of a subgradient algorithm. Traditionally, such methods assume that the underlying spaces are manifolds and that the objectives are geodesically convex: the methods are described using tangent spaces and exponential maps. By contrast, our iteration applies in a general Hadamard space, is framed in the underlyi… ▽ More To explore convex optimization on Hadamard spaces, we consider an iteration in the style of a subgradient algorithm. Traditionally, such methods assume that the underlying spaces are manifolds and that the objectives are geodesically convex: the methods are described using tangent spaces and exponential maps. By contrast, our iteration applies in a general Hadamard space, is framed in the underlying space itself, and relies instead on horospherical convexity of the objective level sets. For this restricted class of objectives, we prove a complexity result of the usual form. Notably, the complexity does not depend on a lower bound on the space curvature. We illustrate our subgradient algorithm on the minimal enclosing ball problem in Hadamard spaces. △ Less

Submitted 2 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

MSC Class: 90C48; 65Y20; 49M29 ACM Class: G.1.6

arXiv:2403.14486 [pdf, other]

doi 10.1016/j.anucene.2024.110717

Incorrect Resonance Escape Probability in Monte Carlo Codes due to the Threshold Approximation of Temperature-Dependent Scattering

Authors: Gabriel Lentchner, William Fritsch, Robert Crowder, Noah Walton, Amanda Lewis, Ondrej Chvala, Kevin Clarno, Vladimir Sobes

Abstract: Monte Carlo-transport codes are designed to simulate the complex neutron transport physics associated with nuclear systems. These codes are tasked with simulating phenomena such as temperature effects on cross-sections, thermo-physical effects, reaction rates, and kinematics. It is not computationally possible to simulate the physics of a system exactly. However, many of the approximations made by… ▽ More Monte Carlo-transport codes are designed to simulate the complex neutron transport physics associated with nuclear systems. These codes are tasked with simulating phenomena such as temperature effects on cross-sections, thermo-physical effects, reaction rates, and kinematics. It is not computationally possible to simulate the physics of a system exactly. However, many of the approximations made by modern simulation codes have been well validated. This article investigates an impactful simulation error caused by an approximation made in many Monte Carlo-transport codes. The approximation that target-at-rest is valid for neutrons at energies 400 times that of the thermal energy of the target particle is found to be inaccurate in certain scenarios. This paper identifies such cases, notably TRISO [1] fuel and instances where fuel infiltrates the pores of graphite in Molten Salt Reactors. The breakdown of this approximation occurs particularly when there exists a small length scale between fuel, a material with absorption resonances, and moderator, a scattering material. When threshold values are too small, resonance escape probabilities can deviate by as much as 1% per resonance, forming a baseline defect. Furthermore, two distinct anomalies were observed upon temperature variation, directly attributed to the transition between target-at-rest and target-in-motion physics. Equations provided in this study offer predictions for the temperature ranges within which these anomalies occur, based on system temperature and threshold value. The recommendations put forth in this paper advocate for incorporating the threshold value as a user-defined variable in transport Monte Carlo codes employing this approximation. Additionally, users are advised to conduct convergence studies to ensure that the chosen threshold value is sufficiently high to mitigate the influence of baseline defects and anomalies. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 27 pages, 33 figures

arXiv:2402.14122 [pdf, other]

Automated Resonance Identification in Nuclear Data Evaluation

Authors: Noah A. W. Walton, Oleksii Zivenko, William Fritsch, Jacob Forbes, Amanda Lewis, Jesse Brown, Vlad Sobes

Abstract: Global and national efforts to deliver high-quality nuclear data to users have a broad impact across applications such as national security, reactor operation, basic science, medical fields, and more. Cross section evaluation is a large part this effort as it combines theory and experiment to produce suggested values and uncertainty for reaction probabilities. In most isotopes, the cross section e… ▽ More Global and national efforts to deliver high-quality nuclear data to users have a broad impact across applications such as national security, reactor operation, basic science, medical fields, and more. Cross section evaluation is a large part this effort as it combines theory and experiment to produce suggested values and uncertainty for reaction probabilities. In most isotopes, the cross section exhibits resonant behavior in what is called the resonance region of incident neutron energy. Resonance region evaluation is a specialized type of nuclear data evaluation that can require significant, manual effort and months of time from expert scientists. In this article, non-convex, non-linear optimization methods are combined with concepts of inferential statistics to infer a set of optimized resonance models from experimental data in an automated manner that is not dependent on prior evaluation(s). This methodology aims to enhance the workflow of a resonance evaluator by minimizing time, effort, and prior biases while improving reproducibility and document-ability, addressing widely recognized challenges in the field. △ Less

Submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.07964 [pdf, other]

Can smartphone apps reveal fishing catch rates and durations?

Authors: Azar T. Tayebi, Julia S. Schmid, Sean Simmons, Mark S. Poesch, Mark A. Lewis, Pouria Ramazi

Abstract: Data on angler behaviour are conventionally collected by creel surveys. An innovative, cost-effective method is the use of smartphone applications for recreational anglers. Correlations were found between these citizen-reported data and the data from creel surveys. It is, however, unclear whether angler behaviour measured from the two sources is directly related, or the citizen-reported informatio… ▽ More Data on angler behaviour are conventionally collected by creel surveys. An innovative, cost-effective method is the use of smartphone applications for recreational anglers. Correlations were found between these citizen-reported data and the data from creel surveys. It is, however, unclear whether angler behaviour measured from the two sources is directly related, or the citizen-reported information can be obtained mainly from other "intermediate" variables. We used Bayesian networks to investigate this question for two management-related quantities, daily catch rate, and fishing duration, sourced from creel surveys and the MyCatch smartphone application in two river systems in Alberta, Canada. Environmental variables and website views of the waterbodies were included as possible intermediate variables. We found direct relationships between mean catch rates from creel surveys and smartphone applications. In contrast, the daily mean fishing durations were only indirectly related to intermediate variables "wind speed", "degree days" and "solar radiation". The study provides insight into the potential use of citizen-reported data to understand angler behaviour on a large scale. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 10 pages, 1 table, and 1 figure

arXiv:2402.06678 [pdf, other]

Can machine learning predict citizen-reported angler behavior?

Authors: Julia S. Schmid, Sean Simmons, Mark A. Lewis, Mark S. Poesch, Pouria Ramazi

Abstract: Prediction of angler behaviors, such as catch rates and angler pressure, is essential to maintaining fish populations and ensuring angler satisfaction. Angler behavior can partly be tracked by online platforms and mobile phone applications that provide fishing activities reported by recreational anglers. Moreover, angler behavior is known to be driven by local site attributes. Here, the prediction… ▽ More Prediction of angler behaviors, such as catch rates and angler pressure, is essential to maintaining fish populations and ensuring angler satisfaction. Angler behavior can partly be tracked by online platforms and mobile phone applications that provide fishing activities reported by recreational anglers. Moreover, angler behavior is known to be driven by local site attributes. Here, the prediction of citizen-reported angler behavior was investigated by machine-learning methods using auxiliary data on the environment, socioeconomics, fisheries management objectives, and events at a freshwater body. The goal was to determine whether auxiliary data alone could predict the reported behavior. Different spatial and temporal extents and temporal resolutions were considered. Accuracy scores averaged 88% for monthly predictions at single water bodies and 86% for spatial predictions on a day in a specific region across Canada. At other resolutions and scales, the models only achieved low prediction accuracy of around 60%. The study represents a first attempt at predicting angler behavior in time and space at a large scale and establishes a foundation for potential future expansions in various directions. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 36 pages, 10 figures, 4 tables (including supplementary information)

arXiv:2402.05761 [pdf, other]

Growth history and quasar bias evolution at z < 3 from Quaia

Authors: G. Piccirilli, G. Fabbian, D. Alonso, K. Storey-Fisher, J. Carron, A. Lewis, C. García-García

Abstract: We make use of the Gaia-Unwise quasar catalogue, Quaia, to constrain the growth history out to high redshifts from the clustering of quasars and their cross-correlation with maps of the Cosmic Microwave Background (CMB) lensing convergence. Considering three tomographic bins, centered at redshifts $\bar{z}_i = [0.69, 1.59, 2.72]$, we reconstruct the evolution of the amplitude of matter fluctuation… ▽ More We make use of the Gaia-Unwise quasar catalogue, Quaia, to constrain the growth history out to high redshifts from the clustering of quasars and their cross-correlation with maps of the Cosmic Microwave Background (CMB) lensing convergence. Considering three tomographic bins, centered at redshifts $\bar{z}_i = [0.69, 1.59, 2.72]$, we reconstruct the evolution of the amplitude of matter fluctuations $σ_8(z)$ over the last $\sim12$ billion years of cosmic history. In particular, we make one of the highest-redshift measurements of $σ_8$ ($σ_8(z=2.72)=0.22\pm 0.06$), finding it to be in good agreement (at the $\sim1σ$ level) with the value predicted by $Λ$CDM using CMB data from Planck. We also used the data to study the evolution of the linear quasar bias for this sample, finding values similar to those of other quasar samples, although with a less steep evolution at high redshifts. Finally, we study the potential impact of foreground contamination in the CMB lensing maps and, although we find evidence of contamination in cross-correlations at $z\sim1.7$ we are not able to clearly pinpoint its origin as being Galactic or extragalactic. Nevertheless, we determine that the impact of this contamination on our results is negligible. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 28 pages, 11 figures

arXiv:2312.14882 [pdf, ps, other]

Sampling and estimation on manifolds using the Langevin diffusion

Authors: Karthik Bharath, Alexander Lewis, Akash Sharma, Michael V Tretyakov

Abstract: Error bounds are derived for sampling and estimation using a discretization of an intrinsically defined Langevin diffusion with invariant measure $\text{d}μ_φ\propto e^{-φ} \mathrm{dvol}_g $ on a compact Riemannian manifold. Two estimators of linear functionals of $μ_φ$ based on the discretized Markov process are considered: a time-averaging estimator based on a single trajectory and an ensemble-a… ▽ More Error bounds are derived for sampling and estimation using a discretization of an intrinsically defined Langevin diffusion with invariant measure $\text{d}μ_φ\propto e^{-φ} \mathrm{dvol}_g $ on a compact Riemannian manifold. Two estimators of linear functionals of $μ_φ$ based on the discretized Markov process are considered: a time-averaging estimator based on a single trajectory and an ensemble-averaging estimator based on multiple independent trajectories. Imposing no restrictions beyond a nominal level of smoothness on $φ$, first-order error bounds, in discretization step size, on the bias and variance/mean-square error of both estimators are derived. The order of error matches the optimal rate in Euclidean and flat spaces, and leads to a first-order bound on distance between the invariant measure $μ_φ$ and a stationary measure of the discretized Markov process. This order is preserved even upon using retractions when exponential maps are unavailable in closed form, thus enhancing practicality of the proposed algorithms. Generality of the proof techniques, which exploit links between two partial differential equations and the semigroup of operators corresponding to the Langevin diffusion, renders them amenable for the study of a more general class of sampling algorithms related to the Langevin diffusion. Conditions for extending analysis to the case of non-compact manifolds are discussed. Numerical illustrations with distributions, log-concave and otherwise, on the manifolds of positive and negative curvature elucidate on the derived bounds and demonstrate practical utility of the sampling algorithm. △ Less

Submitted 15 June, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

arXiv:2312.09827 [pdf, other]

doi 10.1103/PhysRevC.109.054910

Identified charged-hadron production in $p$$+$Al, $^3$He$+$Au, and Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV

Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, J. Alexander, M. Alfred, V. Andrieux, K. Aoki, N. Apadula, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, X. Bai, N. S. Bandara, B. Bannier, K. N. Barish, S. Bathe, V. Baublis , et al. (456 additional authors not shown)

Abstract: The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interprete… ▽ More The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interpreted in terms of radially expanding thermalized systems. The particle ratios of $K/π$ and $p/π$ have been measured in different centrality ranges of large (Cu$+$Au, U$+$U) and small ($p$$+$Al, $^3$He$+$Au) collision systems. The values of $K/π$ ratios measured in all considered collision systems were found to be consistent with those measured in $p$$+$$p$ collisions. However the values of $p/π$ ratios measured in large collision systems reach the values of $\approx0.6$, which is $\approx2$ times larger than in $p$$+$$p$ collisions. These results can be qualitatively understood in terms of the baryon enhancement expected from hadronization by recombination. Identified charged-hadron nuclear-modification factors ($R_{AB}$) are also presented. Enhancement of proton $R_{AB}$ values over meson $R_{AB}$ values was observed in central $^3$He$+$Au, Cu$+$Au, and U$+$U collisions. The proton $R_{AB}$ values measured in $p$$+$Al collision system were found to be consistent with $R_{AB}$ values of $φ$, $π^\pm$, $K^\pm$, and $π^0$ mesons, which may indicate that the size of the system produced in $p$$+$Al collisions is too small for recombination to cause a noticeable increase in proton production. △ Less

Submitted 22 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

Comments: 480 authors from 78 institutions, 18 pages, 6 tables, 16 figures. v2 is version accepted for publication in Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. C 109, 054910 (2024)

arXiv:2310.12293 [pdf, ps, other]

Topologies for geometric flows and continuous dependence on parameters

Authors: Andrew D. Lewis, Yanlei Zhang

Abstract: We study time- and parameter-dependent ordinary differential equations in the geometric setting of vector fields and their flows. Various degrees of regularities in state are considered, including Lipschitz, finitely diferentiable, smooth, and holomorphic. A suitable topology for the space of flows is derived using geometric descriptions of suitable topologies for vector fields. A new kind of cont… ▽ More We study time- and parameter-dependent ordinary differential equations in the geometric setting of vector fields and their flows. Various degrees of regularities in state are considered, including Lipschitz, finitely diferentiable, smooth, and holomorphic. A suitable topology for the space of flows is derived using geometric descriptions of suitable topologies for vector fields. A new kind of continuous dependence is proved, that of the fixed time local flow on the parameter in a general topological space. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2202.00741

arXiv:2310.09668 [pdf, other]

Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs

Authors: Chenyang Yang, Rishabh Rustogi, Rachel Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu

Abstract: Current model testing work has mostly focused on creating test cases. Identifying what to test is a step that is largely ignored and poorly supported. We propose Weaver, an interactive tool that supports requirements elicitation for guiding model testing. Weaver uses large language models to generate knowledge bases and recommends concepts from them interactively, allowing testers to elicit requir… ▽ More Current model testing work has mostly focused on creating test cases. Identifying what to test is a step that is largely ignored and poorly supported. We propose Weaver, an interactive tool that supports requirements elicitation for guiding model testing. Weaver uses large language models to generate knowledge bases and recommends concepts from them interactively, allowing testers to elicit requirements for further testing. Weaver provides rich external knowledge to testers and encourages testers to systematically explore diverse concepts beyond their own biases. In a user study, we show that both NLP experts and non-experts identified more, as well as more diverse concepts worth testing when using Weaver. Collectively, they found more than 200 failing test cases for stance detection with zero-shot ChatGPT. Our case studies further show that Weaver can help practitioners test models in real-world settings, where developers define more nuanced application scenarios (e.g., code understanding and transcript summarization) using LLMs. △ Less

Submitted 14 October, 2023; originally announced October 2023.

arXiv:2310.04254 [pdf, other]

High-yield atmospheric water capture via bioinspired material segregation

Authors: Yiwei Gao, Santiago Ricoy, Addison Cobb, Ryan Phung, Areianna Lewis, Aaron Sahm, Nathan Ortiz, Sameer Rao, H. Jeremy Cho

Abstract: Atmospheric water harvesting is urgently needed given increasing global water scarcity. Current sorbent-based devices that cycle between water capture and release have low harvesting rates. We envision a radically different multi-material architecture with segregated and simultaneous capture and release. This way, proven fast-release mechanisms that approach theoretical limits can be incorporated;… ▽ More Atmospheric water harvesting is urgently needed given increasing global water scarcity. Current sorbent-based devices that cycle between water capture and release have low harvesting rates. We envision a radically different multi-material architecture with segregated and simultaneous capture and release. This way, proven fast-release mechanisms that approach theoretical limits can be incorporated; however, no capture mechanism exists to supply liquid adequately for release. Inspired by tree frogs and airplants, our capture approach transports water through a hydrogel membrane ``skin'' into a liquid desiccant. We report an extraordinarily high capture rate of 5.50 $\text{kg}\,\text{m}^{-2}\,\text{d}^{-1}$ at a low humidity of 35%, limited by the convection of air to the device. At higher humidities, we demonstrate up to 16.9 $\text{kg}\,\text{m}^{-2}\,\text{d}^{-1}$, exceeding theoretical limits for release. Simulated performance of a hypothetical one-square-meter device shows that water could be supplied to two to three people in dry environments. This work is a significant step toward providing new resources to water-scarce regions. △ Less

Submitted 6 October, 2023; originally announced October 2023.

Comments: 22 pages, 23 figures

arXiv:2310.03802 [pdf, other]

Not So Fast Kepler-1513: A Perturbing Planetary Interloper in the Exomoon Corridor

Authors: Daniel A. Yahalomi, David Kipping, David Nesvorný, Paul A. Dalba, Paul Benni, Ceiligh Cacho-Negrete, Karen Collins, Joel T. Earwicker, John Arban Lewis, Kim K. McLeod, Richard P. Schwarz, Gavin Wang

Abstract: Transit Timing Variations (TTVs) can be induced by a range of physical phenomena, including planet-planet interactions, planet-moon interactions, and stellar activity. Recent work has shown that roughly half of moons would induce fast TTVs with a short period in the range of two-to-four orbits of its host planet around the star. An investigation of the Kepler TTV data in this period range identifi… ▽ More Transit Timing Variations (TTVs) can be induced by a range of physical phenomena, including planet-planet interactions, planet-moon interactions, and stellar activity. Recent work has shown that roughly half of moons would induce fast TTVs with a short period in the range of two-to-four orbits of its host planet around the star. An investigation of the Kepler TTV data in this period range identified one primary target of interest, Kepler-1513 b. Kepler-1513 b is a $8.05^{+0.58}_{-0.40}$ $R_\oplus$ planet orbiting a late G-type dwarf at $0.53^{+0.04}_{-0.03}$ AU. Using Kepler photometry, this initial analysis showed that Kepler-1513 b's TTVs were consistent with a moon. Here, we report photometric observations of two additional transits nearly a decade after the last Kepler transit using both ground-based observations and space-based photometry with TESS. These new transit observations introduce a previously undetected long period TTV, in addition to the original short period TTV signal. Using the complete transit dataset, we investigate whether a non-transiting planet, a moon, or stellar activity could induce the observed TTVs. We find that only a non-transiting perturbing planet can reproduce the observed TTVs. We additionally perform transit origami on the Kepler photometry, which independently applies pressure against a moon hypothesis. Specifically, we find that Kepler-1513 b's TTVs are consistent with an exterior non-transiting $\sim$Saturn mass planet, Kepler-1513 c, on a wide orbit, $\sim$5$\%$ outside a 5:1 period ratio with Kepler-1513 b. This example introduces a previously unidentified cause for planetary interlopers in the exomoon corridor, namely an insufficient baseline of observations. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 20 pages, 13 figures. Accepted to MNRAS. Code available at https://github.com/dyahalomi/Kepler1513

arXiv:2309.11121 [pdf, ps, other]

A canonical treatment of line bundles over general projective spaces

Authors: Andrew D. Lewis

Abstract: Projective spaces for finite-dimensional vector spaces over general fields are considered. The geometry of these spaces and the theory of line bundles over these spaces is presented. Particularly, the space of global regular sections of these bundles is examined. Care is taken in two directions: (1) places where algebraic closedness of the field are important are pointed out; (2) basis free constr… ▽ More Projective spaces for finite-dimensional vector spaces over general fields are considered. The geometry of these spaces and the theory of line bundles over these spaces is presented. Particularly, the space of global regular sections of these bundles is examined. Care is taken in two directions: (1) places where algebraic closedness of the field are important are pointed out; (2) basis free constructions are used exclusively. △ Less

Submitted 20 September, 2023; originally announced September 2023.

MSC Class: 14-01

arXiv:2309.10471 [pdf, ps, other]

Generalised subbundles and distributions: A comprehensive review

Authors: Andrew D. Lewis

Abstract: Distributions, i.e., subsets of tangent bundles formed by piecing together subspaces of tangent spaces, are commonly encountered in the theory and application of differential geometry. Indeed, the theory of distributions is a fundamental part of mechanics and control theory. The theory of distributions is presented in a systematic way, and self-contained proofs are given of some of the major res… ▽ More Distributions, i.e., subsets of tangent bundles formed by piecing together subspaces of tangent spaces, are commonly encountered in the theory and application of differential geometry. Indeed, the theory of distributions is a fundamental part of mechanics and control theory. The theory of distributions is presented in a systematic way, and self-contained proofs are given of some of the major results. Parts of the theory are presented in the context of generalised subbundles of vector bundles. Special emphasis is placed on understanding the rôle of sheaves and understanding the distinctions between the smooth or finitely differentiable cases and the real analytic case. The Orbit Theorem and applications, including Frobenius's Theorem and theorems on the equivalence of families of vector fields, are considered in detail. Examples illustrate the phenomenon that can occur with generalised subbundles and distributions. △ Less

Submitted 19 September, 2023; originally announced September 2023.

MSC Class: 58A30

arXiv:2309.08908 [pdf, ps, other]

Should we fly in the Lebesgue-designed airplane? -- The correct defence of the Lebesgue integral

Authors: Andrew D. Lewis

Abstract: It is well-known that the Lebesgue integral generalises the Riemann integral. However, as is also well-known but less frequently well-explained, this generalisation alone is not the reason why the Lebesgue integral is important and needs to be a part of the arsenal of any mathematician, pure or applied. Those who understand the correct reasons for the importance of the Lebesgue integral realise th… ▽ More It is well-known that the Lebesgue integral generalises the Riemann integral. However, as is also well-known but less frequently well-explained, this generalisation alone is not the reason why the Lebesgue integral is important and needs to be a part of the arsenal of any mathematician, pure or applied. Those who understand the correct reasons for the importance of the Lebesgue integral realise there are at least two crucial differences between the Riemann and Lebesgue theories. One is the difference between the Dominated Convergence Theorem in the two theories, and another is the completeness of the normed vector spaces of integrable functions. Here topological interpretations are provided for the differences in the Dominated Convergence Theorems, and explicit counterexamples are given which illustrate the deficiencies of the Riemann integral. Also illustrated are the deleterious consequences of the defects in the Riemann integral on Fourier transform theory if one restricts to Riemann integrable functions. △ Less

Submitted 16 September, 2023; originally announced September 2023.

MSC Class: 28-01

arXiv:2309.07190 [pdf, ps, other]

A top nine list: Most popular induced matrix norms

Authors: Andrew D. Lewis

Abstract: Explicit formulae are given for the nine possible induced matrix norms corresponding to the 1-, 2-, and $\infty$-norms for Euclidean space. The complexity of computing these norms is investigated. Explicit formulae are given for the nine possible induced matrix norms corresponding to the 1-, 2-, and $\infty$-norms for Euclidean space. The complexity of computing these norms is investigated. △ Less

Submitted 13 September, 2023; originally announced September 2023.

MSC Class: 15A60

arXiv:2307.16081 [pdf, other]

Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System

Authors: Lingbo Mo, Shijie Chen, Ziru Chen, Xiang Deng, Ashley Lewis, Sunit Singh, Samuel Stevens, Chang-You Tai, Zhen Wang, Xiang Yue, Tianshu Zhang, Yu Su, Huan Sun

Abstract: We introduce TacoBot, a user-centered task-oriented digital assistant designed to guide users through complex real-world tasks with multiple steps. Covering a wide range of cooking and how-to tasks, we aim to deliver a collaborative and engaging dialogue experience. Equipped with language understanding, dialogue management, and response generation components supported by a robust search engine, Ta… ▽ More We introduce TacoBot, a user-centered task-oriented digital assistant designed to guide users through complex real-world tasks with multiple steps. Covering a wide range of cooking and how-to tasks, we aim to deliver a collaborative and engaging dialogue experience. Equipped with language understanding, dialogue management, and response generation components supported by a robust search engine, TacoBot ensures efficient task assistance. To enhance the dialogue experience, we explore a series of data augmentation strategies using LLMs to train advanced neural models continuously. TacoBot builds upon our successful participation in the inaugural Alexa Prize TaskBot Challenge, where our team secured third place among ten competing teams. We offer TacoBot as an open-source framework that serves as a practical example for deploying task-oriented dialogue systems. △ Less

Submitted 29 July, 2023; originally announced July 2023.

arXiv:2307.07371 [pdf, other]

Two-Way Quantum Time Transfer: A Method for Daytime Space-Earth Links

Authors: Randy Lafler, Mark L. Eickhoff, Scott C. Newey, Yamil Nieves Gonzalez, Kurt E. Stoltenburg, J. Frank Camacho, Mark A. Harris, Denis W. Oesch, Adrian J. Lewis, R. Nicholas Lanning

Abstract: High-precision remote clock synchronization is crucial for many classical and quantum network applications. Evaluating options for space-Earth links, we find that traditional solutions may not produce the desired synchronization for low Earth orbits and unnecessarily complicate quantum-networking architectures. Demonstrating an alternative, we use commercial off-the-shelf quantum-photon sources an… ▽ More High-precision remote clock synchronization is crucial for many classical and quantum network applications. Evaluating options for space-Earth links, we find that traditional solutions may not produce the desired synchronization for low Earth orbits and unnecessarily complicate quantum-networking architectures. Demonstrating an alternative, we use commercial off-the-shelf quantum-photon sources and detection equipment to synchronize two remote clocks across our freespace testbed utilizing a method called two-way quantum time transfer (QTT). We reach picosecond-scale timing precision under very lossy and noisy channel conditions representative of daytime space-Earth links and software-emulated satellite motion. This work demonstrates how QTT is potentially relevant for daytime space-Earth quantum networking and/or providing high-precision timing in GPS-denied environments. △ Less

Submitted 9 April, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

Comments: arXiv admin note: text overlap with arXiv:2211.00737

arXiv:2305.14954 [pdf, other]

Weakly nonlinear analysis of a two-species non-local advection-diffusion system

Authors: Valeria Giunta, Thomas Hillen, Mark A. Lewis, Jonathan R. Potts

Abstract: Nonlocal interactions are ubiquitous in nature and play a central role in many biological systems. In this paper, we perform a bifurcation analysis of a widely-applicable advection-diffusion model with nonlocal advection terms describing the species movements generated by inter-species interactions. We use linear analysis to assess the stability of the constant steady state, then weakly nonlinear… ▽ More Nonlocal interactions are ubiquitous in nature and play a central role in many biological systems. In this paper, we perform a bifurcation analysis of a widely-applicable advection-diffusion model with nonlocal advection terms describing the species movements generated by inter-species interactions. We use linear analysis to assess the stability of the constant steady state, then weakly nonlinear analysis to recover the shape and stability of non-homogeneous solutions. Since the system arises from a conservation law, the resulting amplitude equations consist of a Ginzburg-Landau equation coupled with an equation for the zero mode. In particular, this means that supercritical branches from the Ginzburg-Landau equation need not be stable. Indeed, we find that, depending on the parameters, bifurcations can be subcritical (always unstable), stable supercritical, or unstable supercritical. We show numerically that, when small amplitude patterns are unstable, the system exhibits large amplitude patterns and hysteresis, even in supercritical regimes. Finally, we construct bifurcation diagrams by combining our analysis with a previous study of the minimisers of the associated energy functional. Through this approach we reveal parameter regions in which stable small amplitude patterns coexist with strongly modulated solutions. △ Less

Submitted 24 May, 2023; originally announced May 2023.

MSC Class: 35C20; 35B32; 35B36; 35Q92

arXiv:2305.03208 [pdf, ps, other]

The complexity of first-order optimization methods from a metric perspective

Authors: Adrian S. Lewis, Tonghua Tian

Abstract: A central tool for understanding first-order optimization algorithms is the Kurdyka-Lojasiewicz inequality. Standard approaches to such methods rely crucially on this inequality to leverage sufficient decrease conditions involving gradients or subgradients. However, the KL property fundamentally concerns not subgradients but rather "slope", a purely metric notion. By highlighting this view, and av… ▽ More A central tool for understanding first-order optimization algorithms is the Kurdyka-Lojasiewicz inequality. Standard approaches to such methods rely crucially on this inequality to leverage sufficient decrease conditions involving gradients or subgradients. However, the KL property fundamentally concerns not subgradients but rather "slope", a purely metric notion. By highlighting this view, and avoiding any use of subgradients, we present a simple and concise complexity analysis for first-order optimization algorithms on metric spaces. This subgradient-free perspective also frames a short and focused proof of the KL property for nonsmooth semi-algebraic functions. △ Less

Submitted 4 May, 2023; originally announced May 2023.

MSC Class: 90C48; 49J52; 65Y20; 14P10 ACM Class: G.1.6

arXiv:2304.09931 [pdf, ps, other]

doi 10.1007/s11538-024-01290-4

Revealing the unseen: Likely half of the Americans relied on others' experience when deciding on taking the COVID-19 vaccine

Authors: Azadeh Aghaeeyan, Pouria Ramazi, Mark A. Lewis

Abstract: Efficient coverage for newly developed vaccines requires knowing which groups of individuals will accept the vaccine immediately and which will take longer to accept or never accept. Of those who may eventually accept the vaccine, there are two main types: success-based learners, basing their decisions on others' satisfaction, and myopic rationalists, attending to their own immediate perceived ben… ▽ More Efficient coverage for newly developed vaccines requires knowing which groups of individuals will accept the vaccine immediately and which will take longer to accept or never accept. Of those who may eventually accept the vaccine, there are two main types: success-based learners, basing their decisions on others' satisfaction, and myopic rationalists, attending to their own immediate perceived benefit. We used COVID-19 vaccination data to fit a mechanistic model capturing the distinct effects of the two types on the vaccination progress. We estimated that 47 percent of Americans behaved as myopic rationalist with a high variations across the jurisdictions, from 31 percent in Mississippi to 76 percent in Vermont. The proportion was correlated with the vaccination coverage, proportion of votes in favor of Democrats in 2020 presidential election, and education score. △ Less

Submitted 11 August, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

Comments: The population was narrowed to those of 12 years and older

Journal ref: Bull Math Biol 86, 72 (2024)

arXiv:2304.09057 [pdf, other]

doi 10.1063/5.0154710

Predicting the Electronic Density Response of Condensed-Phase Systems to Electric Field Perturbations

Authors: Alan M Lewis, Paolo Lazzaroni, Mariana Rossi

Abstract: We present a local and transferable machine learning approach capable of predicting the real-space density response of both molecules and periodic systems to external homogeneous electric fields. The new method, SALTER, builds on the Symmetry-Adapted Gaussian Process Regression SALTED framework. SALTER requires only a small, but necessary, modification to the descriptors used to represent the atom… ▽ More We present a local and transferable machine learning approach capable of predicting the real-space density response of both molecules and periodic systems to external homogeneous electric fields. The new method, SALTER, builds on the Symmetry-Adapted Gaussian Process Regression SALTED framework. SALTER requires only a small, but necessary, modification to the descriptors used to represent the atomic environments. We present the performance of the method on isolated water molecules, bulk water and a naphthalene crystal. Root mean square errors of the predicted density response lie at or below 10% with barely more than 100 training structures. Derived quantities, such as polarizability tensors and even Raman spectra further derived from these tensors show a good agreement with those calculated directly from quantum mechanical methods. Therefore, SALTER shows excellent performance when predicting derived quantities, while retaining all of the information contained in the full electronic response. This method is thus capable of learning vector fields in a chemical context and serves as a landmark for further developments. △ Less

Submitted 18 April, 2023; originally announced April 2023.

Comments: Main text: 7 pages, 5 figures. SI: 5 pages, 4 figures

arXiv:2303.13313 [pdf, other]

How to detect lensing rotation

Authors: Mathew Robertson, Antony Lewis

Abstract: Gravitational lensing rotation of images is predicted to be negligible at linear order in density perturbations, but can be produced by the post-Born lens-lens coupling at second order. This rotation is somewhat enhanced for Cosmic Microwave Background (CMB) lensing due to the large source path length, but remains small and very challenging to detect directly by CMB lensing reconstruction alone. W… ▽ More Gravitational lensing rotation of images is predicted to be negligible at linear order in density perturbations, but can be produced by the post-Born lens-lens coupling at second order. This rotation is somewhat enhanced for Cosmic Microwave Background (CMB) lensing due to the large source path length, but remains small and very challenging to detect directly by CMB lensing reconstruction alone. We show the rotation may be detectable at high significance as a cross-correlation signal between the curl reconstructed with Simons Observatory (SO) or CMB-S4 data, and a template constructed from quadratic combinations of large-scale structure (LSS) tracers. Equivalently, the lensing rotation-tracer-tracer bispectrum can also be detected, where LSS tracers considered include the CMB lensing convergence, galaxy density, and the Cosmic Infrared Background (CIB), or optimal combinations thereof. We forecast that an optimal combination of these tracers can probe post-Born rotation at the level of $5.7σ$-$6.1σ$ with SO and $13.6σ$-$14.7σ$ for CMB-S4, depending on whether standard quadratic estimators or maximum a posteriori iterative methods are deployed. We also show possible improvement up to $21.3σ$ using a CMB-S4 deep patch observation with polarization-only iterative lensing reconstruction. However, these cross-correlation signals have non-zero bias because the rotation template is quadratic in the tracers, and exists even if the lensing is rotation free. We estimate this bias analytically, and test it using simple null-hypothesis simulations to confirm that the bias remains subdominant to the rotation signal of interest. Detection and then measurement of the lensing rotation cross-spectrum is therefore a realistic target for future observations. △ Less

Submitted 28 June, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: 15 pages + appendix, 11 figures, 4 tables. Minor corrections (results unchanged): sign clarification in Eq 2.13, fixing typos in Eq 2.15 and Eq 5.4

arXiv:2303.12899 [pdf, other]

Disentangling centrality bias and final-state effects in the production of high-$p_T$ $π^0$ using direct $γ$ in $d$$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

Authors: N. J. Abdulameer, U. Acharya, C. Aidala, Y. Akiba, M. Alfred, K. Aoki, N. Apadula, C. Ayuso, V. Babintsev, K. N. Barish, S. Bathe, A. Bazilevsky, R. Belmont, A. Berdnikov, Y. Berdnikov, L. Bichon, B. Blankenship, D. S. Blau, M. Boer, J. S. Bok, V. Borisov, M. L. Brooks, J. Bryslawskyj, V. Bumazhnov, C. Butler , et al. (253 additional authors not shown)

Abstract: PHENIX presents a simultaneous measurement of the production of direct $γ$ and $π^0$ in $d$$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV over a $p_T$ range of 7.5 to 18 GeV/$c$ for different event samples selected by event activity, i.e. charged-particle multiplicity detected at forward rapidity. Direct-photon yields are used to empirically estimate the contribution of hard-scattering processes i… ▽ More PHENIX presents a simultaneous measurement of the production of direct $γ$ and $π^0$ in $d$$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV over a $p_T$ range of 7.5 to 18 GeV/$c$ for different event samples selected by event activity, i.e. charged-particle multiplicity detected at forward rapidity. Direct-photon yields are used to empirically estimate the contribution of hard-scattering processes in the different event samples. Using this estimate, the average nuclear-modification factor $R_{d\rm Au,EXP}^{γ^{\rm dir}}$ is $0.925{\pm}0.023({\rm stat}){\pm}0.15^{\rm (scale)}$, consistent with unity for minimum-bias (MB) $d$$+$Au events. For event classes with moderate event activity, $R_{d\rm Au,EXP}^{γ^{\rm dir}}$ is consistent with the MB value within 5\% uncertainty. These results confirm that the previously observed enhancement of high-$p_T$ $π^0$ production found in small-system collisions with low event activity is a result of a bias in interpreting event activity within the Glauber framework. In contrast, for the top 5\% of events with the highest event activity, $R_{d\rm Au,EXP}^{γ^{\rm dir}}$ is suppressed by 20\% relative to the MB value with a significance of $4.5σ$, which may be due to final-state effects. △ Less

Submitted 22 March, 2023; originally announced March 2023.

Comments: 279 authors from 69 institutions, 8 pages, 3 figures, v1 is version submitted to Physical Review Letters. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

arXiv:2303.12095 [pdf, other]

Interpretable histopathology-based prediction of disease relevant features in Inflammatory Bowel Disease biopsies using weakly-supervised deep learning

Authors: Ricardo Mokhtari, Azam Hamidinekoo, Daniel Sutton, Arthur Lewis, Bastian Angermann, Ulf Gehrmann, Pal Lundin, Hibret Adissu, Junmei Cairns, Jessica Neisen, Emon Khan, Daniel Marks, Nia Khachapuridze, Talha Qaiser, Nikolay Burlutskiy

Abstract: Crohn's Disease (CD) and Ulcerative Colitis (UC) are the two main Inflammatory Bowel Disease (IBD) types. We developed deep learning models to identify histological disease features for both CD and UC using only endoscopic labels. We explored fine-tuning and end-to-end training of two state-of-the-art self-supervised models for predicting three different endoscopic categories (i) CD vs UC (AUC=0.8… ▽ More Crohn's Disease (CD) and Ulcerative Colitis (UC) are the two main Inflammatory Bowel Disease (IBD) types. We developed deep learning models to identify histological disease features for both CD and UC using only endoscopic labels. We explored fine-tuning and end-to-end training of two state-of-the-art self-supervised models for predicting three different endoscopic categories (i) CD vs UC (AUC=0.87), (ii) normal vs lesional (AUC=0.81), (iii) low vs high disease severity score (AUC=0.80). We produced visual attention maps to interpret what the models learned and validated them with the support of a pathologist, where we observed a strong association between the models' predictions and histopathological inflammatory features of the disease. Additionally, we identified several cases where the model incorrectly predicted normal samples as lesional but were correct on the microscopic level when reviewed by the pathologist. This tendency of histological presentation to be more severe than endoscopic presentation was previously published in the literature. In parallel, we utilised a model trained on the Colon Nuclei Identification and Counting (CoNIC) dataset to predict and explore 6 cell populations. We observed correlation between areas enriched with the predicted immune cells in biopsies and the pathologist's feedback on the attention maps. Finally, we identified several cell level features indicative of disease severity in CD and UC. These models can enhance our understanding about the pathology behind IBD and can shape our strategies for patient stratification in clinical trials. △ Less

Submitted 16 May, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

Comments: Accepted to the Medical Imaging with Deep Learning (MIDL'23)

arXiv:2303.10838 [pdf, other]

Deceptive Reinforcement Learning in Model-Free Domains

Authors: Alan Lewis, Tim Miller

Abstract: This paper investigates deceptive reinforcement learning for privacy preservation in model-free and continuous action space domains. In reinforcement learning, the reward function defines the agent's objective. In adversarial scenarios, an agent may need to both maximise rewards and keep its reward function private from observers. Recent research presented the ambiguity model (AM), which selects a… ▽ More This paper investigates deceptive reinforcement learning for privacy preservation in model-free and continuous action space domains. In reinforcement learning, the reward function defines the agent's objective. In adversarial scenarios, an agent may need to both maximise rewards and keep its reward function private from observers. Recent research presented the ambiguity model (AM), which selects actions that are ambiguous over a set of possible reward functions, via pre-trained $Q$-functions. Despite promising results in model-based domains, our investigation shows that AM is ineffective in model-free domains due to misdirected state space exploration. It is also inefficient to train and inapplicable in continuous action space domains. We propose the deceptive exploration ambiguity model (DEAM), which learns using the deceptive policy during training, leading to targeted exploration of the state space. DEAM is also applicable in continuous action spaces. We evaluate DEAM in discrete and continuous action space path planning environments. DEAM achieves similar performance to an optimal model-based version of AM and outperforms a model-free version of AM in terms of path cost, deceptiveness and training efficiency. These results extend to the continuous domain. △ Less

Submitted 19 March, 2023; originally announced March 2023.

Comments: 8 pages, 1 reference page, 4 appendix pages, Accepted into International Conference on Automated Planning and Scheduling (ICAPS) 2023

arXiv:2303.08805 [pdf, other]

doi 10.1103/PhysRevLett.131.063401

Spin Squeezing by Rydberg Dressing in an Array of Atomic Ensembles

Authors: Jacob A. Hines, Shankari V. Rajagopal, Gabriel L. Moreau, Michael D. Wahrman, Neomi A. Lewis, Ognjen Marković, Monika Schleier-Smith

Abstract: We report on the creation of an array of spin-squeezed ensembles of cesium atoms via Rydberg dressing, a technique that offers optical control over local interactions between neutral atoms. We optimize the coherence of the interactions by a stroboscopic dressing sequence that suppresses super-Poissonian loss. We thereby prepare squeezed states of $N=200$ atoms with a metrological squeezing paramet… ▽ More We report on the creation of an array of spin-squeezed ensembles of cesium atoms via Rydberg dressing, a technique that offers optical control over local interactions between neutral atoms. We optimize the coherence of the interactions by a stroboscopic dressing sequence that suppresses super-Poissonian loss. We thereby prepare squeezed states of $N=200$ atoms with a metrological squeezing parameter $ξ^2 = 0.77(9)$ quantifying the reduction in phase variance below the standard quantum limit. We realize metrological gain across three spatially separated ensembles in parallel, with the strength of squeezing controlled by the local intensity of the dressing light. Our method can be applied to enhance the precision of tests of fundamental physics based on arrays of atomic clocks and to enable quantum-enhanced imaging of electromagnetic fields. △ Less

Submitted 23 August, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

Comments: 18 pages, 11 figures, typos corrected, edits for clarity

Journal ref: Phys. Rev. Lett. 131, 063401 (2023)

arXiv:2303.07191 [pdf, other]

doi 10.1103/PhysRevD.108.072016

Transverse single-spin asymmetry of charged hadrons at forward and backward rapidity in polarized $p$+$p$, $p$+Al, and $p$+Au collisions at $\sqrt{s_{NN}}=200$ GeV}

Authors: N. J. Abdulameer, U. Acharya, C. Aidala, Y. Akiba, M. Alfred, V. Andrieux, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, N. S. Bandara, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, R. Belmont, A. Berdnikov, Y. Berdnikov, L. Bichon, B. Blankenship, D. S. Blau, J. S. Bok, V. Borisov, M. L. Brooks, J. Bryslawskyj , et al. (297 additional authors not shown)

Abstract: Reported here are transverse single-spin asymmetries ($A_{N}$) in the production of charged hadrons as a function of transverse momentum ($p_T$) and Feynman-$x$ ($x_F$) in polarized $p^{\uparrow}$+$p$, $p^{\uparrow}$+Al, and $p^{\uparrow}$+Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. The measurements have been performed at forward and backward rapidity ($1.4<|η|<2.4$) over the range of… ▽ More Reported here are transverse single-spin asymmetries ($A_{N}$) in the production of charged hadrons as a function of transverse momentum ($p_T$) and Feynman-$x$ ($x_F$) in polarized $p^{\uparrow}$+$p$, $p^{\uparrow}$+Al, and $p^{\uparrow}$+Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. The measurements have been performed at forward and backward rapidity ($1.4<|η|<2.4$) over the range of $1.5<p_{T}<7.0~{\rm GeV}/c$ and $0.04<|x_{F}|<0.2$. A nonzero asymmetry is observed for positively charged hadrons at forward rapidity ($x_F>0$) in $p^{\uparrow}$+$p$ collisions, whereas the $p^{\uparrow}$+Al and $p^{\uparrow}$+Au results show smaller asymmetries. This finding provides new opportunities to investigate the origin of transverse single-spin asymmetries and a tool to study nuclear effects in $p$+$A$ collisions. △ Less

Submitted 31 October, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

Comments: 322 authors from 70 institutions, 13 pages, 9 figures, 13 tables, one appendix, 2015 data. v2 is version accepted for publication in Phys. Rev. D. HEPData tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. D 108, 072016 (2023)

arXiv:2303.07190 [pdf, other]

Transverse single-spin asymmetry of midrapidity $π^{0}$ and $η$ mesons in $p$+Au and $p$+Al collisions at $\sqrt{s_{_{NN}}}=$ 200 GeV

Authors: N. J. Abdulameer, U. Acharya, C. Aidala, Y. Akiba, M. Alfred, V. Andrieux, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, N. S. Bandara, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, R. Belmont, A. Berdnikov, Y. Berdnikov, L. Bichon, B. Blankenship, D. S. Blau, J. S. Bok, V. Borisov, M. L. Brooks, J. Bryslawskyj , et al. (297 additional authors not shown)

Abstract: Presented are the first measurements of the transverse single-spin asymmetries ($A_N$) for neutral pions and eta mesons in $p$+Au and $p$+Al collisions at $\sqrt{s_{_{NN}}}=200$ GeV in the pseudorapidity range $|η|<$0.35 with the PHENIX detector at the Relativistic Heavy Ion Collider. The asymmetries are consistent with zero, similar to those for midrapidity neutral pions and eta mesons produced i… ▽ More Presented are the first measurements of the transverse single-spin asymmetries ($A_N$) for neutral pions and eta mesons in $p$+Au and $p$+Al collisions at $\sqrt{s_{_{NN}}}=200$ GeV in the pseudorapidity range $|η|<$0.35 with the PHENIX detector at the Relativistic Heavy Ion Collider. The asymmetries are consistent with zero, similar to those for midrapidity neutral pions and eta mesons produced in $p$+$p$ collisions. These measurements show no evidence of additional effects that could potentially arise from the more complex partonic environment present in proton-nucleus collisions. △ Less

Submitted 6 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

Comments: 322 authors from 70 institutions, 8 pages, 2 figures, 1 table, 2015 data. v2 is version accepted for publication in Phys. Rev. D. HEPData tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

arXiv:2303.01998 [pdf, other]

MLTEing Models: Negotiating, Evaluating, and Documenting Model and System Qualities

Authors: Katherine R. Maffey, Kyle Dotterrer, Jennifer Niemann, Iain Cruickshank, Grace A. Lewis, Christian Kästner

Abstract: Many organizations seek to ensure that machine learning (ML) and artificial intelligence (AI) systems work as intended in production but currently do not have a cohesive methodology in place to do so. To fill this gap, we propose MLTE (Machine Learning Test and Evaluation, colloquially referred to as "melt"), a framework and implementation to evaluate ML models and systems. The framework compiles… ▽ More Many organizations seek to ensure that machine learning (ML) and artificial intelligence (AI) systems work as intended in production but currently do not have a cohesive methodology in place to do so. To fill this gap, we propose MLTE (Machine Learning Test and Evaluation, colloquially referred to as "melt"), a framework and implementation to evaluate ML models and systems. The framework compiles state-of-the-art evaluation techniques into an organizational process for interdisciplinary teams, including model developers, software engineers, system owners, and other stakeholders. MLTE tooling supports this process by providing a domain-specific language that teams can use to express model requirements, an infrastructure to define, generate, and collect ML evaluation metrics, and the means to communicate results. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: Accepted to the NIER Track of the 45th International Conference on Software Engineering (ICSE 2023)

arXiv:2302.12911 [pdf, other]

doi 10.1103/PhysRevD.107.103505

CMB constraints on the early universe independent of late time cosmology

Authors: Pablo Lemos, Antony Lewis

Abstract: The CMB is a powerful probe of early-universe physics but is only observed after passing through large-scale structure, which changes the observed spectra in important model-dependent ways. This is of particular concern given recent claims of significant discrepancies with low redshift data sets when a standard $Λ$CDM model is assumed. By using empirical measurements of the CMB lensing reconstruct… ▽ More The CMB is a powerful probe of early-universe physics but is only observed after passing through large-scale structure, which changes the observed spectra in important model-dependent ways. This is of particular concern given recent claims of significant discrepancies with low redshift data sets when a standard $Λ$CDM model is assumed. By using empirical measurements of the CMB lensing reconstruction, combined with weak priors on the smoothness of the lensing spectrum, foregrounds, and shape of any additional integrated Sachs-Wolfe effect, we show how the early-universe parameters can be constrained from CMB observations almost independently of the late-time evolution. This provides a way to test new models for early-universe physics, and measure early-universe parameters, independently of late-time cosmology. Using the empirical measurement of lensing keeps the size of the effect of late-time modelling uncertainty under control, leading to only modest increases in error bars of most early-universe parameters compared to assuming a full evolution model. We provide robust constraints on early-$Λ$CDM model parameters using the latest Planck PR4 data and show that with future data marginalizing over a single lensing amplitude parameter is sufficient to remove sensitivity to late-time cosmological model only if the spectral shape matches predictions. △ Less

Submitted 16 May, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: 9 pages, 5 figures

Journal ref: Phys. Rev. D 107, 103505 (2023)

arXiv:2302.04999 [pdf, other]

Ablation Study on Features in Learning-based Joints Calibration of Cable-driven Surgical Robots

Authors: Haonan Peng, Andrew Lewis, Blake Hannaford

Abstract: With worldwide implementation, millions of surgeries are assisted by surgical robots. The cable-drive mechanism on many surgical robots allows flexible, light, and compact arms and tools. However, the slack and stretch of the cables and the backlash of the gears introduce inevitable errors from motor poses to joint poses, and thus forwarded to the pose and orientation of the end-effector. In this… ▽ More With worldwide implementation, millions of surgeries are assisted by surgical robots. The cable-drive mechanism on many surgical robots allows flexible, light, and compact arms and tools. However, the slack and stretch of the cables and the backlash of the gears introduce inevitable errors from motor poses to joint poses, and thus forwarded to the pose and orientation of the end-effector. In this paper, a learning-based calibration using a deep neural network is proposed, which reduces the unloaded pose RMSE of joints 1, 2, 3 to 0.3003 deg, 0.2888 deg, 0.1565 mm, and loaded pose RMSE of joints 1, 2, 3 to 0.4456 deg, 0.3052 deg, 0.1900 mm, respectively. Then, removal ablation and inaccurate ablation are performed to study which features of the DNN model contribute to the calibration accuracy. The results suggest that raw joint poses and motor torques are the most important features. For joint poses, the removal ablation shows that DNN model can derive this information from end-effector pose and orientation. For motor torques, the direction is much more important than amplitude. △ Less

Submitted 14 February, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

arXiv:2302.03588 [pdf, ps, other]

Basic convex analysis in metric spaces with bounded curvature

Authors: Adrian S. Lewis, Genaro López-Acedo, Adriana Nicolae

Abstract: Differentiable structure ensures that many of the basics of classical convex analysis extend naturally from Euclidean space to Riemannian manifolds. Without such structure, however, extensions are more challenging. Nonetheless, in Alexandrov spaces with curvature bounded above (but possibly positive), we develop several basic building blocks. We define subgradients via projection and the normal co… ▽ More Differentiable structure ensures that many of the basics of classical convex analysis extend naturally from Euclidean space to Riemannian manifolds. Without such structure, however, extensions are more challenging. Nonetheless, in Alexandrov spaces with curvature bounded above (but possibly positive), we develop several basic building blocks. We define subgradients via projection and the normal cone, prove their existence, and relate them to the classical affine minorant property. Then, in what amounts to a simple calculus or duality result, we develop a necessary optimality condition for minimizing the sum of two convex functions. △ Less

Submitted 24 November, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

arXiv:2302.02005 [pdf, other]

DeepAstroUDA: Semi-Supervised Universal Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

Authors: A. Ćiprijanović, A. Lewis, K. Pedro, S. Madireddy, B. Nord, G. N. Perdue, S. M. Wild

Abstract: Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, \textit{DeepAstroUDA}, as an approach to o… ▽ More Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, \textit{DeepAstroUDA}, as an approach to overcome this challenge. This algorithm performs semi-supervised domain adaptation and can be applied to datasets with different data distributions and class overlaps. Non-overlapping classes can be present in any of the two datasets (the labeled source domain, or the unlabeled target domain), and the method can even be used in the presence of unknown classes. We apply our method to three examples of galaxy morphology classification tasks of different complexities ($3$-class and $10$-class problems), with anomaly detection: 1) datasets created after different numbers of observing years from a single survey (LSST mock data of $1$ and $10$ years of observations); 2) data from different surveys (SDSS and DECaLS); and 3) data from observing fields with different depths within one survey (wide field and Stripe 82 deep field of SDSS). For the first time, we demonstrate the successful use of domain adaptation between very discrepant observational datasets. \textit{DeepAstroUDA} is capable of bridging the gap between two astronomical surveys, increasing classification accuracy in both domains (up to $40\%$ on the unlabeled data), and making model performance consistent across datasets. Furthermore, our method also performs well as an anomaly detection algorithm and successfully clusters unknown class samples even in the unlabeled target dataset. △ Less

Submitted 22 March, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

Comments: Accepted in Machine Learning Science and Technology (MLST); 24 pages, 14 figures

Report number: FERMILAB-PUB-23-034-CSAID

arXiv:2301.09815 [pdf, ps, other]

Mixed Effects Random Forests for Personalised Predictions of Clinical Depression Severity

Authors: Robert A. Lewis, Asma Ghandeharioun, Szymon Fedor, Paola Pedrelli, Rosalind Picard, David Mischoulon

Abstract: This work demonstrates how mixed effects random forests enable accurate predictions of depression severity using multimodal physiological and digital activity data collected from an 8-week study involving 31 patients with major depressive disorder. We show that mixed effects random forests outperform standard random forests and personal average baselines when predicting clinical Hamilton Depressio… ▽ More This work demonstrates how mixed effects random forests enable accurate predictions of depression severity using multimodal physiological and digital activity data collected from an 8-week study involving 31 patients with major depressive disorder. We show that mixed effects random forests outperform standard random forests and personal average baselines when predicting clinical Hamilton Depression Rating Scale scores (HDRS_17). Compared to the latter baseline, accuracy is significantly improved for each patient by an average of 0.199-0.276 in terms of mean absolute error (p<0.05). This is noteworthy as these simple baselines frequently outperform machine learning methods in mental health prediction tasks. We suggest that this improved performance results from the ability of the mixed effects random forest to personalise model parameters to individuals in the dataset. However, we find that these improvements pertain exclusively to scenarios where labelled patient data are available to the model at training time. Investigating methods that improve accuracy when generalising to new patients is left as important future work. △ Less

Submitted 23 January, 2023; originally announced January 2023.

Comments: 9 pages

arXiv:2301.08492 [pdf, other]

WASP-39b: exo-Saturn with patchy cloud composition, moderate metallicity, and underdepleted S/O

Authors: Ludmila Carone, David A. Lewis, Dominic Samra, Aaron D. Schneider, Christiane Helling

Abstract: WASP-39b is one of the first extrasolar giant gas planets that has been observed within the JWST ERS program. Fundamental properties that may enable the link to exoplanet formation differ amongst retrieval methods, for example metallicity and mineral ratios. In this work, the formation of clouds in the atmosphere of WASP-39b is explored to investigate how inhomogeneous cloud properties (particle… ▽ More WASP-39b is one of the first extrasolar giant gas planets that has been observed within the JWST ERS program. Fundamental properties that may enable the link to exoplanet formation differ amongst retrieval methods, for example metallicity and mineral ratios. In this work, the formation of clouds in the atmosphere of WASP-39b is explored to investigate how inhomogeneous cloud properties (particle sizes, material composition, opacity) may be for this intermediately warm gaseous exoplanet. WASP-39b's atmosphere has a comparable day-night temperature median with sufficiently low temperatures that clouds may form globally. The presence of clouds on WASP-39b can explain observations without resorting to a high (> 100x solar) metallicity atmosphere for a reduced vertical mixing efficiency. The assessment of mineral ratios shows an under-depletion of S/O due to condensation compared to C/O, Mg/O, Si/O, Fe/O ratios. Vertical patchiness due to heterogeneous cloud composition challenges simple cloud models. An equal mixture of silicates and metal oxides is expected to characterise the cloud top. Further, optical properties of Fe and Mg silicates in the mid-infrared differ significantly which will impact the interpretation of JWST observations. We conclude that WASP-39b's atmosphere contains clouds and the underdepletion of S/O by atmospheric condensation processes suggest the use of sulphur gas species as a possible link to primordial element abundances. Over-simplified cloud models do not capture the complex nature of mixed-condensate clouds in exoplanet atmospheres. The clouds in the observable upper atmosphere of WASP-39b are a mixture of different silicates and metal oxides. The use of constant particles sizes and/or one-material cloud particles alone to interpret spectra may not be sufficient to capture the full complexity available through JWST observations. △ Less

Submitted 20 January, 2023; originally announced January 2023.

Comments: 21 pages, 18 figures, submitted to A&A on 22. November 2022, in review since 8. December 2022

arXiv:2211.06409 [pdf, other]

Capabilities for Better ML Engineering

Authors: Chenyang Yang, Rachel Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu

Abstract: In spite of machine learning's rapid growth, its engineering support is scattered in many forms, and tends to favor certain engineering stages, stakeholders, and evaluation preferences. We envision a capability-based framework, which uses fine-grained specifications for ML model behaviors to unite existing efforts towards better ML engineering. We use concrete scenarios (model design, debugging, a… ▽ More In spite of machine learning's rapid growth, its engineering support is scattered in many forms, and tends to favor certain engineering stages, stakeholders, and evaluation preferences. We envision a capability-based framework, which uses fine-grained specifications for ML model behaviors to unite existing efforts towards better ML engineering. We use concrete scenarios (model design, debugging, and maintenance) to articulate capabilities' broad applications across various different dimensions, and their impact on building safer, more generalizable and more trustworthy models that reflect human needs. Through preliminary experiments, we show capabilities' potential for reflecting model generalizability, which can provide guidance for ML engineering process. We discuss challenges and opportunities for capabilities' integration into ML engineering. △ Less

Submitted 10 February, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

arXiv:2211.00677 [pdf, other]

Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

Authors: Aleksandra Ćiprijanović, Ashia Lewis, Kevin Pedro, Sandeep Madireddy, Brian Nord, Gabriel N. Perdue, Stefan M. Wild

Abstract: In the era of big astronomical surveys, our ability to leverage artificial intelligence algorithms simultaneously for multiple datasets will open new avenues for scientific discovery. Unfortunately, simply training a deep neural network on images from one data domain often leads to very poor performance on any other dataset. Here we develop a Universal Domain Adaptation method DeepAstroUDA, capabl… ▽ More In the era of big astronomical surveys, our ability to leverage artificial intelligence algorithms simultaneously for multiple datasets will open new avenues for scientific discovery. Unfortunately, simply training a deep neural network on images from one data domain often leads to very poor performance on any other dataset. Here we develop a Universal Domain Adaptation method DeepAstroUDA, capable of performing semi-supervised domain alignment that can be applied to datasets with different types of class overlap. Extra classes can be present in any of the two datasets, and the method can even be used in the presence of unknown classes. For the first time, we demonstrate the successful use of domain adaptation on two very different observational datasets (from SDSS and DECaLS). We show that our method is capable of bridging the gap between two astronomical surveys, and also performs well for anomaly detection and clustering of unknown data in the unlabeled dataset. We apply our model to two examples of galaxy morphology classification tasks with anomaly detection: 1) classifying spiral and elliptical galaxies with detection of merging galaxies (three classes including one unknown anomaly class); 2) a more granular problem where the classes describe more detailed morphological properties of galaxies, with the detection of gravitational lenses (ten classes including one unknown anomaly class). △ Less

Submitted 11 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

Comments: 3 figures, 1 table; accepted to Machine Learning and the Physical Sciences - Workshop at the 36th conference on Neural Information Processing Systems (NeurIPS)

Report number: FERMILAB-CONF-22-791-SCD

Showing 1–50 of 469 results for author: Lewis, A