-
EngineBench: Flow Reconstruction in the Transparent Combustion Chamber III Optical Engine
Authors:
Samuel J. Baker,
Michael A. Hobley,
Isabel Scherl,
Xiaohang Fang,
Felix C. P. Leach,
Martin H. Davy
Abstract:
We present EngineBench, the first machine learning (ML) oriented database to use high quality experimental data for the study of turbulent flows inside combustion machinery. Prior datasets for ML in fluid mechanics are synthetic or use overly simplistic geometries. EngineBench is comprised of real-world particle image velocimetry (PIV) data that captures the turbulent airflow patterns in a special…
▽ More
We present EngineBench, the first machine learning (ML) oriented database to use high quality experimental data for the study of turbulent flows inside combustion machinery. Prior datasets for ML in fluid mechanics are synthetic or use overly simplistic geometries. EngineBench is comprised of real-world particle image velocimetry (PIV) data that captures the turbulent airflow patterns in a specially-designed optical engine. However, in PIV data from internal flows, such as from engines, it is often challenging to achieve a full field of view and large occlusions can be present. In order to design optimal combustion systems, insight into the turbulent flows in these obscured areas is needed, which can be provided via inpainting models. Here we propose a novel inpainting task using random edge gaps, a technique that emphasises realism by introducing occlusions at random sizes and orientations at the edges of the PIV images. We test five ML methods on random edge gaps using pixel-wise, vector-based, and multi-scale performance metrics. We find that UNet-based models are more accurate than the industry-norm non-parametric approach and the context encoder at this task on both small and large gap sizes. The dataset and inpainting task presented in this paper support the development of more general-purpose pre-trained ML models for engine design problems. The method comparisons allow for more informed selection of ML models for problems in experimental flow diagnostics. All data and code are publicly available at https://eng.ox.ac.uk/tpsrg/research/enginebench/.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Simulating optically-active spin defects with a quantum computer
Authors:
Jack S. Baker,
Pablo A. M. Casares,
Modjtaba Shokrian Zini,
Jaydeep Thik,
Debasish Banerjee,
Chen Ling,
Alain Delgado,
Juan Miguel Arrazola
Abstract:
There is a pressing need for more accurate computational simulations of the opto-electronic properties of defects in materials to aid in the development of quantum sensing platforms. In this work, we explore how quantum computers could be effectively utilized for this purpose. Specifically, we develop fault-tolerant quantum algorithms to simulate optically active defect states and their radiative…
▽ More
There is a pressing need for more accurate computational simulations of the opto-electronic properties of defects in materials to aid in the development of quantum sensing platforms. In this work, we explore how quantum computers could be effectively utilized for this purpose. Specifically, we develop fault-tolerant quantum algorithms to simulate optically active defect states and their radiative emission rates. We employ quantum defect embedding theory to translate the Hamiltonian of a defect-containing supercell into a smaller, effective Hamiltonian that accounts for dielectric screening effects. Our approach integrates block-encoding of the dipole operator with quantum phase estimation to selectively sample the optically active excited states that exhibit the largest dipole transition amplitudes. We also provide estimates of the quantum resources required to simulate a negatively-charged boron vacancy in a hexagonal boron nitride cluster. We conclude by offering a forward-looking perspective on the potential of quantum computers to enhance quantum sensor capabilities and identify specific scenarios where quantum computing can resolve problems traditionally challenging for classical computers.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Evaluating Algorithmic Bias in Models for Predicting Academic Performance of Filipino Students
Authors:
Valdemar Švábenský,
Mélina Verger,
Maria Mercedes T. Rodrigo,
Clarence James G. Monterozo,
Ryan S. Baker,
Miguel Zenon Nicanor Lerias Saavedra,
Sébastien Lallé,
Atsushi Shimada
Abstract:
Algorithmic bias is a major issue in machine learning models in educational contexts. However, it has not yet been studied thoroughly in Asian learning contexts, and only limited work has considered algorithmic bias based on regional (sub-national) background. As a step towards addressing this gap, this paper examines the population of 5,986 students at a large university in the Philippines, inves…
▽ More
Algorithmic bias is a major issue in machine learning models in educational contexts. However, it has not yet been studied thoroughly in Asian learning contexts, and only limited work has considered algorithmic bias based on regional (sub-national) background. As a step towards addressing this gap, this paper examines the population of 5,986 students at a large university in the Philippines, investigating algorithmic bias based on students' regional background. The university used the Canvas learning management system (LMS) in its online courses across a broad range of domains. Over the period of three semesters, we collected 48.7 million log records of the students' activity in Canvas. We used these logs to train binary classification models that predict student grades from the LMS activity. The best-performing model reached AUC of 0.75 and weighted F1-score of 0.79. Subsequently, we examined the data for bias based on students' region. Evaluation using three metrics: AUC, weighted F1-score, and MADD showed consistent results across all demographic groups. Thus, no unfairness was observed against a particular student group in the grade predictions.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Discovery of a dormant 33 solar-mass black hole in pre-release Gaia astrometry
Authors:
Gaia Collaboration,
P. Panuzzo,
T. Mazeh,
F. Arenou,
B. Holl,
E. Caffau,
A. Jorissen,
C. Babusiaux,
P. Gavras,
J. Sahlmann,
U. Bastian,
Ł. Wyrzykowski,
L. Eyer,
N. Leclerc,
N. Bauchet,
A. Bombrun,
N. Mowlavi,
G. M. Seabroke,
D. Teyssier,
E. Balbinot,
A. Helmi,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne
, et al. (390 additional authors not shown)
Abstract:
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is exp…
▽ More
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is expected to uncover many Galactic wide-binary systems containing dormant BHs, which may not have been detected before. The study of this population will provide new information on the BH-mass distribution in binaries and shed light on their formation mechanisms and progenitors. As part of the validation efforts in preparation for the fourth Gaia data release (DR4), we analysed the preliminary astrometric binary solutions, obtained by the Gaia Non-Single Star pipeline, to verify their significance and to minimise false-detection rates in high-mass-function orbital solutions. The astrometric binary solution of one source, Gaia BH3, implies the presence of a 32.70 \pm 0.82 M\odot BH in a binary system with a period of 11.6 yr. Gaia radial velocities independently validate the astrometric orbit. Broad-band photometric and spectroscopic data show that the visible component is an old, very metal-poor giant of the Galactic halo, at a distance of 590 pc. The BH in the Gaia BH3 system is more massive than any other Galactic stellar-origin BH known thus far. The low metallicity of the star companion supports the scenario that metal-poor massive stars are progenitors of the high-mass BHs detected by gravitational-wave telescopes. The Galactic orbit of the system and its metallicity indicate that it might belong to the Sequoia halo substructure. Alternatively, and more plausibly, it could belong to the ED-2 stream, which likely originated from a globular cluster that had been disrupted by the Milky Way.
△ Less
Submitted 19 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
On Fixing the Right Problems in Predictive Analytics: AUC Is Not the Problem
Authors:
Ryan S. Baker,
Nigel Bosch,
Stephen Hutt,
Andres F. Zambrano,
Alex J. Bowers
Abstract:
Recently, ACM FAccT published an article by Kwegyir-Aggrey and colleagues (2023), critiquing the use of AUC ROC in predictive analytics in several domains. In this article, we offer a critique of that article. Specifically, we highlight technical inaccuracies in that paper's comparison of metrics, mis-specification of the interpretation and goals of AUC ROC, the article's use of the accuracy metri…
▽ More
Recently, ACM FAccT published an article by Kwegyir-Aggrey and colleagues (2023), critiquing the use of AUC ROC in predictive analytics in several domains. In this article, we offer a critique of that article. Specifically, we highlight technical inaccuracies in that paper's comparison of metrics, mis-specification of the interpretation and goals of AUC ROC, the article's use of the accuracy metric as a gold standard for comparison to AUC ROC, and the article's application of critiques solely to AUC ROC for concerns that would apply to the use of any metric. We conclude with a re-framing of the very valid concerns raised in that article, and discuss how the use of AUC ROC can remain a valid and appropriate practice in a well-informed predictive analytics approach taking those concerns into account. We conclude by discussing the combined use of multiple metrics, including machine learning bias metrics, and AUC ROC's place in such an approach. Like broccoli, AUC ROC is healthy, but also like broccoli, researchers and practitioners in our field shouldn't eat a diet of only AUC ROC.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Comparison of Three Programming Error Measures for Explaining Variability in CS1 Grades
Authors:
Valdemar Švábenský,
Maciej Pankiewicz,
Jiayi Zhang,
Elizabeth B. Cloude,
Ryan S. Baker,
Eric Fouh
Abstract:
Programming courses can be challenging for first year university students, especially for those without prior coding experience. Students initially struggle with code syntax, but as more advanced topics are introduced across a semester, the difficulty in learning to program shifts to learning computational thinking (e.g., debugging strategies). This study examined the relationships between student…
▽ More
Programming courses can be challenging for first year university students, especially for those without prior coding experience. Students initially struggle with code syntax, but as more advanced topics are introduced across a semester, the difficulty in learning to program shifts to learning computational thinking (e.g., debugging strategies). This study examined the relationships between students' rate of programming errors and their grades on two exams. Using an online integrated development environment, data were collected from 280 students in a Java programming course. The course had two parts. The first focused on introductory procedural programming and culminated with exam 1, while the second part covered more complex topics and object-oriented programming and ended with exam 2. To measure students' programming abilities, 51095 code snapshots were collected from students while they completed assignments that were autograded based on unit tests. Compiler and runtime errors were extracted from the snapshots, and three measures -- Error Count, Error Quotient and Repeated Error Density -- were explored to identify the best measure explaining variability in exam grades. Models utilizing Error Quotient outperformed the models using the other two measures, in terms of the explained variability in grades and Bayesian Information Criterion. Compiler errors were significant predictors of exam 1 grades but not exam 2 grades; only runtime errors significantly predicted exam 2 grades. The findings indicate that leveraging Error Quotient with multiple error types (compiler and runtime) may be a better measure of students' introductory programming abilities, though still not explaining most of the observed variability.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Convergence of Iterative Quadratic Programming for Robust Fixed-Endpoint Transfer of Bilinear Systems
Authors:
Luke S. Baker,
Andre Luiz P. de Lima,
Anatoly Zlotnik,
Jr-Shin Li
Abstract:
We present a computational method for open-loop minimum-norm control synthesis for fixed-endpoint transfer of bilinear ensemble systems that are indexed by two continuously varying parameters. We suppose that one ensemble parameter scales the homogeneous, linear part of the dynamics, and the second parameter scales the effect of the applied control inputs on the inhomogeneous, bilinear dynamics. T…
▽ More
We present a computational method for open-loop minimum-norm control synthesis for fixed-endpoint transfer of bilinear ensemble systems that are indexed by two continuously varying parameters. We suppose that one ensemble parameter scales the homogeneous, linear part of the dynamics, and the second parameter scales the effect of the applied control inputs on the inhomogeneous, bilinear dynamics. This class of dynamical systems is motivated by robust quantum control pulse synthesis, where the ensemble parameters correspond to uncertainty in the free Hamiltonian and inhomogeneity in the control Hamiltonian, respectively. Our computational method is based on polynomial approximation of the ensemble state in parameter space and discretization of the evolution equations in the time domain using a product of matrix exponentials corresponding to zero-order hold controls over the time intervals. The dynamics are successively linearized about control and trajectory iterates to formulate a sequence of quadratic programs for computing perturbations to the control that successively improve the objective until the iteration converges. We use a two-stage computation to first ensure transfer to the desired terminal state, and then minimize the norm of the control function. The method is demonstrated for the canonical uniform transfer problem for the Bloch system that appears in nuclear magnetic resonance, as well as the matter-wave splitting problem for the Raman-Nath system that appears in ultra-cold atom interferometry.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Navigating Compiler Errors with AI Assistance -- A Study of GPT Hints in an Introductory Programming Course
Authors:
Maciej Pankiewicz,
Ryan S. Baker
Abstract:
We examined the efficacy of AI-assisted learning in an introductory programming course at the university level by using a GPT-4 model to generate personalized hints for compiler errors within a platform for automated assessment of programming assignments. The control group had no access to GPT hints. In the experimental condition GPT hints were provided when a compiler error was detected, for the…
▽ More
We examined the efficacy of AI-assisted learning in an introductory programming course at the university level by using a GPT-4 model to generate personalized hints for compiler errors within a platform for automated assessment of programming assignments. The control group had no access to GPT hints. In the experimental condition GPT hints were provided when a compiler error was detected, for the first half of the problems in each module. For the latter half of the module, hints were disabled. Students highly rated the usefulness of GPT hints. In affect surveys, the experimental group reported significantly higher levels of focus and lower levels of confrustion (confusion and frustration) than the control group. For the six most commonly occurring error types we observed mixed results in terms of performance when access to GPT hints was enabled for the experimental group. However, in the absence of GPT hints, the experimental group's performance surpassed the control group for five out of the six error types.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1092 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 14 June, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
On the strong separation condition for self-similar iterated function systems with random translations
Authors:
Simon Baker,
Derong Kong,
Zhiqiang Wang
Abstract:
Given a self-similar iterated function system $Φ=\{ φ_i(x)=ρ_i O_i x+t_i \}_{i=1}^m$ acting on $\mathbb{R}^d$, we can generate a parameterised family of iterated function systems by replacing each $t_i$ with a random vector in $\mathbb{R}^d$. In this paper we study whether a Lebesgue typical member of this family will satisfy the strong separation condition. Our main results show that if the simil…
▽ More
Given a self-similar iterated function system $Φ=\{ φ_i(x)=ρ_i O_i x+t_i \}_{i=1}^m$ acting on $\mathbb{R}^d$, we can generate a parameterised family of iterated function systems by replacing each $t_i$ with a random vector in $\mathbb{R}^d$. In this paper we study whether a Lebesgue typical member of this family will satisfy the strong separation condition. Our main results show that if the similarity dimension of $Φ$ is sufficiently small, then a Lebesgue typical member of this family will satisfy the strong separation condition.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Train Small, Model Big: Scalable Physics Simulators via Reduced Order Modeling and Domain Decomposition
Authors:
Seung Whan Chung,
Youngsoo Choi,
Pratanu Roy,
Thomas Moore,
Thomas Roy,
Tiras Y. Lin,
Du Y. Nguyen,
Christopher Hahn,
Eric B. Duoss,
Sarah E. Baker
Abstract:
Numerous cutting-edge scientific technologies originate at the laboratory scale, but transitioning them to practical industry applications is a formidable challenge. Traditional pilot projects at intermediate scales are costly and time-consuming. An alternative, the E-pilot, relies on high-fidelity numerical simulations, but even these simulations can be computationally prohibitive at larger scale…
▽ More
Numerous cutting-edge scientific technologies originate at the laboratory scale, but transitioning them to practical industry applications is a formidable challenge. Traditional pilot projects at intermediate scales are costly and time-consuming. An alternative, the E-pilot, relies on high-fidelity numerical simulations, but even these simulations can be computationally prohibitive at larger scales. To overcome these limitations, we propose a scalable, physics-constrained reduced order model (ROM) method. ROM identifies critical physics modes from small-scale unit components, projecting governing equations onto these modes to create a reduced model that retains essential physics details. We also employ Discontinuous Galerkin Domain Decomposition (DG-DD) to apply ROM to unit components and interfaces, enabling the construction of large-scale global systems without data at such large scales. This method is demonstrated on the Poisson and Stokes flow equations, showing that it can solve equations about $15 - 40$ times faster with only $\sim$ $1\%$ relative error. Furthermore, ROM takes one order of magnitude less memory than the full order model, enabling larger scale predictions at a given memory limitation.
△ Less
Submitted 5 December, 2023;
originally announced January 2024.
-
Polynomial Fourier decay for fractal measures and their pushforwards
Authors:
Simon Baker,
Amlan Banaji
Abstract:
We prove that the pushforwards of a very general class of fractal measures $μ$ on $\mathbb{R}^d$ under a large family of non-linear maps $F \colon \mathbb{R}^d \to \mathbb{R}$ exhibit polynomial Fourier decay: there exist $C,η>0$ such that $|\widehat{Fμ}(ξ)|\leq C|ξ|^{-η}$ for all $ξ\neq 0$. Using this, we prove that if $Φ= \{ \varphi_a \colon [0,1] \to [0,1] \}_{a \in \mathcal{A}}$ is an iterated…
▽ More
We prove that the pushforwards of a very general class of fractal measures $μ$ on $\mathbb{R}^d$ under a large family of non-linear maps $F \colon \mathbb{R}^d \to \mathbb{R}$ exhibit polynomial Fourier decay: there exist $C,η>0$ such that $|\widehat{Fμ}(ξ)|\leq C|ξ|^{-η}$ for all $ξ\neq 0$. Using this, we prove that if $Φ= \{ \varphi_a \colon [0,1] \to [0,1] \}_{a \in \mathcal{A}}$ is an iterated function system consisting of analytic contractions, and there exists $a \in \mathcal{A}$ such that $\varphi_a$ is not an affine map, then every non-atomic self-conformal measure for $Φ$ has polynomial Fourier decay; this result was obtained simultaneously by Algom, Rodriguez Hertz, and Wang. We prove applications related to the Fourier uniqueness problem, Fractal Uncertainty Principles, and normal numbers in fractal sets.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Using Think-Aloud Data to Understand Relations between Self-Regulation Cycle Characteristics and Student Performance in Intelligent Tutoring Systems
Authors:
Conrad Borchers,
Jiayi Zhang,
Ryan S. Baker,
Vincent Aleven
Abstract:
Numerous studies demonstrate the importance of self-regulation during learning by problem-solving. Recent work in learning analytics has largely examined students' use of SRL concerning overall learning gains. Limited research has related SRL to in-the-moment performance differences among learners. The present study investigates SRL behaviors in relationship to learners' moment-by-moment performan…
▽ More
Numerous studies demonstrate the importance of self-regulation during learning by problem-solving. Recent work in learning analytics has largely examined students' use of SRL concerning overall learning gains. Limited research has related SRL to in-the-moment performance differences among learners. The present study investigates SRL behaviors in relationship to learners' moment-by-moment performance while working with intelligent tutoring systems for stoichiometry chemistry. We demonstrate the feasibility of labeling SRL behaviors based on AI-generated think-aloud transcripts, identifying the presence or absence of four SRL categories (processing information, planning, enacting, and realizing errors) in each utterance. Using the SRL codes, we conducted regression analyses to examine how the use of SRL in terms of presence, frequency, cyclical characteristics, and recency relate to student performance on subsequent steps in multi-step problems. A model considering students' SRL cycle characteristics outperformed a model only using in-the-moment SRL assessment. In line with theoretical predictions, students' actions during earlier, process-heavy stages of SRL cycles exhibited lower moment-by-moment correctness during problem-solving than later SRL cycle stages. We discuss system re-design opportunities to add SRL support during stages of processing and paths forward for using machine learning to speed research depending on the assessment of SRL based on transcription of think-aloud data.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Explainable AI is Responsible AI: How Explainability Creates Trustworthy and Socially Responsible Artificial Intelligence
Authors:
Stephanie Baker,
Wei Xiang
Abstract:
Artificial intelligence (AI) has been clearly established as a technology with the potential to revolutionize fields from healthcare to finance - if developed and deployed responsibly. This is the topic of responsible AI, which emphasizes the need to develop trustworthy AI systems that minimize bias, protect privacy, support security, and enhance transparency and accountability. Explainable AI (XA…
▽ More
Artificial intelligence (AI) has been clearly established as a technology with the potential to revolutionize fields from healthcare to finance - if developed and deployed responsibly. This is the topic of responsible AI, which emphasizes the need to develop trustworthy AI systems that minimize bias, protect privacy, support security, and enhance transparency and accountability. Explainable AI (XAI) has been broadly considered as a building block for responsible AI (RAI), with most of the literature considering it as a solution for improved transparency. This work proposes that XAI and responsible AI are significantly more deeply entwined. In this work, we explore state-of-the-art literature on RAI and XAI technologies. Based on our findings, we demonstrate that XAI can be utilized to ensure fairness, robustness, privacy, security, and transparency in a wide range of contexts. Our findings lead us to conclude that XAI is an essential foundation for every pillar of RAI.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
Cultural Bias and Cultural Alignment of Large Language Models
Authors:
Yan Tao,
Olga Viberg,
Ryan S. Baker,
Rene F. Kizilcec
Abstract:
Culture fundamentally shapes people's reasoning, behavior, and communication. As people increasingly use generative artificial intelligence (AI) to expedite and automate personal and professional tasks, cultural values embedded in AI models may bias people's authentic expression and contribute to the dominance of certain cultures. We conduct a disaggregated evaluation of cultural bias for five wid…
▽ More
Culture fundamentally shapes people's reasoning, behavior, and communication. As people increasingly use generative artificial intelligence (AI) to expedite and automate personal and professional tasks, cultural values embedded in AI models may bias people's authentic expression and contribute to the dominance of certain cultures. We conduct a disaggregated evaluation of cultural bias for five widely used large language models (OpenAI's GPT-4o/4-turbo/4/3.5-turbo/3) by comparing the models' responses to nationally representative survey data. All models exhibit cultural values resembling English-speaking and Protestant European countries. We test cultural prompting as a control strategy to increase cultural alignment for each country/territory. For recent models (GPT-4, 4-turbo, 4o), this improves the cultural alignment of the models' output for 71-81% of countries and territories. We suggest using cultural prompting and ongoing evaluation to reduce cultural bias in the output of generative AI.
△ Less
Submitted 26 June, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Boundary Control for Suppressing Chaotic Response to Dynamic Hydrogen Blending in a Gas Pipeline
Authors:
Luke S. Baker,
Anatoly Zlotnik
Abstract:
It is known that periodic forcing of nonlinear flows can result in a chaotic response under certain conditions. Such non-periodic and chaotic solutions have been observed in simulations of heterogeneous gas flow in a pipeline with periodic, time-varying boundary conditions. In this paper, we examine a proportional feedback law for boundary control of a parabolic partial differential equation syste…
▽ More
It is known that periodic forcing of nonlinear flows can result in a chaotic response under certain conditions. Such non-periodic and chaotic solutions have been observed in simulations of heterogeneous gas flow in a pipeline with periodic, time-varying boundary conditions. In this paper, we examine a proportional feedback law for boundary control of a parabolic partial differential equation system that represents the flow of two gases through a pipe. We demonstrate that periodic variation of the mass fraction of the lighter gas at the pipe inlet can result in the chaotic propagation of gas pressure waves, and show that appropriate flow control can suppress this response. We examine phase space solutions for the single pipe system subject to boundary control, and use numerical experiments to characterize conditions for the controller gain to suppress chaos.
△ Less
Submitted 16 May, 2024; v1 submitted 7 November, 2023;
originally announced November 2023.
-
The CeBrA demonstrator for particle-$γ$ coincidence experiments at the FSU Super-Enge Split-Pole Spectrograph
Authors:
A. L. Conley,
B. Kelly,
M. Spieker,
R. Aggarwal,
S. Ajayi,
L. T. Baby,
S. Baker,
C. Benetti,
I. Conroy,
P. D. Cottle,
I. B. D`Amato,
P. DeRosa,
J. Esparza,
S. Genty,
K. Hanselman,
I. Hay,
M. Heinze,
D. Houlihan,
M. I. Khawaja,
P. S. Kielb,
A. N. Kuchera,
G. W. McCann,
A. B. Morelock,
E. Lopez-Saavedra,
R. Renom
, et al. (8 additional authors not shown)
Abstract:
We report on a highly selective experimental setup for particle-$γ$ coincidence experiments at the Super-Enge Split-Pole Spectrograph (SE-SPS) of the John D. Fox Superconducting Linear Accelerator Laboratory at Florida State University (FSU) using fast CeBr$_3$ scintillators for $γ$-ray detection. Specifically, we report on the results of characterization tests for the first five CeBr$_3$ scintill…
▽ More
We report on a highly selective experimental setup for particle-$γ$ coincidence experiments at the Super-Enge Split-Pole Spectrograph (SE-SPS) of the John D. Fox Superconducting Linear Accelerator Laboratory at Florida State University (FSU) using fast CeBr$_3$ scintillators for $γ$-ray detection. Specifically, we report on the results of characterization tests for the first five CeBr$_3$ scintillation detectors of the CeBr$_3$ Array (CeBrA) with respect to energy resolution and timing characteristics. We also present results from the first particle-$γ$ coincidence experiments successfully performed with the CeBrA demonstrator and the FSU SE-SPS. We show that with the new setup, $γ$-decay branching ratios and particle-$γ$ angular correlations can be measured very selectively using narrow excitation energy gates, which are possible thanks to the excellent particle energy resolution of the SE-SPS. In addition, we highlight that nuclear level lifetimes in the nanoseconds regime can be determined by measuring the time difference between particle detection with the SE-SPS focal-plane scintillator and $γ$-ray detection with the fast CeBrA detectors. Selective excitation energy gates with the SE-SPS exclude any feeding contributions to these lifetimes.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Gaia Focused Product Release: Sources from Service Interface Function image analysis -- Half a million new sources in omega Centauri
Authors:
Gaia Collaboration,
K. Weingrill,
A. Mints,
J. Castañeda,
Z. Kostrzewa-Rutkowska,
M. Davidson,
F. De Angeli,
J. Hernández,
F. Torra,
M. Ramos-Lerate,
C. Babusiaux,
M. Biermann,
C. Crowley,
D. W. Evans,
L. Lindegren,
J. M. Martín-Fleitas,
L. Palaversa,
D. Ruz Mieres,
K. Tisanić,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
A. Barbier
, et al. (378 additional authors not shown)
Abstract:
Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This ne…
▽ More
Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This new pipeline produced half a million additional Gaia sources in the region of the omega Centauri ($ω$ Cen) cluster, which are published with this Focused Product Release. We discuss the dedicated SIF CF data reduction pipeline, validate its data products, and introduce their Gaia archive table. Our aim is to improve the completeness of the {\it Gaia} source inventory in a very dense region in the sky, $ω$ Cen. An adapted version of {\it Gaia}'s Source Detection and Image Parameter Determination software located sources in the 2D SIF CF images. We validated the results by comparing them to the public {\it Gaia} DR3 catalogue and external Hubble Space Telescope data. With this Focused Product Release, 526\,587 new sources have been added to the {\it Gaia} catalogue in $ω$ Cen. Apart from positions and brightnesses, the additional catalogue contains parallaxes and proper motions, but no meaningful colour information. While SIF CF source parameters generally have a lower precision than nominal {\it Gaia} sources, in the cluster centre they increase the depth of the combined catalogue by three magnitudes and improve the source density by a factor of ten. This first SIF CF data publication already adds great value to the {\it Gaia} catalogue. It demonstrates what to expect for the fourth {\it Gaia} catalogue, which will contain additional sources for all nine SIF CF regions.
△ Less
Submitted 8 November, 2023; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Gaia Focused Product Release: A catalogue of sources around quasars to search for strongly lensed quasars
Authors:
Gaia Collaboration,
A. Krone-Martins,
C. Ducourant,
L. Galluccio,
L. Delchambre,
I. Oreshina-Slezak,
R. Teixeira,
J. Braine,
J. -F. Le Campion,
F. Mignard,
W. Roux,
A. Blazere,
L. Pegoraro,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
A. Barbier,
M. Biermann,
O. L. Creevey,
D. W. Evans,
L. Eyer,
R. Guerra
, et al. (376 additional authors not shown)
Abstract:
Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those ex…
▽ More
Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those expected for most lenses. Aims. We present the Data Processing and Analysis Consortium GravLens pipeline, which was built to analyse all Gaia detections around quasars and to cluster them into sources, thus producing a catalogue of secondary sources around each quasar. We analysed the resulting catalogue to produce scores that indicate source configurations that are compatible with strongly lensed quasars. Methods. GravLens uses the DBSCAN unsupervised clustering algorithm to detect sources around quasars. The resulting catalogue of multiplets is then analysed with several methods to identify potential gravitational lenses. We developed and applied an outlier scoring method, a comparison between the average BP and RP spectra of the components, and we also used an extremely randomised tree algorithm. These methods produce scores to identify the most probable configurations and to establish a list of lens candidates. Results. We analysed the environment of 3 760 032 quasars. A total of 4 760 920 sources, including the quasars, were found within 6" of the quasar positions. This list is given in the Gaia archive. In 87\% of cases, the quasar remains a single source, and in 501 385 cases neighbouring sources were detected. We propose a list of 381 lensed candidates, of which we identified 49 as the most promising. Beyond these candidates, the associate tables in this Focused Product Release allow the entire community to explore the unique Gaia data for strong lensing studies further.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Gaia Focused Product Release: Radial velocity time series of long-period variables
Authors:
Gaia Collaboration,
Gaia Collaboration,
M. Trabucchi,
N. Mowlavi,
T. Lebzelter,
I. Lecoeur-Taibi,
M. Audard,
L. Eyer,
P. García-Lario,
P. Gavras,
B. Holl,
G. Jevardat de Fombelle,
K. Nienartowicz,
L. Rimoldini,
P. Sartoretti,
R. Blomme,
Y. Frémat,
O. Marchal,
Y. Damerdji,
A. G. A. Brown,
A. Guerrier,
P. Panuzzo,
D. Katz,
G. M. Seabroke,
K. Benson
, et al. (382 additional authors not shown)
Abstract:
The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the…
▽ More
The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the methods used to compute variability parameters published in the Gaia FPR. Starting from the DR3 LPVs catalog, we applied filters to construct a sample of sources with high-quality RV measurements. We modeled their RV and photometric time series to derive their periods and amplitudes, and further refined the sample by requiring compatibility between the RV period and at least one of the $G$, $G_{\rm BP}$, or $G_{\rm RP}$ photometric periods. The catalog includes RV time series and variability parameters for 9\,614 sources in the magnitude range $6\lesssim G/{\rm mag}\lesssim 14$, including a flagged top-quality subsample of 6\,093 stars whose RV periods are fully compatible with the values derived from the $G$, $G_{\rm BP}$, and $G_{\rm RP}$ photometric time series. The RV time series contain a mean of 24 measurements per source taken unevenly over a duration of about three years. We identify the great most sources (88%) as genuine LPVs, with about half of them showing a pulsation period and the other half displaying a long secondary period. The remaining 12% consists of candidate ellipsoidal binaries. Quality checks against RVs available in the literature show excellent agreement. We provide illustrative examples and cautionary remarks. The publication of RV time series for almost 10\,000 LPVs constitutes, by far, the largest such database available to date in the literature. The availability of simultaneous photometric measurements gives a unique added value to the Gaia catalog (abridged)
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
On the cardinality and dimension of the slices of Okamoto's functions
Authors:
Simon Baker,
George Bender
Abstract:
The graphs of Okamoto's functions, denoted by $K_q$, are self-affine fractal curves contained in $[0,1]^2$, parameterised by $q \in (1,2)$. In this paper we consider the cardinality and dimension of the intersection of these curves with horizontal lines. Our first theorem proves that if $q$ is sufficiently close to $2$, then $K_q$ admits a horizontal slice with exactly three elements. Our second t…
▽ More
The graphs of Okamoto's functions, denoted by $K_q$, are self-affine fractal curves contained in $[0,1]^2$, parameterised by $q \in (1,2)$. In this paper we consider the cardinality and dimension of the intersection of these curves with horizontal lines. Our first theorem proves that if $q$ is sufficiently close to $2$, then $K_q$ admits a horizontal slice with exactly three elements. Our second theorem proves that if a horizontal slice of $K_q$ contains an uncountable number of elements then it has positive Hausdorff dimension provided $q$ is in a certain subset of $(1,2)$. Finally, we prove that if $q$ is a $k$-Bonacci number for some $k \in \mathbb{N}_{\geq 3}$, then the set of $y \in [0,1]$ such that the horizontal slice at height $y$ has $(2m+1)$ elements has positive Hausdorff dimension for any $m \in \mathbb{N}$. We also show that, under the same assumption on $q$, there is some horizontal slice whose cardinality is countably infinite.
△ Less
Submitted 3 November, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Grad DFT: a software library for machine learning enhanced density functional theory
Authors:
Pablo A. M. Casares,
Jack S. Baker,
Matija Medvidovic,
Roberto dos Reis,
Juan Miguel Arrazola
Abstract:
Density functional theory (DFT) stands as a cornerstone method in computational quantum chemistry and materials science due to its remarkable versatility and scalability. Yet, it suffers from limitations in accuracy, particularly when dealing with strongly correlated systems. To address these shortcomings, recent work has begun to explore how machine learning can expand the capabilities of DFT; an…
▽ More
Density functional theory (DFT) stands as a cornerstone method in computational quantum chemistry and materials science due to its remarkable versatility and scalability. Yet, it suffers from limitations in accuracy, particularly when dealing with strongly correlated systems. To address these shortcomings, recent work has begun to explore how machine learning can expand the capabilities of DFT; an endeavor with many open questions and technical challenges. In this work, we present Grad DFT: a fully differentiable JAX-based DFT library, enabling quick prototyping and experimentation with machine learning-enhanced exchange-correlation energy functionals. Grad DFT employs a pioneering parametrization of exchange-correlation functionals constructed using a weighted sum of energy densities, where the weights are determined using neural networks. Moreover, Grad DFT encompasses a comprehensive suite of auxiliary functions, notably featuring a just-in-time compilable and fully differentiable self-consistent iterative procedure. To support training and benchmarking efforts, we additionally compile a curated dataset of experimental dissociation energies of dimers, half of which contain transition metal atoms characterized by strong electronic correlations. The software library is tested against experimental results to study the generalization capabilities of a neural functional across potential energy surfaces and atomic species, as well as the effect of training data noise on the resulting model accuracy.
△ Less
Submitted 11 December, 2023; v1 submitted 22 September, 2023;
originally announced September 2023.
-
Breaking Free with AI: The Deconfinement Transition
Authors:
Christian Ermann,
Stephen Baker,
Mohamed M. Anber
Abstract:
Employing supervised machine learning techniques, we investigate the deconfinement phase transition within $4$-dimensional $SU(2)$ Yang-Mills (YM) theory, compactified on a small circle and endowed with center-stabilizing potential. This exploration encompasses scenarios both without and with matter in either the fundamental or adjoint representations. Central to our study is a profound duality re…
▽ More
Employing supervised machine learning techniques, we investigate the deconfinement phase transition within $4$-dimensional $SU(2)$ Yang-Mills (YM) theory, compactified on a small circle and endowed with center-stabilizing potential. This exploration encompasses scenarios both without and with matter in either the fundamental or adjoint representations. Central to our study is a profound duality relationship, intricately mapping the YM theory onto an XY-spin model with $\mathbb Z_p$-preserving perturbations. The parameter $p$ embodies the essence of the matter representation, with values of $p=1$ and $p=4$ for fundamental and adjoint representations, respectively, while $p=2$ corresponds to pure YM theory. The logistic regression method struggles to produce satisfactory results, particularly in predicting the transition temperature. Contrarily, convolutional neural networks (CNNs) exhibit remarkable prowess, effectively foreseeing critical temperatures in cases where $p=2$ and $p=4$. Furthermore, by harnessing CNNs, we compute critical exponents at the transition, aligning favorably with computations grounded in conventional order parameters. Taking our investigation a step further, we use CNNs to lend meaning to phases within YM theory with fundamental matter. Notably, this theory lacks conventional order parameters. Interestingly, CNNs manage to predict a transition temperature in this context. However, the fragility of this prediction under variations in the boundaries of the training window undermines its utility as a robust order parameter. This outcome underscores the constraints inherent in employing supervised machine learning techniques as innovative substitutes for traditional order parameters.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Towards Generalizable Detection of Urgency of Discussion Forum Posts
Authors:
Valdemar Švábenský,
Ryan S. Baker,
Andrés Zambrano,
Yishan Zou,
Stefan Slater
Abstract:
Students who take an online course, such as a MOOC, use the course's discussion forum to ask questions or reach out to instructors when encountering an issue. However, reading and responding to students' questions is difficult to scale because of the time needed to consider each message. As a result, critical issues may be left unresolved, and students may lose the motivation to continue in the co…
▽ More
Students who take an online course, such as a MOOC, use the course's discussion forum to ask questions or reach out to instructors when encountering an issue. However, reading and responding to students' questions is difficult to scale because of the time needed to consider each message. As a result, critical issues may be left unresolved, and students may lose the motivation to continue in the course. To help address this problem, we build predictive models that automatically determine the urgency of each forum post, so that these posts can be brought to instructors' attention. This paper goes beyond previous work by predicting not just a binary decision cut-off but a post's level of urgency on a 7-point scale. First, we train and cross-validate several models on an original data set of 3,503 posts from MOOCs at University of Pennsylvania. Second, to determine the generalizability of our models, we test their performance on a separate, previously published data set of 29,604 posts from MOOCs at Stanford University. While the previous work on post urgency used only one data set, we evaluated the prediction across different data sets and courses. The best-performing model was a support vector regressor trained on the Universal Sentence Encoder embeddings of the posts, achieving an RMSE of 1.1 on the training set and 1.4 on the test set. Understanding the urgency of forum posts enables instructors to focus their time more effectively and, as a result, better support student learning.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Large Language Models (GPT) for automating feedback on programming assignments
Authors:
Maciej Pankiewicz,
Ryan S. Baker
Abstract:
Addressing the challenge of generating personalized feedback for programming assignments is demanding due to several factors, like the complexity of code syntax or different ways to correctly solve a task. In this experimental study, we automated the process of feedback generation by employing OpenAI's GPT-3.5 model to generate personalized hints for students solving programming assignments on an…
▽ More
Addressing the challenge of generating personalized feedback for programming assignments is demanding due to several factors, like the complexity of code syntax or different ways to correctly solve a task. In this experimental study, we automated the process of feedback generation by employing OpenAI's GPT-3.5 model to generate personalized hints for students solving programming assignments on an automated assessment platform. Students rated the usefulness of GPT-generated hints positively. The experimental group (with GPT hints enabled) relied less on the platform's regular feedback but performed better in terms of percentage of successful submissions across consecutive attempts for tasks, where GPT hints were enabled. For tasks where the GPT feedback was made unavailable, the experimental group needed significantly less time to solve assignments. Furthermore, when GPT hints were unavailable, students in the experimental condition were initially less likely to solve the assignment correctly. This suggests potential over-reliance on GPT-generated feedback. However, students in the experimental condition were able to correct reasonably rapidly, reaching the same percentage correct after seven submission attempts. The availability of GPT hints did not significantly impact students' affective state.
△ Less
Submitted 30 June, 2023;
originally announced July 2023.
-
Spectral gaps and Fourier dimension for self-conformal sets with overlaps
Authors:
Simon Baker,
Tuomas Sahlsten
Abstract:
We prove a uniform spectral gap for complex transfer operators near the critical line associated to overlapping $C^2$ iterated function systems on the real line satisfying a Uniform Non-Integrability (UNI) condition. Our work extends that of Naud (2005) on spectral gaps for nonlinear Cantor sets to allow overlaps. The proof builds a new method to reduce the problem of the lack of Markov structure…
▽ More
We prove a uniform spectral gap for complex transfer operators near the critical line associated to overlapping $C^2$ iterated function systems on the real line satisfying a Uniform Non-Integrability (UNI) condition. Our work extends that of Naud (2005) on spectral gaps for nonlinear Cantor sets to allow overlaps. The proof builds a new method to reduce the problem of the lack of Markov structure to average contraction of products of random Dolgopyat operators. This approach is inspired by a disintegration technique developed by Algom, the first author and Shmerkin in the study of normal numbers. As a consequence of the method of the second author and Stevens, our spectral gap result implies that the Fourier transform of any non-atomic self-conformal measure decays to zero at a polynomial rate for any $C^{2}$ iterated function system satisfying UNI. This latter result leads to Fractal Uncertainty Principles with arbitrary overlaps.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
Quantum-Classical Multiple Kernel Learning
Authors:
Ara Ghukasyan,
Jack S. Baker,
Oktay Goktas,
Juan Carrasquilla,
Santosh Kumar Radha
Abstract:
As quantum computers become increasingly practical, so does the prospect of using quantum computation to improve upon traditional algorithms. Kernel methods in machine learning is one area where such improvements could be realized in the near future. Paired with kernel methods like support-vector machines, small and noisy quantum computers can evaluate classically-hard quantum kernels that capture…
▽ More
As quantum computers become increasingly practical, so does the prospect of using quantum computation to improve upon traditional algorithms. Kernel methods in machine learning is one area where such improvements could be realized in the near future. Paired with kernel methods like support-vector machines, small and noisy quantum computers can evaluate classically-hard quantum kernels that capture unique notions of similarity in data. Taking inspiration from techniques in classical machine learning, this work investigates simulated quantum kernels in the context of multiple kernel learning (MKL). We consider pairwise combinations of several classical-classical, quantum-quantum, and quantum-classical kernels in an empirical investigation of their classification performance with support-vector machines. We also introduce a novel approach, which we call QCC-net (quantum-classical-convex neural network), for optimizing the weights of base kernels together with any kernel parameters. We show this approach to be effective for enhancing various performance metrics in an MKL setting. Looking at data with an increasing number of features (up to 13 dimensions), we find parameter training to be important for successfully weighting kernels in some combinations. Using the optimal kernel weights as indicators of relative utility, we find growing contributions from trainable quantum kernels in quantum-classical kernel combinations as the number of features increases. We observe the opposite trend for combinations containing simpler, non-parametric quantum kernels.
△ Less
Submitted 28 May, 2023;
originally announced May 2023.
-
Linear System Analysis and Optimal Control of Natural Gas Dynamics in Pipeline Networks
Authors:
Luke S. Baker,
Sachin Shivakumar,
Dieter Armbruster,
Rodrigo B. Platte,
Anatoly Zlotnik
Abstract:
We derive a linear system of ordinary differential equations (ODEs) to approximate the dynamics of natural gas in pipeline networks. Although a closed-form expression of the eigenvalues of the state matrix does not generally exist, the poles of an irrational transfer function corresponding to the linearized partial differential equations are used to approximate the eigenvalues of the ODE system. O…
▽ More
We derive a linear system of ordinary differential equations (ODEs) to approximate the dynamics of natural gas in pipeline networks. Although a closed-form expression of the eigenvalues of the state matrix does not generally exist, the poles of an irrational transfer function corresponding to the linearized partial differential equations are used to approximate the eigenvalues of the ODE system. Our analysis qualitatively demonstrates that the eigenvalues of the state matrix of the entire network system are "pipeline separable" in the sense that the eigenvalues are dominated by the individual pipeline parameters and not the incidence connectivity of the network graph. The linear system is used as the dynamic constraints of a linear optimal control problem (OCP) to design the control actions of compressor units to minimize the energy that they expend. The motivation of this work is to reduce the computational complexity of optimizing gas dynamics in large networks to meet the unpredictable and highly variable demand from electric generators. The linear and corresponding nonlinear OCPs are discretized in time to obtain linear and nonlinear optimization problems, which are demonstrated on a test network to illustrate the validity of linear programming. Moreover, an analytical bound on the error between the solutions of the linear and nonlinear flow dynamics is presented using Lyapunov functions and verified computationally by plotting the error against the size of the flow variation around the steady-state solution.
△ Less
Submitted 24 May, 2023; v1 submitted 11 May, 2023;
originally announced May 2023.
-
Parallel hybrid quantum-classical machine learning for kernelized time-series classification
Authors:
Jack S. Baker,
Gilchan Park,
Kwangmin Yu,
Ara Ghukasyan,
Oktay Goktas,
Santosh Kumar Radha
Abstract:
Supervised time-series classification garners widespread interest because of its applicability throughout a broad application domain including finance, astronomy, biosensors, and many others. In this work, we tackle this problem with hybrid quantum-classical machine learning, deducing pairwise temporal relationships between time-series instances using a time-series Hamiltonian kernel (TSHK). A TSH…
▽ More
Supervised time-series classification garners widespread interest because of its applicability throughout a broad application domain including finance, astronomy, biosensors, and many others. In this work, we tackle this problem with hybrid quantum-classical machine learning, deducing pairwise temporal relationships between time-series instances using a time-series Hamiltonian kernel (TSHK). A TSHK is constructed with a sum of inner products generated by quantum states evolved using a parameterized time evolution operator. This sum is then optimally weighted using techniques derived from multiple kernel learning. Because we treat the kernel weighting step as a differentiable convex optimization problem, our method can be regarded as an end-to-end learnable hybrid quantum-classical-convex neural network, or QCC-net, whose output is a data set-generalized kernel function suitable for use in any kernelized machine learning technique such as the support vector machine (SVM). Using our TSHK as input to a SVM, we classify univariate and multivariate time-series using quantum circuit simulators and demonstrate the efficient parallel deployment of the algorithm to 127-qubit superconducting quantum processors using quantum multi-programming.
△ Less
Submitted 17 February, 2024; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Quantitative recurrence and the shrinking target problem for overlapping iterated function systems
Authors:
Simon Baker,
Henna Koivusalo
Abstract:
In this paper we study quantitative recurrence and the shrinking target problem for dynamical systems coming from overlapping iterated function systems. Such iterated function systems have the important property that a point often has several distinct choices of forward orbit. As is demonstrated in this paper, this non-uniqueness leads to different behaviour to that observed in the traditional set…
▽ More
In this paper we study quantitative recurrence and the shrinking target problem for dynamical systems coming from overlapping iterated function systems. Such iterated function systems have the important property that a point often has several distinct choices of forward orbit. As is demonstrated in this paper, this non-uniqueness leads to different behaviour to that observed in the traditional setting where every point has a unique forward orbit.
We prove several almost sure results on the Lebesgue measure of the set of points satisfying a given recurrence rate, and on the Lebesgue measure of the set of points returning to a shrinking target infinitely often. In certain cases, when the Lebesgue measure is zero, we also obtain Hausdorff dimension bounds. One interesting aspect of our approach is that it allows us to handle targets that are not simply balls, but may have a more exotic geometry.
△ Less
Submitted 29 January, 2024; v1 submitted 2 May, 2023;
originally announced May 2023.
-
Transitions from Monotonicity to Chaos in Gas Mixture Dynamics in Pipeline Networks
Authors:
Luke S. Baker,
Saif R. Kazi,
Anatoly Zlotnik
Abstract:
The blending of hydrogen generated using clean energy into natural gas pipeline networks is proposed in order to utilize existing energy systems for their planned lifetimes while reducing their reliance on fossil fuels. We formulate a system of partial differential equations (PDEs) that govern the flow dynamics of mixtures of gases in pipeline networks under the influence of time-varying compresso…
▽ More
The blending of hydrogen generated using clean energy into natural gas pipeline networks is proposed in order to utilize existing energy systems for their planned lifetimes while reducing their reliance on fossil fuels. We formulate a system of partial differential equations (PDEs) that govern the flow dynamics of mixtures of gases in pipeline networks under the influence of time-varying compressor and regulator control actions. The formulation is derived for general gas networks that can inject or withdraw arbitrary time-varying mixtures of gases into or from the network at arbitrarily specified nodes. The PDE formulation is discretized in space to form a nonlinear control system that is used to prove that homogeneous mixtures are well-behaved and heterogeneous mixtures may be ill-behaved in the sense of monotone-ordering of solutions. We use numerical simulations to compute interfaces in the parameter region of sinusoidal boundary conditions that delimit monotonic, periodic, and chaotic system responses. The interfaces suggest that any solution in the monotonic response region is not chaotic and will eventually approach a periodic orbit. The results are demonstrated using examples for a single pipeline and a small test network.
△ Less
Submitted 24 July, 2023; v1 submitted 30 March, 2023;
originally announced March 2023.
-
SwinVFTR: A Novel Volumetric Feature-learning Transformer for 3D OCT Fluid Segmentation
Authors:
Sharif Amit Kamran,
Khondker Fariha Hossain,
Alireza Tavakkoli,
Salah A. Baker,
Stewart Lee Zuckerbrod
Abstract:
Accurately segmenting fluid in 3D volumetric optical coherence tomography (OCT) images is a crucial yet challenging task for detecting eye diseases. Traditional autoencoding-based segmentation approaches have limitations in extracting fluid regions due to successive resolution loss in the encoding phase and the inability to recover lost information in the decoding phase. Although current transform…
▽ More
Accurately segmenting fluid in 3D volumetric optical coherence tomography (OCT) images is a crucial yet challenging task for detecting eye diseases. Traditional autoencoding-based segmentation approaches have limitations in extracting fluid regions due to successive resolution loss in the encoding phase and the inability to recover lost information in the decoding phase. Although current transformer-based models for medical image segmentation addresses this limitation, they are not designed to be applied out-of-the-box for 3D OCT volumes, which have a wide-ranging channel-axis size based on different vendor device and extraction technique. To address these issues, we propose SwinVFTR, a new transformer-based architecture designed for precise fluid segmentation in 3D volumetric OCT images. We first utilize a channel-wise volumetric sampling for training on OCT volumes with varying depths (B-scans). Next, the model uses a novel shifted window transformer block in the encoder to achieve better localization and segmentation of fluid regions. Additionally, we propose a new volumetric attention block for spatial and depth-wise attention, which improves upon traditional residual skip connections. Consequently, utilizing multi-class dice loss, the proposed architecture outperforms other existing architectures on the three publicly available vendor-specific OCT datasets, namely Spectralis, Cirrus, and Topcon, with mean dice scores of 0.72, 0.59, and 0.68, respectively. Additionally, SwinVFTR outperforms other architectures in two additional relevant metrics, mean intersection-over-union (Mean-IOU) and structural similarity measure (SSIM).
△ Less
Submitted 17 March, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
A Quantum-Inspired Binary Optimization Algorithm for Representative Selection
Authors:
Anna G. Hughes,
Jack S. Baker,
Santosh Kumar Radha
Abstract:
Advancements in quantum computing are fuelling emerging applications across disciplines, including finance, where quantum and quantum-inspired algorithms can now make market predictions, detect fraud, and optimize portfolios. Expanding this toolbox, we propose the selector algorithm: a method for selecting the most representative subset of data from a larger dataset. The selected subset includes d…
▽ More
Advancements in quantum computing are fuelling emerging applications across disciplines, including finance, where quantum and quantum-inspired algorithms can now make market predictions, detect fraud, and optimize portfolios. Expanding this toolbox, we propose the selector algorithm: a method for selecting the most representative subset of data from a larger dataset. The selected subset includes data points that simultaneously meet the two requirements of being maximally close to neighboring data points and maximally far from more distant data points where the precise notion of distance is given by any kernel or generalized similarity function. The cost function encoding the above requirements naturally presents itself as a Quadratic Unconstrained Binary Optimization (QUBO) problem, which is well-suited for quantum optimization algorithms - including quantum annealing. While the selector algorithm has applications in multiple areas, it is particularly useful in finance, where it can be used to build a diversified portfolio from a more extensive selection of assets. After experimenting with synthetic datasets, we show two use cases for the selector algorithm with real data: (1) approximately reconstructing the NASDAQ 100 index using a subset of stocks, and (2) diversifying a portfolio of cryptocurrencies. In our analysis of use case (2), we compare the performance of two quantum annealers provided by D-Wave Systems.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
SWIN-SFTNet : Spatial Feature Expansion and Aggregation using Swin Transformer For Whole Breast micro-mass segmentation
Authors:
Sharif Amit Kamran,
Khondker Fariha Hossain,
Alireza Tavakkoli,
George Bebis,
Sal Baker
Abstract:
Incorporating various mass shapes and sizes in training deep learning architectures has made breast mass segmentation challenging. Moreover, manual segmentation of masses of irregular shapes is time-consuming and error-prone. Though Deep Neural Network has shown outstanding performance in breast mass segmentation, it fails in segmenting micro-masses. In this paper, we propose a novel U-net-shaped…
▽ More
Incorporating various mass shapes and sizes in training deep learning architectures has made breast mass segmentation challenging. Moreover, manual segmentation of masses of irregular shapes is time-consuming and error-prone. Though Deep Neural Network has shown outstanding performance in breast mass segmentation, it fails in segmenting micro-masses. In this paper, we propose a novel U-net-shaped transformer-based architecture, called Swin-SFTNet, that outperforms state-of-the-art architectures in breast mammography-based micro-mass segmentation. Firstly to capture the global context, we designed a novel Spatial Feature Expansion and Aggregation Block(SFEA) that transforms sequential linear patches into a structured spatial feature. Next, we combine it with the local linear features extracted by the swin transformer block to improve overall accuracy. We also incorporate a novel embedding loss that calculates similarities between linear feature embeddings of the encoder and decoder blocks. With this approach, we achieve higher segmentation dice over the state-of-the-art by 3.10% on CBIS-DDSM, 3.81% on InBreast, and 3.13% on CBIS pre-trained model on the InBreast test data set.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Quantum Variational Rewinding for Time Series Anomaly Detection
Authors:
Jack S. Baker,
Haim Horowitz,
Santosh Kumar Radha,
Stenio Fernandes,
Colin Jones,
Noorain Noorani,
Vladimir Skavysh,
Philippe Lamontangne,
Barry C. Sanders
Abstract:
Electron dynamics, financial markets and nuclear fission reactors, though seemingly unrelated, all produce observable characteristics evolving with time. Within this broad scope, departures from normal temporal behavior range from academically interesting to potentially catastrophic. New algorithms for time series anomaly detection (TAD) are therefore certainly in demand. With the advent of newly…
▽ More
Electron dynamics, financial markets and nuclear fission reactors, though seemingly unrelated, all produce observable characteristics evolving with time. Within this broad scope, departures from normal temporal behavior range from academically interesting to potentially catastrophic. New algorithms for time series anomaly detection (TAD) are therefore certainly in demand. With the advent of newly accessible quantum processing units (QPUs), exploring a quantum approach to TAD is now relevant and is the topic of this work. Our approach - Quantum Variational Rewinding, or, QVR - trains a family of parameterized unitary time-devolution operators to cluster normal time series instances encoded within quantum states. Unseen time series are assigned an anomaly score based upon their distance from the cluster center, which, beyond a given threshold, classifies anomalous behavior. After a first demonstration with a simple and didactic case, QVR is used to study the real problem of identifying anomalous behavior in cryptocurrency market data. Finally, multivariate time series from the cryptocurrency use case are studied using IBM's Falcon r5.11H family of superconducting transmon QPUs, where anomaly score errors resulting from hardware noise are shown to be reducible by as much as 20% using advanced error mitigation techniques.
△ Less
Submitted 2 November, 2022; v1 submitted 28 October, 2022;
originally announced October 2022.
-
Exploring players' experience of humor and snark in a grade 3-6 history practices game
Authors:
David J. Gagnon,
Ryan S. Baker,
Sarah Gagnon,
Luke Swanson,
Nick Spevacek,
Juliana Andres,
Erik Harpstead,
Jennifer Scianna,
Stefan Slater,
Maria O. C. Z. San Pedro
Abstract:
In this paper we use an existing history learning game with an active audience as a research platform for exploring how humor and "snarkiness" in the dialog script affect students' progression and attitudes about the game. We conducted a 2x2 randomized experiment with 11,804 anonymous 3rd-6th grade students. Using one-way ANOVA and Kruskall-Wallis tests, we find that changes to the script produced…
▽ More
In this paper we use an existing history learning game with an active audience as a research platform for exploring how humor and "snarkiness" in the dialog script affect students' progression and attitudes about the game. We conducted a 2x2 randomized experiment with 11,804 anonymous 3rd-6th grade students. Using one-way ANOVA and Kruskall-Wallis tests, we find that changes to the script produced measurable results in the self-reported perceived humor of the game and the likeability of the player character. Different scripts did not produce significant differences in player completion of the game, or how much of the game was played. Perceived humor and enjoyment of the game and its main character contributed significantly to progress in the game, as did self-perceived reading skill.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
The COVID-19 Pandemic and the Future of Telecommuting in the United States
Authors:
Deborah Salon,
Laura Mirtich,
Matthew Wigginton Bhagat-Conway,
Adam Costello,
Ehsan Rahimi,
Abolfazl,
Mohammadian,
Rishabh Singh Chauhan,
Sybil Derrible,
Denise da Silva Baker,
Ram M. Pendyala
Abstract:
This study focuses on an important transport-related long-term effect of the COVID-19 pandemic in the United States: an increase in telecommuting. Analyzing a nationally representative panel survey of adults, we find that 40-50% of workers expect to telecommute at least a few times per month post-pandemic, up from 24% pre-COVID. If given the option, 90-95% of those who first telecommuted during th…
▽ More
This study focuses on an important transport-related long-term effect of the COVID-19 pandemic in the United States: an increase in telecommuting. Analyzing a nationally representative panel survey of adults, we find that 40-50% of workers expect to telecommute at least a few times per month post-pandemic, up from 24% pre-COVID. If given the option, 90-95% of those who first telecommuted during the pandemic plan to continue the practice regularly. We also find that new telecommuters are demographically similar to pre-COVID telecommuters. Both pre- and post-COVID, higher educational attainment and income, together with certain job categories, largely determine whether workers have the option to telecommute. Despite growth in telecommuting, approximately half of workers expect to remain unable to telecommute and between 2/3 and 3/4 of workers expect their post-pandemic telecommuting patterns to be unchanged from their pre-COVID patterns. This limits the contribution telecommuting can make to reducing peak hour transport demand.
△ Less
Submitted 30 September, 2022;
originally announced October 2022.
-
Recurrence rates for shifts of finite type
Authors:
Demi Allen,
Simon Baker,
Balázs Bárány
Abstract:
Let $Σ_{A}$ be a topologically mixing shift of finite type, let $σ:Σ_{A}\toΣ_{A}$ be the usual left-shift, and let $μ$ be the Gibbs measure for a Hölder continuous potential that is not cohomologous to a constant. In this paper we study recurrence rates for the dynamical system $(Σ_{A},σ)$ that hold $μ$-almost surely. In particular, given a function $ψ:\mathbb{N}\to \mathbb{N}$ we are interested i…
▽ More
Let $Σ_{A}$ be a topologically mixing shift of finite type, let $σ:Σ_{A}\toΣ_{A}$ be the usual left-shift, and let $μ$ be the Gibbs measure for a Hölder continuous potential that is not cohomologous to a constant. In this paper we study recurrence rates for the dynamical system $(Σ_{A},σ)$ that hold $μ$-almost surely. In particular, given a function $ψ:\mathbb{N}\to \mathbb{N}$ we are interested in the following set $$R_ψ=\{{\texttt i}\in Σ_{A}:i_{n+1}\ldots i_{n+ψ(n)+1}=i_1\ldots i_{ψ(n)}\textrm{ for infinitely many }n\in\mathbb{N}\}.$$
We provide sufficient conditions for $μ(R_ψ)=1$ and sufficient conditions for $μ(R_ψ)=0$. As a corollary of these results, we discover a new critical threshold where the measure of $R_ψ$ transitions from zero to one. This threshold was previously unknown even in the special case of a non-uniform Bernoulli measure defined on the full shift. The proofs of our results combine ideas from Probability Theory and Thermodynamic Formalism. In our final section we apply our results to the study of dynamics on self-similar sets.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Gaia Data Release 3: Summary of the content and survey properties
Authors:
Gaia Collaboration,
A. Vallenari,
A. G. A. Brown,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
L. Eyer,
R. Guerra,
A. Hutton,
C. Jordi,
S. A. Klioner,
U. L. Lammers,
L. Lindegren,
X. Luri,
F. Mignard,
C. Panem,
D. Pourbaix,
S. Randich,
P. Sartoretti,
C. Soubiran
, et al. (431 additional authors not shown)
Abstract:
We present the third data release of the European Space Agency's Gaia mission, GDR3. The GDR3 catalogue is the outcome of the processing of raw data collected with the Gaia instruments during the first 34 months of the mission by the Gaia Data Processing and Analysis Consortium. The GDR3 catalogue contains the same source list, celestial positions, proper motions, parallaxes, and broad band photom…
▽ More
We present the third data release of the European Space Agency's Gaia mission, GDR3. The GDR3 catalogue is the outcome of the processing of raw data collected with the Gaia instruments during the first 34 months of the mission by the Gaia Data Processing and Analysis Consortium. The GDR3 catalogue contains the same source list, celestial positions, proper motions, parallaxes, and broad band photometry in the G, G$_{BP}$, and G$_{RP}$ pass-bands already present in the Early Third Data Release. GDR3 introduces an impressive wealth of new data products. More than 33 million objects in the ranges $G_{rvs} < 14$ and $3100 <T_{eff} <14500 $, have new determinations of their mean radial velocities based on data collected by Gaia. We provide G$_{rvs}$ magnitudes for most sources with radial velocities, and a line broadening parameter is listed for a subset of these. Mean Gaia spectra are made available to the community. The GDR3 catalogue includes about 1 million mean spectra from the radial velocity spectrometer, and about 220 million low-resolution blue and red prism photometer BPRP mean spectra. The results of the analysis of epoch photometry are provided for some 10 million sources across 24 variability types. GDR3 includes astrophysical parameters and source class probabilities for about 470 million and 1500 million sources, respectively, including stars, galaxies, and quasars. Orbital elements and trend parameters are provided for some $800\,000$ astrometric, spectroscopic and eclipsing binaries. More than $150\,000$ Solar System objects, including new discoveries, with preliminary orbital solutions and individual epoch observations are part of this release. Reflectance spectra derived from the epoch BPRP spectral data are published for about 60\,000 asteroids. Finally, an additional data set is provided, namely the Gaia Andromeda Photometric Survey (abridged)
△ Less
Submitted 30 July, 2022;
originally announced August 2022.
-
Solid confirmation of the broad DIB around 864.8 nm using stacked Gaia-RVS spectra
Authors:
H. Zhao,
M. Schultheis,
T. Zwitter,
C. A. L. Bailer-Jones,
P. Panuzzo,
P. Sartoretti,
G. M. Seabroke,
A. Recio-Blanco,
P. de Laverny,
G. Kordopatis,
O. L. Creevey,
T. E. Dharmawardena,
Y. Frémat,
R. Sordo,
R. Drimmel,
D. J. Marshall,
P. A. Palicio,
G. Contursi,
M. A. Álvarez,
S. Baker,
K. Benson,
M. Cropper,
C. Dolding,
H. E. Huckle,
M. Smith
, et al. (4 additional authors not shown)
Abstract:
Studies of the correlation between different diffuse interstellar bands (DIBs) are important for exploring their origins. However, the Gaia-RVS spectral window between 846 and 870 nm contains few DIBs, the strong DIB at 862 nm being the only convincingly confirmed one. Here we attempt to confirm the existence of a broad DIB around 864.8 nm and estimate its characteristics using the stacked Gaia-RV…
▽ More
Studies of the correlation between different diffuse interstellar bands (DIBs) are important for exploring their origins. However, the Gaia-RVS spectral window between 846 and 870 nm contains few DIBs, the strong DIB at 862 nm being the only convincingly confirmed one. Here we attempt to confirm the existence of a broad DIB around 864.8 nm and estimate its characteristics using the stacked Gaia-RVS spectra of a large number of stars. We study the correlations between the two DIBs at 862 nm and 864.8 nm, as well as the interstellar extinction. We obtained spectra of the interstellar medium absorption by subtracting the stellar components using templates constructed from real spectra at high Galactic latitudes with low extinctions. We then stacked the ISM spectra in Galactic coordinates, pixelized by the HEALPix scheme, to measure the DIBs. The stacked spectrum is modeled by the profiles of the two DIBs, Gaussian for $λ$862 and Lorentzian for $λ$864.8, and a linear continuum. We obtain 8458 stacked spectra in total, of which 1103 (13%) have reliable fitting results after applying numerous conservative filters. This work is the first of its kind to fit and measure $λ$862 and $λ$864.8 simultaneously in cool-star spectra. We find that the EWs and CDs of the two DIBs are well correlated with each other. The full width at half maximum (FWHM) of $λ$864.8 is estimated as $1.62 \pm 0.33$ nm which compares to $0.55 \pm 0.06$ nm for $λ$862. We also measure the vacuum rest-frame wavelength of $λ$864.8 to be $λ_0 = 864.53 \pm 0.14$ nm, smaller than previous estimates. We find a solid confirmation of the existence of the DIB around 864.8 nm based on an exploration of its correlation with $λ$862 and estimation of its FWHM. $λ$862 correlates better with E(BP-RP) than $λ$864.8.
△ Less
Submitted 7 October, 2022; v1 submitted 24 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Reflectance spectra of Solar System small bodies
Authors:
Gaia Collaboration,
L. Galluccio,
M. Delbo,
F. De Angeli,
T. Pauwels,
P. Tanga,
F. Mignard,
A. Cellino,
A. G. A. Brown,
K. Muinonen,
A. Penttila,
S. Jordan,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
L. Eyer,
R. Guerra,
A. Hutton,
C. Jordi
, et al. (422 additional authors not shown)
Abstract:
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was deriv…
▽ More
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was derived from measurements obtained by means of the Blue and Red photometers (BP/RP), which were binned in 16 discrete wavelength bands. We describe the processing of the Gaia spectral data of SSOs, explaining both the criteria used to select the subset of asteroid spectra published in Gaia DR3, and the different steps of our internal validation procedures. In order to further assess the quality of Gaia SSO reflectance spectra, we carried out external validation against SSO reflectance spectra obtained from ground-based and space-borne telescopes and available in the literature. For each selected SSO, an epoch reflectance was computed by dividing the calibrated spectrum observed by the BP/RP at each transit on the focal plane by the mean spectrum of a solar analogue. The latter was obtained by averaging the Gaia spectral measurements of a selected sample of stars known to have very similar spectra to that of the Sun. Finally, a mean of the epoch reflectance spectra was calculated in 16 spectral bands for each SSO. The agreement between Gaia mean reflectance spectra and those available in the literature is good for bright SSOs, regardless of their taxonomic spectral class. We identify an increase in the spectral slope of S-type SSOs with increasing phase angle. Moreover, we show that the spectral slope increases and the depth of the 1 um absorption band decreases for increasing ages of S-type asteroid families.
△ Less
Submitted 24 June, 2022;
originally announced June 2022.
-
Feature Representation Learning for Robust Retinal Disease Detection from Optical Coherence Tomography Images
Authors:
Sharif Amit Kamran,
Khondker Fariha Hossain,
Alireza Tavakkoli,
Stewart Lee Zuckerbrod,
Salah A. Baker
Abstract:
Ophthalmic images may contain identical-looking pathologies that can cause failure in automated techniques to distinguish different retinal degenerative diseases. Additionally, reliance on large annotated datasets and lack of knowledge distillation can restrict ML-based clinical support systems' deployment in real-world environments. To improve the robustness and transferability of knowledge, an e…
▽ More
Ophthalmic images may contain identical-looking pathologies that can cause failure in automated techniques to distinguish different retinal degenerative diseases. Additionally, reliance on large annotated datasets and lack of knowledge distillation can restrict ML-based clinical support systems' deployment in real-world environments. To improve the robustness and transferability of knowledge, an enhanced feature-learning module is required to extract meaningful spatial representations from the retinal subspace. Such a module, if used effectively, can detect unique disease traits and differentiate the severity of such retinal degenerative pathologies. In this work, we propose a robust disease detection architecture with three learning heads, i) A supervised encoder for retinal disease classification, ii) An unsupervised decoder for the reconstruction of disease-specific spatial information, and iii) A novel representation learning module for learning the similarity between encoder-decoder feature and enhancing the accuracy of the model. Our experimental results on two publicly available OCT datasets illustrate that the proposed model outperforms existing state-of-the-art models in terms of accuracy, interpretability, and robustness for out-of-distribution retinal disease detection.
△ Less
Submitted 31 July, 2022; v1 submitted 24 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Properties of the line broadening parameter derived with the Radial Velocity Spectrometer (RVS)
Authors:
Y. Frémat,
F. Royer,
O. Marchal,
R. Blomme,
P. Sartoretti,
A. Guerrier,
P. Panuzzo,
D. Katz,
G. M. Seabroke,
F. Thévenin,
M. Cropper,
K. Benson,
Y. Damerdji,
R. Haigron,
A. Lobel,
M. Smith,
S. G. Baker,
L. Chemin,
M. David,
C. Dolding,
E. Gosset,
K. Janßen,
G. Jasniewicz,
G. Plum,
N. Samaras
, et al. (16 additional authors not shown)
Abstract:
The third release of the Gaia catalogue contains the radial velocities for 33,812,183 stars having effective temperatures ranging from 3100 K to 14,500 K. The measurements are based on the comparison of the observed RVS spectrum (wavelength coverage: 846--870 nm, median resolving power: 11,500) to synthetic data broadened to the adequate Along-Scan Line Spread Function. The additional line-broaden…
▽ More
The third release of the Gaia catalogue contains the radial velocities for 33,812,183 stars having effective temperatures ranging from 3100 K to 14,500 K. The measurements are based on the comparison of the observed RVS spectrum (wavelength coverage: 846--870 nm, median resolving power: 11,500) to synthetic data broadened to the adequate Along-Scan Line Spread Function. The additional line-broadening, fitted as it would only be due to axial rotation, is also produced by the pipeline and is available in the catalogue (field name gaia_source:vbroad). To describe the properties of the line-broadening information extracted from the RVS and published in the catalogue, as well as to analyse the limitations imposed by the adopted method, wavelength range, and instrument. We use simulations to express the link existing between the line broadening measurement provided in Gaia Data Release 3 and Vsin(i). We then compare the observed values to the measurements published by various catalogues and surveys (GALAH, APOGEE, LAMOST, ...). While we recommend being cautious in the interpretation of the vbroad measurement, we also find a reasonable global agreement between the Gaia Data Release 3 line broadening values and those found in the other catalogues. We discuss and establish the validity domain of the published vbroad values. The estimate tends to be overestimated at the lower vsini end, and at $T_\mathrm{eff}>7500\,\mathrm{K}$ its quality and significance degrade rapidly when $G_\mathrm{RVS}>10$. Despite all the known and reported limitations, the Gaia Data Release 3 line broadening catalogue contains the measurements obtained for 3,524,677 stars with $T_\mathrm{eff}$\ ranging from 3500 to 14,500 K, and $G_\mathrm{RVS}<12$. It gathers the largest stellar sample ever considered for the purpose, and allows a first mapping of the \Gaia\ line broadening parameter across the HR diagram.
△ Less
Submitted 27 June, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Mapping the asymmetric disc of the Milky Way
Authors:
Gaia Collaboration,
R. Drimmel,
M. Romero-Gomez,
L. Chemin,
P. Ramos,
E. Poggio,
V. Ripepi,
R. Andrae,
R. Blomme,
T. Cantat-Gaudin,
A. Castro-Ginard,
G. Clementini,
F. Figueras,
M. Fouesneau,
Y. Fremat,
K. Jardine,
S. Khanna,
A. Lobel,
D. J. Marshall,
T. Muraveva,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou
, et al. (431 additional authors not shown)
Abstract:
With the most recent Gaia data release the number of sources with complete 6D phase space information (position and velocity) has increased to well over 33 million stars, while stellar astrophysical parameters are provided for more than 470 million sources, in addition to the identification of over 11 million variable stars. Using the astrophysical parameters and variability classifications provid…
▽ More
With the most recent Gaia data release the number of sources with complete 6D phase space information (position and velocity) has increased to well over 33 million stars, while stellar astrophysical parameters are provided for more than 470 million sources, in addition to the identification of over 11 million variable stars. Using the astrophysical parameters and variability classifications provided in Gaia DR3, we select various stellar populations to explore and identify non-axisymmetric features in the disc of the Milky Way in both configuration and velocity space. Using more about 580 thousand sources identified as hot OB stars, together with 988 known open clusters younger than 100 million years, we map the spiral structure associated with star formation 4-5 kpc from the Sun. We select over 2800 Classical Cepheids younger than 200 million years, which show spiral features extending as far as 10 kpc from the Sun in the outer disc. We also identify more than 8.7 million sources on the red giant branch (RGB), of which 5.7 million have line-of-sight velocities, allowing the velocity field of the Milky Way to be mapped as far as 8 kpc from the Sun, including the inner disc. The spiral structure revealed by the young populations is consistent with recent results using Gaia EDR3 astrometry and source lists based on near infrared photometry, showing the Local (Orion) arm to be at least 8 kpc long, and an outer arm consistent with what is seen in HI surveys, which seems to be a continuation of the Perseus arm into the third quadrant. Meanwhile, the subset of RGB stars with velocities clearly reveals the large scale kinematic signature of the bar in the inner disc, as well as evidence of streaming motions in the outer disc that might be associated with spiral arms or bar resonances. (abridged)
△ Less
Submitted 5 August, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Pulsations in main sequence OBAF-type stars
Authors:
Gaia Collaboration,
J. De Ridder,
V. Ripepi,
C. Aerts,
L. Palaversa,
L. Eyer,
B. Holl,
M. Audard,
L. Rimoldini,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
R. Guerra,
A. Hutton,
C. Jordi,
S. A. Klioner,
U. L. Lammers,
L. Lindegren
, et al. (423 additional authors not shown)
Abstract:
The third Gaia data release provides photometric time series covering 34 months for about 10 million stars. For many of those stars, a characterisation in Fourier space and their variability classification are also provided. This paper focuses on intermediate- to high-mass (IHM) main sequence pulsators M >= 1.3 Msun) of spectral types O, B, A, or F, known as beta Cep, slowly pulsating B (SPB), del…
▽ More
The third Gaia data release provides photometric time series covering 34 months for about 10 million stars. For many of those stars, a characterisation in Fourier space and their variability classification are also provided. This paper focuses on intermediate- to high-mass (IHM) main sequence pulsators M >= 1.3 Msun) of spectral types O, B, A, or F, known as beta Cep, slowly pulsating B (SPB), delta Sct, and gamma Dor stars. These stars are often multi-periodic and display low amplitudes, making them challenging targets to analyse with sparse time series. All datasets used in this analysis are part of the Gaia DR3 data release. The photometric time series were used to perform a Fourier analysis, while the global astrophysical parameters necessary for the empirical instability strips were taken from the Gaia DR3 gspphot tables, and the vsini data were taken from the Gaia DR3 esphs tables. We show that for nearby OBAF-type pulsators, the Gaia DR3 data are precise and accurate enough to pinpoint them in the Hertzsprung-Russell diagram. We find empirical instability strips covering broader regions than theoretically predicted. In particular, our study reveals the presence of fast rotating gravity-mode pulsators outside the strips, as well as the co-existence of rotationally modulated variables inside the strips as reported before in the literature. We derive an extensive period-luminosity relation for delta Sct stars and provide evidence that the relation features different regimes depending on the oscillation period. Finally, we demonstrate how stellar rotation attenuates the amplitude of the dominant oscillation mode of delta Sct stars.
△ Less
Submitted 16 August, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3 Properties and validation of the radial velocities
Authors:
D. Katz,
P. Sartoretti,
A. Guerrier,
P. Panuzzo,
G. M. Seabroke,
F. Thévenin,
M. Cropper,
K. Benson,
R. Blomme,
R. Haigron,
O. Marchal,
M. Smith,
S. Baker,
L. Chemin,
Y. Damerdji,
M. David,
C. Dolding,
Y. Frémat,
E. Gosset,
K. Janßen,
G. Jasniewicz,
A. Lobel,
G. Plum,
N. Samaras,
O. Snaith
, et al. (25 additional authors not shown)
Abstract:
Gaia Data Release 3 (Gaia DR3) contains the second release of the combined radial velocities. It is based on the spectra collected during the first 34 months of the nominal mission. The longer time baseline and the improvements of the pipeline made it possible to push the processing limit, from Grvs = 12 in Gaia DR2, to Grvs = 14 mag. In this article, we describe the new functionalities implemente…
▽ More
Gaia Data Release 3 (Gaia DR3) contains the second release of the combined radial velocities. It is based on the spectra collected during the first 34 months of the nominal mission. The longer time baseline and the improvements of the pipeline made it possible to push the processing limit, from Grvs = 12 in Gaia DR2, to Grvs = 14 mag. In this article, we describe the new functionalities implemented for Gaia DR3, the quality filters applied during processing and post-processing and the properties and performance of the published velocities. For Gaia DR3, several functionalities were upgraded or added. (Abridged) Gaia DR3 contains the combined radial velocities of 33 812 183 stars. With respect to Gaia DR2, the interval of temperature has been expanded from Teff \in [3600, 6750] K to Teff \in [3100, 14500] K for the bright stars ( Grvs \leq 12 mag) and [3100, 6750] K for the fainter stars. The radial velocities sample a significant part of the Milky Way: they reach a few kilo-parsecs beyond the Galactic centre in the disc and up to about 10-15 kpc vertically into the inner halo. The median formal precision of the velocities is of 1.3 km/s at Grvs = 12 and 6.4 km/s at Grvs = 14 mag. The velocity zero point exhibits a small systematic trend with magnitude starting around Grvs = 11 mag and reaching about 400 m/s at Grvs = 14 mag. A correction formula is provided, which can be applied to the published data. The Gaia DR3 velocity scale is in satisfactory agreement with APOGEE, GALAH, GES and RAVE, with systematic differences that mostly do not exceed a few hundreds m/s. The properties of the radial velocities are also illustrated with specific objects: open clusters, globular clusters as well as the Large Magellanic Cloud (LMC). For example, the precision of the data allows to map the line-of-sight rotational velocities of the globular cluster 47 Tuc and of the LMC.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: A Golden Sample of Astrophysical Parameters
Authors:
Gaia Collaboration,
O. L. Creevey,
L. M. Sarro,
A. Lobel,
E. Pancino,
R. Andrae,
R. L. Smart,
G. Clementini,
U. Heiter,
A. J. Korn,
M. Fouesneau,
Y. Frémat,
F. De Angeli,
A. Vallenari,
D. L. Harrison,
F. Thévenin,
C. Reylé,
R. Sordo,
A. Garofalo,
A. G. A. Brown,
L. Eyer,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux
, et al. (423 additional authors not shown)
Abstract:
Gaia Data Release 3 (DR3) provides a wealth of new data products for the astronomical community to exploit, including astrophysical parameters for a half billion stars. In this work we demonstrate the high quality of these data products and illustrate their use in different astrophysical contexts. We query the astrophysical parameter tables along with other tables in Gaia DR3 to derive the samples…
▽ More
Gaia Data Release 3 (DR3) provides a wealth of new data products for the astronomical community to exploit, including astrophysical parameters for a half billion stars. In this work we demonstrate the high quality of these data products and illustrate their use in different astrophysical contexts. We query the astrophysical parameter tables along with other tables in Gaia DR3 to derive the samples of the stars of interest. We validate our results by using the Gaia catalogue itself and by comparison with external data. We have produced six homogeneous samples of stars with high quality astrophysical parameters across the HR diagram for the community to exploit. We first focus on three samples that span a large parameter space: young massive disk stars (~3M), FGKM spectral type stars (~3M), and UCDs (~20K). We provide these sources along with additional information (either a flag or complementary parameters) as tables that are made available in the Gaia archive. We furthermore identify 15740 bone fide carbon stars, 5863 solar-analogues, and provide the first homogeneous set of stellar parameters of the Spectro Photometric Standard Stars. We use a subset of the OBA sample to illustrate its usefulness to analyse the Milky Way rotation curve. We then use the properties of the FGKM stars to analyse known exoplanet systems. We also analyse the ages of some unseen UCD-companions to the FGKM stars. We additionally predict the colours of the Sun in various passbands (Gaia, 2MASS, WISE) using the solar-analogue sample.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: G_RVS photometry from the RVS spectra
Authors:
P. Sartoretti,
O. Marchal,
C. Babusiaux,
C. Jordi,
A. Guerrier,
P. Panuzzo,
D. Katz,
G. M. Seabroke,
F. Thévenin,
M. Cropper,
K. Benson,
R. Blomme,
R. Haigron,
M. Smith,
S. Baker,
L. Chemin,
M. David,
C. Dolding,
Y. Frémat,
K. Janssen,
G. Jasniewicz,
A. Lobel,
G. Plum,
N. Samaras,
O. Snaith
, et al. (16 additional authors not shown)
Abstract:
Gaia Data Release 3 (DR3) contains the first release of magnitudes estimated from the integration of Radial Velocity Spectrometer (RVS) spectra for a sample of about 32.2 million stars brighter than G_RVS~14 mag (or G~15 mag). In this paper, we describe the data used and the approach adopted to derive and validate the G_RVS magnitudes published in DR3. We also provide estimates of the G_RVS passba…
▽ More
Gaia Data Release 3 (DR3) contains the first release of magnitudes estimated from the integration of Radial Velocity Spectrometer (RVS) spectra for a sample of about 32.2 million stars brighter than G_RVS~14 mag (or G~15 mag). In this paper, we describe the data used and the approach adopted to derive and validate the G_RVS magnitudes published in DR3. We also provide estimates of the G_RVS passband and associated G_RVS zero-point. We derived G_RVS photometry from the integration of RVS spectra over the wavelength range from 846 to 870 nm. We processed these spectra following a procedure similar to that used for DR2, but incorporating several improvements that allow a better estimation of G_RVS. These improvements pertain to the stray-light background estimation, the line spread function calibration, and the detection of spectra contaminated by nearby relatively bright sources. We calibrated the G_RVS zero-point every 30 hours based on the reference magnitudes of constant stars from the Hipparcos catalogue, and used them to transform the integrated flux of the cleaned and calibrated spectra into epoch magnitudes. The G_RVS magnitude of a star published in DR3 is the median of the epoch magnitudes for that star. We estimated the G_RVS passband by comparing the RVS spectra of 108 bright stars with their flux-calibrated spectra from external spectrophotometric libraries. The G_RVS magnitude provides information that is complementary to that obtained from the G, G_BP, and G_RP magnitudes, which is useful for constraining stellar metallicity and interstellar extinction. The median precision of G_RVS measurements ranges from about 0.006 mag for the brighter stars (i.e. with 3.5 < G_RVS < 6.5 mag) to 0.125 mag at the faint end. The derived G_RVS passband shows that the effective transmittance of the RVS is approximately 1.23 times better than the pre-launch estimate.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: The extragalactic content
Authors:
Gaia Collaboration,
C. A. L. Bailer-Jones,
D. Teyssier,
L. Delchambre,
C. Ducourant,
D. Garabato,
D. Hatzidimitriou,
S. A. Klioner,
L. Rimoldini,
I. Bellas-Velidis,
R. Carballo,
M. I. Carnerero,
C. Diener,
M. Fouesneau,
L. Galluccio,
P. Gavras,
A. Krone-Martins,
C. M. Raiteri,
R. Teixeira,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux
, et al. (422 additional authors not shown)
Abstract:
The Gaia Galactic survey mission is designed and optimized to obtain astrometry, photometry, and spectroscopy of nearly two billion stars in our Galaxy. Yet as an all-sky multi-epoch survey, Gaia also observes several million extragalactic objects down to a magnitude of G~21 mag. Due to the nature of the Gaia onboard selection algorithms, these are mostly point-source-like objects. Using data prov…
▽ More
The Gaia Galactic survey mission is designed and optimized to obtain astrometry, photometry, and spectroscopy of nearly two billion stars in our Galaxy. Yet as an all-sky multi-epoch survey, Gaia also observes several million extragalactic objects down to a magnitude of G~21 mag. Due to the nature of the Gaia onboard selection algorithms, these are mostly point-source-like objects. Using data provided by the satellite, we have identified quasar and galaxy candidates via supervised machine learning methods, and estimate their redshifts using the low resolution BP/RP spectra. We further characterise the surface brightness profiles of host galaxies of quasars and of galaxies from pre-defined input lists. Here we give an overview of the processing of extragalactic objects, describe the data products in Gaia DR3, and analyse their properties. Two integrated tables contain the main results for a high completeness, but low purity (50-70%), set of 6.6 million candidate quasars and 4.8 million candidate galaxies. We provide queries that select purer sub-samples of these containing 1.9 million probable quasars and 2.9 million probable galaxies (both 95% purity). We also use high quality BP/RP spectra of 43 thousand high probability quasars over the redshift range 0.05-4.36 to construct a composite quasar spectrum spanning restframe wavelengths from 72-100 nm.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Stellar multiplicity, a teaser for the hidden treasure
Authors:
Gaia Collaboration,
F. Arenou,
C. Babusiaux,
M. A. Barstow,
S. Faigler,
A. Jorissen,
P. Kervella,
T. Mazeh,
N. Mowlavi,
P. Panuzzo,
J. Sahlmann,
S. Shahaf,
A. Sozzetti,
N. Bauchet,
Y. Damerdji,
P. Gavras,
P. Giacobbe,
E. Gosset,
J. -L. Halbwachs,
B. Holl,
M. G. Lattanzi,
N. Leclerc,
T. Morel,
D. Pourbaix,
P. Re Fiorentin
, et al. (425 additional authors not shown)
Abstract:
The Gaia DR3 Catalogue contains for the first time about eight hundred thousand solutions with either orbital elements or trend parameters for astrometric, spectroscopic and eclipsing binaries, and combinations of them. This paper aims to illustrate the huge potential of this large non-single star catalogue. Using the orbital solutions together with models of the binaries, a catalogue of tens of t…
▽ More
The Gaia DR3 Catalogue contains for the first time about eight hundred thousand solutions with either orbital elements or trend parameters for astrometric, spectroscopic and eclipsing binaries, and combinations of them. This paper aims to illustrate the huge potential of this large non-single star catalogue. Using the orbital solutions together with models of the binaries, a catalogue of tens of thousands of stellar masses, or lower limits, partly together with consistent flux ratios, has been built. Properties concerning the completeness of the binary catalogues are discussed, statistical features of the orbital elements are explained and a comparison with other catalogues is performed. Illustrative applications are proposed for binaries across the H-R diagram. The binarity is studied in the RGB/AGB and a search for genuine SB1 among long-period variables is performed. The discovery of new EL CVn systems illustrates the potential of combining variability and binarity catalogues. Potential compact object companions are presented, mainly white dwarf companions or double degenerates, but one candidate neutron star is also presented. Towards the bottom of the main sequence, the orbits of previously-suspected binary ultracool dwarfs are determined and new candidate binaries are discovered. The long awaited contribution of Gaia to the analysis of the substellar regime shows the brown dwarf desert around solar-type stars using true, rather than minimum, masses, and provides new important constraints on the occurrence rates of substellar companions to M dwarfs. Several dozen new exoplanets are proposed, including two with validated orbital solutions and one super-Jupiter orbiting a white dwarf, all being candidates requiring confirmation. Beside binarity, higher order multiple systems are also found.
△ Less
Submitted 11 June, 2022;
originally announced June 2022.