-
Periodic Orbits in Fermi-Pasta-Ulam-Tsingou Systems
Authors:
Nachiket Karve,
Nathan Rose,
David Campbell
Abstract:
The FPUT paradox is the phenomenon whereby a one-dimensional chain of oscillators with nonlinear couplings shows non-ergodic behavior. The trajectory of the system in phase space, with a long wavelength initial condition, closely follows that of the Toda model over short times, as both systems seem to relax quickly to a non-thermal, metastable state. Over longer times, resonances in the FPUT spect…
▽ More
The FPUT paradox is the phenomenon whereby a one-dimensional chain of oscillators with nonlinear couplings shows non-ergodic behavior. The trajectory of the system in phase space, with a long wavelength initial condition, closely follows that of the Toda model over short times, as both systems seem to relax quickly to a non-thermal, metastable state. Over longer times, resonances in the FPUT spectrum drive the system towards equilibrium, away from the Toda trajectory. Similar resonances are observed in $q$-breather spectra, suggesting that $q$-breathers are involved in the route towards thermalization. In this article we investigate such resonances and show that they occur due to exact overlaps of $q$-breather frequencies of the type $mΩ_1 = Ω_k$. The resonances appear as peaks in the energy spectrum. Further, they give rise to new composite periodic orbits, which exist simultaneously with the original $q$-breathers. We find that such resonances are absent in integrable systems, as a consequence of the (infinite number of) conservation laws associated with integrability.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image
Authors:
Stanislaw Szymanowicz,
Eldar Insafutdinov,
Chuanxia Zheng,
Dylan Campbell,
João F. Henriques,
Christian Rupprecht,
Andrea Vedaldi
Abstract:
In this paper, we propose Flash3D, a method for scene reconstruction and novel view synthesis from a single image which is both very generalisable and efficient. For generalisability, we start from a "foundation" model for monocular depth estimation and extend it to a full 3D shape and appearance reconstructor. For efficiency, we base this extension on feed-forward Gaussian Splatting. Specifically…
▽ More
In this paper, we propose Flash3D, a method for scene reconstruction and novel view synthesis from a single image which is both very generalisable and efficient. For generalisability, we start from a "foundation" model for monocular depth estimation and extend it to a full 3D shape and appearance reconstructor. For efficiency, we base this extension on feed-forward Gaussian Splatting. Specifically, we predict a first layer of 3D Gaussians at the predicted depth, and then add additional layers of Gaussians that are offset in space, allowing the model to complete the reconstruction behind occlusions and truncations. Flash3D is very efficient, trainable on a single GPU in a day, and thus accessible to most researchers. It achieves state-of-the-art results when trained and tested on RealEstate10k. When transferred to unseen datasets like NYU it outperforms competitors by a large margin. More impressively, when transferred to KITTI, Flash3D achieves better PSNR than methods trained specifically on that dataset. In some instances, it even outperforms recent methods that use multiple views as input. Code, models, demo, and more results are available at https://www.robots.ox.ac.uk/~vgg/research/flash3d/.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Using Contrastive Learning with Generative Similarity to Learn Spaces that Capture Human Inductive Biases
Authors:
Raja Marjieh,
Sreejan Kumar,
Declan Campbell,
Liyi Zhang,
Gianluca Bencomo,
Jake Snell,
Thomas L. Griffiths
Abstract:
Humans rely on strong inductive biases to learn from few examples and abstract useful information from sensory data. Instilling such biases in machine learning models has been shown to improve their performance on various benchmarks including few-shot learning, robustness, and alignment. However, finding effective training procedures to achieve that goal can be challenging as psychologically-rich…
▽ More
Humans rely on strong inductive biases to learn from few examples and abstract useful information from sensory data. Instilling such biases in machine learning models has been shown to improve their performance on various benchmarks including few-shot learning, robustness, and alignment. However, finding effective training procedures to achieve that goal can be challenging as psychologically-rich training data such as human similarity judgments are expensive to scale, and Bayesian models of human inductive biases are often intractable for complex, realistic domains. Here, we address this challenge by introducing a Bayesian notion of generative similarity whereby two datapoints are considered similar if they are likely to have been sampled from the same distribution. This measure can be applied to complex generative processes, including probabilistic programs. We show that generative similarity can be used to define a contrastive learning objective even when its exact form is intractable, enabling learning of spatial embeddings that express specific inductive biases. We demonstrate the utility of our approach by showing how it can be used to capture human inductive biases for geometric shapes, and to better distinguish different abstract drawing styles that are parameterized by probabilistic programs.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Stale Diffusion: Hyper-realistic 5D Movie Generation Using Old-school Methods
Authors:
Joao F. Henriques,
Dylan Campbell,
Tengda Han
Abstract:
Two years ago, Stable Diffusion achieved super-human performance at generating images with super-human numbers of fingers. Following the steady decline of its technical novelty, we propose Stale Diffusion, a method that solidifies and ossifies Stable Diffusion in a maximum-entropy state. Stable Diffusion works analogously to a barn (the Stable) from which an infinite set of horses have escaped (th…
▽ More
Two years ago, Stable Diffusion achieved super-human performance at generating images with super-human numbers of fingers. Following the steady decline of its technical novelty, we propose Stale Diffusion, a method that solidifies and ossifies Stable Diffusion in a maximum-entropy state. Stable Diffusion works analogously to a barn (the Stable) from which an infinite set of horses have escaped (the Diffusion). As the horses have long left the barn, our proposal may be seen as antiquated and irrelevant. Nevertheless, we vigorously defend our claim of novelty by identifying as early adopters of the Slow Science Movement, which will produce extremely important pearls of wisdom in the future. Our speed of contributions can also be seen as a quasi-static implementation of the recent call to pause AI experiments, which we wholeheartedly support. As a result of a careful archaeological expedition to 18-months-old Git commit histories, we found that naturally-accumulating errors have produced a novel entropy-maximising Stale Diffusion method, that can produce sleep-inducing hyper-realistic 5D video that is as good as one's imagination.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Authors:
Nathaniel Li,
Alexander Pan,
Anjali Gopal,
Summer Yue,
Daniel Berrios,
Alice Gatti,
Justin D. Li,
Ann-Kathrin Dombrowski,
Shashwat Goel,
Long Phan,
Gabriel Mukobi,
Nathan Helm-Burger,
Rassin Lababidi,
Lennart Justen,
Andrew B. Liu,
Michael Chen,
Isabelle Barrass,
Oliver Zhang,
Xiaoyuan Zhu,
Rishub Tamirisa,
Bhrugu Bharathi,
Adam Khoja,
Zhenqi Zhao,
Ariel Herbert-Voss,
Cort B. Breuer
, et al. (32 additional authors not shown)
Abstract:
The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in developing biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are developing evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe…
▽ More
The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in developing biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are developing evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing further research into mitigating risk. Furthermore, they focus on only a few, highly specific pathways for malicious use. To fill these gaps, we publicly release the Weapons of Mass Destruction Proxy (WMDP) benchmark, a dataset of 3,668 multiple-choice questions that serve as a proxy measurement of hazardous knowledge in biosecurity, cybersecurity, and chemical security. WMDP was developed by a consortium of academics and technical consultants, and was stringently filtered to eliminate sensitive information prior to public release. WMDP serves two roles: first, as an evaluation for hazardous knowledge in LLMs, and second, as a benchmark for unlearning methods to remove such hazardous knowledge. To guide progress on unlearning, we develop RMU, a state-of-the-art unlearning method based on controlling model representations. RMU reduces model performance on WMDP while maintaining general capabilities in areas such as biology and computer science, suggesting that unlearning may be a concrete path towards reducing malicious use from LLMs. We release our benchmark and code publicly at https://wmdp.ai
△ Less
Submitted 15 May, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
A Relational Inductive Bias for Dimensional Abstraction in Neural Networks
Authors:
Declan Campbell,
Jonathan D. Cohen
Abstract:
The human cognitive system exhibits remarkable flexibility and generalization capabilities, partly due to its ability to form low-dimensional, compositional representations of the environment. In contrast, standard neural network architectures often struggle with abstract reasoning tasks, overfitting, and requiring extensive data for training. This paper investigates the impact of the relational b…
▽ More
The human cognitive system exhibits remarkable flexibility and generalization capabilities, partly due to its ability to form low-dimensional, compositional representations of the environment. In contrast, standard neural network architectures often struggle with abstract reasoning tasks, overfitting, and requiring extensive data for training. This paper investigates the impact of the relational bottleneck -- a mechanism that focuses processing on relations among inputs -- on the learning of factorized representations conducive to compositional coding and the attendant flexibility of processing. We demonstrate that such a bottleneck not only improves generalization and learning efficiency, but also aligns network performance with human-like behavioral biases. Networks trained with the relational bottleneck developed orthogonal representations of feature dimensions latent in the dataset, reflecting the factorized structure thought to underlie human cognitive flexibility. Moreover, the relational network mimics human biases towards regularity without pre-specified symbolic primitives, suggesting that the bottleneck fosters the emergence of abstract representations that confer flexibility akin to symbols.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Zeptonewton and Attotesla per Centimeter Metrology With Coupled Oscillators
Authors:
Ian Bouche,
Josh Javor,
Abhishek Som,
David K. Campbell,
David J. Bishop
Abstract:
We present the coupled oscillator: a new mechanism for signal amplification with widespread application in metrology. We introduce the mechanical theory of this framework, and support it by way of simulations. We present a particular implementation of coupled oscillators: a microelectromechanical system (MEMS) that uses one large (~100mm) N52 magnet coupled magnetically to a small (~0.25mm), oscil…
▽ More
We present the coupled oscillator: a new mechanism for signal amplification with widespread application in metrology. We introduce the mechanical theory of this framework, and support it by way of simulations. We present a particular implementation of coupled oscillators: a microelectromechanical system (MEMS) that uses one large (~100mm) N52 magnet coupled magnetically to a small (~0.25mm), oscillating N52 magnet, providing a force resolution of 200zN measured over 1s in a noiseless environment. We show that the same system is able to resolve magnetic gradients of 130aT/cm at a single point (within 500um). This technology therefore has the potential to revolutionize force and magnetic gradient sensing, including high-impact areas such cardiac and brain imaging.
△ Less
Submitted 22 May, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
An Empirical Study Into What Matters for Calibrating Vision-Language Models
Authors:
Weijie Tu,
Weijian Deng,
Dylan Campbell,
Stephen Gould,
Tom Gedeon
Abstract:
Vision-Language Models (VLMs) have emerged as the dominant approach for zero-shot recognition, adept at handling diverse scenarios and significant distribution changes. However, their deployment in risk-sensitive areas requires a deeper understanding of their uncertainty estimation capabilities, a relatively uncharted area. In this study, we explore the calibration properties of VLMs across differ…
▽ More
Vision-Language Models (VLMs) have emerged as the dominant approach for zero-shot recognition, adept at handling diverse scenarios and significant distribution changes. However, their deployment in risk-sensitive areas requires a deeper understanding of their uncertainty estimation capabilities, a relatively uncharted area. In this study, we explore the calibration properties of VLMs across different architectures, datasets, and training strategies. In particular, we analyze the uncertainty estimation performance of VLMs when calibrated in one domain, label set or hierarchy level, and tested in a different one. Our findings reveal that while VLMs are not inherently calibrated for uncertainty, temperature scaling significantly and consistently improves calibration, even across shifts in distribution and changes in label set. Moreover, VLMs can be calibrated with a very small set of examples. Through detailed experimentation, we highlight the potential applications and importance of our insights, aiming for more reliable and effective use of VLMs in critical, real-world scenarios.
△ Less
Submitted 14 June, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Non-interpenetration of rods derived by $Γ$-limits
Authors:
Barbora Benešová,
Daniel Campbell,
Stanislav Hencl,
Martin Kružík
Abstract:
Ensuring non-interpenetration of matter is a fundamental prerequisite when modeling the deformation response of solid materials. In this contribution, we thoroughly examine how this requirement, equivalent to the injectivity of deformations within bulk structures, manifests itself in dimensional-reduction problems. Specifically, we focus on the case of rods embedded in a two-dimensional plane. Our…
▽ More
Ensuring non-interpenetration of matter is a fundamental prerequisite when modeling the deformation response of solid materials. In this contribution, we thoroughly examine how this requirement, equivalent to the injectivity of deformations within bulk structures, manifests itself in dimensional-reduction problems. Specifically, we focus on the case of rods embedded in a two-dimensional plane. Our results focus on $Γ$-limits of energy functionals that enforce an admissible deformation to be a homeomorphism. These $Γ$-limits are evaluated along a passage from the bulk configuration to the rod arrangement. The proofs rely on the equivalence between the weak and strong closures of the set of homeomorphisms from $\mathbb{R}$ to $\mathbb{R}^2$, a result that is of independent interest and that we establish in this paper, too.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Human-Like Geometric Abstraction in Large Pre-trained Neural Networks
Authors:
Declan Campbell,
Sreejan Kumar,
Tyler Giallanza,
Thomas L. Griffiths,
Jonathan D. Cohen
Abstract:
Humans possess a remarkable capacity to recognize and manipulate abstract structure, which is especially apparent in the domain of geometry. Recent research in cognitive science suggests neural networks do not share this capacity, concluding that human geometric abilities come from discrete symbolic structure in human mental representations. However, progress in artificial intelligence (AI) sugges…
▽ More
Humans possess a remarkable capacity to recognize and manipulate abstract structure, which is especially apparent in the domain of geometry. Recent research in cognitive science suggests neural networks do not share this capacity, concluding that human geometric abilities come from discrete symbolic structure in human mental representations. However, progress in artificial intelligence (AI) suggests that neural networks begin to demonstrate more human-like reasoning after scaling up standard architectures in both model size and amount of training data. In this study, we revisit empirical results in cognitive science on geometric visual processing and identify three key biases in geometric visual processing: a sensitivity towards complexity, regularity, and the perception of parts and relations. We test tasks from the literature that probe these biases in humans and find that large pre-trained neural network models used in AI demonstrate more human-like abstract geometric processing.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction
Authors:
Sreejan Kumar,
Raja Marjieh,
Byron Zhang,
Declan Campbell,
Michael Y. Hu,
Umang Bhatt,
Brenden Lake,
Thomas L. Griffiths
Abstract:
Humans extract useful abstractions of the world from noisy sensory data. Serial reproduction allows us to study how people construe the world through a paradigm similar to the game of telephone, where one person observes a stimulus and reproduces it for the next to form a chain of reproductions. Past serial reproduction experiments typically employ a single sensory modality, but humans often commu…
▽ More
Humans extract useful abstractions of the world from noisy sensory data. Serial reproduction allows us to study how people construe the world through a paradigm similar to the game of telephone, where one person observes a stimulus and reproduces it for the next to form a chain of reproductions. Past serial reproduction experiments typically employ a single sensory modality, but humans often communicate abstractions of the world to each other through language. To investigate the effect language on the formation of abstractions, we implement a novel multimodal serial reproduction framework by asking people who receive a visual stimulus to reproduce it in a linguistic format, and vice versa. We ran unimodal and multimodal chains with both humans and GPT-4 and find that adding language as a modality has a larger effect on human reproductions than GPT-4's. This suggests human visual and linguistic representations are more dissociable than those of GPT-4.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
A survey for variable young stars with small telescopes: VIII -- Properties of 1687 Gaia selected members in 21 nearby clusters
Authors:
Dirk Froebrich,
Aleks Scholz,
Justyn Campbell-White,
Siegfried Vanaverbeke,
Carys Herbert,
Jochen Eislöffel,
Thomas Urtly,
Timothy P. Long,
Ivan L. Walton,
Klaas Wiersema,
Nick J. Quinn,
Tony Rodda,
Juan-Luis González-Carballo,
Mario Morales Aimar,
Rafael Castillo García,
Francisco C. Soldán Alfaro,
Faustino García de la Cuesta,
Domenico Licchelli,
Alex Escartin Perez,
José Luis Salto González,
Marc Deldem,
Stephen R. L. Futcher,
Tim Nelson,
Shawn Dvorak,
Dawid Moździerski
, et al. (38 additional authors not shown)
Abstract:
The Hunting Outbursting Young Stars (HOYS) project performs long-term, optical, multi-filter, high cadence monitoring of 25 nearby young clusters and star forming regions. Utilising Gaia DR3 data we have identified about 17000 potential young stellar members in 45 coherent astrometric groups in these fields. Twenty one of them are clear young groups or clusters of stars within one kiloparsec and t…
▽ More
The Hunting Outbursting Young Stars (HOYS) project performs long-term, optical, multi-filter, high cadence monitoring of 25 nearby young clusters and star forming regions. Utilising Gaia DR3 data we have identified about 17000 potential young stellar members in 45 coherent astrometric groups in these fields. Twenty one of them are clear young groups or clusters of stars within one kiloparsec and they contain 9143 Gaia selected potential members. The cluster distances, proper motions and membership numbers are determined. We analyse long term (about 7yr) V, R, and I-band light curves from HOYS for 1687 of the potential cluster members. One quarter of the stars are variable in all three optical filters, and two thirds of these have light curves that are symmetric around the mean. Light curves affected by obscuration from circumstellar materials are more common than those affected by accretion bursts, by a factor of 2-4. The variability fraction in the clusters ranges from 10 to almost 100 percent, and correlates positively with the fraction of stars with detectable inner disks, indicating that a lot of variability is driven by the disk. About one in six variables shows detectable periodicity, mostly caused by magnetic spots. Two thirds of the periodic variables with disk excess emission are slow rotators, and amongst the stars without disk excess two thirds are fast rotators - in agreement with rotation being slowed down by the presence of a disk.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
SCENES: Subpixel Correspondence Estimation With Epipolar Supervision
Authors:
Dominik A. Kloepfer,
João F. Henriques,
Dylan Campbell
Abstract:
Extracting point correspondences from two or more views of a scene is a fundamental computer vision problem with particular importance for relative camera pose estimation and structure-from-motion. Existing local feature matching approaches, trained with correspondence supervision on large-scale datasets, obtain highly-accurate matches on the test sets. However, they do not generalise well to new…
▽ More
Extracting point correspondences from two or more views of a scene is a fundamental computer vision problem with particular importance for relative camera pose estimation and structure-from-motion. Existing local feature matching approaches, trained with correspondence supervision on large-scale datasets, obtain highly-accurate matches on the test sets. However, they do not generalise well to new datasets with different characteristics to those they were trained on, unlike classic feature extractors. Instead, they require finetuning, which assumes that ground-truth correspondences or ground-truth camera poses and 3D structure are available. We relax this assumption by removing the requirement of 3D structure, e.g., depth maps or point clouds, and only require camera pose information, which can be obtained from odometry. We do so by replacing correspondence losses with epipolar losses, which encourage putative matches to lie on the associated epipolar line. While weaker than correspondence supervision, we observe that this cue is sufficient for finetuning existing models on new data. We then further relax the assumption of known camera poses by using pose estimates in a novel bootstrapping approach. We evaluate on highly challenging datasets, including an indoor drone dataset and an outdoor smartphone camera dataset, and obtain state-of-the-art results without strong supervision.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
Point-wise characterizations of limits of planar Sobolev homeomorphisms and their quasi-monotonicity
Authors:
Daniel Campbell
Abstract:
We present three novel classifications of the weak sequential (and strong) limits in $W^{1,p}$ of planar diffeomorphisms. We introduce a concept called the QM condition which is a kind of separation property for pre-images of closed connected sets and show that $u$ satisfies this property exactly when it is the limit of Sobolev homeomorphisms. Further, we prove that…
▽ More
We present three novel classifications of the weak sequential (and strong) limits in $W^{1,p}$ of planar diffeomorphisms. We introduce a concept called the QM condition which is a kind of separation property for pre-images of closed connected sets and show that $u$ satisfies this property exactly when it is the limit of Sobolev homeomorphisms. Further, we prove that $u\in W^{1,p}_{\operatorname{id}}((-1,1)^2,\mathbb{R}^2)$ is the limit of a sequence of homeomorphisms exactly when there are classically monotone mappings $g_δ:[-1,1]^2\to \mathbb{R}^2$ and very small open sets $U_δ$ such that $g_δ = u$ on $[-1,1]^2 \setminus U_δ$. Also, we introduce the so-called three curve condition, which is in some sense reminiscent of the NCL condition of \cite{CPR} but for $u^{-1}$ instead of for $u$, and prove that a map is the $W^{1,p}$ limit of planar Sobolev homeomorphisms exactly when it satisfies this property. This improves on results in \cite{DPP} answering the question from \cite{IO2}.
△ Less
Submitted 22 January, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
Principles for Optimizing Quantum Transduction in Piezo-Optomechanical Systems
Authors:
James Schneeloch,
Erin Sheridan,
A. Matthew Smith,
Christopher C. Tison,
Daniel L. Campbell,
Matthew D. LaHaye,
Michael L. Fanto,
Paul M. Alsing
Abstract:
Two-way microwave-optical quantum transduction is an essential capability to connect distant superconducting qubits via optical fiber, and to enable quantum networking at a large scale. In Blésin, Tian, Bhave, and Kippenberg's article, ``Quantum coherent microwave-optical transduction using high overtone bulk acoustic resonances" (Phys. Rev. A, 104, 052601 (2021)), they lay out a quantum transduct…
▽ More
Two-way microwave-optical quantum transduction is an essential capability to connect distant superconducting qubits via optical fiber, and to enable quantum networking at a large scale. In Blésin, Tian, Bhave, and Kippenberg's article, ``Quantum coherent microwave-optical transduction using high overtone bulk acoustic resonances" (Phys. Rev. A, 104, 052601 (2021)), they lay out a quantum transduction system that accomplishes this by combining a piezoelectric interaction to convert microwave photons to GHz-scale phonons, and an optomechanical interaction to up-convert those phonons into telecom-band photons using a pump laser set to an adjacent telecom-band tone. In this work, we discuss these coupling interactions from first principles in order to discover what device parameters matter most in determining the transduction efficiency of this new platform, and to discuss strategies toward system optimization for near-unity transduction efficiency, as well as how noise impacts the transduction process.
In addition, we address the post-transduction challenge of separating single photons of the transduced light from a classically bright pump only a few GHz away in frequency by proposing a novel optomechanical coupling mechanism using phonon-photon four-wave mixing via stress-induced optical nonlinearity and its thermodynamic connection to higher-orders of electrostriction. Where this process drives transduction by consuming pairs instead of individual pump photons, it will allow a clean separation of the transduced light from the classically bright pump driving the transduction process.
△ Less
Submitted 13 March, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images
Authors:
Yiqun Zhang,
Zhenyue Qin,
Yang Liu,
Dylan Campbell
Abstract:
We introduce a pipeline to address anatomical inaccuracies in Stable Diffusion generated hand images. The initial step involves constructing a specialized dataset, focusing on hand anomalies, to train our models effectively. A finetuned detection model is pivotal for precise identification of these anomalies, ensuring targeted correction. Body pose estimation aids in understanding hand orientation…
▽ More
We introduce a pipeline to address anatomical inaccuracies in Stable Diffusion generated hand images. The initial step involves constructing a specialized dataset, focusing on hand anomalies, to train our models effectively. A finetuned detection model is pivotal for precise identification of these anomalies, ensuring targeted correction. Body pose estimation aids in understanding hand orientation and positioning, crucial for accurate anomaly correction. The integration of ControlNet and InstructPix2Pix facilitates sophisticated inpainting and pixel-level transformation, respectively. This dual approach allows for high-fidelity image adjustments. This comprehensive approach ensures the generation of images with anatomically accurate hands, closely resembling real-world appearances. Our experimental results demonstrate the pipeline's efficacy in enhancing hand image realism in Stable Diffusion outputs. We provide an online demo at https://fixhand.yiqun.io
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
An Empirical Model For Intrinsic Alignments: Insights From Cosmological Simulations
Authors:
Nicholas Van Alfen,
Duncan Campbell,
Jonathan Blazek,
C. Danielle Leonard,
Francois Lanusse,
Andrew Hearin,
Rachel Mandelbaum,
The LSST Dark Energy Science Collaboration
Abstract:
We extend current models of the halo occupation distribution (HOD) to include a flexible, empirical framework for the forward modeling of the intrinsic alignment (IA) of galaxies. A primary goal of this work is to produce mock galaxy catalogs for the purpose of validating existing models and methods for the mitigation of IA in weak lensing measurements. This technique can also be used to produce n…
▽ More
We extend current models of the halo occupation distribution (HOD) to include a flexible, empirical framework for the forward modeling of the intrinsic alignment (IA) of galaxies. A primary goal of this work is to produce mock galaxy catalogs for the purpose of validating existing models and methods for the mitigation of IA in weak lensing measurements. This technique can also be used to produce new, simulation-based predictions for IA and galaxy clustering. Our model is probabilistically formulated, and rests upon the assumption that the orientations of galaxies exhibit a correlation with their host dark matter (sub)halo orientation or with their position within the halo. We examine the necessary components and phenomenology of such a model by considering the alignments between (sub)halos in a cosmological dark matter only simulation. We then validate this model for a realistic galaxy population in a set of simulations in the IllustrisTNG suite. We create an HOD mock with TNG-like correlations using our method, constraining the associated IA model parameters, with the $χ^2_{\rm dof}$ between our model's correlations and those of Illustris matching as closely as 1.4 and 1.1 for orientation--position and orientation--orientation correlation functions, respectively. By modeling the misalignment between galaxies and their host halo, we show that the 3-dimensional two-point position and orientation correlation functions of simulated (sub)halos and galaxies can be accurately reproduced from quasi-linear scales down to $0.1~h^{-1}{\rm Mpc}$. We also find evidence for environmental influence on IA within a halo. Our publicly-available software provides a key component enabling efficient determination of Bayesian posteriors on IA model parameters using observational measurements of galaxy-orientation correlation functions in the highly nonlinear regime.
△ Less
Submitted 3 June, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models
Authors:
Zhaoyuan Yang,
Zhengyang Yu,
Zhiwei Xu,
Jaskirat Singh,
Jing Zhang,
Dylan Campbell,
Peter Tu,
Richard Hartley
Abstract:
We present a diffusion-based image morphing approach with perceptually-uniform sampling (IMPUS) that produces smooth, direct and realistic interpolations given an image pair. The embeddings of two images may lie on distinct conditioned distributions of a latent diffusion model, especially when they have significant semantic difference. To bridge this gap, we interpolate in the locally linear and c…
▽ More
We present a diffusion-based image morphing approach with perceptually-uniform sampling (IMPUS) that produces smooth, direct and realistic interpolations given an image pair. The embeddings of two images may lie on distinct conditioned distributions of a latent diffusion model, especially when they have significant semantic difference. To bridge this gap, we interpolate in the locally linear and continuous text embedding space and Gaussian latent space. We first optimize the endpoint text embeddings and then map the images to the latent space using a probability flow ODE. Unlike existing work that takes an indirect morphing path, we show that the model adaptation yields a direct path and suppresses ghosting artifacts in the interpolated images. To achieve this, we propose a heuristic bottleneck constraint based on a novel relative perceptual path diversity score that automatically controls the bottleneck size and balances the diversity along the path with its directness. We also propose a perceptually-uniform sampling technique that enables visually smooth changes between the interpolated images. Extensive experiments validate that our IMPUS can achieve smooth, direct, and realistic image morphing and is adaptable to several other generative tasks.
△ Less
Submitted 16 March, 2024; v1 submitted 12 November, 2023;
originally announced November 2023.
-
Likelihood-based Out-of-Distribution Detection with Denoising Diffusion Probabilistic Models
Authors:
Joseph Goodier,
Neill D. F. Campbell
Abstract:
Out-of-Distribution detection between dataset pairs has been extensively explored with generative models. We show that likelihood-based Out-of-Distribution detection can be extended to diffusion models by leveraging the fact that they, like other likelihood-based generative models, are dramatically affected by the input sample complexity. Currently, all Out-of-Distribution detection methods with D…
▽ More
Out-of-Distribution detection between dataset pairs has been extensively explored with generative models. We show that likelihood-based Out-of-Distribution detection can be extended to diffusion models by leveraging the fact that they, like other likelihood-based generative models, are dramatically affected by the input sample complexity. Currently, all Out-of-Distribution detection methods with Diffusion Models are reconstruction-based. We propose a new likelihood ratio for Out-of-Distribution detection with Deep Denoising Diffusion Models, which we call the Complexity Corrected Likelihood Ratio. Our likelihood ratio is constructed using Evidence Lower-Bound evaluations from an individual model at various noising levels. We present results that are comparable to state-of-the-art Out-of-Distribution detection methods with generative models.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Zigzag materials: selective interchain couplings control the coexistence of one-dimensional physics and deviations from it
Authors:
J. M. P. Carmelo,
P. D. Sacramento,
T. Stauber,
D. K. Campbell
Abstract:
The coexistence in the low-temperature spin-conducting phases of the zigzag materials BaCo2V2O8 and SrCo2V2O8 of one-dimensional (1D) physics with important deviations from it is not well understood. The studies of this paper account for an important selection rule that follows from interchain spin states being coupled more strongly within the spin dynamical structure factor of such zigzag materia…
▽ More
The coexistence in the low-temperature spin-conducting phases of the zigzag materials BaCo2V2O8 and SrCo2V2O8 of one-dimensional (1D) physics with important deviations from it is not well understood. The studies of this paper account for an important selection rule that follows from interchain spin states being coupled more strongly within the spin dynamical structure factor of such zigzag materials whenever they are connected by a specific symmetry operation of the underlying lattice. In the case of excited states, this symmetry operation is only a symmetry in spin-space ifno electronic spin flip is performed within the generation of such states. Our results on both the role of selective interchain couplings in protecting the 1D physics and being behind deviations from it and on the dynamical properties being controlled by scattering of singlet pairs of physical spins 1/2 open the door to a key advance in the understanding of the physics of the spin chains in BaCo2V2O8 and SrCo2V2O8.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
LoCUS: Learning Multiscale 3D-consistent Features from Posed Images
Authors:
Dominik A. Kloepfer,
Dylan Campbell,
João F. Henriques
Abstract:
An important challenge for autonomous agents such as robots is to maintain a spatially and temporally consistent model of the world. It must be maintained through occlusions, previously-unseen views, and long time horizons (e.g., loop closure and re-identification). It is still an open question how to train such a versatile neural representation without supervision. We start from the idea that the…
▽ More
An important challenge for autonomous agents such as robots is to maintain a spatially and temporally consistent model of the world. It must be maintained through occlusions, previously-unseen views, and long time horizons (e.g., loop closure and re-identification). It is still an open question how to train such a versatile neural representation without supervision. We start from the idea that the training objective can be framed as a patch retrieval problem: given an image patch in one view of a scene, we would like to retrieve (with high precision and recall) all patches in other views that map to the same real-world location. One drawback is that this objective does not promote reusability of features: by being unique to a scene (achieving perfect precision/recall), a representation will not be useful in the context of other scenes. We find that it is possible to balance retrieval and reusability by constructing the retrieval set carefully, leaving out patches that map to far-away locations. Similarly, we can easily regulate the scale of the learned features (e.g., points, objects, or rooms) by adjusting the spatial tolerance for considering a retrieval to be positive. We optimize for (smooth) Average Precision (AP), in a single unified ranking-based objective. This objective also doubles as a criterion for choosing landmarks or keypoints, as patches with high AP. We show results creating sparse, multi-scale, semantic spatial maps composed of highly identifiable landmarks, with applications in landmark retrieval, localization, semantic segmentation and instance segmentation.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Relational Constraints On Neural Networks Reproduce Human Biases towards Abstract Geometric Regularity
Authors:
Declan Campbell,
Sreejan Kumar,
Tyler Giallanza,
Jonathan D. Cohen,
Thomas L. Griffiths
Abstract:
Uniquely among primates, humans possess a remarkable capacity to recognize and manipulate abstract structure in the service of task goals across a broad range of behaviors. One illustration of this is in the visual perception of geometric forms. Studies have shown a uniquely human bias toward geometric regularity, with task performance enhanced for more regular and symmetric forms compared to thei…
▽ More
Uniquely among primates, humans possess a remarkable capacity to recognize and manipulate abstract structure in the service of task goals across a broad range of behaviors. One illustration of this is in the visual perception of geometric forms. Studies have shown a uniquely human bias toward geometric regularity, with task performance enhanced for more regular and symmetric forms compared to their geometrically irregular counterparts. Such studies conclude that this behavior implies the existence of discrete symbolic structure in human mental representations, and that replicating such behavior in neural network architectures will require mechanisms for symbolic processing. In this study, we argue that human biases towards geometric regularity can be reproduced in neural networks, without explicitly providing them with symbolic machinery, by augmenting them with an architectural constraint that enables the system to discover and manipulate relational structure. When trained with the appropriate curriculum, this model exhibits human-like biases towards symmetry and regularity in two distinct tasks involving abstract geometric reasoning. Our findings indicate that neural networks, when equipped with the necessary training objectives and architectural elements, can exhibit human-like regularity biases and generalization. This approach provides insights into the neural mechanisms underlying geometric reasoning and offers an alternative to prevailing symbolic "Language of Thought" models in this domain.
△ Less
Submitted 29 September, 2023;
originally announced September 2023.
-
The Robust Semantic Segmentation UNCV2023 Challenge Results
Authors:
Xuanlong Yu,
Yi Zuo,
Zitao Wang,
Xiaowen Zhang,
Jiaxuan Zhao,
Yuting Yang,
Licheng Jiao,
Rui Peng,
Xinyi Wang,
Junpei Zhang,
Kexin Zhang,
Fang Liu,
Roberto Alcover-Couso,
Juan C. SanMiguel,
Marcos Escudero-Viñolo,
Hanlin Tian,
Kenta Matsui,
Tianhao Wang,
Fahmy Adan,
Zhitong Gao,
Xuming He,
Quentin Bouniot,
Hossein Moghaddam,
Shyam Nandan Rai,
Fabio Cermelli
, et al. (12 additional authors not shown)
Abstract:
This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q…
▽ More
This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty quantification methodologies presented at prominent conferences in the fields of computer vision and machine learning and journals over the past few years. Within this document, the challenge is introduced, shedding light on its purpose and objectives, which primarily revolved around enhancing the robustness of semantic segmentation in urban scenes under varying natural adversarial conditions. The report then delves into the top-performing solutions. Moreover, the document aims to provide a comprehensive overview of the diverse solutions deployed by all participants. By doing so, it seeks to offer readers a deeper insight into the array of strategies that can be leveraged to effectively handle the inherent uncertainties associated with autonomous driving and semantic segmentation, especially within urban environments.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
The Relational Bottleneck as an Inductive Bias for Efficient Abstraction
Authors:
Taylor W. Webb,
Steven M. Frankland,
Awni Altabaa,
Simon Segert,
Kamesh Krishnamurthy,
Declan Campbell,
Jacob Russin,
Tyler Giallanza,
Zack Dulberg,
Randall O'Reilly,
John Lafferty,
Jonathan D. Cohen
Abstract:
A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck…
▽ More
A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck. In that approach, neural networks are constrained via their architecture to focus on relations between perceptual inputs, rather than the attributes of individual inputs. We review a family of models that employ this approach to induce abstractions in a data-efficient manner, emphasizing their potential as candidate models for the acquisition of abstract concepts in the human mind and brain.
△ Less
Submitted 1 May, 2024; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Exploring Predicate Visual Context in Detecting Human-Object Interactions
Authors:
Frederic Z. Zhang,
Yuhui Yuan,
Dylan Campbell,
Zhuoyao Zhong,
Stephen Gould
Abstract:
Recently, the DETR framework has emerged as the dominant approach for human--object interaction (HOI) research. In particular, two-stage transformer-based HOI detectors are amongst the most performant and training-efficient approaches. However, these often condition HOI classification on object features that lack fine-grained contextual information, eschewing pose and orientation information in fa…
▽ More
Recently, the DETR framework has emerged as the dominant approach for human--object interaction (HOI) research. In particular, two-stage transformer-based HOI detectors are amongst the most performant and training-efficient approaches. However, these often condition HOI classification on object features that lack fine-grained contextual information, eschewing pose and orientation information in favour of visual cues about object identity and box extremities. This naturally hinders the recognition of complex or ambiguous interactions. In this work, we study these issues through visualisations and carefully designed experiments. Accordingly, we investigate how best to re-introduce image features via cross-attention. With an improved query design, extensive exploration of keys and values, and box pair positional embeddings as spatial guidance, our model with enhanced predicate visual context (PViC) outperforms state-of-the-art methods on the HICO-DET and V-COCO benchmarks, while maintaining low training cost.
△ Less
Submitted 7 November, 2023; v1 submitted 11 August, 2023;
originally announced August 2023.
-
Robotic Vision for Human-Robot Interaction and Collaboration: A Survey and Systematic Review
Authors:
Nicole Robinson,
Brendan Tidd,
Dylan Campbell,
Dana Kulić,
Peter Corke
Abstract:
Robotic vision for human-robot interaction and collaboration is a critical process for robots to collect and interpret detailed information related to human actions, goals, and preferences, enabling robots to provide more useful services to people. This survey and systematic review presents a comprehensive analysis on robotic vision in human-robot interaction and collaboration over the last 10 yea…
▽ More
Robotic vision for human-robot interaction and collaboration is a critical process for robots to collect and interpret detailed information related to human actions, goals, and preferences, enabling robots to provide more useful services to people. This survey and systematic review presents a comprehensive analysis on robotic vision in human-robot interaction and collaboration over the last 10 years. From a detailed search of 3850 articles, systematic extraction and evaluation was used to identify and explore 310 papers in depth. These papers described robots with some level of autonomy using robotic vision for locomotion, manipulation and/or visual communication to collaborate or interact with people. This paper provides an in-depth analysis of current trends, common domains, methods and procedures, technical processes, data sets and models, experimental testing, sample populations, performance metrics and future challenges. This manuscript found that robotic vision was often used in action and gesture recognition, robot movement in human spaces, object handover and collaborative actions, social communication and learning from demonstration. Few high-impact and novel techniques from the computer vision field had been translated into human-robot interaction and collaboration. Overall, notable advancements have been made on how to develop and deploy robots to assist people.
△ Less
Submitted 28 July, 2023;
originally announced July 2023.
-
Probabilistic and Semantic Descriptions of Image Manifolds and Their Applications
Authors:
Peter Tu,
Zhaoyuan Yang,
Richard Hartley,
Zhiwei Xu,
Jing Zhang,
Yiwei Fu,
Dylan Campbell,
Jaskirat Singh,
Tianyu Wang
Abstract:
This paper begins with a description of methods for estimating image probability density functions that reflects the observation that such data is usually constrained to lie in restricted regions of the high-dimensional image space-not every pattern of pixels is an image. It is common to say that images lie on a lower-dimensional manifold in the high-dimensional space. However, it is not the case…
▽ More
This paper begins with a description of methods for estimating image probability density functions that reflects the observation that such data is usually constrained to lie in restricted regions of the high-dimensional image space-not every pattern of pixels is an image. It is common to say that images lie on a lower-dimensional manifold in the high-dimensional space. However, it is not the case that all points on the manifold have an equal probability of being images. Images are unevenly distributed on the manifold, and our task is to devise ways to model this distribution as a probability distribution. We therefore consider popular generative models. For our purposes, generative/probabilistic models should have the properties of 1) sample generation: the possibility to sample from this distribution with the modelled density function, and 2) probability computation: given a previously unseen sample from the dataset of interest, one should be able to compute its probability, at least up to a normalising constant. To this end, we investigate the use of methods such as normalising flow and diffusion models. We then show how semantic interpretations are used to describe points on the manifold. To achieve this, we consider an emergent language framework that uses variational encoders for a disentangled representation of points that reside on a given manifold. Trajectories between points on a manifold can then be described as evolving semantic descriptions. We also show that such probabilistic descriptions (bounded) can be used to improve semantic consistency by constructing defences against adversarial attacks. We evaluate our methods with improved semantic robustness and OoD detection capability, explainable and editable semantic interpolation, and improved classification accuracy under patch attacks. We also discuss the limitation in diffusion models.
△ Less
Submitted 11 November, 2023; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Rethinking Polyp Segmentation from an Out-of-Distribution Perspective
Authors:
Ge-Peng Ji,
Jing Zhang,
Dylan Campbell,
Huan Xiong,
Nick Barnes
Abstract:
Unlike existing fully-supervised approaches, we rethink colorectal polyp segmentation from an out-of-distribution perspective with a simple but effective self-supervised learning approach. We leverage the ability of masked autoencoders -- self-supervised vision transformers trained on a reconstruction task -- to learn in-distribution representations; here, the distribution of healthy colon images.…
▽ More
Unlike existing fully-supervised approaches, we rethink colorectal polyp segmentation from an out-of-distribution perspective with a simple but effective self-supervised learning approach. We leverage the ability of masked autoencoders -- self-supervised vision transformers trained on a reconstruction task -- to learn in-distribution representations; here, the distribution of healthy colon images. We then perform out-of-distribution reconstruction and inference, with feature space standardisation to align the latent distribution of the diverse abnormal samples with the statistics of the healthy samples. We generate per-pixel anomaly scores for each image by calculating the difference between the input and reconstructed images and use this signal for out-of-distribution (ie, polyp) segmentation. Experimental results on six benchmarks show that our model has excellent segmentation performance and generalises across datasets. Our code is publicly available at https://github.com/GewelsJI/Polyp-OOD.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Experimental study of developing free-falling annular flow in a large-scale vertical pipe
Authors:
Yunpeng Xue,
Colin Stewart,
David Kelly,
David Campbell,
Michael Gormley
Abstract:
Annular flow is the primary characteristic of unsteady wastewater flow, which initiates entrained air and sets up the air pressure regime within the system - an important design consideration. This paper reports on an experimental investigation of free-falling annular flow in a vertical pipe with different inlets at extended flow ranges up to Re = 3 x 10e4, similar to those in Building Drainage Sy…
▽ More
Annular flow is the primary characteristic of unsteady wastewater flow, which initiates entrained air and sets up the air pressure regime within the system - an important design consideration. This paper reports on an experimental investigation of free-falling annular flow in a vertical pipe with different inlets at extended flow ranges up to Re = 3 x 10e4, similar to those in Building Drainage Systems (BDS). In the experimental setup, a vertical pipe system (5 m) was used to record velocity profiles and film thickness in the developing region through Particle Image Velocimetry (PIV) measurements. Entrained droplets were collected through a separator, and the entrainment fraction was calculated at different flow conditions. The study reports on the development process of the film velocity and thickness along the vertical pipe, which agrees well with empirical predictions. The results of the droplet entrainment of a vertical annular flow show the development process to the steady state. Additionally, a Tee-junction inlet in drainage system generates a higher and different entrainment profile.
△ Less
Submitted 19 October, 2023; v1 submitted 11 June, 2023;
originally announced June 2023.
-
Charge Order Evolution of Superconducting BaNi2As2 Under High Pressure
Authors:
John Collini,
Daniel J. Campbell,
Daniel Sneed,
Prathum Saraf,
Christopher Eckberg,
Jason Jeffries,
Nicholas Butch,
Johnpierre Paglione
Abstract:
BaNi$_2$As$_2$, a non-magnetic superconductor counterpart to BaFe$_2$As$_2$, has been shown to develop nematic order, multiple charge orders, and a dramatic six-fold enhancement of superconductivity via isovalent chemical substitution of Sr for Ba. Here we present high pressure single-crystal and powder x-ray diffraction measurements of BaNi$_2$As$_2$ to study the effects of tuning lattice density…
▽ More
BaNi$_2$As$_2$, a non-magnetic superconductor counterpart to BaFe$_2$As$_2$, has been shown to develop nematic order, multiple charge orders, and a dramatic six-fold enhancement of superconductivity via isovalent chemical substitution of Sr for Ba. Here we present high pressure single-crystal and powder x-ray diffraction measurements of BaNi$_2$As$_2$ to study the effects of tuning lattice density on the evolution of charge order in this system. Single-crystal X-ray experiments track the evolution of the incommensurate (Q=0.28) and commensurate (Q=0.33 and Q=0.5) charge orders, and the tetragonal-triclinic distortion as a function of temperature up to pressures of 10.4 GPa, and powder diffraction experiments at 300 K provide lattice parameters up to 17 GPa. We find that applying pressure to BaNi$_2$As$_2$ produces a similar evolution of structural and charge-ordered phases as found as a function of chemical pressure in Ba$_{1-x}$Sr$_{x}$Ni$_2$As$_2$ , with coexisting commensurate charge orders appearing on increasing pressure. These phases also exhibit a similar abrupt cutoff at a critical pressure of (9 $\pm$ 0.5) GPa, where powder diffraction experiments indicate a collapse of the tetragonal structure at higher temperatures. We discuss the relationship between this collapsed tetragonal phase and the discontinuous phase boundary observed at the optimal substitution value for superconductivity in Ba$_{1-x}$Sr$_{x}$Ni$_2$As$_2$
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Application of Generalized Periodic Anderson Hamiltonians to the Superconducting Nickelates
Authors:
Abhishek Som,
Nahom K. Yirga,
David K. Campbell
Abstract:
We study the extent to which a three-dimensional dispersing Periodic Anderson Model (PAM) can explain the emergence of novel superconductivity in the Infinite-Layer Nickelate compounds. By going beyond frequently used 2D models, the 3D dispersing PAM allows us to incorporate effects of finite out-of-plane hopping and orbital hybridization in describing these systems. Using an unbiased functional R…
▽ More
We study the extent to which a three-dimensional dispersing Periodic Anderson Model (PAM) can explain the emergence of novel superconductivity in the Infinite-Layer Nickelate compounds. By going beyond frequently used 2D models, the 3D dispersing PAM allows us to incorporate effects of finite out-of-plane hopping and orbital hybridization in describing these systems. Using an unbiased functional Renormalization Group (fRG) approach, we show that $d_{x^2-y^2}$ superconductivity arises in a series of 3D {\it {ab-initio}} models of the Nickelates ({\it {e.g.}}$\mathrm{RNiO_2}$), where R is a rare earth element. We the study the impact of going beyond the Ni-d orbital by including the R-$d_{z^2}$ and the interstitial-$s$ as hybridizing conducting bands. We explore the dependence of the models on key parameters, including the local Hubbard coupling, doping and temperature. We find the hybridization with the interstitial-$s$ band driving a 3D $d_{z^2-r^2}$-type superconductivity while out of plane hopping primarily enhances an $s$-wave superconducting order.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
A new room-temperature equation of state of Bi up to 260 GPa
Authors:
Daniel J. Campbell,
Daniel T. Sneed,
Earl F. O'Bannon III,
Per Söderlind,
Zsolt Jenei
Abstract:
At room temperature, bismuth undergoes several structural transitions with increasing pressure before taking on a body-centered cubic (bcc) phase at approximately 8 GPa. The bcc structure is stable to the highest measured pressure and its simplicity, along with its high compressibility and atomic number, make it an enticing choice as a pressure calibrant. We present three data sets on the compress…
▽ More
At room temperature, bismuth undergoes several structural transitions with increasing pressure before taking on a body-centered cubic (bcc) phase at approximately 8 GPa. The bcc structure is stable to the highest measured pressure and its simplicity, along with its high compressibility and atomic number, make it an enticing choice as a pressure calibrant. We present three data sets on the compression of bismuth in a diamond anvil cell in a neon pressure medium, up to a maximum pressure of about 260 GPa. The use of a soft pressure medium reduces deviatoric stress when compared to previous work. With an expanded pressure range, higher point density, and a decreased uniaxial stress component, we are able to provide more reliable equation of state (EOS) parameters. We also conduct density functional theory (DFT) electronic-structure calculations that confirm the stability of the bcc phase at high pressure.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
A survey for variable young stars with small telescopes: VI -- Analysis of the outbursting Be stars NSW284, Gaia19eyy, and VES263
Authors:
Dirk Froebrich,
Lynne A. Hillenbrand,
Carys Herbert,
Kishalay De,
Jochen Eislöffel,
Justyn Campbell-White,
Ruhee Kahar,
Franz-Josef Hambsch,
Thomas Urtly,
Adam Popowicz,
Krzysztof Bernacki,
Andrzej Malcher,
Slawomir Lasota,
Jerzy Fiolka,
Piotr Jozwik-Wabik,
Franky Dubois,
Ludwig Logie,
Steve Rau,
Mark Phillips,
George Fleming,
Rafael Gonzalez Farfán,
Francisco C. Soldán Alfaro,
Tim Nelson,
Stephen R. L. Futcher,
Samantha M. Rolfe
, et al. (22 additional authors not shown)
Abstract:
This paper is one in a series reporting results from small telescope observations of variable young stars. Here, we study the repeating outbursts of three likely Be stars based on long-term optical, near-infrared, and mid-infrared photometry for all three objects, along with follow-up spectra for two of the three. The sources are characterised as rare, truly regularly outbursting Be stars. We inte…
▽ More
This paper is one in a series reporting results from small telescope observations of variable young stars. Here, we study the repeating outbursts of three likely Be stars based on long-term optical, near-infrared, and mid-infrared photometry for all three objects, along with follow-up spectra for two of the three. The sources are characterised as rare, truly regularly outbursting Be stars. We interpret the photometric data within a framework for modelling light curve morphology, and find that the models correctly predict the burst shapes, including their larger amplitudes and later peaks towards longer wavelengths. We are thus able to infer the start and end times of mass loading into the circumstellar disks of these stars. The disk sizes are typically 3-6 times the areas of the central star. The disk temperatures are ~40%, and the disk luminosities are ~10% of those of the central Be star, respectively. The available spectroscopy is consistent with inside-out evolution of the disk. Higher excitation lines have larger velocity widths in their double-horned shaped emission profiles. Our observations and analysis support the decretion disk model for outbursting Be stars.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
The Swansong of the Galactic Center Source X7: An Extreme Example of Tidal Evolution near the Supermassive Black Hole
Authors:
Anna Ciurlo,
Randall D. Campbell,
Mark R. Morris,
Tuan Do,
Andrea M. Ghez,
Eric E. Becklin,
Rory O. Bentley,
Devin S. Chu,
Abhimat K. Gautam,
Yash A. Gursahani,
Aurelien Hees,
Kelly Kosmo O'Neil,
Jessica R. Lu,
Gregory D. Martinez,
Smadar Naoz,
Shoko Sakai,
Rainer Schoedel
Abstract:
We present two decades of new high-angular-resolution near-infrared data from the W. M. Keck Observatory that reveal extreme evolution in X7, an elongated dust and gas feature, presently located half an arcsecond from the Galactic Center supermassive black hole. With both spectro-imaging observations of Br-γ line-emission and Lp (3.8 μm) imaging data, we provide the first estimate of its orbital p…
▽ More
We present two decades of new high-angular-resolution near-infrared data from the W. M. Keck Observatory that reveal extreme evolution in X7, an elongated dust and gas feature, presently located half an arcsecond from the Galactic Center supermassive black hole. With both spectro-imaging observations of Br-γ line-emission and Lp (3.8 μm) imaging data, we provide the first estimate of its orbital parameters and quantitative characterization of the evolution of its morphology and mass. We find that the leading edge of X7 appears to be on a mildly eccentric (e~0.3), relatively short-period (170 years) orbit and is headed towards periapse passage, estimated to occur in ~2036. Furthermore, our kinematic measurements rule out the earlier suggestion that X7 is associated with the stellar source S0-73 or with any other point source that has overlapped with X7 during our monitoring period. Over the course of our observations, X7 has (1) become more elongated, with a current length-to-width ratio of 9, (2) maintained a very consistent long-axis orientation (position angle of 50 deg), (3) inverted its radial velocity differential from tip to tail from -50 to +80 km/sec, and (4) sustained its total brightness (12.8 Lp magnitudes at the leading edge) and color temperature (425 K), which suggest a constant mass of ~50 MEarth. We present a simple model showing that these results are compatible with the expected effect of tidal forces exerted on it by the central black hole and we propose that X7 is the gas and dust recently ejected from a grazing collision in a binary system.
△ Less
Submitted 16 January, 2023;
originally announced January 2023.
-
Common Subcontracting and Airline Prices
Authors:
Gaurab Aryal,
Dennis J. Campbell,
Federico Ciliberto,
Ekaterina A. Khmelnitskaya
Abstract:
In the US airline industry, independent regional airlines fly passengers on behalf of several national airlines across different markets, giving rise to $\textit{common subcontracting}$. On the one hand, we find that subcontracting is associated with lower prices, consistent with the notion that regional airlines tend to fly passengers at lower costs than major airlines. On the other hand, we find…
▽ More
In the US airline industry, independent regional airlines fly passengers on behalf of several national airlines across different markets, giving rise to $\textit{common subcontracting}$. On the one hand, we find that subcontracting is associated with lower prices, consistent with the notion that regional airlines tend to fly passengers at lower costs than major airlines. On the other hand, we find that $\textit{common}$ subcontracting is associated with higher prices. These two countervailing effects suggest that the growth of regional airlines can have anticompetitive implications for the industry.
△ Less
Submitted 23 December, 2023; v1 submitted 14 January, 2023;
originally announced January 2023.
-
Classification of area-strict limits of planar BV homeomorphisms
Authors:
Daniel Campbell,
Aapo Kauranen,
Emanuela Radici
Abstract:
We present a classification of area-strict limits of planar $BV$ homeomorphisms. This class of mappings allows for cavitations and fractures but fulfil a suitable generalization of the INV condition. As pointed out by J. Ball [4], these features are expected in limit configurations of elastic deformations. In [12], De Philippis and Pratelli introduced the \emph{no-crossing} condition which charact…
▽ More
We present a classification of area-strict limits of planar $BV$ homeomorphisms. This class of mappings allows for cavitations and fractures but fulfil a suitable generalization of the INV condition. As pointed out by J. Ball [4], these features are expected in limit configurations of elastic deformations. In [12], De Philippis and Pratelli introduced the \emph{no-crossing} condition which characterizes the $W^{1,p}$ closure of planar homeomorphisms. In the current paper we show that a suitable version of this concept is equivalent with a map, $f$, being the area-strict limit of BV homeomorphisms. This extends our results from [10], where we proved that the \emph{no-crossing BV} condition for a BV map was equivalent with the map being the m-strict limit of homeomorphisms (i.e. $f_k$ converges $w^*$ to $f$ and $|D_1f_k|(Ω)+|D_2f_k|(Ω) \to |D_1f|(Ω)+|D_2f|(Ω)$). Further we show that the \emph{no-crossing BV} condition is equivalent with a seemingly stronger version of the same condition.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Minimal Extension for the $α$-Manhattan norm
Authors:
Daniel Campbell,
Aapo Kauranen,
Emanuela Radici
Abstract:
Let $\partial \mathcal{Q}$ be the boundary of a convex polygon in $\mathbb{R}^2$, $e_α= (\cosα, \sin α)$ and $e_α^{\bot} = (-\sinα, \cos α)$ be a basis of $\mathbb{R}^2$ for some $α\in[0,2π)$ and $φ:\partial\mathcal{Q} \to\mathbb{R}^2$ be a continuous, finitely piecewise linear injective map. We construct a finitely piecewise affine homeomorphism $v: \mathcal{Q} \to \mathbb{R}^2$ coinciding with…
▽ More
Let $\partial \mathcal{Q}$ be the boundary of a convex polygon in $\mathbb{R}^2$, $e_α= (\cosα, \sin α)$ and $e_α^{\bot} = (-\sinα, \cos α)$ be a basis of $\mathbb{R}^2$ for some $α\in[0,2π)$ and $φ:\partial\mathcal{Q} \to\mathbb{R}^2$ be a continuous, finitely piecewise linear injective map. We construct a finitely piecewise affine homeomorphism $v: \mathcal{Q} \to \mathbb{R}^2$ coinciding with $φ$ on $\partial \mathcal{Q}$ such that the following property holds: $|\langle Dv, e_α\rangle|(\mathcal{Q})$ (resp. $\langle Dv, e_α^{\bot}\rangle|(\mathcal{Q})$) is as close as we want to $\inf |\langle Du, e_α\rangle|(\mathcal{Q})$ (resp. $\inf |\langle Du, e_α^{\bot}\rangle|(\mathcal{Q})$) where the infimum is meant over the class of all $BV$ homeomorphisms $u$ extending $φ$ inside $\mathcal{Q}$. This result extends that already proven in [14] in the shape of the domain.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Galaxies on graph neural networks: towards robust synthetic galaxy catalogs with deep generative models
Authors:
Yesukhei Jagvaral,
Francois Lanusse,
Sukhdeep Singh,
Rachel Mandelbaum,
Siamak Ravanbakhsh,
Duncan Campbell
Abstract:
The future astronomical imaging surveys are set to provide precise constraints on cosmological parameters, such as dark energy. However, production of synthetic data for these surveys, to test and validate analysis methods, suffers from a very high computational cost. In particular, generating mock galaxy catalogs at sufficiently large volume and high resolution will soon become computationally un…
▽ More
The future astronomical imaging surveys are set to provide precise constraints on cosmological parameters, such as dark energy. However, production of synthetic data for these surveys, to test and validate analysis methods, suffers from a very high computational cost. In particular, generating mock galaxy catalogs at sufficiently large volume and high resolution will soon become computationally unreachable. In this paper, we address this problem with a Deep Generative Model to create robust mock galaxy catalogs that may be used to test and develop the analysis pipelines of future weak lensing surveys. We build our model on a custom built Graph Convolutional Networks, by placing each galaxy on a graph node and then connecting the graphs within each gravitationally bound system. We train our model on a cosmological simulation with realistic galaxy populations to capture the 2D and 3D orientations of galaxies. The samples from the model exhibit comparable statistical properties to those in the simulations. To the best of our knowledge, this is the first instance of a generative model on graphs in an astrophysical/cosmological context.
△ Less
Submitted 11 December, 2022;
originally announced December 2022.
-
Compressed Sensing MRI Reconstruction Regularized by VAEs with Structured Image Covariance
Authors:
Margaret Duff,
Ivor J. A. Simpson,
Matthias J. Ehrhardt,
Neill D. F. Campbell
Abstract:
Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned regularization will provide complex data-driven priors to inverse problems while still retaining the control and insight of a variational regularization method. M…
▽ More
Objective: This paper investigates how generative models, trained on ground-truth images, can be used \changes{as} priors for inverse problems, penalizing reconstructions far from images the generator can produce. The aim is that learned regularization will provide complex data-driven priors to inverse problems while still retaining the control and insight of a variational regularization method. Moreover, unsupervised learning, without paired training data, allows the learned regularizer to remain flexible to changes in the forward problem such as noise level, sampling pattern or coil sensitivities in MRI.
Approach: We utilize variational autoencoders (VAEs) that generate not only an image but also a covariance uncertainty matrix for each image. The covariance can model changing uncertainty dependencies caused by structure in the image, such as edges or objects, and provides a new distance metric from the manifold of learned images.
Main results: We evaluate these novel generative regularizers on retrospectively sub-sampled real-valued MRI measurements from the fastMRI dataset. We compare our proposed learned regularization against other unlearned regularization approaches and unsupervised and supervised deep learning methods.
Significance: Our results show that the proposed method is competitive with other state-of-the-art methods and behaves consistently with changing sampling patterns and noise levels.
△ Less
Submitted 16 June, 2023; v1 submitted 26 October, 2022;
originally announced October 2022.
-
Analysing Training-Data Leakage from Gradients through Linear Systems and Gradient Matching
Authors:
Cangxiong Chen,
Neill D. F. Campbell
Abstract:
Recent works have demonstrated that it is possible to reconstruct training images and their labels from gradients of an image-classification model when its architecture is known. Unfortunately, there is still an incomplete theoretical understanding of the efficacy and failure of these gradient-leakage attacks. In this paper, we propose a novel framework to analyse training-data leakage from gradie…
▽ More
Recent works have demonstrated that it is possible to reconstruct training images and their labels from gradients of an image-classification model when its architecture is known. Unfortunately, there is still an incomplete theoretical understanding of the efficacy and failure of these gradient-leakage attacks. In this paper, we propose a novel framework to analyse training-data leakage from gradients that draws insights from both analytic and optimisation-based gradient-leakage attacks. We formulate the reconstruction problem as solving a linear system from each layer iteratively, accompanied by corrections using gradient matching. Under this framework, we claim that the solubility of the reconstruction problem is primarily determined by that of the linear system at each layer. As a result, we are able to partially attribute the leakage of the training data in a deep network to its architecture. We also propose a metric to measure the level of security of a deep learning model against gradient-based attacks on the training data.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Current Paths in an Atomic Precision Advanced Manufactured Device Imaged by Nitrogen-Vacancy Diamond Magnetic Microscopy
Authors:
Luca Basso,
Pauli Kehayias,
Jacob Henshaw,
Maziar Saleh Ziabari,
Heejun Byeon,
Michael P. Lilly,
Ezra Bussmann,
Deanna M. Campbell,
Shashank Misra,
Andrew M. Mounce
Abstract:
The recently-developed ability to control phosphorous-doping of silicon at an atomic level using scanning tunneling microscopy (STM), a technique known as atomic-precision-advanced-manufacturing (APAM), has allowed us to tailor electronic devices with atomic precision, and thus has emerged as a way to explore new possibilities in Si electronics. In these applications, critical questions include wh…
▽ More
The recently-developed ability to control phosphorous-doping of silicon at an atomic level using scanning tunneling microscopy (STM), a technique known as atomic-precision-advanced-manufacturing (APAM), has allowed us to tailor electronic devices with atomic precision, and thus has emerged as a way to explore new possibilities in Si electronics. In these applications, critical questions include where current flow is actually occurring in or near APAM structures as well as whether leakage currents are present. In general, detection and mapping of current flow in APAM structures are valuable diagnostic tools to obtain reliable devices in digital-enhanced applications. In this paper, we performed nitrogen-vacancy (NV) wide-field magnetic imaging of stray magnetic fields from surface current densities flowing in an APAM test device over a mm-field of view with μm-resolution. To do this, we integrated a diamond having a surface NV ensemble with the device (patterned in two parallel mm-sized ribbons), then mapped the magnetic field from the DC current injected in the APAM device in a home-built NV wide-field microscope. The 2D magnetic field maps were used to reconstruct the surface current density, allowing us to obtain information on current paths, device failures such as choke points where current flow is impeded, and current leakages outside the APAM-defined P-doped regions. Analysis on the current density reconstructed map showed a projected sensitivity of ~0.03 A/m, corresponding to a smallest detectable current in the 200 μm-wide APAM ribbon of ~6 μA. These results demonstrate the failure analysis capability of NV wide-field magnetometry for APAM materials, opening the possibility to investigate other cutting-edge microelectronic devices.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
Modular tunable coupler for superconducting qubits
Authors:
Daniel L. Campbell,
Archana Kamal,
Leonardo Ranzani,
Michael Senatore,
Matthew LaHaye
Abstract:
The development of modular and versatile quantum interconnect hardware is a key next step in the scaling of quantum information platforms to larger size and greater functionality. For superconducting quantum systems, fast and well-controlled tunable circuit couplers will be paramount for achieving high fidelity and resource efficient connectivity, whether for performing two-qubit gate operations,…
▽ More
The development of modular and versatile quantum interconnect hardware is a key next step in the scaling of quantum information platforms to larger size and greater functionality. For superconducting quantum systems, fast and well-controlled tunable circuit couplers will be paramount for achieving high fidelity and resource efficient connectivity, whether for performing two-qubit gate operations, encoding or decoding a quantum data bus, or interfacing across modalities. Here we propose a versatile and internally-tunable double-transmon coupler (DTC) architecture that implements tunable coupling via flux-controlled interference in a three-junction dcSQUID. Crucially, the DTC possesses an internally defined zero-coupling state that is independent of the coupled data qubits or circuit resonators. This makes it particular attractive as a modular and versatile design element for realizing fast and robust linear coupling in several applications such as high-fidelity two-qubit gate operations, qubit readout, and quantum bus interfacing.
△ Less
Submitted 1 May, 2023; v1 submitted 13 July, 2022;
originally announced July 2022.
-
SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data
Authors:
Eldar Insafutdinov,
Dylan Campbell,
João F. Henriques,
Andrea Vedaldi
Abstract:
We present a method for the accurate 3D reconstruction of partly-symmetric objects. We build on the strengths of recent advances in neural reconstruction and rendering such as Neural Radiance Fields (NeRF). A major shortcoming of such approaches is that they fail to reconstruct any part of the object which is not clearly visible in the training image, which is often the case for in-the-wild images…
▽ More
We present a method for the accurate 3D reconstruction of partly-symmetric objects. We build on the strengths of recent advances in neural reconstruction and rendering such as Neural Radiance Fields (NeRF). A major shortcoming of such approaches is that they fail to reconstruct any part of the object which is not clearly visible in the training image, which is often the case for in-the-wild images and videos. When evidence is lacking, structural priors such as symmetry can be used to complete the missing information. However, exploiting such priors in neural rendering is highly non-trivial: while geometry and non-reflective materials may be symmetric, shadows and reflections from the ambient scene are not symmetric in general. To address this, we apply a soft symmetry constraint to the 3D geometry and material properties, having factored appearance into lighting, albedo colour and reflectivity. We evaluate our method on the recently introduced CO3D dataset, focusing on the car category due to the challenge of reconstructing highly-reflective materials. We show that it can reconstruct unobserved regions with high fidelity and render high-quality novel view images.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Phonon Induced Instabilities in Correlated Electron Hamiltonians
Authors:
Nahom K. Yirga,
Ka-Ming Tam,
David K. Campbell
Abstract:
Studies of Hamiltonians modeling the coupling between electrons as well as to local phonon excitations have been fundamental in capturing the novel ordering seen in many quasi-one dimensional condensed matter systems. Extending studies of such Hamiltonians to quasi-two dimensional systems is of great current interest, as electron-phonon couplings are predicted to play a major role in the stabiliza…
▽ More
Studies of Hamiltonians modeling the coupling between electrons as well as to local phonon excitations have been fundamental in capturing the novel ordering seen in many quasi-one dimensional condensed matter systems. Extending studies of such Hamiltonians to quasi-two dimensional systems is of great current interest, as electron-phonon couplings are predicted to play a major role in the stabilization or enhancement of novel phases in 2D material systems. In this work, we study model systems that describe the interplay between the Hubbard coupling and the phonon modes in the Holstein (H) and Su-Schrieffer-Heeger (SSH) Hamiltonians using the functional renormalization group (fRG). For both types of electron phonon couplings, we find the predicted charge density wave phases in competition with anti-ferromagnetic ($AF$) ordering. As the system is doped, the transition shifts, with both orders showing incommensurate peaks. We compare the evolution of the quasiparticle weight for the Holstein model with that of the SSH model as the systems transition from antiferromagnetic to charge-ordered ground states. Finally, we calculate the self-energy of the phonon and capture the impact of charge ordering on the phonon modes.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
Stable and high quality electron beams from staged laser and plasma wakefield accelerators
Authors:
F. M. Foerster,
A. Döpp,
F. Haberstroh,
K. v. Grafenstein,
D. Campbell,
Y. -Y. Chang,
S. Corde,
J. P. Couperus Cabadağ,
A. Debus,
M. F. Gilljohann,
A. F. Habib,
T. Heinemann,
B. Hidding,
A. Irman,
F. Irshad,
A. Knetsch,
O. Kononenko,
A. Martinez de la Ossa,
A. Nutter,
R. Pausch,
G. Schilling,
A. Schletter,
S. Schöbel,
U. Schramm,
E. Travac
, et al. (2 additional authors not shown)
Abstract:
We present experimental results on a plasma wakefield accelerator (PWFA) driven by high-current electron beams from a laser wakefield accelerator (LWFA). In this staged setup stable and high quality (low divergence and low energy spread) electron beams are generated at an optically-generated hydrodynamic shock in the PWFA. The energy stability of the beams produced by that arrangement in the PWFA…
▽ More
We present experimental results on a plasma wakefield accelerator (PWFA) driven by high-current electron beams from a laser wakefield accelerator (LWFA). In this staged setup stable and high quality (low divergence and low energy spread) electron beams are generated at an optically-generated hydrodynamic shock in the PWFA. The energy stability of the beams produced by that arrangement in the PWFA stage is comparable to both single-stage laser accelerators and plasma wakefield accelerators driven by conventional accelerators. Simulations support that the intrinsic insensitivity of PWFAs to driver energy fluctuations can be exploited to overcome stability limitations of state-of-the-art laser wakefield accelerators when adding a PWFA stage. Furthermore, we demonstrate the generation of electron bunches with energy spread and divergence superior to single-stage LW-FAs, resulting in bunches with dense phase space and an angular-spectral charge density beyond the initial drive beam parameters. These results unambiguously show that staged LWFA-PWFA can help to tailor the electron-beam quality for certain applications and to reduce the influence of fluctuating laser drivers on the electron-beam stability. This encourages further development of this new class of staged wakefield acceleration as a viable scheme towards compact, high-quality electron beam sources.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
Galaxies and Halos on Graph Neural Networks: Deep Generative Modeling Scalar and Vector Quantities for Intrinsic Alignment
Authors:
Yesukhei Jagvaral,
François Lanusse,
Sukhdeep Singh,
Rachel Mandelbaum,
Siamak Ravanbakhsh,
Duncan Campbell
Abstract:
In order to prepare for the upcoming wide-field cosmological surveys, large simulations of the Universe with realistic galaxy populations are required. In particular, the tendency of galaxies to naturally align towards overdensities, an effect called intrinsic alignments (IA), can be a major source of systematics in the weak lensing analysis. As the details of galaxy formation and evolution releva…
▽ More
In order to prepare for the upcoming wide-field cosmological surveys, large simulations of the Universe with realistic galaxy populations are required. In particular, the tendency of galaxies to naturally align towards overdensities, an effect called intrinsic alignments (IA), can be a major source of systematics in the weak lensing analysis. As the details of galaxy formation and evolution relevant to IA cannot be simulated in practice on such volumes, we propose as an alternative a Deep Generative Model. This model is trained on the IllustrisTNG-100 simulation and is capable of sampling the orientations of a population of galaxies so as to recover the correct alignments. In our approach, we model the cosmic web as a set of graphs, where the graphs are constructed for each halo, and galaxy orientations as a signal on those graphs. The generative model is implemented on a Generative Adversarial Network architecture and uses specifically designed Graph-Convolutional Networks sensitive to the relative 3D positions of the vertices. Given (sub)halo masses and tidal fields, the model is able to learn and predict scalar features such as galaxy and dark matter subhalo shapes; and more importantly, vector features such as the 3D orientation of the major axis of the ellipsoid and the complex 2D ellipticities. For correlations of 3D orientations the model is in good quantitative agreement with the measured values from the simulation, except for at very small and transition scales. For correlations of 2D ellipticities, the model is in good quantitative agreement with the measured values from the simulation on all scales. Additionally, the model is able to capture the dependence of IA on mass, morphological type and central/satellite type.
△ Less
Submitted 22 July, 2022; v1 submitted 14 April, 2022;
originally announced April 2022.
-
Injectivity in second-gradient Nonlinear Elasticity
Authors:
D. Campbell,
S. Hencl,
A. Menovschikov,
S. Schwarzacher
Abstract:
We study injectivity for models of Nonlinear Elasticity that involve the second gradient. We assume that $Ω\subset\mathbb{R}^n$ is a domain, $f\in W^{2,q}(Ω,\mathbb{R}^n)$ satisfies $|J_f|^{-a}\in L^1$ and that $f$ equals a given homeomorphism on $\partial Ω$. Under suitable conditions on $q$ and $a$ we show that $f$ must be a homeomorphism. As a main new tool we find an optimal condition for $a$…
▽ More
We study injectivity for models of Nonlinear Elasticity that involve the second gradient. We assume that $Ω\subset\mathbb{R}^n$ is a domain, $f\in W^{2,q}(Ω,\mathbb{R}^n)$ satisfies $|J_f|^{-a}\in L^1$ and that $f$ equals a given homeomorphism on $\partial Ω$. Under suitable conditions on $q$ and $a$ we show that $f$ must be a homeomorphism. As a main new tool we find an optimal condition for $a$ and $q$ that imply that $\mathcal{H}^{n-1}(\{J_f=0\})=0$ and hence $J_f$ cannot change sign. We further specify in dependence of $q$ and $a$ the maximal Hausdorff dimension $d$ of the critical set $\{J_f=0\}$. The sharpness of our conditions for $d$ is demonstrated by constructing respective counterexamples.
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
A 23 MW data centre is all you need
Authors:
Samuel Albanie,
Dylan Campbell,
João F. Henriques
Abstract:
The field of machine learning has achieved striking progress in recent years, witnessing breakthrough results on language modelling, protein folding and nitpickingly fine-grained dog breed classification. Some even succeeded at playing computer games and board games, a feat both of engineering and of setting their employers' expectations. The central contribution of this work is to carefully exami…
▽ More
The field of machine learning has achieved striking progress in recent years, witnessing breakthrough results on language modelling, protein folding and nitpickingly fine-grained dog breed classification. Some even succeeded at playing computer games and board games, a feat both of engineering and of setting their employers' expectations. The central contribution of this work is to carefully examine whether this progress, and technology more broadly, can be expected to continue indefinitely. Through a rigorous application of statistical theory and failure to extrapolate beyond the training data, we answer firmly in the negative and provide details: technology will peak at 3:07 am (BST) on 20th July, 2032. We then explore the implications of this finding, discovering that individuals awake at this ungodly hour with access to a sufficiently powerful computer possess an opportunity for myriad forms of long-term linguistic 'lock in'. All we need is a large (>> 1W) data centre to seize this pivotal moment. By setting our analogue alarm clocks, we propose a tractable algorithm to ensure that, for the future of humanity, the British spelling of colour becomes the default spelling across more than 80% of the global word processing software market.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Learning Structured Gaussians to Approximate Deep Ensembles
Authors:
Ivor J. A. Simpson,
Sara Vicente,
Neill D. F. Campbell
Abstract:
This paper proposes using a sparse-structured multivariate Gaussian to provide a closed-form approximator for the output of probabilistic ensemble models used for dense image prediction tasks. This is achieved through a convolutional neural network that predicts the mean and covariance of the distribution, where the inverse covariance is parameterised by a sparsely structured Cholesky matrix. Simi…
▽ More
This paper proposes using a sparse-structured multivariate Gaussian to provide a closed-form approximator for the output of probabilistic ensemble models used for dense image prediction tasks. This is achieved through a convolutional neural network that predicts the mean and covariance of the distribution, where the inverse covariance is parameterised by a sparsely structured Cholesky matrix. Similarly to distillation approaches, our single network is trained to maximise the probability of samples from pre-trained probabilistic models, in this work we use a fixed ensemble of networks. Once trained, our compact representation can be used to efficiently draw spatially correlated samples from the approximated output distribution. Importantly, this approach captures the uncertainty and structured correlations in the predictions explicitly in a formal distribution, rather than implicitly through sampling alone. This allows direct introspection of the model, enabling visualisation of the learned structure. Moreover, this formulation provides two further benefits: estimation of a sample probability, and the introduction of arbitrary spatial conditioning at test time. We demonstrate the merits of our approach on monocular depth estimation and show that the advantages of our approach are obtained with comparable quantitative performance.
△ Less
Submitted 29 March, 2022;
originally announced March 2022.
-
Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image Matching
Authors:
Yujiao Shi,
Xin Yu,
Liu Liu,
Dylan Campbell,
Piotr Koniusz,
Hongdong Li
Abstract:
We address the problem of ground-to-satellite image geo-localization, that is, estimating the camera latitude, longitude and orientation (azimuth angle) by matching a query image captured at the ground level against a large-scale database with geotagged satellite images. Our prior arts treat the above task as pure image retrieval by selecting the most similar satellite reference image matching the…
▽ More
We address the problem of ground-to-satellite image geo-localization, that is, estimating the camera latitude, longitude and orientation (azimuth angle) by matching a query image captured at the ground level against a large-scale database with geotagged satellite images. Our prior arts treat the above task as pure image retrieval by selecting the most similar satellite reference image matching the ground-level query image. However, such an approach often produces coarse location estimates because the geotag of the retrieved satellite image only corresponds to the image center while the ground camera can be located at any point within the image. To further consolidate our prior research findings, we present a novel geometry-aware geo-localization method. Our new method is able to achieve the fine-grained location of a query image, up to pixel size precision of the satellite image, once its coarse location and orientation have been determined. Moreover, we propose a new geometry-aware image retrieval pipeline to improve the coarse localization accuracy. Apart from a polar transform in our conference work, this new pipeline also maps satellite image pixels to the ground-level plane in the ground-view via a geometry-constrained projective transform to emphasize informative regions, such as road structures, for cross-view geo-localization. Extensive quantitative and qualitative experiments demonstrate the effectiveness of our newly proposed framework. We also significantly improve the performance of coarse localization results compared to the state-of-the-art in terms of location recalls.
△ Less
Submitted 26 March, 2022;
originally announced March 2022.