-
Specialist vision-language models for clinical ophthalmology
Authors:
Robbie Holland,
Thomas R. P. Taylor,
Christopher Holmes,
Sophie Riedl,
Julia Mai,
Maria Patsiamanidi,
Dimitra Mitsopoulou,
Paul Hager,
Philip Müller,
Hendrik P. N. Scholl,
Hrvoje Bogunović,
Ursula Schmidt-Erfurth,
Daniel Rueckert,
Sobha Sivaprasad,
Andrew J. Lotery,
Martin J. Menten
Abstract:
Clinicians spend a significant amount of time reviewing medical images and transcribing their findings regarding patient diagnosis, referral and treatment in text form. Vision-language models (VLMs), which automatically interpret images and summarize their findings as text, have enormous potential to alleviate clinical workloads and increase patient access to high-quality medical care. While found…
▽ More
Clinicians spend a significant amount of time reviewing medical images and transcribing their findings regarding patient diagnosis, referral and treatment in text form. Vision-language models (VLMs), which automatically interpret images and summarize their findings as text, have enormous potential to alleviate clinical workloads and increase patient access to high-quality medical care. While foundational models have stirred considerable interest in the medical community, it is unclear whether their general capabilities translate to real-world clinical utility. In this work, we show that foundation VLMs markedly underperform compared to practicing ophthalmologists on specialist tasks crucial to the care of patients with age-related macular degeneration (AMD). To address this, we initially identified the essential capabilities required for image-based clinical decision-making, and then developed a curriculum to selectively train VLMs in these skills. The resulting model, RetinaVLM, can be instructed to write reports that significantly outperform those written by leading foundation medical VLMs in disease staging (F1 score of 0.63 vs. 0.11) and patient referral (0.67 vs. 0.39), and approaches the diagnostic performance of junior ophthalmologists (who achieve 0.77 and 0.78 on the respective tasks). Furthermore, in a reader study involving two senior ophthalmologists with up to 32 years of experience, RetinaVLM's reports were found to be similarly correct (78.6% vs. 82.1%) and complete (both 78.6%) as reports written by junior ophthalmologists with up to 10 years of experience. These results demonstrate that our curriculum-based approach provides a blueprint for specializing generalist foundation medical VLMs to handle real-world clinical tasks.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge
Authors:
Hongwei Bran Li,
Fernando Navarro,
Ivan Ezhov,
Amirhossein Bayat,
Dhritiman Das,
Florian Kofler,
Suprosanna Shit,
Diana Waldmannstetter,
Johannes C. Paetzold,
Xiaobin Hu,
Benedikt Wiestler,
Lucas Zimmer,
Tamaz Amiranashvili,
Chinmay Prabhakar,
Christoph Berger,
Jonas Weidner,
Michelle Alonso-Basant,
Arif Rashid,
Ujjwal Baid,
Wesam Adel,
Deniz Ali,
Bhakti Baheti,
Yingbin Bai,
Ishaan Bhatt,
Sabri Can Cetindag
, et al. (55 additional authors not shown)
Abstract:
Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de…
▽ More
Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the development and evaluation of automated segmentation algorithms. Accurately modeling and quantifying this variability is essential for enhancing the robustness and clinical applicability of these algorithms. We report the set-up and summarize the benchmark results of the Quantification of Uncertainties in Biomedical Image Quantification Challenge (QUBIQ), which was organized in conjunction with International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2020 and 2021. The challenge focuses on the uncertainty quantification of medical image segmentation which considers the omnipresence of inter-rater variability in imaging datasets. The large collection of images with multi-rater annotations features various modalities such as MRI and CT; various organs such as the brain, prostate, kidney, and pancreas; and different image dimensions 2D-vs-3D. A total of 24 teams submitted different solutions to the problem, combining various baseline models, Bayesian neural networks, and ensemble model techniques. The obtained results indicate the importance of the ensemble models, as well as the need for further research to develop efficient 3D methods for uncertainty quantification methods in 3D segmentation tasks.
△ Less
Submitted 24 June, 2024; v1 submitted 19 March, 2024;
originally announced May 2024.
-
The role of excitation vector fields and all-polarisation state control of cavity magnonics
Authors:
Alban Joseph,
Jayakrishnan M. P. Nair,
Mawgan A. Smith,
Rory Holland,
Luke J. McLellan,
Isabella Boventer,
Tim Wolz,
Dmytro A. Bozhko,
Benedetta Flebus,
Martin P. Weides,
Rair Macedo
Abstract:
Recently the field of cavity magnonics, a field focused on controlling the interaction between magnons and confined microwave photons within microwave resonators, has drawn significant attention as it offers a platform for enabling advancements in quantum- and spin-based technologies. Here, we introduce excitation vector fields, whose polarisation and profile can be easily tuned in a two-port cavi…
▽ More
Recently the field of cavity magnonics, a field focused on controlling the interaction between magnons and confined microwave photons within microwave resonators, has drawn significant attention as it offers a platform for enabling advancements in quantum- and spin-based technologies. Here, we introduce excitation vector fields, whose polarisation and profile can be easily tuned in a two-port cavity setup, thus acting as an effective experimental knob to explore the coupled dynamics of cavity magnon-polaritons. Moreover, we develop theoretical models that accurately predict and reproduce the experimental results for any polarisation state and field profile within the cavity resonator. This versatile experimental platform offers a new avenue for controlling spin-photon interactions and as such also delivering a mechanism to readily control the exchange of information between hybrid systems.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Deep-learning-based clustering of OCT images for biomarker discovery in age-related macular degeneration (Pinnacle study report 4)
Authors:
Robbie Holland,
Rebecca Kaye,
Ahmed M. Hagag,
Oliver Leingang,
Thomas R. P. Taylor,
Hrvoje Bogunović,
Ursula Schmidt-Erfurth,
Hendrik P. N. Scholl,
Daniel Rueckert,
Andrew J. Lotery,
Sobha Sivaprasad,
Martin J. Menten
Abstract:
Diseases are currently managed by grading systems, where patients are stratified by grading systems into stages that indicate patient risk and guide clinical management. However, these broad categories typically lack prognostic value, and proposals for new biomarkers are currently limited to anecdotal observations. In this work, we introduce a deep-learning-based biomarker proposal system for the…
▽ More
Diseases are currently managed by grading systems, where patients are stratified by grading systems into stages that indicate patient risk and guide clinical management. However, these broad categories typically lack prognostic value, and proposals for new biomarkers are currently limited to anecdotal observations. In this work, we introduce a deep-learning-based biomarker proposal system for the purpose of accelerating biomarker discovery in age-related macular degeneration (AMD). It works by first training a neural network using self-supervised contrastive learning to discover, without any clinical annotations, features relating to both known and unknown AMD biomarkers present in 46,496 retinal optical coherence tomography (OCT) images. To interpret the discovered biomarkers, we partition the images into 30 subsets, termed clusters, that contain similar features. We then conduct two parallel 1.5-hour semi-structured interviews with two independent teams of retinal specialists that describe each cluster in clinical language. Overall, both teams independently identified clearly distinct characteristics in 27 of 30 clusters, of which 23 were related to AMD. Seven were recognised as known biomarkers already used in established grading systems and 16 depicted biomarker combinations or subtypes that are either not yet used in grading systems, were only recently proposed, or were unknown. Clusters separated incomplete from complete retinal atrophy, intraretinal from subretinal fluid and thick from thin choroids, and in simulation outperformed clinically-used grading systems in prognostic value. Overall, contrastive learning enabled the automatic proposal of AMD biomarkers that go beyond the set used by clinically established grading systems. Ultimately, we envision that equipping clinicians with discovery-oriented deep-learning tools can accelerate discovery of novel prognostic biomarkers.
△ Less
Submitted 12 March, 2024;
originally announced May 2024.
-
Spatiotemporal Representation Learning for Short and Long Medical Image Time Series
Authors:
Chengzhi Shen,
Martin J. Menten,
Hrvoje Bogunović,
Ursula Schmidt-Erfurth,
Hendrik Scholl,
Sobha Sivaprasad,
Andrew Lotery,
Daniel Rueckert,
Paul Hager,
Robbie Holland
Abstract:
Analyzing temporal developments is crucial for the accurate prognosis of many medical conditions. Temporal changes that occur over short time scales are key to assessing the health of physiological functions, such as the cardiac cycle. Moreover, tracking longer term developments that occur over months or years in evolving processes, such as age-related macular degeneration (AMD), is essential for…
▽ More
Analyzing temporal developments is crucial for the accurate prognosis of many medical conditions. Temporal changes that occur over short time scales are key to assessing the health of physiological functions, such as the cardiac cycle. Moreover, tracking longer term developments that occur over months or years in evolving processes, such as age-related macular degeneration (AMD), is essential for accurate prognosis. Despite the importance of both short and long term analysis to clinical decision making, they remain understudied in medical deep learning. State of the art methods for spatiotemporal representation learning, developed for short natural videos, prioritize the detection of temporal constants rather than temporal developments. Moreover, they do not account for varying time intervals between acquisitions, which are essential for contextualizing observed changes. To address these issues, we propose two approaches. First, we combine clip-level contrastive learning with a novel temporal embedding to adapt to irregular time series. Second, we propose masking and predicting latent frame representations of the temporal sequence. Our two approaches outperform all prior methods on temporally-dependent tasks including cardiac output estimation and three prognostic AMD tasks. Overall, this enables the automated analysis of temporal patterns which are typically overlooked in applications of deep learning to medicine.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
The Philosopher's Stone: Trojaning Plugins of Large Language Models
Authors:
Tian Dong,
Minhui Xue,
Guoxing Chen,
Rayne Holland,
Shaofeng Li,
Yan Meng,
Zhen Liu,
Haojin Zhu
Abstract:
Open-source Large Language Models (LLMs) have recently gained popularity because of their comparable performance to proprietary LLMs. To efficiently fulfill domain-specialized tasks, open-source LLMs can be refined, without expensive accelerators, using low-rank adapters. However, it is still unknown whether low-rank adapters can be exploited to control LLMs. To address this gap, we demonstrate th…
▽ More
Open-source Large Language Models (LLMs) have recently gained popularity because of their comparable performance to proprietary LLMs. To efficiently fulfill domain-specialized tasks, open-source LLMs can be refined, without expensive accelerators, using low-rank adapters. However, it is still unknown whether low-rank adapters can be exploited to control LLMs. To address this gap, we demonstrate that an infected adapter can induce, on specific triggers, an LLM to output content defined by an adversary and to even maliciously use tools. To train a Trojan adapter, we propose two novel attacks, POLISHED and FUSION, that improve over prior approaches. POLISHED uses LLM-enhanced paraphrasing to polish benchmark poisoned datasets. In contrast, in the absence of a dataset, FUSION leverages an over-poisoning procedure to transform a benign adaptor. In our experiments, we first conduct two case studies to demonstrate that a compromised LLM agent can execute malware to control system (e.g., LLM-driven robot) or launch a spear-phishing attack. Then, in terms of targeted misinformation, we show that our attacks provide higher attack effectiveness than the baseline and, for the purpose of attracting downloads, preserve or improve the adapter's utility. Finally, we design and evaluate three potential defenses, yet none proved entirely effective in safeguarding against our attacks.
△ Less
Submitted 13 March, 2024; v1 submitted 1 December, 2023;
originally announced December 2023.
-
Matrix-analytic methods for the evolution of species trees, gene trees, and their reconciliation
Authors:
Albert C. Soewongsono,
Jiahao Diao,
Tristan Stark,
Amanda E. Wilson,
David A. Liberles,
Barbara R. Holland,
Malgorzata M. O'Reilly
Abstract:
We consider the reconciliation problem, in which the task is to find a mapping of a gene tree into a species tree, so as to maximize the likelihood of such fitting, given the available data. We describe a model for the evolution of the species tree, a subfunctionalisation model for the evolution of the gene tree, and provide an algorithm to compute the likelihood of the reconciliation. We derive o…
▽ More
We consider the reconciliation problem, in which the task is to find a mapping of a gene tree into a species tree, so as to maximize the likelihood of such fitting, given the available data. We describe a model for the evolution of the species tree, a subfunctionalisation model for the evolution of the gene tree, and provide an algorithm to compute the likelihood of the reconciliation. We derive our results using the theory of matrix-analytic methods and describe efficient algorithms for the computation of a range of useful metrics. We illustrate the theory with examples and provide the physical interpretations of the discussed quantities, with a focus on the practical applications of the theory to incomplete data.
△ Less
Submitted 8 November, 2023; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Modelling gene content across a phylogeny to determine when genes become associated
Authors:
Jiahao Diao,
Malgorzata M. O'Reilly,
Barbara R. Holland
Abstract:
In this work, we develop a stochastic model of gene gain and loss with the aim of inferring when (if at all) in evolutionary history and association between two genes arises. The data we consider is a species tree along with information on the presence or absence of two genes in each of the species. The biological motivation for our model is that if two genes are involved in the same biochemical p…
▽ More
In this work, we develop a stochastic model of gene gain and loss with the aim of inferring when (if at all) in evolutionary history and association between two genes arises. The data we consider is a species tree along with information on the presence or absence of two genes in each of the species. The biological motivation for our model is that if two genes are involved in the same biochemical pathway, i.e. they are both required for some function, then the rate of gain or loss of one gene in the pathway should depend upon the presence or absence of the other gene in the pathway. However, if the two genes are not functionally linked, then the rate of gain or loss of one gene should be independent of the state of another gene.
We simulate data under this model to determine under what conditions a shift from the independent rates class to the dependent rates class can be detected. For example, how large a tree is required and how large a shift in the rates is needed before Akaike information criterion (AIC) supports a model with two rate classes over a simpler model with just one rate class? If a model with two rate classes is preferred, can it correctly detect where on the evolutionary tree the shift occurred?
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Stochastic niche-based models for the evolution of species
Authors:
Albert Ch. Soewongsono,
Barbara R. Holland,
Malgorzata M. O'Reilly
Abstract:
There have been many studies to examine whether one trait is correlated with another trait across a group of present-day species (for example, do species with larger brains tend to have longer gestation times. Since the introduction of the phylogenetic comparative method some authors have argued that it is necessary to have a biologically realistic model to generate evolutionary trees that incorpo…
▽ More
There have been many studies to examine whether one trait is correlated with another trait across a group of present-day species (for example, do species with larger brains tend to have longer gestation times. Since the introduction of the phylogenetic comparative method some authors have argued that it is necessary to have a biologically realistic model to generate evolutionary trees that incorporates information about the ecological niche occupied by species. Price presented a simple model along these lines in 1997. He defined a two-dimensional niche space formed by two continuous-valued traits, in which new niches arise with trait values drawn from a bivariate normal distribution. When a new niche arises, it is occupied by a descendant species of whichever current species is closest in ecological niche space. In sequence, more species are then evolved from already-existing species to which they are ecologically closest.
Here we explore ways of extending Price's adaptive radiation model. One extension is to increase the dimensionality of the niche space by considering more than two continuous traits. A second extension is to allow both extinction of species (which may leave unoccupied niches) and removal of niches (which causes species occupying them to go extinct). To model this problem, we consider a continuous-time stochastic process which implicitly defines a phylogeny. To explore if trees generated under such a model (or under different parametrizations of the model) are realistic we can compute a variety of summary statistics that can be compared to those of empirically observed phylogenies. For example, there are existing statistics that aim to measure: tree balance, the relative rate of diversification, and phylogenetic signal of traits.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
A skeletonization algorithm for gradient-based optimization
Authors:
Martin J. Menten,
Johannes C. Paetzold,
Veronika A. Zimmer,
Suprosanna Shit,
Ivan Ezhov,
Robbie Holland,
Monika Probst,
Julia A. Schnabel,
Daniel Rueckert
Abstract:
The skeleton of a digital image is a compact representation of its topology, geometry, and scale. It has utility in many computer vision applications, such as image description, segmentation, and registration. However, skeletonization has only seen limited use in contemporary deep learning solutions. Most existing skeletonization algorithms are not differentiable, making it impossible to integrate…
▽ More
The skeleton of a digital image is a compact representation of its topology, geometry, and scale. It has utility in many computer vision applications, such as image description, segmentation, and registration. However, skeletonization has only seen limited use in contemporary deep learning solutions. Most existing skeletonization algorithms are not differentiable, making it impossible to integrate them with gradient-based optimization. Compatible algorithms based on morphological operations and neural networks have been proposed, but their results often deviate from the geometry and topology of the true medial axis. This work introduces the first three-dimensional skeletonization algorithm that is both compatible with gradient-based optimization and preserves an object's topology. Our method is exclusively based on matrix additions and multiplications, convolutional operations, basic non-linear functions, and sampling from a uniform probability distribution, allowing it to be easily implemented in any major deep learning library. In benchmarking experiments, we prove the advantages of our skeletonization algorithm compared to non-differentiable, morphological, and neural-network-based baselines. Finally, we demonstrate the utility of our algorithm by integrating it with two medical image processing applications that use gradient-based optimization: deep-learning-based blood vessel segmentation, and multimodal registration of the mandible in computed tomography and magnetic resonance images.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Clustering disease trajectories in contrastive feature space for biomarker discovery in age-related macular degeneration
Authors:
Robbie Holland,
Oliver Leingang,
Christopher Holmes,
Philipp Anders,
Rebecca Kaye,
Sophie Riedl,
Johannes C. Paetzold,
Ivan Ezhov,
Hrvoje Bogunović,
Ursula Schmidt-Erfurth,
Lars Fritsche,
Hendrik P. N. Scholl,
Sobha Sivaprasad,
Andrew J. Lotery,
Daniel Rueckert,
Martin J. Menten
Abstract:
Age-related macular degeneration (AMD) is the leading cause of blindness in the elderly. Current grading systems based on imaging biomarkers only coarsely group disease stages into broad categories and are unable to predict future disease progression. It is widely believed that this is due to their focus on a single point in time, disregarding the dynamic nature of the disease. In this work, we pr…
▽ More
Age-related macular degeneration (AMD) is the leading cause of blindness in the elderly. Current grading systems based on imaging biomarkers only coarsely group disease stages into broad categories and are unable to predict future disease progression. It is widely believed that this is due to their focus on a single point in time, disregarding the dynamic nature of the disease. In this work, we present the first method to automatically discover biomarkers that capture temporal dynamics of disease progression. Our method represents patient time series as trajectories in a latent feature space built with contrastive learning. Then, individual trajectories are partitioned into atomic sub-sequences that encode transitions between disease states. These are clustered using a newly introduced distance metric. In quantitative experiments we found our method yields temporal biomarkers that are predictive of conversion to late AMD. Furthermore, these clusters were highly interpretable to ophthalmologists who confirmed that many of the clusters represent dynamics that have previously been linked to the progression of AMD, even though they are currently not included in any clinical grading system.
△ Less
Submitted 20 March, 2023; v1 submitted 11 January, 2023;
originally announced January 2023.
-
Metadata-enhanced contrastive learning from retinal optical coherence tomography images
Authors:
Robbie Holland,
Oliver Leingang,
Hrvoje Bogunović,
Sophie Riedl,
Lars Fritsche,
Toby Prevost,
Hendrik P. N. Scholl,
Ursula Schmidt-Erfurth,
Sobha Sivaprasad,
Andrew J. Lotery,
Daniel Rueckert,
Martin J. Menten
Abstract:
Deep learning has potential to automate screening, monitoring and grading of disease in medical images. Pretraining with contrastive learning enables models to extract robust and generalisable features from natural image datasets, facilitating label-efficient downstream image analysis. However, the direct application of conventional contrastive methods to medical datasets introduces two domain-spe…
▽ More
Deep learning has potential to automate screening, monitoring and grading of disease in medical images. Pretraining with contrastive learning enables models to extract robust and generalisable features from natural image datasets, facilitating label-efficient downstream image analysis. However, the direct application of conventional contrastive methods to medical datasets introduces two domain-specific issues. Firstly, several image transformations which have been shown to be crucial for effective contrastive learning do not translate from the natural image to the medical image domain. Secondly, the assumption made by conventional methods, that any two images are dissimilar, is systematically misleading in medical datasets depicting the same anatomy and disease. This is exacerbated in longitudinal image datasets that repeatedly image the same patient cohort to monitor their disease progression over time. In this paper we tackle these issues by extending conventional contrastive frameworks with a novel metadata-enhanced strategy. Our approach employs widely available patient metadata to approximate the true set of inter-image contrastive relationships. To this end we employ records for patient identity, eye position (i.e. left or right) and time series information. In experiments using two large longitudinal datasets containing 170,427 retinal OCT images of 7,912 patients with age-related macular degeneration (AMD), we evaluate the utility of using metadata to incorporate the temporal dynamics of disease progression into pretraining. Our metadata-enhanced approach outperforms both standard contrastive methods and a retinal image foundation model in five out of six image-level downstream tasks related to AMD. Due to its modularity, our method can be quickly and cost-effectively tested to establish the potential benefits of including available metadata in contrastive pretraining.
△ Less
Submitted 14 December, 2023; v1 submitted 4 August, 2022;
originally announced August 2022.
-
Analysis of the first Genetic Engineering Attribution Challenge
Authors:
Oliver M. Crook,
Kelsey Lane Warmbrod,
Greg Lipstein,
Christine Chung,
Christopher W. Bakerlee,
T. Greg McKelvey Jr.,
Shelly R. Holland,
Jacob L. Swett,
Kevin M. Esvelt,
Ethan C. Alley,
William J. Bradshaw
Abstract:
The ability to identify the designer of engineered biological sequences -- termed genetic engineering attribution (GEA) -- would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Engineering Attribution Challenge, a public data-science competition to advance GEA. Top-scoring…
▽ More
The ability to identify the designer of engineered biological sequences -- termed genetic engineering attribution (GEA) -- would help ensure due credit for biotechnological innovation, while holding designers accountable to the communities they affect. Here, we present the results of the first Genetic Engineering Attribution Challenge, a public data-science competition to advance GEA. Top-scoring teams dramatically outperformed previous models at identifying the true lab-of-origin of engineered sequences, including an increase in top-1 and top-10 accuracy of 10 percentage points. A simple ensemble of prizewinning models further increased performance. New metrics, designed to assess a model's ability to confidently exclude candidate labs, also showed major improvements, especially for the ensemble. Most winning teams adopted CNN-based machine-learning approaches; however, one team achieved very high accuracy with an extremely fast neural-network-free approach. Future work, including future competitions, should further explore a wide diversity of approaches for bringing GEA technology into practical use.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
The Shape of Phylogenies Under Phase-Type Distributed Times to Speciation and Extinction
Authors:
Albert Ch. Soewongsono,
Barbara R. Holland,
Małgorzata M. O'Reilly
Abstract:
Phylogenetic trees are widely used to understand the evolutionary history of organisms. Tree shapes provide information about macroevolutionary processes. However, macroevolutionary models are unreliable for inferring the true processes underlying empirical trees. Here, we propose a flexible and biologically plausible macroevolutionary model for phylogenetic trees where times to speciation or exti…
▽ More
Phylogenetic trees are widely used to understand the evolutionary history of organisms. Tree shapes provide information about macroevolutionary processes. However, macroevolutionary models are unreliable for inferring the true processes underlying empirical trees. Here, we propose a flexible and biologically plausible macroevolutionary model for phylogenetic trees where times to speciation or extinction events are drawn from a Coxian phase-type (PH) distribution. First, we show that different choices of parameters in our model lead to a range of tree balances as measured by Aldous' $β$ statistic. In particular, we demonstrate that it is possible to find parameters that correspond well to empirical tree balance. Next, we provide a natural extension of the $β$ statistic to sets of trees. This extension produces less biased estimates of $β$ compared to using the median $β$ values from individual trees. Furthermore, we derive a likelihood expression for the probability of observing any tree with branch lengths under a model with speciation but no extinction. Finally, we illustrate the application of our model by performing both absolute and relative goodness-of-fit tests for two large empirical phylogenies (squamates and angiosperms) that compare models with Coxian PH distributed times to speciation with models that assume exponential or Weibull distributed waiting times. In our numerical analysis, we found that, in most cases, models assuming a Coxian PH distribution provided the best fit.
△ Less
Submitted 8 October, 2021;
originally announced October 2021.
-
Strong magnon-photon coupling with chip-integrated YIG in the zero-temperature limit
Authors:
Paul G. Baity,
Dmytro A. Bozhko,
Rair Macêdo,
William Smith,
Rory C. Holland,
Sergey Danilin,
Valentino Seferai,
João Barbosa,
Renju R. Peroor,
Sara Goldman,
Umberto Nasti,
Jharna Paul,
Robert H. Hadfield,
Stephen McVitie,
Martin Weides
Abstract:
The cross-integration of spin-wave and superconducting technologies is a promising method for creating novel hybrid devices for future information processing technologies to store, manipulate, or convert data in both classical and quantum regimes. Hybrid magnon-polariton systems have been widely studied using bulk Yttrium Iron Garnet (Y$_{3}$Fe$_{5}$O$_{12}$, YIG) and three-dimensional microwave p…
▽ More
The cross-integration of spin-wave and superconducting technologies is a promising method for creating novel hybrid devices for future information processing technologies to store, manipulate, or convert data in both classical and quantum regimes. Hybrid magnon-polariton systems have been widely studied using bulk Yttrium Iron Garnet (Y$_{3}$Fe$_{5}$O$_{12}$, YIG) and three-dimensional microwave photon cavities. However, limitations in YIG growth have thus far prevented its incorporation into CMOS compatible technology such as high quality factor superconducting quantum technology. To overcome this impediment, we have used Plasma Focused Ion Beam (PFIB) technology -- taking advantage of precision placement down to the micron-scale -- to integrate YIG with superconducting microwave devices. Ferromagnetic resonance has been measured at millikelvin temperatures on PFIB-processed YIG samples using planar microwave circuits. Furthermore, we demonstrate strong coupling between superconducting resonator and YIG ferromagnetic resonance modes by maintaining reasonably low loss while reducing the system down to the micron scale. This achievement of strong coupling on-chip is a crucial step toward fabrication of functional hybrid quantum devices that advantage from spin-wave and superconducting components.
△ Less
Submitted 14 June, 2021; v1 submitted 16 April, 2021;
originally announced April 2021.
-
An Electromagnetic Approach to Cavity Spintronics
Authors:
Rair Macêdo,
Rory C. Holland,
Paul G. Baity,
Luke J. McLellan,
Karen L. Livesey,
Robert L. Stamps,
Martin P. Weides,
Dmytro A. Bozhko
Abstract:
The fields of cavity quantum electrodynamics and magnetism have recently merged into \textit{`cavity spintronics'}, investigating a quasiparticle that emerges from the strong coupling between standing electromagnetic waves confined in a microwave cavity resonator and the quanta of spin waves, magnons. This phenomenon is now expected to be employed in a variety of devices for applications ranging f…
▽ More
The fields of cavity quantum electrodynamics and magnetism have recently merged into \textit{`cavity spintronics'}, investigating a quasiparticle that emerges from the strong coupling between standing electromagnetic waves confined in a microwave cavity resonator and the quanta of spin waves, magnons. This phenomenon is now expected to be employed in a variety of devices for applications ranging from quantum communication to dark matter detection. To be successful, most of these applications require a vast control of the coupling strength, resulting in intensive efforts to understanding coupling by a variety of different approaches. Here, the electromagnetic properties of both resonator and magnetic samples are investigated to provide a comprehensive understanding of the coupling between these two systems. Because the coupling is a consequence of the excitation vector fields, which directly interact with magnetisation dynamics, a highly-accurate electromagnetic perturbation theory is employed which allows for predicting the resonant hybrid mode frequencies for any field configuration within the cavity resonator, without any fitting parameters. The coupling is shown to be strongly dependent not only on the excitation vector fields and sample's magnetic properties but also on the sample's shape. These findings are illustrated by applying the theoretical framework to two distinct experiments: a magnetic sphere placed in a three-dimensional resonator, and a rectangular, magnetic prism placed on a two-dimensional resonator. The theory provides comprehensive understanding of the overall behaviour of strongly coupled systems and it can be easily modified for a variety of other systems.
△ Less
Submitted 25 October, 2020; v1 submitted 22 July, 2020;
originally announced July 2020.
-
Topography generation by melting and freezing in a turbulent shear flow
Authors:
Louis-Alexandre Couston,
Eric Hester,
Benjamin Favier,
John R. Taylor,
Paul R. Holland,
Adrian Jenkins
Abstract:
We report an idealized numerical study of a melting and freezing solid adjacent to a turbulent, buoyancy-affected shear flow, in order to improve our understanding of topography generation by phase changes in the environment. We use the phase-field method to dynamically couple the heat equation for the solid with the Navier-Stokes equations for the fluid. We investigate the evolution of an initial…
▽ More
We report an idealized numerical study of a melting and freezing solid adjacent to a turbulent, buoyancy-affected shear flow, in order to improve our understanding of topography generation by phase changes in the environment. We use the phase-field method to dynamically couple the heat equation for the solid with the Navier-Stokes equations for the fluid. We investigate the evolution of an initially-flat and horizontal solid boundary overlying a pressure-driven turbulent flow. We assume a linear equation of state for the fluid and change the sign of the thermal expansion coefficient, such that the background density stratification is either stable, neutral or unstable. We find that channels aligned with the direction of the mean flow are generated spontaneously by phase changes at the fluid-solid interface. Streamwise vortices in the fluid, the interface topography and the temperature field in the solid influence each other and adjust until a statistical steady state is obtained. The crest-to-trough amplitude of the channels are larger than about 10$δ_ν$ in all cases, with $δ_ν$ the viscous length scale, but are much larger and more persistent for an unstable stratification than for a neutral or stable stratification. This happens because a stable stratification makes the cool melt fluid buoyant such that it shields the channel from further melting, whereas an unstable stratification makes the cool melt fluid sink, inducing further melting by rising hot plumes. The statistics of flow velocities and melt rates are investigated, and we find that channels and keels emerging in our simulations do not significantly change the mean drag coefficient.
△ Less
Submitted 9 October, 2020; v1 submitted 21 April, 2020;
originally announced April 2020.
-
The ancient Operational Code is embedded in the amino acid substitution matrix and aaRS phylogenies
Authors:
Julia A. Shore,
Barbara R. Holland,
Jeremy G. Sumner,
Kay Nieselt,
Peter R. Wills
Abstract:
The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the a…
▽ More
The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the aaSM, in which (i) the number of distinguishable amino acid types, (ii) the matrix dimension, and (iii) the number of parameters, each increases by one for each bifurcation in an aaRS phylogeny. Parameterized matrices corresponding to trees in which the size of an amino acid sidechain is the only discernible property behind its categorization as a substrate, exclusively for a Class I or II aaRS, provide a significantly better fit to empirically determined aaSM than trees with random bifurcation patterns. A second split between polar and nonpolar amino acids in each Class effects a vastly greater further improvement. The earliest Class-separated epochs in the phylogenies of the aaRS reflect these enzymes' capability to distinguish tRNAs through the recognition of acceptor stem identity elements via the minor (Class I) and major (Class II) helical grooves, which is how the ancient Operational Code functioned. The advent of tRNA recognition using the anticodon loop supports the evolution of the optimal map of amino acid chemistry found in the later Genetic Code, an essentially digital categorization, in which polarity is the major functional property, compensating for the unrefined, haphazard differentiation of amino acids achieved by the Operational Code.
△ Less
Submitted 10 December, 2019;
originally announced December 2019.
-
Automatic Detection of Bowel Disease with Residual Networks
Authors:
Robert Holland,
Uday Patel,
Phillip Lung,
Elisa Chotzoglou,
Bernhard Kainz
Abstract:
Crohn's disease, one of two inflammatory bowel diseases (IBD), affects 200,000 people in the UK alone, or roughly one in every 500. We explore the feasibility of deep learning algorithms for identification of terminal ileal Crohn's disease in Magnetic Resonance Enterography images on a small dataset. We show that they provide comparable performance to the current clinical standard, the MaRIA score…
▽ More
Crohn's disease, one of two inflammatory bowel diseases (IBD), affects 200,000 people in the UK alone, or roughly one in every 500. We explore the feasibility of deep learning algorithms for identification of terminal ileal Crohn's disease in Magnetic Resonance Enterography images on a small dataset. We show that they provide comparable performance to the current clinical standard, the MaRIA score, while requiring only a fraction of the preparation and inference time. Moreover, bowels are subject to high variation between individuals due to the complex and free-moving anatomy. Thus we also explore the effect of difficulty of the classification at hand on performance. Finally, we employ soft attention mechanisms to amplify salient local features and add interpretability.
△ Less
Submitted 31 August, 2019;
originally announced September 2019.
-
The impracticalities of multiplicatively-closed codon models: a retreat to linear alternatives
Authors:
Julia A. Shore,
Jeremy G. Sumner,
Barbara R. Holland
Abstract:
A matrix Lie algebra is a linear space of matrices closed under the operation $ [A, B] = AB-BA $. The "Lie closure" of a set of matrices is the smallest matrix Lie algebra which contains the set. In the context of Markov chain theory, if a set of rate matrices form a Lie algebra, their corresponding Markov matrices are closed under matrix multiplication; this has been found to be a useful property…
▽ More
A matrix Lie algebra is a linear space of matrices closed under the operation $ [A, B] = AB-BA $. The "Lie closure" of a set of matrices is the smallest matrix Lie algebra which contains the set. In the context of Markov chain theory, if a set of rate matrices form a Lie algebra, their corresponding Markov matrices are closed under matrix multiplication; this has been found to be a useful property in phylogenetics. Inspired by previous research involving Lie closures of DNA models, it was hypothesised that finding the Lie closure of a codon model could help to solve the problem of mis-estimation of the non-synonymous/synonymous rate ratio, $ ω$. We propose two different methods of finding a linear space from a model: the first is the \emph{linear closure} which is the smallest linear space which contains the model, and the second is the \emph{linear version} which changes multiplicative constraints in the model to additive ones. For each of these linear spaces we then find the Lie closures of them. Under both methods, it was found that closed codon models would require thousands of parameters, and that any partial solution to this problem that was of a reasonable size violated stochasticity. Investigation of toy models indicated that finding the Lie closure of matrix linear spaces which deviated only slightly from a simple model resulted in a Lie closure that was close to having the maximum number of parameters possible. Given that Lie closures are not practical, we propose further consideration of the two variants of linearly closed models.
△ Less
Submitted 5 August, 2020; v1 submitted 25 April, 2018;
originally announced April 2018.
-
Exploring the consequences of lack of closure in codon models
Authors:
Michael D. Woodhams,
Jeremy G. Sumner,
David A. Liberles,
Michael A. Charleston,
Barbara R. Holland
Abstract:
Models of codon evolution are commonly used to identify positive selection. Positive selection is typically a heterogeneous process, i.e., it acts on some branches of the evolutionary tree and not others. Previous work on DNA models showed that when evolution occurs under a heterogeneous process it is important to consider the property of model closure, because non-closed models can give biased es…
▽ More
Models of codon evolution are commonly used to identify positive selection. Positive selection is typically a heterogeneous process, i.e., it acts on some branches of the evolutionary tree and not others. Previous work on DNA models showed that when evolution occurs under a heterogeneous process it is important to consider the property of model closure, because non-closed models can give biased estimates of evolutionary processes. The existing codon models that account for the genetic code are not closed; to establish this it is enough to show that they are not linear (meaning that the sum of two codon rate matrices in the model is not a matrix in the model). This raises the concern that a single codon model fit to a heterogeneous process might mis-estimate both the effect of selection and branch lengths.
Codon models are typically constructed by choosing an underlying DNA model (e.g., HKY) that acts identically and independently at each codon position, and then applying the genetic code via the parameter $ω$ to modify the rate of transitions between codons that code for different amino acids. Here we use simulation to investigate the accuracy of estimation of both the selection parameter $ω$ and branch lengths in cases where the underlying DNA process is heterogeneous but $ω$ is constant. We find that both $ω$ and branch lengths can be mis-estimated in these scenarios. Errors in $ω$ were usually less than 2% but could be as high as 17%. We also assessed if choosing different underlying DNA models had any affect on accuracy, in particular we assessed if using closed DNA models gave any advantage. However, a DNA model being closed does not imply that the codon model constructed from it is closed, and in general we found that using closed DNA models did not decrease errors in the estimation of $ω$.
△ Less
Submitted 15 September, 2017;
originally announced September 2017.
-
Distinguishing between convergent evolution and violation of the molecular clock
Authors:
Jonathan D. Mitchell,
Jeremy G. Sumner,
Barbara R. Holland
Abstract:
We give a non-technical introduction to convergence-divergence models, a new modeling approach for phylogenetic data that allows for the usual divergence of species post speciation but also allows for species to converge, i.e. become more similar over time. By examining the $3$-taxon case in some detail we illustrate that phylogeneticists have been "spoiled" in the sense of not having to think abo…
▽ More
We give a non-technical introduction to convergence-divergence models, a new modeling approach for phylogenetic data that allows for the usual divergence of species post speciation but also allows for species to converge, i.e. become more similar over time. By examining the $3$-taxon case in some detail we illustrate that phylogeneticists have been "spoiled" in the sense of not having to think about the structural parameters in their models by virtue of the strong assumption that evolution is treelike. We show that there are not always good statistical reasons to prefer the usual class of treelike models over more general convergence-divergence models. Specifically we show many $3$-taxon datasets can be equally well explained by supposing violation of the molecular clock due to change in the rate of evolution along different edges, or by keeping the assumption of a constant rate of evolution but instead assuming that evolution is not a purely divergent process. Given the abundance of evidence that evolution is not strictly treelike, our discussion is an illustration that as phylogeneticists we often need to think clearly about the structural form of the models we use.
△ Less
Submitted 13 September, 2017;
originally announced September 2017.
-
Developing a statistically powerful measure for quartet tree inference using phylogenetic identities and Markov invariants
Authors:
Jeremy G Sumner,
Amelia Taylor,
Barbara R Holland,
Peter D Jarvis
Abstract:
Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transform…
▽ More
Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transformation properties (in the case of Markov invariants). While both approaches have been valued for their intrinsic mathematical interest, it is not clear how they relate to each other, and to what extent they can be used as practical tools for inference of phylogenetic trees.
In this paper, by focusing on the special case of binary sequence data and quartets of taxa, we are able to view these two different polynomial-based approaches within a common framework. To motivate the discussion, we present three desirable statistical properties that we argue any phylogenetic method should satisfy: (1) sensible behaviour under reordering of input sequences; (2) stability as the taxa evolve independently according to a Markov process; and (3) ability to detect if the conditions of a continuous-time process are violated. Motivated by these statistical properties, we develop and explore several new phylogenetic inference methods. In particular, we develop a statistical bias-corrected version of the Markov invariants approach which satisfies all three properties. We also extend previous work by showing that the phylogenetic invariants can be implemented in such a way as to satisfy property (3). A simulation study shows that, in comparison to other methods, our new proposed approach based on bias-corrected Markov invariants is extremely powerful for phylogenetic inference.
△ Less
Submitted 29 March, 2017; v1 submitted 16 August, 2016;
originally announced August 2016.
-
Comparison of three Statistical Classification Techniques for Maser Identification
Authors:
Ellen M. Manning,
Barbara R. Holland,
Simon P. Ellingsen,
Shari L. Breen,
Xi Chen,
Melissa Humphries
Abstract:
We applied three statistical classification techniques - linear discriminant analysis (LDA), logistic regression and random forests - to three astronomical datasets associated with searches for interstellar masers. We compared the performance of these methods in identifying whether specific mid-infrared or millimetre continuum sources are likely to have associated interstellar masers. We also disc…
▽ More
We applied three statistical classification techniques - linear discriminant analysis (LDA), logistic regression and random forests - to three astronomical datasets associated with searches for interstellar masers. We compared the performance of these methods in identifying whether specific mid-infrared or millimetre continuum sources are likely to have associated interstellar masers. We also discuss the ease, or otherwise, with which the results of each classification technique can be interpreted. Non-parametric methods have the potential to make accurate predictions when there are complex relationships between critical parameters. We found that for the small datasets the parametric methods logistic regression and LDA performed best, for the largest dataset the non-parametric method of random forests performed with comparable accuracy to parametric techniques, rather than any significant improvement. This suggests that at least for the specific examples investigated here accuracy of the predictions obtained is not being limited by the use of parametric models. We also found that for LDA, transformation of the data to match a normal distribution in the input parameters led to big improvements in accuracy. The different classification techniques had significant overlap in their predictions, further astronomical observations will enable the accuracy of these predictions to be tested.
△ Less
Submitted 21 March, 2016;
originally announced March 2016.
-
Maximum likelihood estimates of pairwise rearrangement distances
Authors:
Stuart Serdoz,
Attila Egri-Nagy,
Jeremy Sumner,
Barbara R. Holland,
Peter D. Jarvis,
Mark M. Tanaka,
Andrew R. Francis
Abstract:
Accurate estimation of evolutionary distances between taxa is important for many phylogenetic reconstruction methods. In the case of bacteria, distances can be estimated using a range of different evolutionary models, from single nucleotide polymorphisms to large-scale genome rearrangements. In the case of sequence evolution models (such as the Jukes-Cantor model and associated metric) have been u…
▽ More
Accurate estimation of evolutionary distances between taxa is important for many phylogenetic reconstruction methods. In the case of bacteria, distances can be estimated using a range of different evolutionary models, from single nucleotide polymorphisms to large-scale genome rearrangements. In the case of sequence evolution models (such as the Jukes-Cantor model and associated metric) have been used to correct pairwise distances. Similar correction methods for genome rearrangement processes are required to improve inference. Current attempts at correction fall into 3 categories: Empirical computational studies, Bayesian/MCMC approaches, and combinatorial approaches. Here we introduce a maximum likelihood estimator for the inversion distance between a pair of genomes, using the group-theoretic approach to modelling inversions introduced recently. This MLE functions as a corrected distance: in particular, we show that because of the way sequences of inversions interact with each other, it is quite possible for minimal distance and MLE distance to differently order the distances of two genomes from a third. This has obvious implications for the use of minimal distance in phylogeny reconstruction. The work also tackles the above problem allowing free rotation of the genome. Generally a frame of reference is locked, and all computation made accordingly. This work incorporates the action of the dihedral group so that distance estimates are free from any a priori frame of reference.
△ Less
Submitted 14 April, 2017; v1 submitted 11 February, 2016;
originally announced February 2016.
-
Component masses of young, wide, non-magnetic white dwarf binaries in the SDSS DR7
Authors:
R. B. Baxter,
P. D. Dobbie,
Q. A. Parker,
S. L. Casewell,
N. Lodieu,
M. R. Burleigh,
K. A. Lawrie,
B. Kulebi,
D. Koester,
B. R. Holland
Abstract:
We present a spectroscopic component analysis of 18 candidate young, wide, non-magnetic, double-degenerate binaries identified from a search of the Sloan Digital Sky Survey Data Release 7 (DR7). All but two pairings are likely to be physical systems. We show SDSS J084952.47+471247.7 + SDSS J084952.87+471249.4 to be a wide DA+DB binary, only the second identified to date. Combining our measurements…
▽ More
We present a spectroscopic component analysis of 18 candidate young, wide, non-magnetic, double-degenerate binaries identified from a search of the Sloan Digital Sky Survey Data Release 7 (DR7). All but two pairings are likely to be physical systems. We show SDSS J084952.47+471247.7 + SDSS J084952.87+471249.4 to be a wide DA+DB binary, only the second identified to date. Combining our measurements for the components of 16 new binaries with results for three similar, previously known systems within the DR7, we have constructed a mass distribution for the largest sample to date (38) of white dwarfs in young, wide, non-magnetic, double-degenerate pairings. This is broadly similar in form to that of the isolated field population with a substantial peak around M~0.6 Msun. We identify an excess of ultra-massive white dwarfs and attribute this to the primordial separation distribution of their progenitor systems peaking at relatively larger values and the greater expansion of their binary orbits during the final stages of stellar evolution. We exploit this mass distribution to probe the origins of unusual types of degenerates, confirming a mild preference for the progenitor systems of high-field-magnetic white dwarfs, at least within these binaries, to be associated with early-type stars. Additionally, we consider the 19 systems in the context of the stellar initial mass-final mass relation. None appear to be strongly discordant with current understanding of this relationship.
△ Less
Submitted 17 March, 2014;
originally announced March 2014.
-
Ultrasonic Attenuation and Speed of Sound of Cornstarch Suspensions
Authors:
Benjamin L. Johnson,
Mark R. Holland,
James G. Miller,
Jonathan I. Katz
Abstract:
The goal of this study is to contribute to the physics underlying the material properties of suspensions that exhibit shear thickening through the ultrasonic characterization of suspensions of cornstarch in a density-matched solution. Ultrasonic measurements at frequencies in the range of 4 to 8 MHz of the speed of sound and the frequency-dependent attenuation properties are reported for concentra…
▽ More
The goal of this study is to contribute to the physics underlying the material properties of suspensions that exhibit shear thickening through the ultrasonic characterization of suspensions of cornstarch in a density-matched solution. Ultrasonic measurements at frequencies in the range of 4 to 8 MHz of the speed of sound and the frequency-dependent attenuation properties are reported for concentrations of cornstarch in a density-matched aqueous (cesium chloride brine) suspension, ranging up to 40% cornstarch. The speed of sound is found to range from 1483 +/- 10 m/s in pure brine to 1765 +/- 9 m/s in the 40% cornstarch suspension. The bulk modulus of a granule of cornstarch is inferred to be (1.2 +/- 0.1) X 10^{10} Pa. The attenuation coefficient at 5 MHz increases from essentially zero in brine to 12.0 +/- 1.2 dB/cm at 40% cornstarch.
△ Less
Submitted 20 March, 2013;
originally announced March 2013.
-
A tensorial approach to the inversion of group-based phylogenetic models
Authors:
Jeremy G. Sumner,
Peter D. Jarvis,
Barbara R. Holland
Abstract:
Using a tensorial approach, we show how to construct a one-one correspondence between pattern probabilities and edge parameters for any group-based model. This is a generalisation of the "Hadamard conjugation" and is equivalent to standard results that use Fourier analysis. In our derivation we focus on the connections to group representation theory and emphasize that the inversion is possible bec…
▽ More
Using a tensorial approach, we show how to construct a one-one correspondence between pattern probabilities and edge parameters for any group-based model. This is a generalisation of the "Hadamard conjugation" and is equivalent to standard results that use Fourier analysis. In our derivation we focus on the connections to group representation theory and emphasize that the inversion is possible because, under their usual definition, group-based models are defined for abelian groups only. We also argue that our approach is elementary in the sense that it can be understood as simple matrix multiplication where matrices are rectangular and indexed by ordered-partitions of varying sizes.
△ Less
Submitted 17 December, 2012;
originally announced December 2012.
-
Low-parameter phylogenetic estimation under the general Markov model
Authors:
Barbara R. Holland,
Peter D. Jarvis,
Jeremy G. Sumner
Abstract:
In their 2008 and 2009 papers, Sumner and colleagues introduced the "squangles" - a small set of Markov invariants for phylogenetic quartets. The squangles are consistent with the general Markov model (GM) and can be used to infer quartets without the need to explicitly estimate all parameters. As GM is inhomogeneous and hence non-stationary, the squangles are expected to perform well compared to…
▽ More
In their 2008 and 2009 papers, Sumner and colleagues introduced the "squangles" - a small set of Markov invariants for phylogenetic quartets. The squangles are consistent with the general Markov model (GM) and can be used to infer quartets without the need to explicitly estimate all parameters. As GM is inhomogeneous and hence non-stationary, the squangles are expected to perform well compared to standard approaches when there are changes in base-composition amongst species. However, GM includes the IID assumption, so the squangles should be confounded by data generated with invariant sites or with rate-variation across sites. Here we implement the squangles in a least-squares setting that returns quartets weighted by either confidence or internal edge lengths; and use these as input into a variety of quartet-based supertree methods. For the first time, we quantitatively investigate the robustness of the squangles to the breaking of IID assumptions on both simulated and real data sets; and we suggest a modification that improves the performance of the squangles in the presence of invariant sites. Our conclusion is that the squangles provide a novel tool for phylogenetic estimation that is complementary to methods that explicitly account for rate-variation across sites, but rely on homogeneous - and hence stationary - models.
△ Less
Submitted 20 April, 2012;
originally announced April 2012.
-
Novel Distances for Dollo Data
Authors:
Michael Woodhams,
Dorothy A. Steane,
Rebecca C. Jones,
Dean Nicolle,
Vincent Moulton,
Barbara R. Holland
Abstract:
We investigate distances on binary (presence/absence) data in the context of a Dollo process, where a trait can only arise once on a phylogenetic tree but may be lost many times. We introduce a novel distance, the Additive Dollo Distance (ADD), which is consistent for data generated under a Dollo model, and show that it has some useful theoretical properties including an intriguing link to the Log…
▽ More
We investigate distances on binary (presence/absence) data in the context of a Dollo process, where a trait can only arise once on a phylogenetic tree but may be lost many times. We introduce a novel distance, the Additive Dollo Distance (ADD), which is consistent for data generated under a Dollo model, and show that it has some useful theoretical properties including an intriguing link to the LogDet distance. Simulations of Dollo data are used to compare a number of binary distances including ADD, LogDet, Nei Li and some simple, but to our knowledge previously unstudied, variations on common binary distances. The simulations suggest that ADD outperforms other distances on Dollo data. Interestingly, we found that the LogDet distance performs poorly in the context of a Dollo process, which may have implications for its use in connection with conditioned genome reconstruction. We apply the ADD to two Diversity Arrays Technology (DArT) datasets, one that broadly covers Eucalyptus species and one that focuses on the Eucalyptus series Adnataria. We also reanalyse gene family presence/absence data on bacteria from the COG database and compare the results to previous phylogenies estimated using the conditioned genome reconstruction approach.
△ Less
Submitted 29 February, 2012;
originally announced March 2012.