Skip to main content

Showing 1–22 of 22 results for author: Rattray, M

  1. arXiv:2407.02476  [pdf, other

    cs.LG stat.ML

    Scalable Multi-Output Gaussian Processes with Stochastic Variational Inference

    Authors: Xiaoyu Jiang, Sokratia Georgaka, Magnus Rattray, Mauricio A. Alvarez

    Abstract: The Multi-Output Gaussian Process is is a popular tool for modelling data from multiple sources. A typical choice to build a covariance function for a MOGP is the Linear Model of Coregionalization (LMC) which parametrically models the covariance between outputs. The Latent Variable MOGP (LV-MOGP) generalises this idea by modelling the covariance between outputs using a kernel applied to latent var… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: none

  2. Bayesian detection of piecewise linear trends in replicated time-series with application to growth data modelling

    Authors: Panagiotis Papastamoulis, Takanori Furukawa, Norman van Rhijn, Michael Bromley, Elaine Bignell, Magnus Rattray

    Abstract: We consider the situation where a temporal process is composed of contiguous segments with differing slopes and replicated noise-corrupted time series measurements are observed. The unknown mean of the data generating process is modelled as a piecewise linear function of time with an unknown number of change-points. We develop a Bayesian approach to infer the joint posterior distribution of the nu… ▽ More

    Submitted 8 July, 2019; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Accepted to International Journal of Biostatistics

    Journal ref: The International Journal Of Biostatistics, 2019

  3. arXiv:1701.03095  [pdf, other

    q-bio.GN stat.AP stat.CO

    Bayesian estimation of Differential Transcript Usage from RNA-seq data

    Authors: Panagiotis Papastamoulis, Magnus Rattray

    Abstract: Next generation sequencing allows the identification of genes consisting of differentially expressed transcripts, a term which usually refers to changes in the overall expression level. A specific type of differential expression is differential transcript usage (DTU) and targets changes in the relative within gene expression of a transcript. The contribution of this paper is to: (a) extend the use… ▽ More

    Submitted 28 September, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

    Comments: Revised version, accepted to Statistical Applications in Genetics and Molecular Biology

  4. arXiv:1609.06960  [pdf, other

    stat.CO

    BayesBinMix: an R Package for Model Based Clustering of Multivariate Binary Data

    Authors: Panagiotis Papastamoulis, Magnus Rattray

    Abstract: The BayesBinMix package offers a Bayesian framework for clustering binary data with or without missing values by fitting mixtures of multivariate Bernoulli distributions with an unknown number of components. It allows the joint estimation of the number of clusters and model parameters using Markov chain Monte Carlo sampling. Heated chains are run in parallel and accelerate the convergence to the t… ▽ More

    Submitted 3 April, 2017; v1 submitted 22 September, 2016; originally announced September 2016.

    Comments: Accepted to the R Journal. The package is available on CRAN: https://CRAN.R-project.org/package=BayesBinMix

    Journal ref: The R Journal (2017) 9:1, pages 403-420

  5. Identifying stochastic oscillations in single-cell live imaging time series using Gaussian processes

    Authors: Nick E. Phillips, Cerys Manning, Nancy Papalopulu, Magnus Rattray

    Abstract: Multiple biological processes are driven by oscillatory gene expression at different time scales. Pulsatile dynamics are thought to be widespread, and single-cell live imaging of gene expression has lead to a surge of dynamic, possibly oscillatory, data for different gene networks. However, the regulation of gene expression at the level of an individual cell involves reactions between finite numbe… ▽ More

    Submitted 25 May, 2017; v1 submitted 23 August, 2016; originally announced August 2016.

    Comments: 36 pages, 17 figures

  6. arXiv:1602.01743  [pdf, other

    q-bio.QM

    Inferring the perturbation time from biological time course data

    Authors: Jing Yang, Christopher A. Penfold, Murray R. Grant, Magnus Rattray

    Abstract: Time course data are often used to study the changes to a biological process after perturbation. Statistical methods have been developed to determine whether such a perturbation induces changes over time, e.g. comparing a perturbed and unperturbed time course dataset to uncover differences. However, existing methods do not provide a principled statistical approach to identify the specific time whe… ▽ More

    Submitted 4 February, 2016; originally announced February 2016.

    Comments: 63 pages, 20 figures, paper submitted to Bioinformatics

  7. arXiv:1503.01081  [pdf, other

    q-bio.GN q-bio.QM stat.AP

    Genome-wide modelling of transcription kinetics reveals patterns of RNA production delays

    Authors: Antti Honkela, Jaakko Peltonen, Hande Topa, Iryna Charapitsa, Filomena Matarese, Korbinian Grote, Hendrik G. Stunnenberg, George Reid, Neil D. Lawrence, Magnus Rattray

    Abstract: Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles due to differences in transcription time, degradation rate and RNA processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. We introduce a joint model of transcriptional activation and mRNA accumulation which can be used for inference of… ▽ More

    Submitted 16 July, 2015; v1 submitted 3 March, 2015; originally announced March 2015.

    Comments: 42 pages, 17 figures

    Journal ref: PNAS 112(42):13115-13120, 2015

  8. arXiv:1412.5995  [pdf, other

    q-bio.QM q-bio.GN

    Fast and accurate approximate inference of transcript expression from RNA-seq data

    Authors: James Hensman, Panagiotis Papastamoulis, Peter Glaus, Antti Honkela, Magnus Rattray

    Abstract: Motivation: Assigning RNA-seq reads to their transcript of origin is a fundamental task in transcript expression estimation. Where ambiguities in assignments exist due to transcripts sharing sequence, e.g. alternative isoforms or alleles, the problem can be solved through probabilistic inference. Bayesian methods have been shown to provide accurate transcript abundance estimates compared to compet… ▽ More

    Submitted 30 June, 2015; v1 submitted 18 December, 2014; originally announced December 2014.

    Comments: Main changes: (a) shuffling of reads simulated from spanki and repeat the analysis for sailfish and eXpress. Now both methods yield better point estimates. (b) including the Markov chain Monte Carlo sampler of rsem (RSEM-PME). (c) including the Kallisto method (d) adding alternative measures of transcript expression (TPM) and filtering out low expressed transcripts (supplementary material). arXiv admin note: substantial text overlap with arXiv:1308.5953

  9. arXiv:1412.3050  [pdf, other

    stat.ME q-bio.QM

    A Bayesian model selection approach for identifying differentially expressed transcripts from RNA-Seq data

    Authors: Panagiotis Papastamoulis, Magnus Rattray

    Abstract: Recent advances in molecular biology allow the quantification of the transcriptome and scoring transcripts as differentially or equally expressed between two biological conditions. Although these two tasks are closely linked, the available inference methods treat them separately: a primary model is used to estimate expression and its output is post-processed using a differential expression model.… ▽ More

    Submitted 26 September, 2016; v1 submitted 9 December, 2014; originally announced December 2014.

    Comments: Revised version of arXiv:1412.3050v3

    MSC Class: 62F15

    Journal ref: Journal of the Royal Statistical Society: Series C (Applied Statistics), 2017

  10. arXiv:1401.1605  [pdf, other

    cs.LG cs.CV stat.ML

    Fast nonparametric clustering of structured time-series

    Authors: James Hensman, Magnus Rattray, Neil D. Lawrence

    Abstract: In this publication, we combine two Bayesian non-parametric models: the Gaussian Process (GP) and the Dirichlet Process (DP). Our innovation in the GP model is to introduce a variation on the GP prior which enables us to model structured time-series data, i.e. data containing groups where we wish to model inter- and intra-group variability. Our innovation in the DP model is an implementation of a… ▽ More

    Submitted 14 April, 2014; v1 submitted 8 January, 2014; originally announced January 2014.

    Comments: Accepted for publication in special edition of TPAMI on Bayesian Nonparametrics

  11. arXiv:1308.5953   

    q-bio.GN stat.AP stat.CO

    Fast Approximate Inference of Transcript Expression Levels from RNA-seq Data

    Authors: James Hensman, Peter Glaus, Antti Honkela, Magnus Rattray

    Abstract: Motivation: The mapping of RNA-seq reads to their transcripts of origin is a fundamental task in transcript expression estimation and differential expression scoring. Where ambiguities in mapping exist due to transcripts sharing sequence, e.g. alternative isoforms or alleles, the problem becomes an instance of non-trivial probabilistic inference. Bayesian inference in such a problem is intractable… ▽ More

    Submitted 27 January, 2015; v1 submitted 27 August, 2013; originally announced August 2013.

    Comments: This paper has been withdrawn by the authors. Please see much revised edition arXiv:1412.5995

  12. arXiv:1303.7090  [pdf, other

    math.ST

    Gaussian process models for periodicity detection

    Authors: Nicolas Durrande, James Hensman, Magnus Rattray, Neil D. Lawrence

    Abstract: We consider the problem of detecting and quantifying the periodic component of a function given noise-corrupted observations of a limited number of input/output tuples. Our approach is based on Gaussian process regression which provides a flexible non-parametric framework for modelling periodic data. We introduce a novel decomposition of the covariance function as the sum of periodic and aperiodic… ▽ More

    Submitted 19 August, 2016; v1 submitted 28 March, 2013; originally announced March 2013.

    Comments: in PeerJ Computer Science, 2016

  13. arXiv:1303.4926  [pdf, other

    q-bio.QM q-bio.MN

    Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data

    Authors: Ciira wa Maina, Antti Honkela, Filomena Matarese, Korbinian Grote, Hendrik G. Stunnenberg, George Reid, Neil D. Lawrence, Magnus Rattray

    Abstract: Gene transcription mediated by RNA polymerase II (pol-II) is a key step in gene expression. The dynamics of pol-II moving along the transcribed region influence the rate and timing of gene expression. In this work we present a probabilistic model of transcription dynamics which is fitted to pol-II occupancy time course data measured using ChIP-Seq. The model can be used to estimate transcription s… ▽ More

    Submitted 5 March, 2014; v1 submitted 20 March, 2013; originally announced March 2013.

    Comments: 40 pages: 21 pages Main text, 19 pages supplementary material

  14. arXiv:1206.5162  [pdf, other

    cs.LG stat.ML

    Fast Variational Inference in the Conjugate Exponential Family

    Authors: James Hensman, Magnus Rattray, Neil D. Lawrence

    Abstract: We present a general method for deriving collapsed variational inference algo- rithms for probabilistic models in the conjugate exponential family. Our method unifies many existing approaches to collapsed variational inference. Our collapsed variational inference leads to a new lower bound on the marginal likelihood. We exploit the information geometry of the bound to derive much faster optimizati… ▽ More

    Submitted 4 December, 2012; v1 submitted 22 June, 2012; originally announced June 2012.

    Comments: Accepted at NIPS 2012

  15. Identifying differentially expressed transcripts from RNA-seq data with biological variation

    Authors: Peter Glaus, Antti Honkela, Magnus Rattray

    Abstract: Motivation: High-throughput sequencing enables expression analysis at the level of individual transcripts. The analysis of transcriptome expression levels and differential expression estimation requires a probabilistic approach to properly account for ambiguity caused by shared exons and finite read sampling as well as the intrinsic biological variance of transcript expression. Results: We prese… ▽ More

    Submitted 5 March, 2012; v1 submitted 5 September, 2011; originally announced September 2011.

    Comments: 12 pages, 6 figures in main text; 11 pages, 5 figures in supplementary information (included in the same file)

    Journal ref: Bioinformatics 28(13):1721-1728, 2012

  16. arXiv:q-bio/0404031  [pdf

    q-bio.PE

    RNA-based Phylogenetic Methods: Application to Mammalian Mitochondrial RNA Sequences

    Authors: Cendrine Hudelot, Vivek Gowri-Shankar, Howsun Jow, Magnus Rattray, Paul G. Higgs

    Abstract: The PHASE software package allows phylogenetic tree construction with a number of evolutionary models designed specifically for use with RNA sequences that have conserved secondary structure. Evolution in the paired regions of RNAs occurs via compensatory substitutions, hence changes on either side of a pair are correlated. Accounting for this correlation is important for phylogenetic inference… ▽ More

    Submitted 23 April, 2004; originally announced April 2004.

    Journal ref: Mol. Phyl. Evol. 28, 241-252. (2003)

  17. arXiv:q-bio/0310031  [pdf

    q-bio.PE

    The Evolution of tRNA-Leu Genes in Animal Mitochondrial Genomes

    Authors: P. G. Higgs, D. Jameson, H. Jow, M. Rattray

    Abstract: Animal mitochondrial genomes usually have two transfer RNAs for Leucine: one, with anticodon UAG, translates the four-codon family CUN, whilst the other, with anticodon UAA, translates the two-codon family UUR. These two genes must differ at the third anticodon position, but in some species the genes differ at many additional sites, indicating that these genes have been independent for a long ti… ▽ More

    Submitted 23 October, 2003; originally announced October 2003.

    Comments: 20 pages, 6 figures. J. Mol. Evol. (in press)

  18. arXiv:cond-mat/0309554  [pdf, ps, other

    cond-mat.dis-nn

    Statistical Dynamics of On-line Independent Component Analysis

    Authors: Gleb Basalyga, Magnus Rattray

    Abstract: The learning dynamics of on-line independent component analysis is analysed in the limit of large data dimension. We study a simple Hebbian learning algorithm that can be used to separate out a small number of non-Gaussian components from a high-dimensional data set. The de-mixing matrix parameters are confined to a Stiefel manifold of tall, orthogonal matrices and we introduce a natural gradien… ▽ More

    Submitted 24 September, 2003; originally announced September 2003.

    Comments: 18 pages, 13 figures, to appear in Journal of Machine Learning Research special issue on ICA

  19. arXiv:cond-mat/0105057  [pdf, ps, other

    cond-mat.dis-nn

    Stochastic trapping in a solvable model of on-line independent component analysis

    Authors: Magnus Rattray

    Abstract: Previous analytical studies of on-line Independent Component Analysis (ICA) learning rules have focussed on asymptotic stability and efficiency. In practice the transient stages of learning will often be more significant in determining the success of an algorithm. This is demonstrated here with an analysis of a Hebbian ICA algorithm which can find a small number of non-Gaussian components given… ▽ More

    Submitted 3 May, 2001; originally announced May 2001.

    Comments: 17 pages, 3 figures. To appear in Neural Computation

  20. arXiv:adap-org/9907009  [pdf, ps, other

    nlin.AO q-bio

    Cumulant Dynamics of a Population under Multiplicative Selection, Mutation and Drift

    Authors: Magnus Rattray, Jonathan L. Shapiro

    Abstract: We revisit the classical population genetics model of a population evolving under multiplicative selection, mutation and drift. The number of beneficial alleles in a multi-locus system can be considered a trait under exponential selection. Equations of motion are derived for the cumulants of the trait distribution in the diffusion limit and under the assumption of linkage equilibrium. Because of… ▽ More

    Submitted 5 June, 2001; v1 submitted 23 July, 1999; originally announced July 1999.

    Comments: Minor changes and authored appendix by Adam Prugel-Bennett. To appear in Theoretical Population Biology, 2001

  21. arXiv:cond-mat/9901212  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech

    Analysis of Natural Gradient Descent for Multilayer Neural Networks

    Authors: Magnus Rattray, David Saad

    Abstract: Natural gradient descent is a principled method for adapting the parameters of a statistical model on-line using an underlying Riemannian parameter space to redefine the direction of steepest descent. The algorithm is examined via methods of statistical physics which accurately characterize both transient and asymptotic behavior. A solution of the learning dynamics is obtained for the case of mu… ▽ More

    Submitted 21 January, 1999; originally announced January 1999.

    Comments: 14 pages including figures. To appear in Physical Review E

  22. The Dynamics of a Genetic Algorithm for a Simple Learning Problem

    Authors: Magnus Rattray, Jonathan Shapiro

    Abstract: A formalism for describing the dynamics of Genetic Algorithms (GAs) using methods from statistical mechanics is applied to the problem of generalization in a perceptron with binary weights. The dynamics are solved for the case where a new batch of training patterns is presented to each population member each generation, which considerably simplifies the calculation. The theory is shown to agree… ▽ More

    Submitted 12 September, 1996; originally announced September 1996.

    Comments: 28 pages, 4 Postscript figures. Latex using IOP macros ioplppt and iopl12 which are included. To appear in Journal of Physics A. Also available at ftp://ftp.cs.man.ac.uk/pub/ai/jls/GAlearn.ps.gz and http://www.cs.man.ac.uk/~jls