-
Brain Age Revisited: Investigating the State vs. Trait Hypotheses of EEG-derived Brain-Age Dynamics with Deep Learning
Authors:
Lukas AW Gemein,
Robin T Schirrmeister,
Joschka Boedecker,
Tonio Ball
Abstract:
The brain's biological age has been considered as a promising candidate for a neurologically significant biomarker. However, recent results based on longitudinal magnetic resonance imaging data have raised questions on its interpretation. A central question is whether an increased biological age of the brain is indicative of brain pathology and if changes in brain age correlate with diagnosed path…
▽ More
The brain's biological age has been considered as a promising candidate for a neurologically significant biomarker. However, recent results based on longitudinal magnetic resonance imaging data have raised questions on its interpretation. A central question is whether an increased biological age of the brain is indicative of brain pathology and if changes in brain age correlate with diagnosed pathology (state hypothesis). Alternatively, could the discrepancy in brain age be a stable characteristic unique to each individual (trait hypothesis)? To address this question, we present a comprehensive study on brain aging based on clinical EEG, which is complementary to previous MRI-based investigations. We apply a state-of-the-art Temporal Convolutional Network (TCN) to the task of age regression. We train on recordings of the Temple University Hospital EEG Corpus (TUEG) explicitly labeled as non-pathological and evaluate on recordings of subjects with non-pathological as well as pathological recordings, both with examinations at a single point in time and repeated examinations over time. Therefore, we created four novel subsets of TUEG that include subjects with multiple recordings: I) all labeled non-pathological; II) all labeled pathological; III) at least one recording labeled non-pathological followed by at least one recording labeled pathological; IV) similar to III) but with opposing transition (first pathological then non-pathological). The results show that our TCN reaches state-of-the-art performance in age decoding with a mean absolute error of 6.6 years. Our extensive analyses demonstrate that the model significantly underestimates the age of non-pathological and pathological subjects (-1 and -5 years, paired t-test, p <= 0.18 and p <= 0.0066). Furthermore, the brain age gap biomarker is not indicative of pathological EEG.
△ Less
Submitted 22 September, 2023;
originally announced October 2023.
-
Symbolic Automata: $ω$-Regularity Modulo Theories
Authors:
Margus Veanes,
Thomas Ball,
Gabriel Ebner,
Olli Saarikivi
Abstract:
Symbolic automata are finite state automata that support potentially infinite alphabets, such as the set of rational numbers, generally applied to regular expressions/languages over finite words. In symbolic automata (or automata modulo theories), an alphabet is represented by an effective Boolean algebra, supported by a decision procedure for satisfiability. Regular languages over infinite words…
▽ More
Symbolic automata are finite state automata that support potentially infinite alphabets, such as the set of rational numbers, generally applied to regular expressions/languages over finite words. In symbolic automata (or automata modulo theories), an alphabet is represented by an effective Boolean algebra, supported by a decision procedure for satisfiability. Regular languages over infinite words (so called $ω$-regular languages) have a rich history paralleling that of regular languages over finite words, with well known applications to model checking via Büchi automata and temporal logics.
We generalize symbolic automata to support $ω$-regular languages via symbolic transition terms and symbolic derivatives, bringing together a variety of classic automata and logics in a unified framework that provides all the necessary ingredients to support symbolic model checking modulo $A$, $NBW_A$. In particular, we define: (1) alternating Büchi automata modulo $A$, $ABW_A$ as well (non-alternating) non-deterministic Büchi automata modulo $A$, $NBW_A$; (2) an alternation elimination algorithm that incrementally constructs an $NBW_A$ from an $ABW_A$, and can also be used for constructing the product of two $NBW_A$'s; (3) a definition of linear temporal logic (LTL) modulo $A$ that generalizes Vardi's construction of alternating Büchi automata from LTL, using (2) to go from LTL modulo $A$ to $NBW_A$ via $ABW_A$.
Finally, we present a combination of LTL modulo $A$ with extended regular expressions modulo $A$ that generalizes the Property Specification Language (PSL). Our combination allows regex complement, that is not supported in PSL but can be supported naturally by using symbolic transition terms.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Panel Data Nowcasting: The Case of Price-Earnings Ratios
Authors:
Andrii Babii,
Ryan T. Ball,
Eric Ghysels,
Jonas Striaukas
Abstract:
The paper uses structured machine learning regressions for nowcasting with panel data consisting of series sampled at different frequencies. Motivated by the problem of predicting corporate earnings for a large cross-section of firms with macroeconomic, financial, and news time series sampled at different frequencies, we focus on the sparse-group LASSO regularization which can take advantage of th…
▽ More
The paper uses structured machine learning regressions for nowcasting with panel data consisting of series sampled at different frequencies. Motivated by the problem of predicting corporate earnings for a large cross-section of firms with macroeconomic, financial, and news time series sampled at different frequencies, we focus on the sparse-group LASSO regularization which can take advantage of the mixed frequency time series panel data structures. Our empirical results show the superior performance of our machine learning panel data regression models over analysts' predictions, forecast combinations, firm-specific time series regression models, and standard machine learning methods.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Deep Riemannian Networks for EEG Decoding
Authors:
Daniel Wilson,
Robin Tibor Schirrmeister,
Lukas Alexander Wilhelm Gemein,
Tonio Ball
Abstract:
State-of-the-art performance in electroencephalography (EEG) decoding tasks is currently often achieved with either Deep-Learning (DL) or Riemannian-Geometry-based decoders (RBDs). Recently, there is growing interest in Deep Riemannian Networks (DRNs) possibly combining the advantages of both previous classes of methods. However, there are still a range of topics where additional insight is needed…
▽ More
State-of-the-art performance in electroencephalography (EEG) decoding tasks is currently often achieved with either Deep-Learning (DL) or Riemannian-Geometry-based decoders (RBDs). Recently, there is growing interest in Deep Riemannian Networks (DRNs) possibly combining the advantages of both previous classes of methods. However, there are still a range of topics where additional insight is needed to pave the way for a more widespread application of DRNs in EEG. These include architecture design questions such as network size and end-to-end ability.How these factors affect model performance has not been explored. Additionally, it is not clear how the data within these networks is transformed, and whether this would correlate with traditional EEG decoding. Our study aims to lay the groundwork in the area of these topics through the analysis of DRNs for EEG with a wide range of hyperparameters. Networks were tested on two public EEG datasets and compared with state-of-the-art ConvNets. Here we propose end-to-end EEG SPDNet (EE(G)-SPDNet), and we show that this wide, end-to-end DRN can outperform the ConvNets, and in doing so use physiologically plausible frequency regions. We also show that the end-to-end approach learns more complex filters than traditional band-pass filters targeting the classical alpha, beta, and gamma frequency bands of the EEG, and that performance can benefit from channel specific filtering approaches. Additionally, architectural analysis revealed areas for further improvement due to the possible loss of Riemannian specific information throughout the network. Our study thus shows how to design and train DRNs to infer task-related information from the raw EEG without the need of handcrafted filterbanks and highlights the potential of end-to-end DRNs such as EE(G)-SPDNet for high-performance EEG decoding.
△ Less
Submitted 1 August, 2023; v1 submitted 20 December, 2022;
originally announced December 2022.
-
A Framework for Preserving Privacy and Cybersecurity in Brain-Computer Interfacing Applications
Authors:
Maryna Kapitonova,
Philipp Kellmeyer,
Simon Vogt,
Tonio Ball
Abstract:
Brain-Computer Interfaces (BCIs) comprise a rapidly evolving field of technology with the potential of far-reaching impact in domains ranging from medical over industrial to artistic, gaming, and military. Today, these emerging BCI applications are typically still at early technology readiness levels, but because BCIs create novel, technical communication channels for the human brain, they have ra…
▽ More
Brain-Computer Interfaces (BCIs) comprise a rapidly evolving field of technology with the potential of far-reaching impact in domains ranging from medical over industrial to artistic, gaming, and military. Today, these emerging BCI applications are typically still at early technology readiness levels, but because BCIs create novel, technical communication channels for the human brain, they have raised privacy and security concerns. To mitigate such risks, a large body of countermeasures has been proposed in the literature, but a general framework is lacking which would describe how privacy and security of BCI applications can be protected by design, i.e., already as an integral part of the early BCI design process, in a systematic manner, and allowing suitable depth of analysis for different contexts such as commercial BCI product development vs. academic research and lab prototypes. Here we propose the adoption of recent systems-engineering methodologies for privacy threat modeling, risk assessment, and privacy engineering to the BCI field. These methodologies address privacy and security concerns in a more systematic and holistic way than previous approaches, and provide reusable patterns on how to move from principles to actions. We apply these methodologies to BCI and data flows and derive a generic, extensible, and actionable framework for brain-privacy-preserving cybersecurity in BCI applications. This framework is designed for flexible application to the wide range of current and future BCI applications. We also propose a range of novel privacy-by-design features for BCIs, with an emphasis on features promoting BCI transparency as a prerequisite for informational self-determination of BCI users, as well as design features for ensuring BCI user autonomy. We anticipate that our framework will contribute to the development of privacy-respecting, trustworthy BCI technologies.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
The spatial scale dimension of speech processing in the human brain
Authors:
Philipp Kellmeyer,
Roland Berkemeier,
Tonio Ball
Abstract:
In the past three decades, neuroimaging has provided important insights into structure-function relationships in the human brain. Recently, however, the methods for analyzing functional magnetic resonance imaging (fMRI) data have come under scrutiny, with studies questioning cross-software comparability, the validity of statistical inference and interpretation, and the influence of the spatial fil…
▽ More
In the past three decades, neuroimaging has provided important insights into structure-function relationships in the human brain. Recently, however, the methods for analyzing functional magnetic resonance imaging (fMRI) data have come under scrutiny, with studies questioning cross-software comparability, the validity of statistical inference and interpretation, and the influence of the spatial filter size on neuroimaging analyses. As most fMRI studies only use a single filter for analysis, much information on the size and shape of the BOLD signal in Gaussian scale space remains hidden and constrains the interpretation of fMRI studies. To investigate the influence of the spatial observation scale on fMRI analysis, we use a spatial multiscale analysis with a range of Gaussian filters from 1-20 mm (full width at half maximum) to analyze fMRI data from a speech repetition paradigm in 25 healthy subjects. We show that analyzing the fMRI data over a range of Gaussian filter kernels reveals substantial variability in the neuroanatomical localization and the average signal strength and size of suprathreshold clusters depending on the filter size. We also demonstrate how small spatial filters bias the results towards subcortical and cerebellar clusters. Furthermore, we describe substantially different scale-dependent cluster size dynamics between cortical and cerebellar clusters. We discuss how spatial multiscale analysis may substantially improve the interpretation of fMRI data. We propose to further develop a spatial multiscale analysis to fully explore the deep structure of the BOLD signal in Gaussian scale space.
△ Less
Submitted 19 January, 2022;
originally announced January 2022.
-
When less is more: Simplifying inputs aids neural network understanding
Authors:
Robin Tibor Schirrmeister,
Rosanne Liu,
Sara Hooker,
Tonio Ball
Abstract:
How do neural network image classifiers respond to simpler and simpler inputs? And what do such responses reveal about the learning process? To answer these questions, we need a clear measure of input simplicity (or inversely, complexity), an optimization objective that correlates with simplification, and a framework to incorporate such objective into training and inference. Lastly we need a varie…
▽ More
How do neural network image classifiers respond to simpler and simpler inputs? And what do such responses reveal about the learning process? To answer these questions, we need a clear measure of input simplicity (or inversely, complexity), an optimization objective that correlates with simplification, and a framework to incorporate such objective into training and inference. Lastly we need a variety of testbeds to experiment and evaluate the impact of such simplification on learning. In this work, we measure simplicity with the encoding bit size given by a pretrained generative model, and minimize the bit size to simplify inputs in training and inference. We investigate the effect of such simplification in several scenarios: conventional training, dataset condensation and post-hoc explanations. In all settings, inputs are simplified along with the original classification task, and we investigate the trade-off between input simplicity and task performance. For images with injected distractors, such simplification naturally removes superfluous information. For dataset condensation, we find that inputs can be simplified with almost no accuracy degradation. When used in post-hoc explanation, our learning-based simplification approach offers a valuable new tool to explore the basis of network decisions.
△ Less
Submitted 1 February, 2022; v1 submitted 14 January, 2022;
originally announced January 2022.
-
Towards a Governance Framework for Brain Data
Authors:
Marcello Ienca,
Joseph J. Fins,
Ralf J. Jox,
Fabrice Jotterand,
Silja Voeneky,
Roberto Andorno,
Tonio Ball,
Claude Castelluccia,
Ricardo Chavarriaga,
Hervé Chneiweiss,
Agata Ferretti,
Orsolya Friedrich,
Samia Hurst,
Grischa Merkel,
Fruzsina Molnar-Gabor,
Jean-Marc Rickli,
James Scheibner,
Effy Vayena,
Rafael Yuste,
Philipp Kellmeyer
Abstract:
The increasing availability of brain data within and outside the biomedical field, combined with the application of artificial intelligence (AI) to brain data analysis, poses a challenge for ethics and governance. We identify distinctive ethical implications of brain data acquisition and processing, and outline a multi-level governance framework. This framework is aimed at maximizing the benefits…
▽ More
The increasing availability of brain data within and outside the biomedical field, combined with the application of artificial intelligence (AI) to brain data analysis, poses a challenge for ethics and governance. We identify distinctive ethical implications of brain data acquisition and processing, and outline a multi-level governance framework. This framework is aimed at maximizing the benefits of facilitated brain data collection and further processing for science and medicine whilst minimizing risks and preventing harmful use. The framework consists of four primary areas of regulatory intervention: binding regulation, ethics and soft law, responsible innovation, and human rights.
△ Less
Submitted 28 September, 2021; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Machine Learning Panel Data Regressions with Heavy-tailed Dependent Data: Theory and Application
Authors:
Andrii Babii,
Ryan T. Ball,
Eric Ghysels,
Jonas Striaukas
Abstract:
The paper introduces structured machine learning regressions for heavy-tailed dependent panel data potentially sampled at different frequencies. We focus on the sparse-group LASSO regularization. This type of regularization can take advantage of the mixed frequency time series panel data structures and improve the quality of the estimates. We obtain oracle inequalities for the pooled and fixed eff…
▽ More
The paper introduces structured machine learning regressions for heavy-tailed dependent panel data potentially sampled at different frequencies. We focus on the sparse-group LASSO regularization. This type of regularization can take advantage of the mixed frequency time series panel data structures and improve the quality of the estimates. We obtain oracle inequalities for the pooled and fixed effects sparse-group LASSO panel data estimators recognizing that financial and economic data can have fat tails. To that end, we leverage on a new Fuk-Nagaev concentration inequality for panel data consisting of heavy-tailed $τ$-mixing processes.
△ Less
Submitted 22 November, 2021; v1 submitted 8 August, 2020;
originally announced August 2020.
-
Coding theory package for Macaulay2
Authors:
Taylor Ball,
Eduardo Camps,
Henry Chimal-Dzul,
Delio Jaramillo-Velez,
Hiram H. López,
Nathan Nichols,
Matthew Perkins,
Ivan Soprunov,
German Vera-Martínez,
Gwyn Whieldon
Abstract:
In this Macaulay2 \cite{M2} package we define an object called {\it linear code}. We implement functions that compute basic parameters and objects associated with a linear code, such as generator and parity check matrices, the dual code, length, dimension, and minimum distance, among others. We define an object {\it evaluation code}, a construction which allows to study linear codes using tools of…
▽ More
In this Macaulay2 \cite{M2} package we define an object called {\it linear code}. We implement functions that compute basic parameters and objects associated with a linear code, such as generator and parity check matrices, the dual code, length, dimension, and minimum distance, among others. We define an object {\it evaluation code}, a construction which allows to study linear codes using tools of algebraic geometry and commutative algebra. We implement functions to generate important families of linear codes such as Hamming codes, cyclic codes, Reed--Solomon codes, Reed--Muller codes, Cartesian codes, monomial--Cartesian codes, and toric codes. In addition, we define functions for the syndrome decoding algorithm and locally recoverable code construction, which are important tools in applications of linear codes. The package \textit{CodingTheory.m2} is available at \url{https://github.com/Macaulay2/Workshop-2020-Cleveland/tree/CodingTheory/CodingTheory}
△ Less
Submitted 13 July, 2020;
originally announced July 2020.
-
Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features
Authors:
Robin Tibor Schirrmeister,
Yuxuan Zhou,
Tonio Ball,
Dan Zhang
Abstract:
Deep generative networks trained via maximum likelihood on a natural image dataset like CIFAR10 often assign high likelihoods to images from datasets with different objects (e.g., SVHN). We refine previous investigations of this failure at anomaly detection for invertible generative networks and provide a clear explanation of it as a combination of model bias and domain prior: Convolutional networ…
▽ More
Deep generative networks trained via maximum likelihood on a natural image dataset like CIFAR10 often assign high likelihoods to images from datasets with different objects (e.g., SVHN). We refine previous investigations of this failure at anomaly detection for invertible generative networks and provide a clear explanation of it as a combination of model bias and domain prior: Convolutional networks learn similar low-level feature distributions when trained on any natural image dataset and these low-level features dominate the likelihood. Hence, when the discriminative features between inliers and outliers are on a high-level, e.g., object shapes, anomaly detection becomes particularly challenging. To remove the negative impact of model bias and domain prior on detecting high-level differences, we propose two methods, first, using the log likelihood ratios of two identical models, one trained on the in-distribution data (e.g., CIFAR10) and the other one on a more general distribution of images (e.g., 80 Million Tiny Images). We also derive a novel outlier loss for the in-distribution network on samples from the more general distribution to further improve the performance. Secondly, using a multi-scale model like Glow, we show that low-level features are mainly captured at early scales. Therefore, using only the likelihood contribution of the final scale performs remarkably well for detecting high-level feature differences of the out-of-distribution and the in-distribution. This method is especially useful if one does not have access to a suitable general distribution. Overall, our methods achieve strong anomaly detection performance in the unsupervised setting, and only slightly underperform state-of-the-art classifier-based methods in the supervised setting. Code can be found at https://github.com/boschresearch/hierarchical_anomaly_detection.
△ Less
Submitted 2 November, 2020; v1 submitted 18 June, 2020;
originally announced June 2020.
-
Machine-Learning-Based Diagnostics of EEG Pathology
Authors:
Lukas Alexander Wilhelm Gemein,
Robin Tibor Schirrmeister,
Patryk Chrabąszcz,
Daniel Wilson,
Joschka Boedecker,
Andreas Schulze-Bonhage,
Frank Hutter,
Tonio Ball
Abstract:
Machine learning (ML) methods have the potential to automate clinical EEG analysis. They can be categorized into feature-based (with handcrafted features), and end-to-end approaches (with learned features). Previous studies on EEG pathology decoding have typically analyzed a limited number of features, decoders, or both. For a I) more elaborate feature-based EEG analysis, and II) in-depth comparis…
▽ More
Machine learning (ML) methods have the potential to automate clinical EEG analysis. They can be categorized into feature-based (with handcrafted features), and end-to-end approaches (with learned features). Previous studies on EEG pathology decoding have typically analyzed a limited number of features, decoders, or both. For a I) more elaborate feature-based EEG analysis, and II) in-depth comparisons of both approaches, here we first develop a comprehensive feature-based framework, and then compare this framework to state-of-the-art end-to-end methods. To this aim, we apply the proposed feature-based framework and deep neural networks including an EEG-optimized temporal convolutional network (TCN) to the task of pathological versus non-pathological EEG classification. For a robust comparison, we chose the Temple University Hospital (TUH) Abnormal EEG Corpus (v2.0.0), which contains approximately 3000 EEG recordings. The results demonstrate that the proposed feature-based decoding framework can achieve accuracies on the same level as state-of-the-art deep neural networks. We find accuracies across both approaches in an astonishingly narrow range from 81--86\%. Moreover, visualizations and analyses indicated that both approaches used similar aspects of the data, e.g., delta and theta band power at temporal electrode locations. We argue that the accuracies of current binary EEG pathology decoders could saturate near 90\% due to the imperfect inter-rater agreement of the clinical labels, and that such decoders are already clinically useful, such as in areas where clinical EEG experts are rare. We make the proposed feature-based framework available open source and thus offer a new tool for EEG machine learning research.
△ Less
Submitted 11 February, 2020;
originally announced February 2020.
-
A Research Framework for Virtual Reality Neurosurgery Based on Open-Source Tools
Authors:
Lukas D. J. Fiederer,
Hisham Alwanni,
Martin Völker,
Oliver Schnell,
Jürgen Beck,
Tonio Ball
Abstract:
Fully immersive virtual reality (VR) has the potential to improve neurosurgical planning. For example, it may offer 3D visualizations of relevant anatomical structures with complex shapes, such as blood vessels and tumors. However, there is a lack of research tools specifically tailored for this area. We present a research framework for VR neurosurgery based on open-source tools and preliminary ev…
▽ More
Fully immersive virtual reality (VR) has the potential to improve neurosurgical planning. For example, it may offer 3D visualizations of relevant anatomical structures with complex shapes, such as blood vessels and tumors. However, there is a lack of research tools specifically tailored for this area. We present a research framework for VR neurosurgery based on open-source tools and preliminary evaluation results. We showcase the potential of such a framework using clinical data of two patients and research data of one subject. As a first step toward practical evaluations, two certified senior neurosurgeons positively assessed the usefulness of the VR visualizations using head-mounted displays. The methods and findings described in our study thus provide a foundation for research and development aiming at versatile and user-friendly VR tools for improving neurosurgical planning and training.
△ Less
Submitted 14 August, 2019;
originally announced August 2019.
-
Deep Invertible Networks for EEG-based brain-signal decoding
Authors:
Robin Tibor Schirrmeister,
Tonio Ball
Abstract:
In this manuscript, we investigate deep invertible networks for EEG-based brain signal decoding and find them to generate realistic EEG signals as well as classify novel signals above chance. Further ideas for their regularization towards better decoding accuracies are discussed.
In this manuscript, we investigate deep invertible networks for EEG-based brain signal decoding and find them to generate realistic EEG signals as well as classify novel signals above chance. Further ideas for their regularization towards better decoding accuracies are discussed.
△ Less
Submitted 17 July, 2019;
originally announced July 2019.
-
Cortical Mirror-System Activation During Real-Life Game Playing: An Intracranial Electroencephalography (EEG) Study
Authors:
Markus Kern,
Johanna Ruescher,
Andreas Schulze-Bonhage,
Tonio Ball
Abstract:
Analogous to the mirror neuron system repeatedly described in monkeys as a possible substrate for imitation learning and/or action understanding, a neuronal execution/observation matching system (OEMS) is assumed in humans, but little is known to what extent this system is activated in non-experimental, real-life conditions. In the present case study, we investigated brain activity of this system…
▽ More
Analogous to the mirror neuron system repeatedly described in monkeys as a possible substrate for imitation learning and/or action understanding, a neuronal execution/observation matching system (OEMS) is assumed in humans, but little is known to what extent this system is activated in non-experimental, real-life conditions. In the present case study, we investigated brain activity of this system during natural, non-experimental motor behavior as it occurred during playing of the board game "Malefiz". We compared spectral modulations of the high-gamma band related to ipsilateral reaching movement execution and observation of the same kind of movement using electrocorticography (ECoG) in one participant. Spatially coincident activity during both conditions execution and observation was recorded at electrode contacts over the premotor/primary motor cortex. The topography and amplitude of the high-gamma modulations related to both, movement observation and execution were clearly spatially correlated over several fronto-parietal brain areas. Thus, our findings indicate that a network of cortical areas contributes to the human OEMS, beyond primary/premotor cortex including Brocas area and the temporo-parieto-occipital junction area, in real-life conditions.
△ Less
Submitted 27 March, 2019; v1 submitted 25 February, 2019;
originally announced February 2019.
-
Independent set and matching permutations
Authors:
Taylor Ball,
David Galvin,
Catherine Hyry,
Kyle Weingartner
Abstract:
Let $G$ be a graph $G$ whose largest independent set has size $m$. A permutation $π$ of $\{1, \ldots, m\}$ is an {\em independent set permutation} of $G$ if $$ a_{π(1)}(G) \leq a_{π(2)}(G) \leq \cdots \leq a_{π(m)}(G) $$ where $a_k(G)$ is the number of independent sets of size $k$ in $G$. In 1987 Alavi, Malde, Schwenk and Erdős proved that every permutation of $\{1, \ldots, m\}$ is an independent…
▽ More
Let $G$ be a graph $G$ whose largest independent set has size $m$. A permutation $π$ of $\{1, \ldots, m\}$ is an {\em independent set permutation} of $G$ if $$ a_{π(1)}(G) \leq a_{π(2)}(G) \leq \cdots \leq a_{π(m)}(G) $$ where $a_k(G)$ is the number of independent sets of size $k$ in $G$. In 1987 Alavi, Malde, Schwenk and Erdős proved that every permutation of $\{1, \ldots, m\}$ is an independent set permutation of some graph with $α(G)=m$, i.e. with largest independent set having size $m$. They raised the question of determining, for each $m$, the smallest number $f(m)$ such that every permutation of $\{1, \ldots, m\}$ is an independent set permutation of some graph with $α(G)=m$ and with at most $f(m)$ vertices, and they gave an upper bound on $f(m)$ of roughly $m^{2m}$. Here we settle the question, determining $f(m)=m^m$, and make progress on a related question, that of determining the smallest order such that every permutation of $\{1, \ldots, m\}$ is the {\em unique} independent set permutation of some graph of at most that order. More generally we consider an extension of independent set permutations to weak orders, and extend Alavi et al.'s main result to show that every weak order on $\{1, \ldots, m\}$ can be realized by the independent set sequence of some graph with $α(G)=m$ and with at most $m^{m+2}$ vertices.
Alavi et al. also considered {\em matching permutations}, defined analogously to independent set permutations. They observed that not every permutation of $\{1,\ldots,m\}$ is a matching permutation of some graph with largest matching having size $m$, putting an upper bound of $2^{m-1}$ on the number of matching permutations of $\{1,\ldots,m\}$. Confirming their speculation that this upper bound is not tight, we improve it to $O(2^m/\sqrt{m})$.
△ Less
Submitted 13 July, 2021; v1 submitted 19 January, 2019;
originally announced January 2019.
-
Deep Learning for micro-Electrocorticographic (μECoG) Data
Authors:
Xi Wang,
C. Alexis Gkogkidis,
Robin T. Schirrmeister,
Felix A. Heilmeyer,
Mortimer Gierthmuehlen,
Fabian Kohler,
Martin Schuettler,
Thomas Stieglitz,
Tonio Ball
Abstract:
Machine learning can extract information from neural recordings, e.g., surface EEG, ECoG and μECoG, and therefore plays an important role in many research and clinical applications. Deep learning with artificial neural networks has recently seen increasing attention as a new approach in brain signal decoding. Here, we apply a deep learning approach using convolutional neural networks to μECoG data…
▽ More
Machine learning can extract information from neural recordings, e.g., surface EEG, ECoG and μECoG, and therefore plays an important role in many research and clinical applications. Deep learning with artificial neural networks has recently seen increasing attention as a new approach in brain signal decoding. Here, we apply a deep learning approach using convolutional neural networks to μECoG data obtained with a wireless, chronically implanted system in an ovine animal model. Regularized linear discriminant analysis (rLDA), a filter bank component spatial pattern (FBCSP) algorithm and convolutional neural networks (ConvNets) were applied to auditory evoked responses captured by μECoG. We show that compared with rLDA and FBCSP, significantly higher decoding accuracy can be obtained by ConvNets trained in an end-to-end manner, i.e., without any predefined signal features. Deep learning thus proves a promising technique for μECoG-based brain-machine interfacing applications.
△ Less
Submitted 5 October, 2018;
originally announced October 2018.
-
The role of robot design in decoding error-related information from EEG signals of a human observer
Authors:
Joos Behncke,
Robin Tibor Schirrmeister,
Wolfram Burgard,
Tonio Ball
Abstract:
For utilization of robotic assistive devices in everyday life, means for detection and processing of erroneous robot actions are a focal aspect in the development of collaborative systems, especially when controlled via brain signals. Though, the variety of possible scenarios and the diversity of used robotic systems pose a challenge for error decoding from recordings of brain signals such as via…
▽ More
For utilization of robotic assistive devices in everyday life, means for detection and processing of erroneous robot actions are a focal aspect in the development of collaborative systems, especially when controlled via brain signals. Though, the variety of possible scenarios and the diversity of used robotic systems pose a challenge for error decoding from recordings of brain signals such as via EEG. For example, it is unclear whether humanoid appearances of robotic assistants have an influence on the performance. In this paper, we designed a study in which two different robots executed the same task both in an erroneous and a correct manner. We find error-related EEG signals of human observers indicating that the performance of the error decoding was independent of robot design. However, we can show that it was possible to identify which robot performed the instructed task by means of the EEG signals. In this case, deep convolutional neural networks (deep ConvNets) could reach significantly higher accuracies than both regularized Linear Discriminanat Analysis (rLDA) and filter bank common spatial patterns (FB-CSP) combined with rLDA. Our findings indicate that decoding information about robot action success from the EEG, particularly when using deep neural networks, may be an applicable approach for a broad range of robot designs.
△ Less
Submitted 18 July, 2018; v1 submitted 4 July, 2018;
originally announced July 2018.
-
Cross-paradigm pretraining of convolutional networks improves intracranial EEG decoding
Authors:
Joos Behncke,
Robin Tibor Schirrmeister,
Martin Völker,
Jiří Hammer,
Petr Marusič,
Andreas Schulze-Bonhage,
Wolfram Burgard,
Tonio Ball
Abstract:
When it comes to the classification of brain signals in real-life applications, the training and the prediction data are often described by different distributions. Furthermore, diverse data sets, e.g., recorded from various subjects or tasks, can even exhibit distinct feature spaces. The fact that data that have to be classified are often only available in small amounts reinforces the need for te…
▽ More
When it comes to the classification of brain signals in real-life applications, the training and the prediction data are often described by different distributions. Furthermore, diverse data sets, e.g., recorded from various subjects or tasks, can even exhibit distinct feature spaces. The fact that data that have to be classified are often only available in small amounts reinforces the need for techniques to generalize learned information, as performances of brain-computer interfaces (BCIs) are enhanced by increasing quantity of available data. In this paper, we apply transfer learning to a framework based on deep convolutional neural networks (deep ConvNets) to prove the transferability of learned patterns in error-related brain signals across different tasks. The experiments described in this paper demonstrate the usefulness of transfer learning, especially improving performances when only little data can be used to distinguish between erroneous and correct realization of a task. This effect could be delimited from a transfer of merely general brain signal characteristics, underlining the transfer of error-specific information. Furthermore, we could extract similar patterns in time-frequency analyses in identical channels, leading to selective high signal correlations between the two different paradigms. Classification on the intracranial data yields in median accuracies up to $(81.50 \pm 9.49)\,\%$. Decoding on only $10\%$ of the data without pre-training reaches performances of $(54.76 \pm 3.56)\,\%$, compared to $(64.95 \pm 0.79)\,\%$ with pre-training.
△ Less
Submitted 20 July, 2018; v1 submitted 20 June, 2018;
originally announced June 2018.
-
A large-scale evaluation framework for EEG deep learning architectures
Authors:
Felix A. Heilmeyer,
Robin T. Schirrmeister,
Lukas D. J. Fiederer,
Martin Völker,
Joos Behncke,
Tonio Ball
Abstract:
EEG is the most common signal source for noninvasive BCI applications. For such applications, the EEG signal needs to be decoded and translated into appropriate actions. A recently emerging EEG decoding approach is deep learning with Convolutional or Recurrent Neural Networks (CNNs, RNNs) with many different architectures already published. Here we present a novel framework for the large-scale eva…
▽ More
EEG is the most common signal source for noninvasive BCI applications. For such applications, the EEG signal needs to be decoded and translated into appropriate actions. A recently emerging EEG decoding approach is deep learning with Convolutional or Recurrent Neural Networks (CNNs, RNNs) with many different architectures already published. Here we present a novel framework for the large-scale evaluation of different deep-learning architectures on different EEG datasets. This framework comprises (i) a collection of EEG datasets currently including 100 examples (recording sessions) from six different classification problems, (ii) a collection of different EEG decoding algorithms, and (iii) a wrapper linking the decoders to the data as well as handling structured documentation of all settings and (hyper-) parameters and statistics, designed to ensure transparency and reproducibility. As an applications example we used our framework by comparing three publicly available CNN architectures: the Braindecode Deep4 ConvNet, Braindecode Shallow ConvNet, and two versions of EEGNet. We also show how our framework can be used to study similarities and differences in the performance of different decoding methods across tasks. We argue that the deep learning EEG framework as described here could help to tap the full potential of deep learning for BCI applications.
△ Less
Submitted 25 July, 2018; v1 submitted 18 June, 2018;
originally announced June 2018.
-
EEG-GAN: Generative adversarial networks for electroencephalograhic (EEG) brain signals
Authors:
Kay Gregor Hartmann,
Robin Tibor Schirrmeister,
Tonio Ball
Abstract:
Generative adversarial networks (GANs) are recently highly successful in generative applications involving images and start being applied to time series data. Here we describe EEG-GAN as a framework to generate electroencephalographic (EEG) brain signals. We introduce a modification to the improved training of Wasserstein GANs to stabilize training and investigate a range of architectural choices…
▽ More
Generative adversarial networks (GANs) are recently highly successful in generative applications involving images and start being applied to time series data. Here we describe EEG-GAN as a framework to generate electroencephalographic (EEG) brain signals. We introduce a modification to the improved training of Wasserstein GANs to stabilize training and investigate a range of architectural choices critical for time series generation (most notably up- and down-sampling). For evaluation we consider and compare different metrics such as Inception score, Frechet inception distance and sliced Wasserstein distance, together showing that our EEG-GAN framework generated naturalistic EEG examples. It thus opens up a range of new generative application scenarios in the neuroscientific and neurological context, such as data augmentation in brain-computer interfacing tasks, EEG super-sampling, or restoration of corrupted data segments. The possibility to generate signals of a certain class and/or with specific properties may also open a new avenue for research into the underlying structure of brain signals.
△ Less
Submitted 5 June, 2018;
originally announced June 2018.
-
Training Generative Reversible Networks
Authors:
Robin Tibor Schirrmeister,
Patryk Chrabąszcz,
Frank Hutter,
Tonio Ball
Abstract:
Generative models with an encoding component such as autoencoders currently receive great interest. However, training of autoencoders is typically complicated by the need to train a separate encoder and decoder model that have to be enforced to be reciprocal to each other. To overcome this problem, by-design reversible neural networks (RevNets) had been previously used as generative models either…
▽ More
Generative models with an encoding component such as autoencoders currently receive great interest. However, training of autoencoders is typically complicated by the need to train a separate encoder and decoder model that have to be enforced to be reciprocal to each other. To overcome this problem, by-design reversible neural networks (RevNets) had been previously used as generative models either directly optimizing the likelihood of the data under the model or using an adversarial approach on the generated data. Here, we instead investigate their performance using an adversary on the latent space in the adversarial autoencoder framework. We investigate the generative performance of RevNets on the CelebA dataset, showing that generative RevNets can generate coherent faces with similar quality as Variational Autoencoders. This first attempt to use RevNets inside the adversarial autoencoder framework slightly underperformed relative to recent advanced generative models using an autoencoder component on CelebA, but this gap may diminish with further optimization of the training setup of generative RevNets. In addition to the experiments on CelebA, we show a proof-of-principle experiment on the MNIST dataset suggesting that adversary-free trained RevNets can discover meaningful latent dimensions without pre-specifying the number of dimensions of the latent sampling distribution. In summary, this study shows that RevNets can be employed in different generative training settings.
Source code for this study is at https://github.com/robintibor/generative-reversible
△ Less
Submitted 23 August, 2018; v1 submitted 5 June, 2018;
originally announced June 2018.
-
Intracranial Error Detection via Deep Learning
Authors:
Martin Völker,
Jiří Hammer,
Robin T. Schirrmeister,
Joos Behncke,
Lukas D. J. Fiederer,
Andreas Schulze-Bonhage,
Petr Marusič,
Wolfram Burgard,
Tonio Ball
Abstract:
Deep learning techniques have revolutionized the field of machine learning and were recently successfully applied to various classification problems in noninvasive electroencephalography (EEG). However, these methods were so far only rarely evaluated for use in intracranial EEG. We employed convolutional neural networks (CNNs) to classify and characterize the error-related brain response as measur…
▽ More
Deep learning techniques have revolutionized the field of machine learning and were recently successfully applied to various classification problems in noninvasive electroencephalography (EEG). However, these methods were so far only rarely evaluated for use in intracranial EEG. We employed convolutional neural networks (CNNs) to classify and characterize the error-related brain response as measured in 24 intracranial EEG recordings. Decoding accuracies of CNNs were significantly higher than those of a regularized linear discriminant analysis. Using time-resolved deep decoding, it was possible to classify errors in various regions in the human brain, and further to decode errors over 200 ms before the actual erroneous button press, e.g., in the precentral gyrus. Moreover, deeper networks performed better than shallower networks in distinguishing correct from error trials in all-channel decoding. In single recordings, up to 100 % decoding accuracy was achieved. Visualization of the networks' learned features indicated that multivariate decoding on an ensemble of channels yields related, albeit non-redundant information compared to single-channel decoding. In summary, here we show the usefulness of deep learning for both intracranial error decoding and mapping of the spatio-temporal structure of the human error processing network.
△ Less
Submitted 2 November, 2018; v1 submitted 4 May, 2018;
originally announced May 2018.
-
Hierarchical internal representation of spectral features in deep convolutional networks trained for EEG decoding
Authors:
Kay Gregor Hartmann,
Robin Tibor Schirrmeister,
Tonio Ball
Abstract:
Recently, there is increasing interest and research on the interpretability of machine learning models, for example how they transform and internally represent EEG signals in Brain-Computer Interface (BCI) applications. This can help to understand the limits of the model and how it may be improved, in addition to possibly provide insight about the data itself. Schirrmeister et al. (2017) have rece…
▽ More
Recently, there is increasing interest and research on the interpretability of machine learning models, for example how they transform and internally represent EEG signals in Brain-Computer Interface (BCI) applications. This can help to understand the limits of the model and how it may be improved, in addition to possibly provide insight about the data itself. Schirrmeister et al. (2017) have recently reported promising results for EEG decoding with deep convolutional neural networks (ConvNets) trained in an end-to-end manner and, with a causal visualization approach, showed that they learn to use spectral amplitude changes in the input. In this study, we investigate how ConvNets represent spectral features through the sequence of intermediate stages of the network. We show higher sensitivity to EEG phase features at earlier stages and higher sensitivity to EEG amplitude features at later stages. Intriguingly, we observed a specialization of individual stages of the network to the classical EEG frequency bands alpha, beta, and high gamma. Furthermore, we find first evidence that particularly in the last convolutional layer, the network learns to detect more complex oscillatory patterns beyond spectral phase and amplitude, reminiscent of the representation of complex visual features in later layers of ConvNets in computer vision tasks. Our findings thus provide insights into how ConvNets hierarchically represent spectral EEG features in their intermediate layers and suggest that ConvNets can exploit and might help to better understand the compositional structure of EEG time series.
△ Less
Submitted 15 December, 2017; v1 submitted 21 November, 2017;
originally announced November 2017.
-
The signature of robot action success in EEG signals of a human observer: Decoding and visualization using deep convolutional neural networks
Authors:
Joos Behncke,
Robin Tibor Schirrmeister,
Wolfram Burgard,
Tonio Ball
Abstract:
The importance of robotic assistive devices grows in our work and everyday life. Cooperative scenarios involving both robots and humans require safe human-robot interaction. One important aspect here is the management of robot errors, including fast and accurate online robot-error detection and correction. Analysis of brain signals from a human interacting with a robot may help identifying robot e…
▽ More
The importance of robotic assistive devices grows in our work and everyday life. Cooperative scenarios involving both robots and humans require safe human-robot interaction. One important aspect here is the management of robot errors, including fast and accurate online robot-error detection and correction. Analysis of brain signals from a human interacting with a robot may help identifying robot errors, but accuracies of such analyses have still substantial space for improvement. In this paper we evaluate whether a novel framework based on deep convolutional neural networks (deep ConvNets) could improve the accuracy of decoding robot errors from the EEG of a human observer, both during an object grasping and a pouring task. We show that deep ConvNets reached significantly higher accuracies than both regularized Linear Discriminant Analysis (rLDA) and filter bank common spatial patterns (FB-CSP) combined with rLDA, both widely used EEG classifiers. Deep ConvNets reached mean accuracies of 75% +/- 9 %, rLDA 65% +/- 10% and FB-CSP + rLDA 63% +/- 6% for decoding of erroneous vs. correct trials. Visualization of the time-domain EEG features learned by the ConvNets to decode errors revealed spatiotemporal patterns that reflected differences between the two experimental paradigms. Across subjects, ConvNet decoding accuracies were significantly correlated with those obtained with rLDA, but not CSP, indicating that in the present context ConvNets behaved more 'rLDA-like' (but consistently better), while in a previous decoding study with another task but the same ConvNet architecture, it was found to behave more 'CSP-like'. Our findings thus provide further support for the assumption that deep ConvNets are a versatile addition to the existing toolbox of EEG decoding techniques, and we discuss steps how ConvNet EEG decoding performance could be further optimized.
△ Less
Submitted 16 November, 2017;
originally announced November 2017.
-
Deep Transfer Learning for Error Decoding from Non-Invasive EEG
Authors:
Martin Völker,
Robin T. Schirrmeister,
Lukas D. J. Fiederer,
Wolfram Burgard,
Tonio Ball
Abstract:
We recorded high-density EEG in a flanker task experiment (31 subjects) and an online BCI control paradigm (4 subjects). On these datasets, we evaluated the use of transfer learning for error decoding with deep convolutional neural networks (deep ConvNets). In comparison with a regularized linear discriminant analysis (rLDA) classifier, ConvNets were significantly better in both intra- and inter-s…
▽ More
We recorded high-density EEG in a flanker task experiment (31 subjects) and an online BCI control paradigm (4 subjects). On these datasets, we evaluated the use of transfer learning for error decoding with deep convolutional neural networks (deep ConvNets). In comparison with a regularized linear discriminant analysis (rLDA) classifier, ConvNets were significantly better in both intra- and inter-subject decoding, achieving an average accuracy of 84.1 % within subject and 81.7 % on unknown subjects (flanker task). Neither method was, however, able to generalize reliably between paradigms. Visualization of features the ConvNets learned from the data showed plausible patterns of brain activity, revealing both similarities and differences between the different kinds of errors. Our findings indicate that deep learning techniques are useful to infer information about the correctness of action in BCI applications, particularly for the transfer of pre-trained classifiers to new recording sessions or subjects.
△ Less
Submitted 10 January, 2018; v1 submitted 25 October, 2017;
originally announced October 2017.
-
Monotonically controlled integrals
Authors:
Thomas Ball,
David Preiss
Abstract:
The monotonically controlled integral defined by Bendová and Malý, which is equivalent to the Denjoy-Perron integral, admits a natural parameter $α>0$ thereby leading to the whole scale of integrals called $α$-monotonically controlled integrals. While the power of these integrals is easily seen to increase with increasing $α$, our main results show that their exact dependence on $α$ is rather curi…
▽ More
The monotonically controlled integral defined by Bendová and Malý, which is equivalent to the Denjoy-Perron integral, admits a natural parameter $α>0$ thereby leading to the whole scale of integrals called $α$-monotonically controlled integrals. While the power of these integrals is easily seen to increase with increasing $α$, our main results show that their exact dependence on $α$ is rather curious. For $α<1$ they do not even contain the Lebesgue integral, for $1\leα\le 2$ they coincide with the Denjoy-Perron integral, and for $α>2$ they are mutually different and not even contained in the Denjoy-Khintchine integral.
△ Less
Submitted 13 September, 2017;
originally announced September 2017.
-
Deep learning with convolutional neural networks for decoding and visualization of EEG pathology
Authors:
Robin Tibor Schirrmeister,
Lukas Gemein,
Katharina Eggensperger,
Frank Hutter,
Tonio Ball
Abstract:
We apply convolutional neural networks (ConvNets) to the task of distinguishing pathological from normal EEG recordings in the Temple University Hospital EEG Abnormal Corpus. We use two basic, shallow and deep ConvNet architectures recently shown to decode task-related information from EEG at least as well as established algorithms designed for this purpose. In decoding EEG pathology, both ConvNet…
▽ More
We apply convolutional neural networks (ConvNets) to the task of distinguishing pathological from normal EEG recordings in the Temple University Hospital EEG Abnormal Corpus. We use two basic, shallow and deep ConvNet architectures recently shown to decode task-related information from EEG at least as well as established algorithms designed for this purpose. In decoding EEG pathology, both ConvNets reached substantially better accuracies (about 6% better, ~85% vs. ~79%) than the only published result for this dataset, and were still better when using only 1 minute of each recording for training and only six seconds of each recording for testing. We used automated methods to optimize architectural hyperparameters and found intriguingly different ConvNet architectures, e.g., with max pooling as the only nonlinearity. Visualizations of the ConvNet decoding behavior showed that they used spectral power changes in the delta (0-4 Hz) and theta (4-8 Hz) frequency range, possibly alongside other features, consistent with expectations derived from spectral analysis of the EEG data and from the textual medical reports. Analysis of the textual medical reports also highlighted the potential for accuracy increases by integrating contextual information, such as the age of subjects. In summary, the ConvNets and visualization techniques used in this study constitute a next step towards clinically useful automated EEG diagnosis and establish a new baseline for future work on this topic.
△ Less
Submitted 11 January, 2018; v1 submitted 26 August, 2017;
originally announced August 2017.
-
Brain Responses During Robot-Error Observation
Authors:
Dominik Welke,
Joos Behncke,
Marina Hader,
Robin Tibor Schirrmeister,
Andreas Schönau,
Boris Eßmann,
Oliver Müller,
Wolfram Burgard,
Tonio Ball
Abstract:
Brain-controlled robots are a promising new type of assistive device for severely impaired persons. Little is however known about how to optimize the interaction of humans and brain-controlled robots. Information about the human's perceived correctness of robot performance might provide a useful teaching signal for adaptive control algorithms and thus help enhancing robot control. Here, we studied…
▽ More
Brain-controlled robots are a promising new type of assistive device for severely impaired persons. Little is however known about how to optimize the interaction of humans and brain-controlled robots. Information about the human's perceived correctness of robot performance might provide a useful teaching signal for adaptive control algorithms and thus help enhancing robot control. Here, we studied whether watching robots perform erroneous vs. correct action elicits differential brain responses that can be decoded from single trials of electroencephalographic (EEG) recordings, and whether brain activity during human-robot interaction is modulated by the robot's visual similarity to a human. To address these topics, we designed two experiments. In experiment I, participants watched a robot arm pour liquid into a cup. The robot performed the action either erroneously or correctly, i.e. it either spilled some liquid or not. In experiment II, participants observed two different types of robots, humanoid and non-humanoid, grabbing a ball. The robots either managed to grab the ball or not. We recorded high-resolution EEG during the observation tasks in both experiments to train a Filter Bank Common Spatial Pattern (FBCSP) pipeline on the multivariate EEG signal and decode for the correctness of the observed action, and for the type of the observed robot. Our findings show that it was possible to decode both correctness and robot type for the majority of participants significantly, although often just slightly, above chance level. Our findings suggest that non-invasive recordings of brain responses elicited when observing robots indeed contain decodable information about the correctness of the robot's action and the type of observed robot.
△ Less
Submitted 16 August, 2017; v1 submitted 4 August, 2017;
originally announced August 2017.
-
Acting Thoughts: Towards a Mobile Robotic Service Assistant for Users with Limited Communication Skills
Authors:
Felix Burget,
Lukas Dominique Josef Fiederer,
Daniel Kuhner,
Martin Völker,
Johannes Aldinger,
Robin Tibor Schirrmeister,
Chau Do,
Joschka Boedecker,
Bernhard Nebel,
Tonio Ball,
Wolfram Burgard
Abstract:
As autonomous service robots become more affordable and thus available also for the general public, there is a growing need for user friendly interfaces to control the robotic system. Currently available control modalities typically expect users to be able to express their desire through either touch, speech or gesture commands. While this requirement is fulfilled for the majority of users, paraly…
▽ More
As autonomous service robots become more affordable and thus available also for the general public, there is a growing need for user friendly interfaces to control the robotic system. Currently available control modalities typically expect users to be able to express their desire through either touch, speech or gesture commands. While this requirement is fulfilled for the majority of users, paralyzed users may not be able to use such systems. In this paper, we present a novel framework, that allows these users to interact with a robotic service assistant in a closed-loop fashion, using only thoughts. The brain-computer interface (BCI) system is composed of several interacting components, i.e., non-invasive neuronal signal recording and decoding, high-level task planning, motion and manipulation planning as well as environment perception. In various experiments, we demonstrate its applicability and robustness in real world scenarios, considering fetch-and-carry tasks and tasks involving human-robot interaction. As our results demonstrate, our system is capable of adapting to frequent changes in the environment and reliably completing given tasks within a reasonable amount of time. Combined with high-level planning and autonomous robotic systems, interesting new perspectives open up for non-invasive BCI-based human-robot interactions.
△ Less
Submitted 12 June, 2018; v1 submitted 20 July, 2017;
originally announced July 2017.
-
Deep learning with convolutional neural networks for EEG decoding and visualization
Authors:
Robin Tibor Schirrmeister,
Jost Tobias Springenberg,
Lukas Dominique Josef Fiederer,
Martin Glasstetter,
Katharina Eggensperger,
Michael Tangermann,
Frank Hutter,
Wolfram Burgard,
Tonio Ball
Abstract:
PLEASE READ AND CITE THE REVISED VERSION at Human Brain Mapping: http://onlinelibrary.wiley.com/doi/10.1002/hbm.23730/full
Code available here: https://github.com/robintibor/braindecode
PLEASE READ AND CITE THE REVISED VERSION at Human Brain Mapping: http://onlinelibrary.wiley.com/doi/10.1002/hbm.23730/full
Code available here: https://github.com/robintibor/braindecode
△ Less
Submitted 8 June, 2018; v1 submitted 15 March, 2017;
originally announced March 2017.
-
High solar cycle spectral variations inconsistent with stratospheric ozone observations
Authors:
W. T. Ball,
J. D. Haigh,
E. V. Rozanov,
A. Kuchar,
T. Sukhodolov,
F. Tummon,
A. V. Shapiro,
W. Schmutz
Abstract:
Some of the natural variability in climate is understood to come from changes in the Sun. A key route whereby the Sun may influence surface climate is initiated in the tropical stratosphere by the absorption of solar ultraviolet (UV) radiation by ozone, leading to a modification of the temperature and wind structures and consequently to the surface through changes in wave propagation and circulati…
▽ More
Some of the natural variability in climate is understood to come from changes in the Sun. A key route whereby the Sun may influence surface climate is initiated in the tropical stratosphere by the absorption of solar ultraviolet (UV) radiation by ozone, leading to a modification of the temperature and wind structures and consequently to the surface through changes in wave propagation and circulation. While changes in total, spectrally-integrated, solar irradiance lead to small variations in global mean surface temperature, the `top-down' UV effect preferentially influences on regional scales at mid-to-high latitudes with, in particular, a solar signal noted in the North Atlantic Oscillation (NAO). The amplitude of the UV variability is fundamental in determining the magnitude of the climate response but understanding of the UV variations has been challenged recently by measurements from the SOlar Radiation and Climate Experiment (SORCE) satellite, which show UV solar cycle changes up to 10 times larger than previously thought. Indeed, climate models using these larger UV variations show a much greater response, similar to NAO observations. Here we present estimates of the ozone solar cycle response using a chemistry-climate model (CCM) in which the effects of transport are constrained by observations. Thus the photolytic response to different spectral solar irradiance (SSI) datasets can be isolated. Comparison of the results with the solar signal in ozone extracted from observational datasets yields significantly discriminable responses. According to our evaluation the SORCE UV dataset is not consistent with the observed ozone response whereas the smaller variations suggested by earlier satellite datasets, and by UV data from empirical solar models, are in closer agreement with the measured stratospheric variations. Determining the most appropriate SSI variability to apply in models...
△ Less
Submitted 20 February, 2016;
originally announced February 2016.
-
Causal and anti-causal learning in pattern recognition for neuroimaging
Authors:
Sebastian Weichwald,
Bernhard Schölkopf,
Tonio Ball,
Moritz Grosse-Wentrup
Abstract:
Pattern recognition in neuroimaging distinguishes between two types of models: encoding- and decoding models. This distinction is based on the insight that brain state features, that are found to be relevant in an experimental paradigm, carry a different meaning in encoding- than in decoding models. In this paper, we argue that this distinction is not sufficient: Relevant features in encoding- and…
▽ More
Pattern recognition in neuroimaging distinguishes between two types of models: encoding- and decoding models. This distinction is based on the insight that brain state features, that are found to be relevant in an experimental paradigm, carry a different meaning in encoding- than in decoding models. In this paper, we argue that this distinction is not sufficient: Relevant features in encoding- and decoding models carry a different meaning depending on whether they represent causal- or anti-causal relations. We provide a theoretical justification for this argument and conclude that causal inference is essential for interpretation in neuroimaging.
△ Less
Submitted 15 December, 2015;
originally announced December 2015.
-
Decoding index finger position from EEG using random forests
Authors:
Sebastian Weichwald,
Timm Meyer,
Bernhard Schölkopf,
Tonio Ball,
Moritz Grosse-Wentrup
Abstract:
While invasively recorded brain activity is known to provide detailed information on motor commands, it is an open question at what level of detail information about positions of body parts can be decoded from non-invasively acquired signals. In this work it is shown that index finger positions can be differentiated from non-invasive electroencephalographic (EEG) recordings in healthy human subjec…
▽ More
While invasively recorded brain activity is known to provide detailed information on motor commands, it is an open question at what level of detail information about positions of body parts can be decoded from non-invasively acquired signals. In this work it is shown that index finger positions can be differentiated from non-invasive electroencephalographic (EEG) recordings in healthy human subjects. Using a leave-one-subject-out cross-validation procedure, a random forest distinguished different index finger positions on a numerical keyboard above chance-level accuracy. Among the different spectral features investigated, high $β$-power (20-30 Hz) over contralateral sensorimotor cortex carried most information about finger position. Thus, these findings indicate that finger position is in principle decodable from non-invasive features of brain activity that generalize across individuals.
△ Less
Submitted 14 December, 2015;
originally announced December 2015.
-
Causal interpretation rules for encoding and decoding models in neuroimaging
Authors:
Sebastian Weichwald,
Timm Meyer,
Ozan Özdenizci,
Bernhard Schölkopf,
Tonio Ball,
Moritz Grosse-Wentrup
Abstract:
Causal terminology is often introduced in the interpretation of encoding and decoding models trained on neuroimaging data. In this article, we investigate which causal statements are warranted and which ones are not supported by empirical evidence. We argue that the distinction between encoding and decoding models is not sufficient for this purpose: relevant features in encoding and decoding model…
▽ More
Causal terminology is often introduced in the interpretation of encoding and decoding models trained on neuroimaging data. In this article, we investigate which causal statements are warranted and which ones are not supported by empirical evidence. We argue that the distinction between encoding and decoding models is not sufficient for this purpose: relevant features in encoding and decoding models carry a different meaning in stimulus- and in response-based experimental paradigms. We show that only encoding models in the stimulus-based setting support unambiguous causal interpretations. By combining encoding and decoding models trained on the same data, however, we obtain insights into causal relations beyond those that are implied by each individual model type. We illustrate the empirical relevance of our theoretical findings on EEG data recorded during a visuo-motor learning task.
△ Less
Submitted 15 November, 2015;
originally announced November 2015.
-
On the cop number of generalized Petersen graphs
Authors:
Taylor Ball,
Robert W. Bell,
Jonathan Guzman,
Madeleine Hanson-Colvin,
Nikolas Schonscheck
Abstract:
We show that the cop number of every generalized Petersen graph is at most 4. The strategy is to play a modified game of cops and robbers on an infinite cyclic covering space where the objective is to capture the robber or force the robber towards an end of the infinite graph. We prove that finite isometric subtrees are 1-guardable and apply this to determine the exact cop number of some families…
▽ More
We show that the cop number of every generalized Petersen graph is at most 4. The strategy is to play a modified game of cops and robbers on an infinite cyclic covering space where the objective is to capture the robber or force the robber towards an end of the infinite graph. We prove that finite isometric subtrees are 1-guardable and apply this to determine the exact cop number of some families of generalized Petersen graphs. We also extend these ideas to prove that the cop number of any connected I-graph is at most 5.
△ Less
Submitted 15 September, 2015;
originally announced September 2015.
-
UV solar irradiance in observations and the NRLSSI and SATIRE-S models
Authors:
K. L. Yeo,
W. T. Ball,
N. A. Krivova,
S. K. Solanki,
Y. C. Unruh,
J. Morrill
Abstract:
Total solar irradiance and UV spectral solar irradiance have been monitored since 1978 through a succession of space missions. This is accompanied by the development of models aimed at replicating solar irradiance by relating the variability to solar magnetic activity. The NRLSSI and SATIRE-S models provide the most comprehensive reconstructions of total and spectral solar irradiance over the peri…
▽ More
Total solar irradiance and UV spectral solar irradiance have been monitored since 1978 through a succession of space missions. This is accompanied by the development of models aimed at replicating solar irradiance by relating the variability to solar magnetic activity. The NRLSSI and SATIRE-S models provide the most comprehensive reconstructions of total and spectral solar irradiance over the period of satellite observation currently available. There is persistent controversy between the various measurements and models in terms of the wavelength dependence of the variation over the solar cycle, with repercussions on our understanding of the influence of UV solar irradiance variability on the stratosphere. We review the measurement and modelling of UV solar irradiance variability over the period of satellite observation. The SATIRE-S reconstruction is consistent with spectral solar irradiance observations where they are reliable. It is also supported by an independent, empirical reconstruction of UV spectral solar irradiance based on UARS/SUSIM measurements from an earlier study. The weaker solar cycle variability produced by NRLSSI between 300 and 400 nm is not evident in any available record. We show that although the method employed to construct NRLSSI is principally sound, reconstructed solar cycle variability is detrimentally affected by the uncertainty in the SSI observations it draws upon in the derivation. Based on our findings, we recommend, when choosing between the two models, the use of SATIRE-S for climate studies.
△ Less
Submitted 5 July, 2015;
originally announced July 2015.
-
Assessing the relationship between spectral solar irradiance and stratospheric ozone using Bayesian inference
Authors:
William T. Ball,
Daniel J. Mortlock,
Jack S. Egerton,
Joanna D. Haigh
Abstract:
We investigate the relationship between spectral solar irradiance (SSI) and ozone in the tropical upper stratosphere. We find that solar cycle (SC) changes in ozone can be well approximated by considering the ozone response to SSI changes in a small number individual wavelength bands between 176 and 310 nm, operating independently of each other. Additionally, we find that the ozone varies approxim…
▽ More
We investigate the relationship between spectral solar irradiance (SSI) and ozone in the tropical upper stratosphere. We find that solar cycle (SC) changes in ozone can be well approximated by considering the ozone response to SSI changes in a small number individual wavelength bands between 176 and 310 nm, operating independently of each other. Additionally, we find that the ozone varies approximately linearly with changes in the SSI. Using these facts, we present a Bayesian formalism for inferring SC SSI changes and uncertainties from measured SC ozone profiles. Bayesian inference is a powerful, mathematically self-consistent method of considering both the uncertainties of the data and additional external information to provide the best estimate of parameters being estimated. Using this method, we show that, given measurement uncertainties in both ozone and SSI datasets, it is not currently possible to distinguish between observed or modelled SSI datasets using available estimates of ozone change profiles, although this might be possible by the inclusion of other external constraints. Our methodology has the potential, using wider datasets, to provide better understanding of both variations in SSI and the atmospheric response.
△ Less
Submitted 14 August, 2014;
originally announced August 2014.
-
A new SATIRE-S spectral solar irradiance reconstruction for solar cycles 21--23 and its implications for stratospheric ozone
Authors:
William T. Ball,
Natalie A. Krivova,
Yvonne C. Unruh,
Joanna D. Haigh,
Sami K. Solanki
Abstract:
We present a revised and extended total and spectral solar irradiance (SSI) reconstruction, which includes a wavelength-dependent uncertainty estimate, spanning the last three solar cycles using the SATIRE-S model. The SSI reconstruction covers wavelengths between 115 and 160,000 nm and all dates between August 1974 and October 2009. This represents the first full-wavelength SATIRE-S reconstructio…
▽ More
We present a revised and extended total and spectral solar irradiance (SSI) reconstruction, which includes a wavelength-dependent uncertainty estimate, spanning the last three solar cycles using the SATIRE-S model. The SSI reconstruction covers wavelengths between 115 and 160,000 nm and all dates between August 1974 and October 2009. This represents the first full-wavelength SATIRE-S reconstruction to cover the last three solar cycles without data gaps and with an uncertainty estimate. SATIRE-S is compared with the NRLSSI model and SORCE/SOLSTICE ultraviolet (UV) observations. SATIRE-S displays similar cycle behaviour to NRLSSI for wavelengths below 242 nm and almost twice the variability between 242 and 310 nm. During the decline of last solar cycle, between 2003 and 2008, SSI from SORCE/SOLSTICE version 12 and 10 typically displays more than three times the variability of SATIRE-S between 200 and 300 nm. All three datasets are used to model changes in stratospheric ozone within a 2D atmospheric model for a decline from high solar activity to solar minimum. The different flux changes result in different modelled ozone trends. Using NRLSSI leads to a decline in mesospheric ozone, while SATIRE-S and SORCE/SOLSTICE result in an increase. Recent publications have highlighted increases in mesospheric ozone when considering version 10 SORCE/SOLSTICE irradiances. The recalibrated SORCE/SOLSTICE version 12 irradiances result in a much smaller mesospheric ozone response than when using version 10 and now similar in magnitude to SATIRE-S. This shows that current knowledge of variations in spectral irradiance is not sufficient to warrant robust conclusions concerning the impact of solar variability on the atmosphere and climate.
△ Less
Submitted 2 August, 2014;
originally announced August 2014.
-
The variability of Sun-like stars: reproducing observed photometric trends
Authors:
A. I. Shapiro,
S. K. Solanki,
N. A. Krivova,
W. K. Schmutz,
W. T. Ball,
R. Knaack,
E. V. Rozanov,
Y. C. Unruh
Abstract:
The Sun and stars with low magnetic activity levels, become photometrically brighter when their activity increases. Magnetically more active stars display the opposite behaviour and get fainter when their activity increases.
We reproduce the observed photometric trends in stellar variations with a model that treats stars as hypothetical Suns with coverage by magnetic features different from that…
▽ More
The Sun and stars with low magnetic activity levels, become photometrically brighter when their activity increases. Magnetically more active stars display the opposite behaviour and get fainter when their activity increases.
We reproduce the observed photometric trends in stellar variations with a model that treats stars as hypothetical Suns with coverage by magnetic features different from that of the Sun.
The presented model attributes the variability of stellar spectra to the imbalance between the contributions from different components of the solar atmosphere, such as dark starspots and bright faculae. A stellar spectrum is calculated from spectra of the individual components, by weighting them with corresponding disc area coverages. The latter are obtained by extrapolating the solar dependences of spot and facular disc area coverages on chromospheric activity to stars with different levels of mean chromospheric activity.
We have found that the contribution by starspots to the variability increases faster with chromospheric activity than the facular contribution. This causes the transition from faculae-dominated variability and direct activity--brightness correlation to spot-dominated variability and inverse activity--brightness correlation with increasing chromospheric activity level. We have shown that the regime of the variability also depends on the angle between the stellar rotation axis and the line-of-sight and on the latitudinal distribution of active regions on the stellar surface. Our model can be used as a tool to extrapolate the observed photometric variability of the Sun to Sun-like stars at different activity levels, which makes possible the direct comparison between solar and stellar irradiance data.
△ Less
Submitted 9 June, 2014;
originally announced June 2014.
-
Reconstruction of total solar irradiance 1974-2009
Authors:
W. T. Ball,
Y. C. Unruh,
N. A. Krivova,
S. Solanki,
T. Wenzler,
D. J. Mortlock,
A. H. Jaffe
Abstract:
Context: The study of variations in total solar irradiance (TSI) is important for understanding how the Sun affects the Earth's climate.
Aims: Full-disk continuum images and magnetograms are now available for three full solar cycles. We investigate how modelled TSI compares with direct observations by building a consistent modelled TSI dataset. The model, based only on changes in the photospheri…
▽ More
Context: The study of variations in total solar irradiance (TSI) is important for understanding how the Sun affects the Earth's climate.
Aims: Full-disk continuum images and magnetograms are now available for three full solar cycles. We investigate how modelled TSI compares with direct observations by building a consistent modelled TSI dataset. The model, based only on changes in the photospheric magnetic flux can then be tested on rotational, cyclical and secular timescales.
Methods: We use Kitt Peak and SoHO/MDI continuum images and magnetograms in the SATIRE-S model to reconstruct TSI over cycles 21-23. To maximise independence from TSI composites, SORCE/TIM TSI data are used to fix the one free parameter of the model. We compare and combine the separate data sources for the model to estimate an uncertainty on the reconstruction and prevent any additional free parameters entering the model.
Results: The reconstruction supports the PMOD composite as being the best historical record of TSI observations, although on timescales of the solar rotation the IRMB composite provides somewhat better agreement. Further to this, the model is able to account for 92% of TSI variations from 1978 to 2009 in the PMOD composite and over 96% during cycle 23. The reconstruction also displays an inter-cycle, secular decline of 0.20 (+0.12 / -0.09) Wm-2 between cycle 23 minima, in agreement with the PMOD composite.
Conclusions: SATIRE-S is able to recreate TSI observations on all timescales of a day and longer over 31 years from 1978. This is strong evidence that changes in photospheric magnetic flux alone are responsible for almost all solar irradiance variations over the last three solar cycles.
△ Less
Submitted 16 February, 2012;
originally announced February 2012.
-
Solar irradiance models and measurements: a comparison in the 220 nm to 240 nm wavelength band
Authors:
Yvonne C. Unruh,
Will T. Ball,
Natalie A. Krivova
Abstract:
Solar irradiance models that assume solar irradiance variations to be due to changes in the solar surface magnetic flux have been successfully used to reconstruct total solar irradiance on rotational as well as cyclical and secular time scales. Modelling spectral solar irradiance is not yet as advanced, and also suffers from a lack of comparison data, in particular on solar-cycle time scales. Here…
▽ More
Solar irradiance models that assume solar irradiance variations to be due to changes in the solar surface magnetic flux have been successfully used to reconstruct total solar irradiance on rotational as well as cyclical and secular time scales. Modelling spectral solar irradiance is not yet as advanced, and also suffers from a lack of comparison data, in particular on solar-cycle time scales. Here we compare solar irradiance in the 220 nm to 240 nm band as modelled with SATIRE-S and measured by different instruments on the UARS and SORCE satellites.
We find good agreement between the model and measurements on rotational time scales. The long-term trends, however, show significant differences. Both SORCE instruments, in particular, show a much steeper gradient over the decaying part of cycle 23 than the modelled irradiance or that measured by UARS/SUSIM.
△ Less
Submitted 8 November, 2011;
originally announced November 2011.
-
Solar irradiance variability: A six-year comparison between SORCE observations and the SATIRE model
Authors:
Will T. Ball,
Yvonne C. Unruh,
Natalie A. Krivova,
Sami Solanki,
Jerald W. Harder
Abstract:
Aims: We investigate how well modeled solar irradiances agree with measurements from the SORCE satellite, both for total solar irradiance and broken down into spectral regions on timescales of several years. Methods: We use the SATIRE model and compare modeled total solar irradiance (TSI) with TSI measurements between 2003 and 2009. Spectral solar irradiance over 200-1630nm is compared with the SI…
▽ More
Aims: We investigate how well modeled solar irradiances agree with measurements from the SORCE satellite, both for total solar irradiance and broken down into spectral regions on timescales of several years. Methods: We use the SATIRE model and compare modeled total solar irradiance (TSI) with TSI measurements between 2003 and 2009. Spectral solar irradiance over 200-1630nm is compared with the SIM instrument on SORCE between 2004 and 2009 during a period of decline from moderate activity to the recent solar minimum in 10 nm bands and for three spectral regions of significant interest: the UV integrated over 200-300nm, the visible over 400-691nm and the IR between 972-1630 nm. Results: The model captures 97% of observed TSI variation. In the spectral comparison, rotational variability is well reproduced, especially between 400 and 1200 nm. The magnitude of change in the long-term trends is many times larger in SIM at almost all wavelengths while trends in SIM oppose SATIRE in the visible between 500 and 700nm and between 1000 and 1200nm. We discuss the remaining issues with both SIM data and the identified limits of the model, particularly with the way facular contributions are dealt with, the limit of flux identification in MDI magnetograms during solar minimum and the model atmospheres in the IR employed by SATIRE. It is unlikely that improvements in these areas will significantly enhance the agreement in the long-term trends. This disagreement implies that some mechanism other than surface magnetism is causing SSI variations, in particular between 2004 and 2006, if the SIM data are correct. Since SATIRE was able to reproduce UV irradiance between 1991 and 2002 from UARS, either the solar mechanism for SSI variation fundamentally changed around the peak of cycle 23, or there is an inconsistency between UARS and SORCE UV measurements. We favour the second explanation.
△ Less
Submitted 5 April, 2011;
originally announced April 2011.
-
Predicate Abstraction via Symbolic Decision Procedures
Authors:
Shuvendu K. Lahiri,
Thomas Ball,
Byron Cook
Abstract:
We present a new approach for performing predicate abstraction based on symbolic decision procedures. Intuitively, a symbolic decision procedure for a theory takes a set of predicates in the theory and symbolically executes a decision procedure on all the subsets over the set of predicates. The result of the symbolic decision procedure is a shared expression (represented by a directed acyclic gr…
▽ More
We present a new approach for performing predicate abstraction based on symbolic decision procedures. Intuitively, a symbolic decision procedure for a theory takes a set of predicates in the theory and symbolically executes a decision procedure on all the subsets over the set of predicates. The result of the symbolic decision procedure is a shared expression (represented by a directed acyclic graph) that implicitly represents the answer to a predicate abstraction query.
We present symbolic decision procedures for the logic of Equality and Uninterpreted Functions (EUF) and Difference logic (DIFF) and show that these procedures run in pseudo-polynomial (rather than exponential) time. We then provide a method to construct symbolic decision procedures for simple mixed theories (including the two theories mentioned above) using an extension of the Nelson-Oppen combination method. We present preliminary evaluation of our Procedure on predicate abstraction benchmarks from device driver verification in SLAM.
△ Less
Submitted 24 April, 2007; v1 submitted 1 December, 2006;
originally announced December 2006.
-
Simulated Radio Images and Light Curves of SN 1993J
Authors:
Vikram V. Dwarkadas,
Amy J. Mioduszewski,
Lewis T. Ball
Abstract:
We present calculations of the radio images and light curves from supernovae, based on high-resolution numerical simulations of the hydrodynamics and radiation transfer in a spherically symmetric medium. As a specific example we model the emission from SN1993J. This supernova does not appear to be expanding in a self-similar fashion, and cannot be adequately fitted with the often-used analytic m…
▽ More
We present calculations of the radio images and light curves from supernovae, based on high-resolution numerical simulations of the hydrodynamics and radiation transfer in a spherically symmetric medium. As a specific example we model the emission from SN1993J. This supernova does not appear to be expanding in a self-similar fashion, and cannot be adequately fitted with the often-used analytic mini-shell model. We present a good fit to the radio evolution at a single frequency. Both free-free absorption and synchrotron self-absorption are needed to fit the light curve at early times, and a circumstellar density profile of $ρ\sim r ^{-1.7}$ provides the best fit to the later data. Comparisons of VLBI images of SN1993J with synthetic model images suggest that internal free-free absorption completely obscures emission at 8.4 GHz passing through the center of the supernova for the first few tens of years after explosion.
△ Less
Submitted 6 January, 2004;
originally announced January 2004.