-
Heliophysics Discovery Tools for the 21st Century: Data Science and Machine Learning Structures and Recommendations for 2020-2050
Authors:
R. M. McGranaghan,
B. Thompson,
E. Camporeale,
J. Bortnik,
M. Bobra,
G. Lapenta,
S. Wing,
B. Poduval,
S. Lotz,
S. Murray,
M. Kirk,
T. Y. Chen,
H. M. Bain,
P. Riley,
B. Tremblay,
M. Cheung,
V. Delouille
Abstract:
Three main points: 1. Data Science (DS) will be increasingly important to heliophysics; 2. Methods of heliophysics science discovery will continually evolve, requiring the use of learning technologies [e.g., machine learning (ML)] that are applied rigorously and that are capable of supporting discovery; and 3. To grow with the pace of data, technology, and workforce changes, heliophysics requires…
▽ More
Three main points: 1. Data Science (DS) will be increasingly important to heliophysics; 2. Methods of heliophysics science discovery will continually evolve, requiring the use of learning technologies [e.g., machine learning (ML)] that are applied rigorously and that are capable of supporting discovery; and 3. To grow with the pace of data, technology, and workforce changes, heliophysics requires a new approach to the representation of knowledge.
△ Less
Submitted 26 December, 2022;
originally announced December 2022.
-
Nonparametric monitoring of sunspot number observations: a case study
Authors:
Sophie Mathieu,
Laure Lefèvre,
Rainer von Sachs,
Véronique Delouille,
Christian Ritter,
Frédéric Clette
Abstract:
Solar activity is an important driver of long-term climate trends and must be accounted for in climate models. Unfortunately, direct measurements of this quantity over long periods do not exist. The only observation related to solar activity whose records reach back to the seventeenth century are sunspots. Surprisingly, determining the number of sunspots consistently over time has remained until t…
▽ More
Solar activity is an important driver of long-term climate trends and must be accounted for in climate models. Unfortunately, direct measurements of this quantity over long periods do not exist. The only observation related to solar activity whose records reach back to the seventeenth century are sunspots. Surprisingly, determining the number of sunspots consistently over time has remained until today a challenging statistical problem. It arises from the need of consolidating data from multiple observing stations around the world in a context of low signal-to-noise ratios, non-stationarity, missing data, non-standard distributions and many kinds of errors. The data from some stations experience therefore severe and various deviations over time. In this paper, we propose the first systematic and thorough statistical approach for monitoring these complex and important series. It consists of three steps essential for successful treatment of the data: smoothing on multiple timescales, monitoring using block bootstrap calibrated CUSUM charts and classifying of out-of-control situations by support vector techniques. This approach allows us to detect a wide range of anomalies (such as sudden jumps or more progressive drifts), unseen in previous analyses. It helps us to identify the causes of major deviations, which are often observer or equipment related. Their detection and identification will contribute to improve future observations. Their elimination or correction in past data will lead to a more precise reconstruction of the world reference index for solar activity: the International Sunspot Number.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
The Observational Uncertainty of Coronal Hole Boundaries in Automated Detection Schemes
Authors:
Martin A. Reiss,
Karin Muglach,
Christian Möstl,
Charles N. Arge,
Rachel Bailey,
Veronique Delouille,
Tadhg M. Garton,
Amr Hamada,
Stefan Hofmeister,
Egor Illarionov,
Robert Jarolim,
Michael S. F. Kirk,
Alexander Kosovichev,
Larisza Krista,
Sangwoo Lee,
Chris Lowder,
Peter J. MacNeice,
Astrid Veronig,
ISWAT Coronal Hole Boundary Working Team
Abstract:
Coronal holes are the observational manifestation of the solar magnetic field open to the heliosphere and are of pivotal importance for our understanding of the origin and acceleration of the solar wind. Observations from space missions such as the Solar Dynamics Observatory now allow us to study coronal holes in unprecedented detail. Instrumental effects and other factors, however, pose a challen…
▽ More
Coronal holes are the observational manifestation of the solar magnetic field open to the heliosphere and are of pivotal importance for our understanding of the origin and acceleration of the solar wind. Observations from space missions such as the Solar Dynamics Observatory now allow us to study coronal holes in unprecedented detail. Instrumental effects and other factors, however, pose a challenge to automatically detect coronal holes in solar imagery. The science community addresses these challenges with different detection schemes. Until now, little attention has been paid to assessing the disagreement between these schemes. In this COSPAR ISWAT initiative, we present a comparison of nine automated detection schemes widely-applied in solar and space science. We study, specifically, a prevailing coronal hole observed by the Atmospheric Imaging Assembly instrument on 2018 May 30. Our results indicate that the choice of detection scheme has a significant effect on the location of the coronal hole boundary. Physical properties in coronal holes such as the area, mean intensity, and mean magnetic field strength vary by a factor of up to 4.5 between the maximum and minimum values. We conclude that our findings are relevant for coronal hole research from the past decade, and are therefore of interest to the solar and space research community.
△ Less
Submitted 26 March, 2021;
originally announced March 2021.
-
Coronal Hole Detection and Open Magnetic Flux
Authors:
J. A. Linker,
S. G. Heinemann,
M. Temmer,
M. J. Owens,
R. M. Caplan,
C. N. Arge,
E. Asvestari,
V. Delouille,
C. Downs,
S. J. Hofmeister,
I. C. Jebaraj,
M. Madjarska,
R. Pinto,
J. Pomoell,
E. Samara,
C. Scolini,
B. Vrsnak
Abstract:
Many scientists use coronal hole (CH) detections to infer open magnetic flux. Detection techniques differ in the areas that they assign as open, and may obtain different values for the open magnetic flux. We characterize the uncertainties of these methods, by applying six different detection methods to deduce the area and open flux of a near-disk center CH observed on 9/19/2010, and applying a sin…
▽ More
Many scientists use coronal hole (CH) detections to infer open magnetic flux. Detection techniques differ in the areas that they assign as open, and may obtain different values for the open magnetic flux. We characterize the uncertainties of these methods, by applying six different detection methods to deduce the area and open flux of a near-disk center CH observed on 9/19/2010, and applying a single method to five different EUV filtergrams for this CH. Open flux was calculated using five different magnetic maps. The standard deviation (interpreted as the uncertainty) in the open flux estimate for this CH was about 26%. However, including the variability of different magnetic data sources, this uncertainty almost doubles to 45%. We use two of the methods to characterize the area and open flux for all CHs in this time period. We find that the open flux is greatly underestimated compared to values inferred from in-situ measurements (by 2.2-4 times). We also test our detection techniques on simulated emission images from a thermodynamic MHD model of the solar corona. We find that the methods overestimate the area and open flux in the simulated CH, but the average error in the flux is only about 7%. The full-Sun detections on the simulated corona underestimate the model open flux, but by factors well below what is needed to account for the missing flux in the observations. Under-detection of open flux in coronal holes likely contributes to the recognized deficit in solar open flux, but is unlikely to resolve it.
△ Less
Submitted 9 March, 2021;
originally announced March 2021.
-
Uncertainty quantification in sunspot counts
Authors:
Sophie Mathieu,
Véronique Delouille,
Laure Lefèvre,
Christian Ritter,
Rainer von Sachs
Abstract:
Observing and counting sunspots constitutes one of the longest-running scientific experiment, with first observations dating back to Galileo and the invention of the telescope around 1610. Today the sunspot number (SN) time series acts as a benchmark of solar activity in a large range of physical models. An appropriate statistical modelling, adapted to the time series' complex nature, is however s…
▽ More
Observing and counting sunspots constitutes one of the longest-running scientific experiment, with first observations dating back to Galileo and the invention of the telescope around 1610. Today the sunspot number (SN) time series acts as a benchmark of solar activity in a large range of physical models. An appropriate statistical modelling, adapted to the time series' complex nature, is however still lacking. In this work, we provide the first comprehensive uncertainty quantification analysis of sunspot counts. Our interest lies in the following three components: the number of spots ($N_s$), the number of sunspot groups ($N_g$), and the composite $N_c$, defined as $N_c:=N_s+10N_g$. Those are reported by a network of observatories around the world, and are corrupted by errors of various types. We use a multiplicative framework to provide, for each of the three components, an estimation of their error distribution in various regimes (short-term, long-term, minima of solar activity). We also propose a robust estimator for the underlying solar signal and fit a density distribution that takes into account intrinsic characteristics such as over-dispersion, excess of zeros, and multiple modes. The estimation of the solar signal underlying the composite $N_c$ may be seen as a robust version of the International Sunspot Number (ISN), a quantity widely used as a proxy of solar activity. Therefore our results on $N_c$ may serve to characterize the uncertainty on ISN as well. Our results paves the way for a future monitoring of the observatories in quasi-real time, with the aim to alert the observers when they start deviating from the network and prevent large drifts from occurring in the network.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
A Comparison of Flare Forecasting Methods. IV. Evaluating Consecutive-Day Forecasting Patterns
Authors:
Sung-Hong Park,
K. D. Leka,
Kanya Kusano,
Jesse Andries,
Graham Barnes,
Suzy Bingham,
D. Shaun Bloomfield,
Aoife E. McCloskey,
Veronique Delouille,
David Falconer,
Peter T. Gallagher,
Manolis K. Georgoulis,
Yuki Kubo,
Kangjin Lee,
Sangwoo Lee,
Vasily Lobzin,
JunChul Mun,
Sophie A. Murray,
Tarek A. M. Hamad Nageem,
Rami Qahwaji,
Michael Sharpe,
Rob A. Steenburgh,
Graham Steward,
Michael Terkildsen
Abstract:
A crucial challenge to successful flare prediction is forecasting periods that transition between "flare-quiet" and "flare-active". Building on earlier studies in this series (Barnes et al. 2016; Leka et al. 2019a,b) in which we describe methodology, details, and results of flare forecasting comparison efforts, we focus here on patterns of forecast outcomes (success and failure) over multi-day per…
▽ More
A crucial challenge to successful flare prediction is forecasting periods that transition between "flare-quiet" and "flare-active". Building on earlier studies in this series (Barnes et al. 2016; Leka et al. 2019a,b) in which we describe methodology, details, and results of flare forecasting comparison efforts, we focus here on patterns of forecast outcomes (success and failure) over multi-day periods. A novel analysis is developed to evaluate forecasting success in the context of catching the first event of flare-active periods, and conversely, of correctly predicting declining flare activity. We demonstrate these evaluation methods graphically and quantitatively as they provide both quick comparative evaluations and options for detailed analysis. For the testing interval 2016-2017, we determine the relative frequency distribution of two-day dichotomous forecast outcomes for three different event histories (i.e., event/event, no-event/event and event/no-event), and use it to highlight performance differences between forecasting methods. A trend is identified across all forecasting methods that a high/low forecast probability on day-1 remains high/low on day-2 even though flaring activity is transitioning. For M-class and larger flares, we find that explicitly including persistence or prior flare history in computing forecasts helps to improve overall forecast performance. It is also found that using magnetic/modern data leads to improvement in catching the first-event/first-no-event transitions. Finally, 15% of major (i.e., M-class or above) flare days over the testing interval were effectively missed due to a lack of observations from instruments away from the Earth-Sun line.
△ Less
Submitted 21 January, 2020; v1 submitted 8 January, 2020;
originally announced January 2020.
-
A Comparison of Flare Forecasting Methods. III. Systematic Behaviors of Operational Solar Flare Forecasting Systems
Authors:
K. D. Leka,
Sung-Hong Park,
Kanya Kusano,
Jesse Andries,
Graham Barnes,
Suzy Bingham,
D. Shaun Bloomfield,
Aoife E. McCloskey,
Veronique Delouille,
David Falconer,
Peter T. Gallagher,
Manolis K. Georgoulis,
Yuki Kubo,
Kangjin Lee,
Sangwoo Lee,
Vasily Lobzin,
JunChul Mun,
Sophie A. Murray,
Tarek A. M. Hamad Nageem,
Rami Qahwaji,
Michael Sharpe,
Rob Steenburgh,
Graham Steward,
Michael Terkildsen
Abstract:
A workshop was recently held at Nagoya University (31 October - 02 November 2017), sponsored by the Center for International Collaborative Research, at the Institute for Space-Earth Environmental Research, Nagoya University, Japan, to quantitatively compare the performance of today's operational solar flare forecasting facilities. Building upon Paper I of this series (Barnes et al. 2016), in Paper…
▽ More
A workshop was recently held at Nagoya University (31 October - 02 November 2017), sponsored by the Center for International Collaborative Research, at the Institute for Space-Earth Environmental Research, Nagoya University, Japan, to quantitatively compare the performance of today's operational solar flare forecasting facilities. Building upon Paper I of this series (Barnes et al. 2016), in Paper II (Leka et al. 2019) we described the participating methods for this latest comparison effort, the evaluation methodology, and presented quantitative comparisons. In this paper we focus on the behavior and performance of the methods when evaluated in the context of broad implementation differences. Acknowledging the short testing interval available and the small number of methods available, we do find that forecast performance: 1) appears to improve by including persistence or prior flare activity, region evolution, and a human "forecaster in the loop"; 2) is hurt by restricting data to disk-center observations; 3) may benefit from long-term statistics, but mostly when then combined with modern data sources and statistical approaches. These trends are arguably weak and must be viewed with numerous caveats, as discussed both here and in Paper II. Following this present work, we present in Paper IV a novel analysis method to evaluate temporal patterns of forecasting errors of both types (i.e., misses and false alarms; Park et al. 2019). Hence, most importantly, with this series of papers we demonstrate the techniques for facilitating comparisons in the interest of establishing performance-positive methodologies.
△ Less
Submitted 5 July, 2019;
originally announced July 2019.
-
A Comparison of Flare Forecasting Methods. II. Benchmarks, Metrics and Performance Results for Operational Solar Flare Forecasting Systems
Authors:
K. D. Leka,
Sung-Hong Park,
Kanya Kusano,
Jesse Andries,
Graham Barnes,
Suzy Bingham,
D. Shaun Bloomfield,
Aoife E. McCloskey,
Veronique Delouille,
David Falconer,
Peter T. Gallagher,
Manolis K. Georgoulis,
Yuki Kubo,
Kangjin Lee,
Sangwoo Lee,
Vasily Lobzin,
JunChul Mun,
Sophie A. Murray,
Tarek A. M. Hamad Nageem,
Rami Qahwaji,
Michael Sharpe,
Rob Steenburgh,
Graham Steward,
Michael Terkildsen
Abstract:
Solar flares are extremely energetic phenomena in our Solar System. Their impulsive, often drastic radiative increases, in particular at short wavelengths, bring immediate impacts that motivate solar physics and space weather research to understand solar flares to the point of being able to forecast them. As data and algorithms improve dramatically, questions must be asked concerning how well the…
▽ More
Solar flares are extremely energetic phenomena in our Solar System. Their impulsive, often drastic radiative increases, in particular at short wavelengths, bring immediate impacts that motivate solar physics and space weather research to understand solar flares to the point of being able to forecast them. As data and algorithms improve dramatically, questions must be asked concerning how well the forecasting performs; crucially, we must ask how to rigorously measure performance in order to critically gauge any improvements. Building upon earlier-developed methodology (Barnes et al, 2016, Paper I), international representatives of regional warning centers and research facilities assembled in 2017 at the Institute for Space-Earth Environmental Research, Nagoya University, Japan to - for the first time - directly compare the performance of operational solar flare forecasting methods. Multiple quantitative evaluation metrics are employed, with focus and discussion on evaluation methodologies given the restrictions of operational forecasting. Numerous methods performed consistently above the "no skill" level, although which method scored top marks is decisively a function of flare event definition and the metric used; there was no single winner. Following in this paper series we ask why the performances differ by examining implementation details (Leka et al. 2019, Paper III), and then we present a novel analysis method to evaluate temporal patterns of forecasting errors in (Park et al. 2019, Paper IV). With these works, this team presents a well-defined and robust methodology for evaluating solar flare forecasting methods in both research and operational frameworks, and today's performance benchmarks against which improvements and new methods may be compared.
△ Less
Submitted 5 July, 2019;
originally announced July 2019.
-
Improvements on coronal hole detection in SDO/AIA images using supervised classification
Authors:
Martin A. Reiss,
Stefan J. Hofmeister,
Ruben De Visscher,
Manuela Temmer,
Astrid M. Veronig,
Véronique Delouille,
Benjamin Mampaey,
Helmut Ahammer
Abstract:
We demonstrate the use of machine learning algorithms in combination with segmentation techniques in order to distinguish coronal holes and filaments in SDO/AIA EUV images of the Sun. Based on two coronal hole detection techniques (intensity-based thresholding, SPoCA), we prepared data sets of manually labeled coronal hole and filament channel regions present on the Sun during the time range 2011…
▽ More
We demonstrate the use of machine learning algorithms in combination with segmentation techniques in order to distinguish coronal holes and filaments in SDO/AIA EUV images of the Sun. Based on two coronal hole detection techniques (intensity-based thresholding, SPoCA), we prepared data sets of manually labeled coronal hole and filament channel regions present on the Sun during the time range 2011 - 2013. By mapping the extracted regions from EUV observations onto HMI line-of-sight magnetograms we also include their magnetic characteristics. We computed shape measures from the segmented binary maps as well as first order and second order texture statistics from the segmented regions in the EUV images and magnetograms. These attributes were used for data mining investigations to identify the most performant rule to differentiate between coronal holes and filament channels. We applied several classifiers, namely Support Vector Machine, Linear Support Vector Machine, Decision Tree, and Random Forest and found that all classification rules achieve good results in general, with linear SVM providing the best performances (with a true skill statistic of ~0.90). Additional information from magnetic field data systematically improves the performance across all four classifiers for the SPoCA detection. Since the calculation is inexpensive in computing time, this approach is well suited for applications on real-time data. This study demonstrates how a machine learning approach may help improve upon an unsupervised feature extraction method.
△ Less
Submitted 22 June, 2015;
originally announced June 2015.
-
Meta learning of bounds on the Bayes classifier error
Authors:
Kevin R. Moon,
Veronique Delouille,
Alfred O. Hero III
Abstract:
Meta learning uses information from base learners (e.g. classifiers or estimators) as well as information about the learning problem to improve upon the performance of a single base learner. For example, the Bayes error rate of a given feature space, if known, can be used to aid in choosing a classifier, as well as in feature selection and model selection for the base classifiers and the meta clas…
▽ More
Meta learning uses information from base learners (e.g. classifiers or estimators) as well as information about the learning problem to improve upon the performance of a single base learner. For example, the Bayes error rate of a given feature space, if known, can be used to aid in choosing a classifier, as well as in feature selection and model selection for the base classifiers and the meta classifier. Recent work in the field of f-divergence functional estimation has led to the development of simple and rapidly converging estimators that can be used to estimate various bounds on the Bayes error. We estimate multiple bounds on the Bayes error using an estimator that applies meta learning to slowly converging plug-in estimators to obtain the parametric convergence rate. We compare the estimated bounds empirically on simulated data and then estimate the tighter bounds on features extracted from an image patch analysis of sunspot continuum and magnetogram images.
△ Less
Submitted 3 July, 2015; v1 submitted 27 April, 2015;
originally announced April 2015.
-
Image patch analysis of sunspots and active regions. II. Clustering via matrix factorization
Authors:
Kevin R. Moon,
Veronique Delouille,
Jimmy J. Li,
Ruben De Visscher,
Fraser Watson,
Alfred O. Hero III
Abstract:
Separating active regions that are quiet from potentially eruptive ones is a key issue in Space Weather applications. Traditional classification schemes such as Mount Wilson and McIntosh have been effective in relating an active region large scale magnetic configuration to its ability to produce eruptive events. However, their qualitative nature prevents systematic studies of an active region's ev…
▽ More
Separating active regions that are quiet from potentially eruptive ones is a key issue in Space Weather applications. Traditional classification schemes such as Mount Wilson and McIntosh have been effective in relating an active region large scale magnetic configuration to its ability to produce eruptive events. However, their qualitative nature prevents systematic studies of an active region's evolution for example. We introduce a new clustering of active regions that is based on the local geometry observed in Line of Sight magnetogram and continuum images. We use a reduced-dimension representation of an active region that is obtained by factoring the corresponding data matrix comprised of local image patches. Two factorizations can be compared via the definition of appropriate metrics on the resulting factors. The distances obtained from these metrics are then used to cluster the active regions. We find that these metrics result in natural clusterings of active regions. The clusterings are related to large scale descriptors of an active region such as its size, its local magnetic field distribution, and its complexity as measured by the Mount Wilson classification scheme. We also find that including data focused on the neutral line of an active region can result in an increased correspondence between our clustering results and other active region descriptors such as the Mount Wilson classifications and the $R$ value. We provide some recommendations for which metrics, matrix factorization techniques, and regions of interest to use to study active regions.
△ Less
Submitted 10 December, 2015; v1 submitted 10 April, 2015;
originally announced April 2015.
-
Image patch analysis of sunspots and active regions. I. Intrinsic dimension and correlation analysis
Authors:
Kevin R. Moon,
Jimmy J. Li,
Veronique Delouille,
Ruben De Visscher,
Fraser Watson,
Alfred O. Hero III
Abstract:
The flare-productivity of an active region is observed to be related to its spatial complexity. Mount Wilson or McIntosh sunspot classifications measure such complexity but in a categorical way, and may therefore not use all the information present in the observations. Moreover, such categorical schemes hinder a systematic study of an active region's evolution for example. We propose fine-scale qu…
▽ More
The flare-productivity of an active region is observed to be related to its spatial complexity. Mount Wilson or McIntosh sunspot classifications measure such complexity but in a categorical way, and may therefore not use all the information present in the observations. Moreover, such categorical schemes hinder a systematic study of an active region's evolution for example. We propose fine-scale quantitative descriptors for an active region's complexity and relate them to the Mount Wilson classification. We analyze the local correlation structure within continuum and magnetogram data, as well as the cross-correlation between continuum and magnetogram data. We compute the intrinsic dimension, partial correlation, and canonical correlation analysis (CCA) of image patches of continuum and magnetogram active region images taken from the SOHO-MDI instrument. We use masks of sunspots derived from continuum as well as larger masks of magnetic active regions derived from the magnetogram to analyze separately the core part of an active region from its surrounding part. We find the relationship between complexity of an active region as measured by Mount Wilson and the intrinsic dimension of its image patches. Partial correlation patterns exhibit approximately a third-order Markov structure. CCA reveals different patterns of correlation between continuum and magnetogram within the sunspots and in the region surrounding the sunspots. These results also pave the way for patch-based dictionary learning with a view towards automatic clustering of active regions.
△ Less
Submitted 14 December, 2015; v1 submitted 13 March, 2015;
originally announced March 2015.
-
Non-parametric PSF estimation from celestial transit solar images using blind deconvolution
Authors:
Adriana Gonzalez,
Véronique Delouille,
Laurent Jacques
Abstract:
Context: Characterization of instrumental effects in astronomical imaging is important in order to extract accurate physical information from the observations. The measured image in a real optical instrument is usually represented by the convolution of an ideal image with a Point Spread Function (PSF). Additionally, the image acquisition process is also contaminated by other sources of noise (read…
▽ More
Context: Characterization of instrumental effects in astronomical imaging is important in order to extract accurate physical information from the observations. The measured image in a real optical instrument is usually represented by the convolution of an ideal image with a Point Spread Function (PSF). Additionally, the image acquisition process is also contaminated by other sources of noise (read-out, photon-counting). The problem of estimating both the PSF and a denoised image is called blind deconvolution and is ill-posed.
Aims: We propose a blind deconvolution scheme that relies on image regularization. Contrarily to most methods presented in the literature, our method does not assume a parametric model of the PSF and can thus be applied to any telescope.
Methods: Our scheme uses a wavelet analysis prior model on the image and weak assumptions on the PSF. We use observations from a celestial transit, where the occulting body can be assumed to be a black disk. These constraints allow us to retain meaningful solutions for the filter and the image, eliminating trivial, translated and interchanged solutions. Under an additive Gaussian noise assumption, they also enforce noise canceling and avoid reconstruction artifacts by promoting the whiteness of the residual between the blurred observations and the cleaned data.
Results: Our method is applied to synthetic and experimental data. The PSF is estimated for the SECCHI/EUVI instrument using the 2007 Lunar transit, and for SDO/AIA using the 2012 Venus transit. Results show that the proposed non-parametric blind deconvolution method is able to estimate the core of the PSF with a similar quality to parametric methods proposed in the literature. We also show that, if these parametric estimations are incorporated in the acquisition model, the resulting PSF outperforms both the parametric and non-parametric methods.
△ Less
Submitted 29 September, 2015; v1 submitted 19 December, 2014;
originally announced December 2014.
-
Image patch analysis and clustering of sunspots: a dimensionality reduction approach
Authors:
Kevin R. Moon,
Jimmy J. Li,
Veronique Delouille,
Fraser Watson,
Alfred O. Hero III
Abstract:
Sunspots, as seen in white light or continuum images, are associated with regions of high magnetic activity on the Sun, visible on magnetogram images. Their complexity is correlated with explosive solar activity and so classifying these active regions is useful for predicting future solar activity. Current classification of sunspot groups is visually based and suffers from bias. Supervised learnin…
▽ More
Sunspots, as seen in white light or continuum images, are associated with regions of high magnetic activity on the Sun, visible on magnetogram images. Their complexity is correlated with explosive solar activity and so classifying these active regions is useful for predicting future solar activity. Current classification of sunspot groups is visually based and suffers from bias. Supervised learning methods can reduce human bias but fail to optimally capitalize on the information present in sunspot images. This paper uses two image modalities (continuum and magnetogram) to characterize the spatial and modal interactions of sunspot and magnetic active region images and presents a new approach to cluster the images. Specifically, in the framework of image patch analysis, we estimate the number of intrinsic parameters required to describe the spatial and modal dependencies, the correlation between the two modalities and the corresponding spatial patterns, and examine the phenomena at different scales within the images. To do this, we use linear and nonlinear intrinsic dimension estimators, canonical correlation analysis, and multiresolution analysis of intrinsic dimension.
△ Less
Submitted 24 June, 2014;
originally announced June 2014.
-
The SPOCA-suite: a software for extraction and tracking of Active Regions and Coronal Holes on EUV images
Authors:
Véronique Delouille,
Benjamin Mampaey,
Cis Verbeeck,
Ruben de Visscher
Abstract:
Precise localisation and characterization of active regions and coronal holes as observed by EUV imagers are crucial for a wide range of solar and helio-physics studies. We describe a segmentation procedure, the SPOCA-suite, that produces catalogs of Active Regions (AR) and Coronal Holes (CH) on SDO-AIA images. The method builds upon our previous work on 'Spatial Possibilistic Clustering Algorithm…
▽ More
Precise localisation and characterization of active regions and coronal holes as observed by EUV imagers are crucial for a wide range of solar and helio-physics studies. We describe a segmentation procedure, the SPOCA-suite, that produces catalogs of Active Regions (AR) and Coronal Holes (CH) on SDO-AIA images. The method builds upon our previous work on 'Spatial Possibilistic Clustering Algorithm' (SPOCA) and substantially improve it in several ways. The SPOCA-suite is applied in near real time on AIA archive and produces entries into the AR and CH catalogs of the Heliophysics Event Knowledgebase (HEK) every four hours. We give an illustration of the use of SPOCA for determination of the CH filling factors. This reports is intended as a reference guide for the users of SPoCA output.
△ Less
Submitted 7 August, 2012;
originally announced August 2012.
-
A Multi-Wavelength Analysis of Active Regions and Sunspots by Comparison of Automated Detection Algorithms
Authors:
Cis Verbeeck,
Paul A. Higgins,
Tufan Colak,
Fraser T. Watson,
Veronique Delouille,
Benjamin Mampaey,
Rami Qahwaji
Abstract:
Since the Solar Dynamics Observatory (SDO) began recording ~ 1 TB of data per day, there has been an increased need to automatically extract features and events for further analysis. Here we compare the overall detection performance, correlations between extracted properties, and usability for feature tracking of four solar feature-detection algorithms: the Solar Monitor Active Region Tracker (SMA…
▽ More
Since the Solar Dynamics Observatory (SDO) began recording ~ 1 TB of data per day, there has been an increased need to automatically extract features and events for further analysis. Here we compare the overall detection performance, correlations between extracted properties, and usability for feature tracking of four solar feature-detection algorithms: the Solar Monitor Active Region Tracker (SMART) detects active regions in line-of-sight magnetograms; the Automated Solar Activity Prediction code (ASAP) detects sunspots and pores in white-light continuum images; the Sunspot Tracking And Recognition Algorithm (STARA) detects sunspots in white-light continuum images; the Spatial Possibilistic Clustering Algorithm (SPoCA) automatically segments solar EUV images into active regions (AR), coronal holes (CH) and quiet Sun (QS). One month of data from the SOHO/MDI and SOHO/EIT instruments during 12 May - 23 June 2003 is analysed. The overall detection performance of each algorithm is benchmarked against National Oceanic and Atmospheric Administration (NOAA) and Solar Influences Data Analysis Centre (SIDC) catalogues using various feature properties such as total sunspot area, which shows good agreement, and the number of features detected, which shows poor agreement. Principal Component Analysis indicates a clear distinction between photospheric properties, which are highly correlated to the first component and account for 52.86% of variability in the data set, and coronal properties, which are moderately correlated to both the first and second principal components. Finally, case studies of NOAA 10377 and 10365 are conducted to determine algorithm stability for tracking the evolution of individual features. We find that magnetic flux and total sunspot area are the best indicators of active-region emergence.
△ Less
Submitted 2 September, 2011;
originally announced September 2011.
-
Quantifying and containing the curse of high resolution coronal imaging
Authors:
Véronique Delouille,
Pierre Chainais,
Jean-François Hochedez
Abstract:
Future missions such as Solar Orbiter (SO), InterHelioprobe, or Solar Probe aim at approaching the Sun closer than ever before, with on board some high resolution imagers (HRI) having a subsecond cadence and a pixel area of about $(80km)^2$ at the Sun during perihelion. In order to guarantee their scientific success, it is necessary to evaluate if the photon counts available at these resolution…
▽ More
Future missions such as Solar Orbiter (SO), InterHelioprobe, or Solar Probe aim at approaching the Sun closer than ever before, with on board some high resolution imagers (HRI) having a subsecond cadence and a pixel area of about $(80km)^2$ at the Sun during perihelion. In order to guarantee their scientific success, it is necessary to evaluate if the photon counts available at these resolution and cadence will provide a sufficient signal-to-noise ratio (SNR).
We perform a first step in this direction by analyzing and characterizing the spatial intermittency of Quiet Sun images thanks to a multifractal analysis.
We identify the parameters that specify the scale-invariance behavior. This identification allows next to select a family of multifractal processes, namely the Compound Poisson Cascades, that can synthesize artificial images having some of the scale-invariance properties observed on the recorded images.
The prevalence of self-similarity in Quiet Sun coronal images makes it relevant to study the ratio between the SNR present at SoHO/EIT images and in coarsened images. SoHO/EIT images thus play the role of 'high resolution' images, whereas the 'low-resolution' coarsened images are rebinned so as to simulate a smaller angular resolution and/or a larger distance to the Sun. For a fixed difference in angular resolution and in Spacecraft-Sun distance, we determine the proportion of pixels having a SNR preserved at high resolution given a particular increase in effective area. If scale-invariance continues to prevail at smaller scales, the conclusion reached with SoHO/EIT images can be transposed to the situation where the resolution is increased from SoHO/EIT to SO/HRI resolution at perihelion.
△ Less
Submitted 22 August, 2008;
originally announced August 2008.