Principal component analysis of hybrid functional and vector data
- PMID: 34160848
- PMCID: PMC9084921
- DOI: 10.1002/sim.9117
Principal component analysis of hybrid functional and vector data
Abstract
We propose a practical principal component analysis (PCA) framework that provides a nonparametric means of simultaneously reducing the dimensions of and modeling functional and vector (multivariate) data. We first introduce a Hilbert space that combines functional and vector objects as a single hybrid object. The framework, termed a PCA of hybrid functional and vector data (HFV-PCA), is then based on the eigen-decomposition of a covariance operator that captures simultaneous variations of functional and vector data in the new space. This approach leads to interpretable principal components that have the same structure as each observation and a single set of scores that serves well as a low-dimensional proxy for hybrid functional and vector data. To support practical application of HFV-PCA, the explicit relationship between the hybrid PC decomposition and the functional and vector PC decompositions is established, leading to a simple and robust estimation scheme where components of HFV-PCA are calculated using the components estimated from the existing functional and classical PCA methods. This estimation strategy allows flexible incorporation of sparse and irregular functional data as well as multivariate functional data. We derive the consistency results and asymptotic convergence rates for the proposed estimators. We demonstrate the efficacy of the method through simulations and analysis of renal imaging data.
Keywords: dimension reduction; functional data analysis; multiple data modalities; multivariate data analysis; multivariate functional data; principal component analysis.
© 2021 John Wiley & Sons Ltd.
Conflict of interest statement
CONFLICT OF INTEREST
The authors declare no potential conflict of interests.
Figures
![FIGURE 1](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/9084921/bin/nihms-1797417-f0001.gif)
![FIGURE 2](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/9084921/bin/nihms-1797417-f0002.gif)
![FIGURE 3](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/9084921/bin/nihms-1797417-f0003.gif)
![FIGURE 4](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/9084921/bin/nihms-1797417-f0004.gif)
Similar articles
-
Hybrid principal components analysis for region-referenced longitudinal functional EEG data.Biostatistics. 2020 Jan 1;21(1):139-157. doi: 10.1093/biostatistics/kxy034. Biostatistics. 2020. PMID: 30084925 Free PMC article.
-
Incorporating biological information in sparse principal component analysis with application to genomic data.BMC Bioinformatics. 2017 Jul 11;18(1):332. doi: 10.1186/s12859-017-1740-7. BMC Bioinformatics. 2017. PMID: 28697740 Free PMC article.
-
Applying stability selection to consistently estimate sparse principal components in high-dimensional molecular data.Bioinformatics. 2015 Aug 15;31(16):2683-90. doi: 10.1093/bioinformatics/btv197. Epub 2015 Apr 10. Bioinformatics. 2015. PMID: 25861969 Free PMC article.
-
Beyond principal components: a critical comparison of factor analysis methods for subspace modelling in chemistry.Anal Methods. 2021 Sep 30;13(37):4188-4219. doi: 10.1039/d1ay01124c. Anal Methods. 2021. PMID: 34473142 Review.
-
On the Uses of PCA to Characterise Molecular Dynamics Simulations of Biological Macromolecules: Basics and Tips for an Effective Use.Chemphyschem. 2023 Jan 17;24(2):e202200491. doi: 10.1002/cphc.202200491. Epub 2022 Oct 26. Chemphyschem. 2023. PMID: 36285677 Review.
Cited by
-
Paternal Prenatal and Lactation Exposure to a High-Calorie Diet Shapes Transgenerational Brain Macro- and Microstructure Defects, Impacting Anxiety-Like Behavior in Male Offspring Rats.eNeuro. 2024 Feb 12;11(2):ENEURO.0194-23.2023. doi: 10.1523/ENEURO.0194-23.2023. Print 2024 Feb. eNeuro. 2024. PMID: 38212114 Free PMC article.
-
A novel prognostic signature for lung adenocarcinoma based on cuproptosis-related lncRNAs: A Review.Medicine (Baltimore). 2022 Dec 9;101(49):e31924. doi: 10.1097/MD.0000000000031924. Medicine (Baltimore). 2022. PMID: 36626411 Free PMC article. Review.
References
-
- Besse P, Ramsay JO. Principal component analysis of sampled functions. Psychometrika. 1986;51:285–311.
-
- Castro P, Lawton W, Sylvestre E. Principal modes of variation for processes with continuous sample curves. Technometrics. 1986;28:329–337.
-
- Rice JA, Silverman BW. Estimating the mean and covariance structure nonparametrically when the data are curves. J R Stat Soc Ser B Methodol. 1991;53:233–243.
-
- Silverman BW. Smoothed functional principal component analysis by choice of norm. Ann Stat. 1996;24:1–24.
-
- Shi M, Weiss RE, Taylor JMG. An analysis of paediatric CD4 counts for acquired immune deficiency syndrome using flexible random curves. J R Stat Soc Ser C Appl Stat. 1996;45:151–163.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources