Bayesian cluster analysis
- PMID: 36970819
- PMCID: PMC10041359
- DOI: 10.1098/rsta.2022.0149
Bayesian cluster analysis
Abstract
Bayesian cluster analysis offers substantial benefits over algorithmic approaches by providing not only point estimates but also uncertainty in the clustering structure and patterns within each cluster. An overview of Bayesian cluster analysis is provided, including both model-based and loss-based approaches, along with a discussion on the importance of the kernel or loss selected and prior specification. Advantages are demonstrated in an application to cluster cells and discover latent cell types in single-cell RNA sequencing data to study embryonic cellular development. Lastly, we focus on the ongoing debate between finite and infinite mixtures in a model-based approach and robustness to model misspecification. While much of the debate and asymptotic theory focuses on the marginal posterior of the number of clusters, we empirically show that quite a different behaviour is obtained when estimating the full clustering structure. This article is part of the theme issue 'Bayesian inference: challenges, perspectives, and prospects'.
Keywords: Bayesian analysis; clustering; ensembles; mixture models; model misspecification.
Conflict of interest statement
We declare we have no competing interests.
Figures
![Figure 1.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10041359/bin/rsta20220149f01.gif)
![Figure 2.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10041359/bin/rsta20220149f02.gif)
![Figure 3.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10041359/bin/rsta20220149f03.gif)
![Figure 4.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10041359/bin/rsta20220149f04.gif)
Similar articles
-
Consensus clustering for Bayesian mixture models.BMC Bioinformatics. 2022 Jul 21;23(1):290. doi: 10.1186/s12859-022-04830-8. BMC Bioinformatics. 2022. PMID: 35864476 Free PMC article.
-
Bayesian clustering of multiple zero-inflated outcomes.Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220145. doi: 10.1098/rsta.2022.0145. Epub 2023 Mar 27. Philos Trans A Math Phys Eng Sci. 2023. PMID: 36970823 Free PMC article.
-
Bayesian infinite mixture model based clustering of gene expression profiles.Bioinformatics. 2002 Sep;18(9):1194-206. doi: 10.1093/bioinformatics/18.9.1194. Bioinformatics. 2002. PMID: 12217911
-
Bayesian approaches to include real-world data in clinical studies.Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220158. doi: 10.1098/rsta.2022.0158. Epub 2023 Mar 27. Philos Trans A Math Phys Eng Sci. 2023. PMID: 36970825 Review.
-
Prediction-based uncertainty quantification for exchangeable sequences.Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220142. doi: 10.1098/rsta.2022.0142. Epub 2023 Mar 27. Philos Trans A Math Phys Eng Sci. 2023. PMID: 36970827 Review.
Cited by
-
Identification of cuproptosis-related gene clusters and immune cell infiltration in major burns based on machine learning models and experimental validation.Front Immunol. 2024 Feb 12;15:1335675. doi: 10.3389/fimmu.2024.1335675. eCollection 2024. Front Immunol. 2024. PMID: 38410514 Free PMC article.
-
A special issue on Bayesian inference: challenges, perspectives and prospects.Philos Trans A Math Phys Eng Sci. 2023 May 15;381(2247):20220155. doi: 10.1098/rsta.2022.0155. Epub 2023 Mar 27. Philos Trans A Math Phys Eng Sci. 2023. PMID: 36970829 Free PMC article. No abstract available.
References
-
- Cheeseman P, Kelly J, Self M, Stutz J, Taylor W, Freeman D. 1988. Autoclass: a Bayesian classification system. In Machine learning proceedings 1988 (ed. J Laird), pp. 54–64. San Francisco, CA: Elsevier.
-
- Kuhn MA, Feigelson ED. 2019. Applications in astronomy. In Handbook of mixture analysis (eds S Fruhwirth-Schnatter, G Celeux, CP Robert), pp. 463–489. New York, NY: Chapman and Hall/CRC.
-
- Dasgupta A, Raftery AE. 1998. Detecting features in spatial point processes with clutter via model-based clustering. J. Am. Stat. Assoc. 93, 294-302. (10.1080/01621459.1998.10474110) - DOI
-
- Blei DM, Ng AY, Jordan MI. 2003. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993-1022.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources