GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies
- PMID: 36418338
- PMCID: PMC9684164
- DOI: 10.1038/s41467-022-34932-z
GhostKnockoff inference empowers identification of putative causal variants in genome-wide association studies
Abstract
Recent advances in genome sequencing and imputation technologies provide an exciting opportunity to comprehensively study the contribution of genetic variants to complex phenotypes. However, our ability to translate genetic discoveries into mechanistic insights remains limited at this point. In this paper, we propose an efficient knockoff-based method, GhostKnockoff, for genome-wide association studies (GWAS) that leads to improved power and ability to prioritize putative causal variants relative to conventional GWAS approaches. The method requires only Z-scores from conventional GWAS and hence can be easily applied to enhance existing and future studies. The method can also be applied to meta-analysis of multiple GWAS allowing for arbitrary sample overlap. We demonstrate its performance using empirical simulations and two applications: (1) a meta-analysis for Alzheimer's disease comprising nine overlapping large-scale GWAS, whole-exome and whole-genome sequencing studies and (2) analysis of 1403 binary phenotypes from the UK Biobank data in 408,961 samples of European ancestry. Our results demonstrate that GhostKnockoff can identify putatively functional variants with weaker statistical effects that are missed by conventional association tests.
© 2022. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Identification of putative causal loci in whole-genome sequencing data via knockoff statistics.Nat Commun. 2021 May 25;12(1):3152. doi: 10.1038/s41467-021-22889-4. Nat Commun. 2021. PMID: 34035245 Free PMC article.
-
KnockoffTrio: A knockoff framework for the identification of putative causal variants in genome-wide association studies with trio design.Am J Hum Genet. 2022 Oct 6;109(10):1761-1776. doi: 10.1016/j.ajhg.2022.08.013. Epub 2022 Sep 22. Am J Hum Genet. 2022. PMID: 36150388 Free PMC article.
-
Quantifying the mapping precision of genome-wide association studies using whole-genome sequencing data.Genome Biol. 2017 May 16;18(1):86. doi: 10.1186/s13059-017-1216-0. Genome Biol. 2017. PMID: 28506277 Free PMC article.
-
Molecular genetic studies of complex phenotypes.Transl Res. 2012 Feb;159(2):64-79. doi: 10.1016/j.trsl.2011.08.001. Epub 2011 Aug 31. Transl Res. 2012. PMID: 22243791 Free PMC article. Review.
-
Accurate Imputation of Untyped Variants from Deep Sequencing Data.Methods Mol Biol. 2021;2243:271-281. doi: 10.1007/978-1-0716-1103-6_13. Methods Mol Biol. 2021. PMID: 33606262 Review.
Cited by
-
Controlled Variable Selection from Summary Statistics Only? A Solution via GhostKnockoffs and Penalized Regression.ArXiv [Preprint]. 2024 Feb 20:arXiv:2402.12724v1. ArXiv. 2024. PMID: 38463500 Free PMC article. Preprint.
-
Improving fine-mapping by modeling infinitesimal effects.Nat Genet. 2024 Jan;56(1):162-169. doi: 10.1038/s41588-023-01597-3. Epub 2023 Nov 30. Nat Genet. 2024. PMID: 38036779 Free PMC article.
References
-
- Sierksma, A., Escott-Price, V. & De Strooper, B. Translating genetic risk of Alzheimer’s disease into mechanistic insight and drug targets. Science370, 61–66 (2020). - PubMed
-
- Sims R, Hill M, Williams J. The multiplex model of the genetics of Alzheimer’s disease. Nat. Neurosci. 2020 233. 2020;23:311–322. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources