Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct;586(7831):763-768.
doi: 10.1038/s41586-020-2819-2. Epub 2020 Oct 14.

Inherited causes of clonal haematopoiesis in 97,691 whole genomes

Alexander G Bick #  1   2   3   4 Joshua S Weinstock #  5 Satish K Nandakumar  2   6 Charles P Fulco  2   7 Erik L Bao  2   6   8 Seyedeh M Zekavat  2   9 Mindy D Szeto  10   11 Xiaotian Liao  2   6 Matthew J Leventhal  2 Joseph Nasser  2 Kyle Chang  12 Cecelia Laurie  13 Bala Bharathi Burugula  14 Christopher J Gibson  15 Amy E Lin  16 Margaret A Taub  17 Francois Aguet  2 Kristin Ardlie  2 Braxton D Mitchell  18   19 Kathleen C Barnes  10   20 Arden Moscati  21 Myriam Fornage  22   23 Susan Redline  3   24   25 Bruce M Psaty  26   27   28   29 Edwin K Silverman  3   30 Scott T Weiss  3   30 Nicholette D Palmer  31 Ramachandran S Vasan  32 Esteban G Burchard  33   34 Sharon L R Kardia  35 Jiang He  36   37 Robert C Kaplan  38   39 Nicholas L Smith  27   29   40 Donna K Arnett  41 David A Schwartz  42 Adolfo Correa  43 Mariza de Andrade  44 Xiuqing Guo  45 Barbara A Konkle  46   47 Brian Custer  48   49 Juan M Peralta  50 Hongsheng Gui  51 Deborah A Meyers  52 Stephen T McGarvey  53 Ida Yii-Der Chen  54 M Benjamin Shoemaker  55 Patricia A Peyser  35 Jai G Broome  13 Stephanie M Gogarten  13 Fei Fei Wang  13 Quenna Wong  13 May E Montasser  18 Michelle Daya  10 Eimear E Kenny  56 Kari E North  57 Lenore J Launer  58 Brian E Cade  24   59 Joshua C Bis  26 Michael H Cho  3   30 Jessica Lasky-Su  3   30 Donald W Bowden  31 L Adrienne Cupples  60 Angel C Y Mak  33 Lewis C Becker  61 Jennifer A Smith  35   62 Tanika N Kelly  36   37 Stella Aslibekyan  63 Susan R Heckbert  27   29 Hemant K Tiwari  64 Ivana V Yang  42 John A Heit  65 Steven A Lubitz  2   3   66 Jill M Johnsen  46   47 Joanne E Curran  50 Sally E Wenzel  67 Daniel E Weeks  68 Dabeeru C Rao  69 Dawood Darbar  70 Jee-Young Moon  38 Russell P Tracy  71 Erin J Buth  13 Nicholas Rafaels  20 Ruth J F Loos  21   72 Peter Durda  71 Yongmei Liu  73 Lifang Hou  74 Jiwon Lee  24 Priyadarshini Kachroo  3   30 Barry I Freedman  75 Daniel Levy  76   77 Lawrence F Bielak  35 James E Hixson  78 James S Floyd  26   27   47 Eric A Whitsel  79   80 Patrick T Ellinor  2   3   66 Marguerite R Irvin  63 Tasha E Fingerlin  81 Laura M Raffield  82 Sebastian M Armasu  44 Marsha M Wheeler  83 Ester C Sabino  84 John Blangero  50 L Keoki Williams  51 Bruce D Levy  3   85 Wayne Huey-Herng Sheu  86 Dan M Roden  87   88   89 Eric Boerwinkle  89   90 JoAnn E Manson  3   91   92 Rasika A Mathias  61 Pinkal Desai  93 Kent D Taylor  94   95 Andrew D Johnson  76   77 NHLBI Trans-Omics for Precision Medicine ConsortiumPaul L Auer  96 Charles Kooperberg  97 Cathy C Laurie  13 Thomas W Blackwell  5 Albert V Smith  5 Hongyu Zhao  98   99 Ethan Lange  10 Leslie Lange  10 Stephen S Rich  100 Jerome I Rotter  94   95 James G Wilson  101   102 Paul Scheet  12 Jacob O Kitzman  14   103 Eric S Lander  2   7   104 Jesse M Engreitz  2   105 Benjamin L Ebert  2   3   15   106 Alexander P Reiner  27   97 Siddhartha Jaiswal  107 Gonçalo Abecasis  5   108 Vijay G Sankaran  2   3   6 Sekar Kathiresan  109   110   111   112 Pradeep Natarajan  113   114   115
Collaborators, Affiliations

Inherited causes of clonal haematopoiesis in 97,691 whole genomes

Alexander G Bick et al. Nature. 2020 Oct.

Erratum in

  • Author Correction: Inherited causes of clonal haematopoiesis in 97,691 whole genomes.
    Bick AG, Weinstock JS, Nandakumar SK, Fulco CP, Bao EL, Zekavat SM, Szeto MD, Liao X, Leventhal MJ, Nasser J, Chang K, Laurie C, Burugula BB, Gibson CJ, Niroula A, Lin AE, Taub MA, Aguet F, Ardlie K, Mitchell BD, Barnes KC, Moscati A, Fornage M, Redline S, Psaty BM, Silverman EK, Weiss ST, Palmer ND, Vasan RS, Burchard EG, Kardia SLR, He J, Kaplan RC, Smith NL, Arnett DK, Schwartz DA, Correa A, de Andrade M, Guo X, Konkle BA, Custer B, Peralta JM, Gui H, Meyers DA, McGarvey ST, Chen IY, Shoemaker MB, Peyser PA, Broome JG, Gogarten SM, Wang FF, Wong Q, Montasser ME, Daya M, Kenny EE, North KE, Launer LJ, Cade BE, Bis JC, Cho MH, Lasky-Su J, Bowden DW, Cupples LA, Mak ACY, Becker LC, Smith JA, Kelly TN, Aslibekyan S, Heckbert SR, Tiwari HK, Yang IV, Heit JA, Lubitz SA, Johnsen JM, Curran JE, Wenzel SE, Weeks DE, Rao DC, Darbar D, Moon JY, Tracy RP, Buth EJ, Rafaels N, Loos RJF, Durda P, Liu Y, Hou L, Lee J, Kachroo P, Freedman BI, Levy D, Bielak LF, Hixson JE, Floyd JS, Whitsel EA, Ellinor PT, Irvin MR, Fingerlin TE, Raffield LM, Armasu SM, Wheeler MM, Sabino EC, Blangero J, Williams LK, Levy BD, Sheu WH, Roden DM, Boerwinkle E, Manson JE, Mathias RA, Desai P, Taylor KD, Johnson AD;… See abstract for full author list ➔ Bick AG, et al. Nature. 2021 Mar;591(7851):E27. doi: 10.1038/s41586-021-03280-1. Nature. 2021. PMID: 33707633 No abstract available.

Abstract

Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown1. The age-related acquisition of somatic mutations that lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer2-4 and coronary heart disease5-this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)6. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues.

PubMed Disclaimer

Figures

Extended Data Fig. 1|
Extended Data Fig. 1|. Characterizing TOPMed CHIP.
a, There was marked heterogeneity of CHIP clone size as measured by variant allele fraction by CHIP driver gene. Violin plot spanning minimum and maximum values calculated on full dataset (Supplementary Table 3). Sample size for each element in violin plot displayed in Fig. 1, b, 90% of individuals with CHIP had only one CHIP driver mutation identified c, CHIP prevalence with age was highly concordant across sequenced cohorts. CHIP prevalence was estimated from a logistic mixed model with spline-transformed age, sex, and cohort included as predictors. The cohort was included as a random intercept. Sample size for each cohort listed in Supplementary Table 1. d, CHIP prevalence with age in this study (blue triangles, N=82,807) was highly consistent with previously observed CHIP prevalence (dots represent mean point prevalence with shaded area represents 95% confidence interval; NGenovese=12,380; NJaiswal = 17,182; NXie = 2,728).
Extended Data Fig. 2|
Extended Data Fig. 2|. CHIP age association by mutational mechanism, gene and overlap with somatic chromosomal mosaicism.
a, cumulative density plot of CHIP incidence with age stratified by single nucleotide variant (SNV) vs frameshift mutations. SNVs were observed in younger individuals than Frameshift mutations (N=4,939; two-sided wilcox rank sum test p=0.01). b, cumulative density plot of CHIP incidence with age stratified by driver gene. c, 855 elderly WHI individuals (mean age: 70) with both whole genome and the array genotyping data available were interrogated for large-scale mosaic chromosomal rearrangements. The two somatic events did not co-occur more than would be expected by chance (hypergeometric p=0.25).
Extended Data Fig. 3|
Extended Data Fig. 3|. CHIP associates with Blood, Lipid, and Inflammatory traits.
a, CHIP consistently associated with increased Red Cell Distribution Width (RDW). JAK2, SF3B1 and SRSF2 showed driver gene specific effects on blood traits (see Supplementary Table S5) b, CHIP status was not consistently associated with lipid traits, other than JAK2 CHIP which was associated with decreased total cholesterol and a trend towards decreased LDL (see Supplementary Table S6) c, CHIP status is associated with inflammatory markers, however notable heterogeneity existed across CHIP mutations (see Supplementary Table S7). Associations utilized a two-sided t-test from a multivariate general linear model including age, smoking, race and gender and study center and were not adjusted for multiple comparisons. Sample sizes and exact p-values for each phenotype are listed in Supplementary Tables 5–7.
Extended Data Fig. 4|
Extended Data Fig. 4|. CHIP passenger somatic mutation spectrum.
a, Singleton mutation counts by nucleotide context in CHIP Cases and Controls. b, Signature contribution in CHIP cases and controls identified differential enrichment
Extended Data Fig. 5|
Extended Data Fig. 5|. CHIP Single variant association regional association plots.
a, TERT locus b, TRIM59/KPNA4 locus c, TET2 locus. Two-sided association testing performed using SAIGE (N=65,405 individuals, see methods)
Extended Data Fig. 6|
Extended Data Fig. 6|. CHIP transcriptome-wide association study (TWAS) results across 48 tissues identified 7 significant loci.
UTMOST algorithm applied to CHIP genome wide association study results from n=65,405 individuals (see methods). Genomic coordinates listed on x-axis. P-value from generalized Berk-Jones test on Y axis. Multiple hypothesis corrected threshold, p<2.9 × 10−6 displayed as dotted red line.
Extended Data Fig. 7|
Extended Data Fig. 7|. Tissue-specific results from the top 9 overall UTMOST-significant genes.
UTMOST algorithm applied to CHIP genome wide association study results from n=65,405 individuals. P-value from generalized Berk-Jones test. eQTL z-scores for associations with P<0.05 are displayed in each bar. GTEX eQTL tissue listed on Y-axis.
Extended Data Fig. 8 |
Extended Data Fig. 8 |. CRISPR/Cas9 editing efficiency of TET2 Enhancer deletion in primary CD34+ HSPCs.
a, Schematic showing the position of the two sgRNAs used to delete the TET2 enhancer (512bp) containing rs79901204. B, Gel electrophoresis image of PCR products from genomic DNA of edited HSPCs indicating unedited (WT) and deletion bands at sgRNA target site. Percentages of deletion alleles determined by band intensity and is shown below each lane. The experiment contains 3 biological replicates and was performed once.
Extended Data Fig. 9 |
Extended Data Fig. 9 |. rs79901204 associated with genome wide differential methylation signal,
Methylation Quantitative Trait association results of rs79901204 variant with cpg methylation probes identify an altered peripheral leukocyte methylation profile genome wide in N = 1747 individuals. The strongest signal is at the chr4 TET2 locus. P-values on Y-axis derived from two-sided linear mixed effects model (see methods). To account for multiple hypothesis testing, a Bonferroni threshold of p < 5.8 × 10−8 was used to establish statistical significance.
Extended Data Fig. 10 |
Extended Data Fig. 10 |. Sensitivity of CHIP detection at various VAFs across sequencing depths.
A set of 30 samples from a previously published CHIP cohort (Gibbons et al, 2017) were computationally down sampled to 30x, 40x, 50x, 100x and 400x sequencing depth. TOPMed WGS data was typically in the 40x depth range across CHIP genes. WGS data has excellent sensitivity to detect CHIP clones with VAF >10%, and ~50% sensitivity to detect CHIP VAF 5–10%, with minimal ability to detect CHIP clones <5%.
Fig. 1|
Fig. 1|. Identifying CHIP in TOPMed Genomes.
CHIP was identified in 97,631 whole genome sequenced peripheral blood samples through the curation of somatic driver mutations. Counts for 8 most common driver genes plotted. inset, CHIP prevalence increased with age. Center line represents general additive model spline, 95% confidence interval is shaded (n=82,807 individuals; two-sided t-test: p<10−300).
Fig. 2|
Fig. 2|. Genetic Determinants of CHIP.
Single variant genetic association analyses of CHIP identified three genome wide significant loci. Two-sided association testing performed using SAIGE (n=65,405 individuals, see methods)
Fig. 3|
Fig. 3|. African ancestry specific TET2 locus risk variant disrupts hematopoietic stem cell TET2 enhancer decreasing TET2 expression and increasing self-renewal.
a, the TET2 locus with fine-mapped risk variants, Activity-by-Contact (ABC) hematopoietic stem and progenitor cell (HSPC) enhancers, DNase-Seq CD34+ HSPC and RefSeq genes. ABC model predicts that rs79901204 disrupts a TET2 enhancer resulting in decreased TET2 expression (see methods). b, expanded view of TET2 enhancer element. c, rs79901204 disrupts a GATA motif/E-Box motif. d, rs79901204 is associated with decreased TET2 expression in human peripheral blood RNA-seq (NA/A=230, NA/T=16, NT/T=1, two-sided linear mixed model p=0.012). TPM, transcripts per million. Boxplot displays median, 25th and 75th percentiles, mean (diamond symbol) and outlier observations (black dots) e, luciferase assay in CD34+ primary cells demonstrates four-fold attenuation of enhancer activity by the rs79901204 T risk allele relative to the A reference allele (N=3, two-sided t-test p=0.007). f, deleting the TET2 enhancer (ENH) in CD34+ primary cells results in decreased TET2 expression relative to deletion of control locus AAVS1 (N=3, two-sided t-test, p=0.04). g, Human HSPCs were electroporated with Cas9 targeting a coding region of TET2 and AAVS1 (a control locus) and plated for primary and secondary colony-forming assays. h, two TET2 guides had differential editing efficiency. i, TET2 coding disruption leads to expanded secondary colony formation compared to AAVS1 controls (N=3, two-sided t-test p=0.01, p=0.002 for g1 and g2 respectively, with greater expansion identified in the TET2 guide with greater editing efficiency (two-sided t-test p=0.04). Mean and standard deviation of number of each colony type plotted. CFU-M, colony forming unit-macrophage; CFU-GM, granulocyte macrophage; CFU-GEMM, granulocyte erythrocyte macrophage megakaryocyte; CFU-G, granulocyte; BFU-E, burst forming unit-erythroid. In e, f, h, points represent independent replicates, mean values and error bars represent standard error are plotted.

Similar articles

Cited by

References

    1. Kennedy BK et al. Geroscience: linking aging to chronic disease. Cell 159, 709–13 (2014). - PMC - PubMed
    1. Jaiswal S. et al. Age-Related Clonal Hematopoiesis Associated with Adverse Outcomes. New England Journal of Medicine 371, 2488–2498 (2014). - PMC - PubMed
    1. Genovese G. et al. Clonal Hematopoiesis and Blood-Cancer Risk Inferred from Blood DNA Sequence. New England Journal of Medicine 371, 2477–2487 (2014). - PMC - PubMed
    1. Xie M. et al. Age-related mutations associated with clonal hematopoietic expansion and malignancies. Nature Medicine 20, 1472–1478 (2014). - PMC - PubMed
    1. Jaiswal S. et al. Clonal Hematopoiesis and Risk of Atherosclerotic Cardiovascular Disease. New England Journal of Medicine 377, 111–121 (2017). - PMC - PubMed

Publication types

MeSH terms

Grants and funding

LinkOut - more resources