Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 29;2(9):1133-44.
doi: 10.1016/j.ebiom.2015.07.017. eCollection 2015 Sep.

Integration of copy number and transcriptomics provides risk stratification in prostate cancer: A discovery and validation cohort study

Affiliations

Integration of copy number and transcriptomics provides risk stratification in prostate cancer: A discovery and validation cohort study

H Ross-Adams et al. EBioMedicine. .

Erratum in

Abstract

Background: Understanding the heterogeneous genotypes and phenotypes of prostate cancer is fundamental to improving the way we treat this disease. As yet, there are no validated descriptions of prostate cancer subgroups derived from integrated genomics linked with clinical outcome.

Methods: In a study of 482 tumour, benign and germline samples from 259 men with primary prostate cancer, we used integrative analysis of copy number alterations (CNA) and array transcriptomics to identify genomic loci that affect expression levels of mRNA in an expression quantitative trait loci (eQTL) approach, to stratify patients into subgroups that we then associated with future clinical behaviour, and compared with either CNA or transcriptomics alone.

Findings: We identified five separate patient subgroups with distinct genomic alterations and expression profiles based on 100 discriminating genes in our separate discovery and validation sets of 125 and 103 men. These subgroups were able to consistently predict biochemical relapse (p = 0.0017 and p = 0.016 respectively) and were further validated in a third cohort with long-term follow-up (p = 0.027). We show the relative contributions of gene expression and copy number data on phenotype, and demonstrate the improved power gained from integrative analyses. We confirm alterations in six genes previously associated with prostate cancer (MAP3K7, MELK, RCBTB2, ELAC2, TPD52, ZBTB4), and also identify 94 genes not previously linked to prostate cancer progression that would not have been detected using either transcript or copy number data alone. We confirm a number of previously published molecular changes associated with high risk disease, including MYC amplification, and NKX3-1, RB1 and PTEN deletions, as well as over-expression of PCA3 and AMACR, and loss of MSMB in tumour tissue. A subset of the 100 genes outperforms established clinical predictors of poor prognosis (PSA, Gleason score), as well as previously published gene signatures (p = 0.0001). We further show how our molecular profiles can be used for the early detection of aggressive cases in a clinical setting, and inform treatment decisions.

Interpretation: For the first time in prostate cancer this study demonstrates the importance of integrated genomic analyses incorporating both benign and tumour tissue data in identifying molecular alterations leading to the generation of robust gene sets that are predictive of clinical outcome in independent patient cohorts.

Keywords: Biochemical relapse; Gene signature; Genomics; Personalised medicine; Prognosis; Prostate cancer; Risk stratification.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
A copy number profile of the prostate cancer genome. The percentage of samples containing copy number aberrations (CNA) at each locus is shown by gain/loss (red/blue); left hand y-axis. Established prostate cancer risk genes commonly disrupted by CNAs (from Williams et al. (2014) meta-analysis) are indicated in grey (gene name and frequency altered in this cohort are shown, see also Suppl. Table 3); only those affected in > 10% samples are annotated. Novel CN changes identified in this cohort (> 10% samples) also in our 100-gene set are indicated in black type. MAP3K7 is highlighted in purple as the only previously known CN-altered risk gene included in our 100-gene signature. Data were generated on high-density Illumina OMNI2.5 M arrays and analysed using OncoSNP (Yau et al., 2010); only highly stringent calls are shown (see Methods). Chromosome ends are delineated by grey, vertical stripes. Representative genes with large average fold changes (tumours versus matched benign) are shown by red (up-regulation) and green spots (down-regulation); right-hand y-axis. With the exception of OLFM4 (19%, chr13q14.3), these do not coincide with CN alterations.
Fig. 2
Fig. 2
Integrative subgroups have characteristic molecular profiles. Genome-wide frequencies of somatic copy number alterations (CNAs) presented as a percentage of samples (left y-axis) in each integrated Cluster (iCluster). Regions of copy number gain are indicated in red and regions of loss in blue. Subgroups were identified by integrated hierarchical clustering (as described in Methods) of the discovery cohort (n = 125). For the validation cohort (n = 103), men were allocated to these same clusters as described (see Suppl. Fig. 6). Differentially expressed genes (DEG) are superimposed for each cluster; only genes with log2 fold change > 1.5 or < − 1.5 are shown (tumour versus matched benign; right y-axis). The top ten strongest DEGs in each cluster are annotated (see Suppl. Table 8 for full list).
Fig. 3
Fig. 3
Copy number and expression levels for 100 clustering genes in each integrated cluster. Mean mRNA expression levels are shown as a heatmap for each of the 100 genes used to differentiate the integrated clusters. Copy number is displayed as the number of men with a gain or loss in copies of that gene in that cluster. Chromosome location is also given (see Fig. 2). Scaling as shown.
Fig. 4
Fig. 4
Integrative subgroups have distinct clinical outcomes and are powerful predictors of relapse. A. Kaplan–Meier plot of relapse-free survival over 60 months for the five molecular subtypes in the Cambridge discovery cohort (p = 0.0017 for the two highest versus two lowest risk groups). For each cluster, the total number of samples is indicated (total relapses in brackets). B. Kaplan–Meier plot of relapse-free survival over 96 months in the Stockholm validation cohort (p = 0.016). Further validation was undertaken in a third dataset (Taylor et al. (2010); Suppl. Fig. 9). C. Distribution of Gleason grade across subtypes (Cambridge discovery cohort); no Gleason score predominates in any one subtype (Kruskal–Wallis p = 0.6194). D. Cox proportional hazard ratios with 95% confidence intervals for high vs low Gleason score (≥ 4 + 3 = 7 vs ≤ 3 + 4 = 7), and every other integrative cluster vs best prognosis cluster4. Cambridge and Stockholm datasets were combined to ensure sufficient events per variable (biochemical relapses per cluster) for robust statistical testing (Peduzzi et al., 1995). Confidence intervals shown are 0.9, 0.95 and 0.99. E&F. Refined 100-gene set tested for power to predict relapse in the Stockholm validation set against 1000 random signatures (p < 0.001) and 189 oncological signatures (Subramanian et al., 2005; p < 0.001). Comparison was also made with other prostate cancer signatures (Suppl. Table 11).

Similar articles

Cited by

References

    1. Armisén R., Marcelain K., Simon F. TRPM4 enhances cell proliferation through up-regulation of the β-catenin signaling pathway. J. Cell. Physiol. 2011;226(1):103–109. - PubMed
    1. Attard G., Clark J., Ambroisine L. Duplication of the fusion of TMPRSS2 to ERG sequences identifies fatal human prostate cancer. Oncogene. 2008;27(3):253–263. - PMC - PubMed
    1. Barbosa-Morais N.L., Dunning M.J., Samarajiwa S.A. A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data. Nucleic Acids Res. 2010;38(3):e17. - PMC - PubMed
    1. Catalona W.J., Richie J.P., Ahmann F.R. Comparison of digital rectal examination and serum prostate specific antigen in the early detection of prostate cancer: results of a multicenter clinical trial of 6,630 men. J. Urol. 1994;151(5):1283–1290. - PubMed
    1. Chalise P., Koestler D.C., Bimali M., Yu Q., Fridley B.L. Integrative clustering methods for high-dimensional molecular data. Transl. Cancer Res. 2014;3(3):202–216. - PMC - PubMed

Publication types