Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Apr 14;83(8):1183-1190.
doi: 10.1158/0008-5472.CAN-22-1277.

Case Studies for Overcoming Challenges in Using Big Data in Cancer

Affiliations
Review

Case Studies for Overcoming Challenges in Using Big Data in Cancer

Shawn M Sweeney et al. Cancer Res. .

Abstract

The analysis of big healthcare data has enormous potential as a tool for advancing oncology drug development and patient treatment, particularly in the context of precision medicine. However, there are challenges in organizing, sharing, integrating, and making these data readily accessible to the research community. This review presents five case studies illustrating various successful approaches to addressing such challenges. These efforts are CancerLinQ, the American Association for Cancer Research Project GENIE, Project Data Sphere, the National Cancer Institute Genomic Data Commons, and the Veterans Health Administration Clinical Data Initiative. Critical factors in the development of these systems include attention to the use of robust pipelines for data aggregation, common data models, data deidentification to enable multiple uses, integration of data collection into physician workflows, terminology standardization and attention to interoperability, extensive quality assurance and quality control activity, incorporation of multiple data types, and understanding how data resources can be best applied. By describing some of the emerging resources, we hope to inspire consideration of the secondary use of such data at the earliest possible step to ensure the proper sharing of data in order to generate insights that advance the understanding and the treatment of cancer.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Sweeney SM, Hamadeh HK, Abrams N, Adam SJ, Brenner S, Connors DE, et al. . Challenges to using big data in cancer. Cancer Res 2023;83:1175–82. - PMC - PubMed
    1. Mangravite LM, Sen A, Wilbanks JT, Sage Bionetworks Governance Team. Mechanisms to govern responsible sharing of open data: a progress report. 2020. Seattle, WA: Sage Bionetworks. Available athttps://sage-bionetworks.github.io/governanceGreenPaper/manuscript.pdf.
    1. European Medicines Agency (EMA). Draft guideline on registry-based studies. EMA/502388/2020. 2020.
    1. Schilsky RL, Michels DL, Kearbey AH, Yu PP, Hudis CA. Building a rapid learning health care system for oncology: the regulatory framework of CancerLinQ. J Clin Oncol 2014;32:2373–9. - PubMed
    1. Potter D, Brothers R, Kolacevski A, Koskimaki JE, McNutt A, Miller RS, et al. . Development of CancerLinQ, a health information learning platform from multiple electronic health record systems to support improved quality of care. JCO Clin Cancer Inform 2020;4:929–37. - PMC - PubMed