Who is pregnant? Defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)
- PMID: 37600074
- PMCID: PMC10432357
- DOI: 10.1093/jamiaopen/ooad067
Who is pregnant? Defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)
Abstract
Objectives: To define pregnancy episodes and estimate gestational age within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C).
Materials and methods: We developed a comprehensive approach, named Hierarchy and rule-based pregnancy episode Inference integrated with Pregnancy Progression Signatures (HIPPS), and applied it to EHR data in the N3C (January 1, 2018-April 7, 2022). HIPPS combines: (1) an extension of a previously published pregnancy episode algorithm, (2) a novel algorithm to detect gestational age-specific signatures of a progressing pregnancy for further episode support, and (3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated pregnancy cohorts based on gestational age precision and pregnancy outcomes for assessment of accuracy and comparison of COVID-19 and other characteristics.
Results: We identified 628 165 pregnant persons with 816 471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, abortions), and 23.3% had unknown outcomes. Clinician validation agreed 98.8% with HIPPS-identified episodes. We were able to estimate start dates within 1 week of precision for 475 433 (58.2%) episodes. 62 540 (7.7%) episodes had incident COVID-19 during pregnancy.
Discussion: HIPPS provides measures of support for pregnancy-related variables such as gestational age and pregnancy outcomes based on N3C data. Gestational age precision allows researchers to find time to events with reasonable confidence.
Conclusion: We have developed a novel and robust approach for inferring pregnancy episodes and gestational age that addresses data inconsistency and missingness in EHR data.
Keywords: COVID-19; algorithms; electronic health records; gestational age; pregnancy.
© The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association.
Conflict of interest statement
K.R.B. and S.L. are employees of Palantir Technologies. Y.K. and L.L. are employees of Sema4. M.N.L. is Managing Director of IPQ Analytics, LLC.
Figures
![Figure 1.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10432357/bin/ooad067f1.gif)
![Figure 2.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10432357/bin/ooad067f2.gif)
![Figure 3.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10432357/bin/ooad067f3.gif)
![Figure 4.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10432357/bin/ooad067f4.gif)
![Figure 5.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10432357/bin/ooad067f5.gif)
![Figure 6.](https://cdn.statically.io/img/www.ncbi.nlm.nih.gov/pmc/articles/instance/10432357/bin/ooad067f6.gif)
Update of
-
Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C).medRxiv [Preprint]. 2022 Aug 6:2022.08.04.22278439. doi: 10.1101/2022.08.04.22278439. medRxiv. 2022. Update in: JAMIA Open. 2023 Aug 16;6(3):ooad067. doi: 10.1093/jamiaopen/ooad067. PMID: 35982668 Free PMC article. Updated. Preprint.
Similar articles
-
Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C).medRxiv [Preprint]. 2022 Aug 6:2022.08.04.22278439. doi: 10.1101/2022.08.04.22278439. medRxiv. 2022. Update in: JAMIA Open. 2023 Aug 16;6(3):ooad067. doi: 10.1093/jamiaopen/ooad067. PMID: 35982668 Free PMC article. Updated. Preprint.
-
Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19.PLoS One. 2022 Oct 31;17(10):e0276923. doi: 10.1371/journal.pone.0276923. eCollection 2022. PLoS One. 2022. PMID: 36315520 Free PMC article.
-
Inferring pregnancy episodes and outcomes within a network of observational databases.PLoS One. 2018 Feb 1;13(2):e0192033. doi: 10.1371/journal.pone.0192033. eCollection 2018. PLoS One. 2018. PMID: 29389968 Free PMC article.
-
Mefloquine for preventing malaria in pregnant women.Cochrane Database Syst Rev. 2018 Mar 21;3(3):CD011444. doi: 10.1002/14651858.CD011444.pub2. Cochrane Database Syst Rev. 2018. Update in: Cochrane Database Syst Rev. 2018 Nov 14;11:CD011444. doi: 10.1002/14651858.CD011444.pub3. PMID: 29561063 Free PMC article. Updated. Review.
-
Dietary Patterns before and during Pregnancy and Gestational Age- and Sex-Specific Birth Weight: A Systematic Review [Internet].Alexandria (VA): USDA Nutrition Evidence Systematic Review; 2019 Apr. Alexandria (VA): USDA Nutrition Evidence Systematic Review; 2019 Apr. PMID: 35544663 Free Books & Documents. Review.
Cited by
-
Harnessing Consumer Wearable Digital Biomarkers for Individualized Recognition of Postpartum Depression Using the All of Us Research Program Data Set: Cross-Sectional Study.JMIR Mhealth Uhealth. 2024 May 2;12:e54622. doi: 10.2196/54622. JMIR Mhealth Uhealth. 2024. PMID: 38696234 Free PMC article.
-
Pregnancy Outcomes among Pregnant Persons after COVID-19 Vaccination: Assessing Vaccine Safety in Retrospective Cohort Analysis of U.S. National COVID Cohort Collaborative (N3C).Vaccines (Basel). 2024 Mar 11;12(3):289. doi: 10.3390/vaccines12030289. Vaccines (Basel). 2024. PMID: 38543923 Free PMC article.
-
Harnessing consumer wearable digital biomarkers for individualized recognition of postpartum depression using the All of Us Research Program dataset.medRxiv [Preprint]. 2023 Oct 14:2023.10.13.23296965. doi: 10.1101/2023.10.13.23296965. medRxiv. 2023. Update in: JMIR Mhealth Uhealth. 2024 May 2;12:e54622. doi: 10.2196/54622. PMID: 37873471 Free PMC article. Updated. Preprint.
References
-
- Allotey J, Stallings E, Bonet M, et al.Clinical manifestations, risk factors, and maternal and perinatal outcomes of coronavirus disease 2019 in pregnancy: Living systematic review and meta-analysis. Obstetric Anesthesia Digest. 2021;41(2):81-82. 10.1097/01.aoa.0000744128.44930.48 - DOI - PMC - PubMed
-
- CDC. Pregnant and Recently Pregnant People. Centers for Disease Control and Prevention. 2022. Accessed June 21, 2022. https://www.cdc.gov/coronavirus/2019-ncov/need-extra-precautions/pregnan...
Grants and funding
- UM1 TR004404/TR/NCATS NIH HHS/United States
- UL1 TR002649/TR/NCATS NIH HHS/United States
- UL1 TR001433/TR/NCATS NIH HHS/United States
- UL1 TR001422/TR/NCATS NIH HHS/United States
- UL1 TR001860/TR/NCATS NIH HHS/United States
- UL1 TR001427/TR/NCATS NIH HHS/United States
- U54 GM104942/GM/NIGMS NIH HHS/United States
- UL1 TR001420/TR/NCATS NIH HHS/United States
- UL1 TR001439/TR/NCATS NIH HHS/United States
- UL1 TR002243/TR/NCATS NIH HHS/United States
- UL1 TR001445/TR/NCATS NIH HHS/United States
- UL1 TR003096/TR/NCATS NIH HHS/United States
- UM1 TR004556/TR/NCATS NIH HHS/United States
- U54 GM104938/GM/NIGMS NIH HHS/United States
- UL1 TR002537/TR/NCATS NIH HHS/United States
- UL1 TR001412/TR/NCATS NIH HHS/United States
- UL1 TR001872/TR/NCATS NIH HHS/United States
- UL1 TR001878/TR/NCATS NIH HHS/United States
- UL1 TR002529/TR/NCATS NIH HHS/United States
- UL1 TR001863/TR/NCATS NIH HHS/United States
- UL1 TR002494/TR/NCATS NIH HHS/United States
- UL1 TR002736/TR/NCATS NIH HHS/United States
- U54 GM115516/GM/NIGMS NIH HHS/United States
- UL1 TR002369/TR/NCATS NIH HHS/United States
- UL1 TR002541/TR/NCATS NIH HHS/United States
- U54 GM115371/GM/NIGMS NIH HHS/United States
- UL1 TR002001/TR/NCATS NIH HHS/United States
- UL1 TR002538/TR/NCATS NIH HHS/United States
- U54 GM115458/GM/NIGMS NIH HHS/United States
- R01 MH131542/MH/NIMH NIH HHS/United States
- UL1 TR001442/TR/NCATS NIH HHS/United States
- UL1 TR002535/TR/NCATS NIH HHS/United States
- UL1 TR001866/TR/NCATS NIH HHS/United States
- UL1 TR003167/TR/NCATS NIH HHS/United States
- UL1 TR001409/TR/NCATS NIH HHS/United States
- UL1 TR001449/TR/NCATS NIH HHS/United States
- UL1 TR001453/TR/NCATS NIH HHS/United States
- UL1 TR002489/TR/NCATS NIH HHS/United States
- U54 GM104940/GM/NIGMS NIH HHS/United States
- UL1 TR003107/TR/NCATS NIH HHS/United States
- INV-018455/GATES/Bill & Melinda Gates Foundation/United States
- UL1 TR003015/TR/NCATS NIH HHS/United States
- UL1 TR002733/TR/NCATS NIH HHS/United States
- U24 TR002306/TR/NCATS NIH HHS/United States
- UL1 TR002003/TR/NCATS NIH HHS/United States
- UL1 TR001876/TR/NCATS NIH HHS/United States
- UL1 TR001436/TR/NCATS NIH HHS/United States
- UL1 TR002378/TR/NCATS NIH HHS/United States
- UL1 TR002384/TR/NCATS NIH HHS/United States
- UL1 TR002553/TR/NCATS NIH HHS/United States
- UL1 TR002389/TR/NCATS NIH HHS/United States
- UL1 TR001414/TR/NCATS NIH HHS/United States
- U54 GM104941/GM/NIGMS NIH HHS/United States
- UL1 TR002014/TR/NCATS NIH HHS/United States
- UL1 TR002550/TR/NCATS NIH HHS/United States
- UL1 TR002319/TR/NCATS NIH HHS/United States
- UL1 TR001855/TR/NCATS NIH HHS/United States
- UL1 TR001425/TR/NCATS NIH HHS/United States
- UL1 TR002373/TR/NCATS NIH HHS/United States
- UL1 TR002240/TR/NCATS NIH HHS/United States
- UL1 TR002556/TR/NCATS NIH HHS/United States
- UL1 TR003017/TR/NCATS NIH HHS/United States
- UL1 TR001998/TR/NCATS NIH HHS/United States
- UL1 TR001873/TR/NCATS NIH HHS/United States
- UL1 TR001881/TR/NCATS NIH HHS/United States
- UL1 TR002645/TR/NCATS NIH HHS/United States
- UL1 TR001450/TR/NCATS NIH HHS/United States
- UL1 TR002366/TR/NCATS NIH HHS/United States
- U54 GM115428/GM/NIGMS NIH HHS/United States
- UL1 TR002345/TR/NCATS NIH HHS/United States
- UL1 TR002377/TR/NCATS NIH HHS/United States
- U54 GM115677/GM/NIGMS NIH HHS/United States
- UL1 TR002544/TR/NCATS NIH HHS/United States
- UL1 TR003098/TR/NCATS NIH HHS/United States
- UL1 TR001430/TR/NCATS NIH HHS/United States
- UL1 TR003142/TR/NCATS NIH HHS/United States
LinkOut - more resources
Full Text Sources
Miscellaneous