Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 30;15(9):e0239934.
doi: 10.1371/journal.pone.0239934. eCollection 2020.

Comparing a novel machine learning method to the Friedewald formula and Martin-Hopkins equation for low-density lipoprotein estimation

Affiliations

Comparing a novel machine learning method to the Friedewald formula and Martin-Hopkins equation for low-density lipoprotein estimation

Gurpreet Singh et al. PLoS One. .

Abstract

Background: Low-density lipoprotein cholesterol (LDL-C) is a target for cardiovascular prevention. Contemporary equations for LDL-C estimation have limited accuracy in certain scenarios (high triglycerides [TG], very low LDL-C).

Objectives: We derived a novel method for LDL-C estimation from the standard lipid profile using a machine learning (ML) approach utilizing random forests (the Weill Cornell model). We compared its correlation to direct LDL-C with the Friedewald and Martin-Hopkins equations for LDL-C estimation.

Methods: The study cohort comprised a convenience sample of standard lipid profile measurements (with the directly measured components of total cholesterol [TC], high-density lipoprotein cholesterol [HDL-C], and TG) as well as chemical-based direct LDL-C performed on the same day at the New York-Presbyterian Hospital/Weill Cornell Medicine (NYP-WCM). Subsequently, an ML algorithm was used to construct a model for LDL-C estimation. Results are reported on the held-out test set, with correlation coefficients and absolute residuals used to assess model performance.

Results: Between 2005 and 2019, there were 17,500 lipid profiles performed on 10,936 unique individuals (4,456 females; 40.8%) aged 1 to 103. Correlation coefficients between estimated and measured LDL-C values were 0.982 for the Weill Cornell model, compared to 0.950 for Friedewald and 0.962 for the Martin-Hopkins method. The Weill Cornell model was consistently better across subgroups stratified by LDL-C and TG values, including TG >500 and LDL-C <70.

Conclusions: An ML model was found to have a better correlation with direct LDL-C than either the Friedewald formula or Martin-Hopkins equation, including in the setting of elevated TG and very low LDL-C.

PubMed Disclaimer

Conflict of interest statement

Gurpreet Singh became affiliated with GlaxoSmithKline after working on this project. Benjamin C. Lee receives consulting fees from Cleerly Inc, but has not receive that consulting fee since 2019. Leslee J. Shaw reports having an equity interest in Cleerly Inc. There are no patents, products in development or marketed products associated with this research to declare This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Figures

Fig 1
Fig 1
Scatter plot showing the correlation between estimated and directly measured low-density lipoprotein cholesterol (LDL-C) for the (a) overall cohort, and (b) for each of the LDL estimation models.
Fig 2
Fig 2
(A) Scatter plot showing the correlation between the ground truth LDL-C value (direct LDL-C) and estimated LDL-C value, across LDL-C subgroups, using the Weill Cornell model, Friedewald formula and Martin-Hopkins equation. (B) Correlation coefficients for each model for LDL-C subgroups.
Fig 3
Fig 3
Scatter plot showing the correlation between the ground truth LDL-C value (direct LDL) and estimated LDL-C value, across triglyceride subgroups, using the Cornell model, Friedewald formula and Martin-Hopkins method. (B) Correlation coefficients for each model for TGL subgroups. Abbreviations: TGL: triglycerides.

Similar articles

Cited by

References

    1. Ference Brian A., Ginsberg Henry N., Graham Ian, Ray Kausik K., Packard EB Chris J. et al. Low-density lipoproteins cause atherosclerotic cardiovascular disease. 1. Evidence from genetic, epidemiologic, and clinical studies. A consensus statement from the European Atherosclerosis Society Consensus Panel | European Heart Journal | Oxford Academi. Eur Heart J [Internet]. 2017. [cited 2020 Mar 12];38(32):2459–72. Available from: 10.1093/eurheartj/ehx144 - DOI - PMC - PubMed
    1. Benjamin EJ, Blaha MJ, Chiuve SE, Cushman M, Das SR, Deo R, et al. Heart Disease and Stroke Statistics—2017 Update: A Report From the American Heart Association. Circulation [Internet]. 2017. March 7 [cited 2020 Mar 9];135(10):e146–603. Available from: https://www.ahajournals.org/doi/10.1161/CIR.0000000000000485 - DOI - PMC - PubMed
    1. Smith S, Benjamin E, … RB-J of the, 2011 undefined. AHA/ACCF secondary prevention and risk reduction therapy for patients with coronary and other atherosclerotic vascular disease: 2011 update: a guideline from the. onlinejacc.org [Internet]. [cited 2020 Mar 12]; Available from: http://www.onlinejacc.org/content/58/23/2432.abstract
    1. Vallejo-Vaz AJ, Robertson M, Catapano AL, Watts GF, Kastelein JJ, Packard CJ, et al. LDL-Cholesterol Lowering for the Primary Prevention of Cardiovascular Disease Among Men with Primary Elevations of LDL-Cholesterol Levels of 190 mg/dL or Above: Analyses from the WOSCOPS 5-year Randomised Trial and 20-year Observational Follow-Up. Circulation [Internet]. 2017. [cited 2020 Mar 12];136(20):CIRCULATIONAHA.117.027966. Available from: http://circ.ahajournals.org/lookup/doi/10.1161/CIRCULATIONAHA.117.027966 - DOI - PubMed
    1. Anderson TJ, Grégoire J, Pearson GJ, Barry AR, Couture P, Dawes M, et al. 2016. Canadian Cardiovascular Society Guidelines for the Management of Dyslipidemia for the Prevention of Cardiovascular Disease in the Adult. Can J Cardiol. 2016. November 1;32(11):1263–82. 10.1016/j.cjca.2016.07.510 - DOI - PubMed

Publication types

Grants and funding

The research reported in this manuscript was supported by the Dalio Institute of Cardiovascular Imaging (New York, NY, USA). No funding was provided for this study.The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.