Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations
- PMID: 33330936
- PMCID: PMC8128477
- DOI: 10.1093/ije/dyaa213
Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations
Abstract
Background: Directed acyclic graphs (DAGs) are an increasingly popular approach for identifying confounding variables that require conditioning when estimating causal effects. This review examined the use of DAGs in applied health research to inform recommendations for improving their transparency and utility in future research.
Methods: Original health research articles published during 1999-2017 mentioning 'directed acyclic graphs' (or similar) or citing DAGitty were identified from Scopus, Web of Science, Medline and Embase. Data were extracted on the reporting of: estimands, DAGs and adjustment sets, alongside the characteristics of each article's largest DAG.
Results: A total of 234 articles were identified that reported using DAGs. A fifth (n = 48, 21%) reported their target estimand(s) and half (n = 115, 48%) reported the adjustment set(s) implied by their DAG(s). Two-thirds of the articles (n = 144, 62%) made at least one DAG available. DAGs varied in size but averaged 12 nodes [interquartile range (IQR): 9-16, range: 3-28] and 29 arcs (IQR: 19-42, range: 3-99). The median saturation (i.e. percentage of total possible arcs) was 46% (IQR: 31-67, range: 12-100). 37% (n = 53) of the DAGs included unobserved variables, 17% (n = 25) included 'super-nodes' (i.e. nodes containing more than one variable) and 34% (n = 49) were visually arranged so that the constituent arcs flowed in the same direction (e.g. top-to-bottom).
Conclusion: There is substantial variation in the use and reporting of DAGs in applied health research. Although this partly reflects their flexibility, it also highlights some potential areas for improvement. This review hence offers several recommendations to improve the reporting and use of DAGs in future research.
Keywords: Directed acyclic graphs; causal diagrams; causal inference; confounding; covariate adjustment; graphical model theory; observational studies; reporting practices.
© The Author(s) 2020. Published by Oxford University Press on behalf of the International Epidemiological Association.
Figures
Similar articles
-
Software Application Profile: The daggle app-a tool to support learning and teaching the graphical rules of selecting adjustment variables using directed acyclic graphs.Int J Epidemiol. 2023 Oct 5;52(5):1659-1664. doi: 10.1093/ije/dyad038. Int J Epidemiol. 2023. PMID: 36952629 Free PMC article.
-
Evidence synthesis for constructing directed acyclic graphs (ESC-DAGs): a novel and systematic method for building directed acyclic graphs.Int J Epidemiol. 2020 Feb 1;49(1):322-329. doi: 10.1093/ije/dyz150. Int J Epidemiol. 2020. PMID: 31325312 Free PMC article.
-
Robust causal inference using directed acyclic graphs: the R package 'dagitty'.Int J Epidemiol. 2016 Dec 1;45(6):1887-1894. doi: 10.1093/ije/dyw341. Int J Epidemiol. 2016. PMID: 28089956
-
Directed Acyclic Graphs for Oral Disease Research.J Dent Res. 2016 Jul;95(8):853-9. doi: 10.1177/0022034516639920. Epub 2016 Mar 21. J Dent Res. 2016. PMID: 27000052 Free PMC article. Review.
-
Using directed acyclic graphs to guide analyses of neighbourhood health effects: an introduction.J Epidemiol Community Health. 2008 Sep;62(9):842-6. doi: 10.1136/jech.2007.067371. J Epidemiol Community Health. 2008. PMID: 18701738 Review.
Cited by
-
Using routinely collected clinical data for circadian medicine: A review of opportunities and challenges.PLOS Digit Health. 2024 May 23;3(5):e0000511. doi: 10.1371/journal.pdig.0000511. eCollection 2024 May. PLOS Digit Health. 2024. PMID: 38781189 Free PMC article. Review.
-
Treat and release: an observational study of non-conveyed high-acuity dispatches in a Danish emergency medical system.Intern Emerg Med. 2024 May 15. doi: 10.1007/s11739-024-03618-3. Online ahead of print. Intern Emerg Med. 2024. PMID: 38748389
-
Health-related quality of life and impact of socioeconomic status among primary and secondary school students after the third COVID-19 wave in Berlin, Germany.PLoS One. 2024 May 9;19(5):e0302995. doi: 10.1371/journal.pone.0302995. eCollection 2024. PLoS One. 2024. PMID: 38722991 Free PMC article.
-
Association between COVID-19 severity and tobacco smoking status: a retrospective cohort study using propensity score matching weights analysis.BMJ Open Respir Res. 2024 May 7;11(1):e001976. doi: 10.1136/bmjresp-2023-001976. BMJ Open Respir Res. 2024. PMID: 38719502 Free PMC article.
-
Effect of homemade peanut oil consumption during pregnancy on low birth weight and preterm birth outcomes: a cohort study in Southwestern China.Glob Health Action. 2024 Dec 31;17(1):2336312. doi: 10.1080/16549716.2024.2336312. Epub 2024 Apr 17. Glob Health Action. 2024. PMID: 38629142 Free PMC article.
References
-
- Hernán MA, Hsu J, Healy B.. A second chance to get causal inference right: a classification of data science tasks. Chance 2019; 32:42–49.
-
- Morgan SL, Winship C.. Counterfactuals and Causal Inference. 2nd edn. Cambridge, UK: Cambridge University Press, 2015.