Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 May 17;50(2):620-632.
doi: 10.1093/ije/dyaa213.

Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations

Affiliations
Review

Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations

Peter W G Tennant et al. Int J Epidemiol. .

Abstract

Background: Directed acyclic graphs (DAGs) are an increasingly popular approach for identifying confounding variables that require conditioning when estimating causal effects. This review examined the use of DAGs in applied health research to inform recommendations for improving their transparency and utility in future research.

Methods: Original health research articles published during 1999-2017 mentioning 'directed acyclic graphs' (or similar) or citing DAGitty were identified from Scopus, Web of Science, Medline and Embase. Data were extracted on the reporting of: estimands, DAGs and adjustment sets, alongside the characteristics of each article's largest DAG.

Results: A total of 234 articles were identified that reported using DAGs. A fifth (n = 48, 21%) reported their target estimand(s) and half (n = 115, 48%) reported the adjustment set(s) implied by their DAG(s). Two-thirds of the articles (n = 144, 62%) made at least one DAG available. DAGs varied in size but averaged 12 nodes [interquartile range (IQR): 9-16, range: 3-28] and 29 arcs (IQR: 19-42, range: 3-99). The median saturation (i.e. percentage of total possible arcs) was 46% (IQR: 31-67, range: 12-100). 37% (n = 53) of the DAGs included unobserved variables, 17% (n = 25) included 'super-nodes' (i.e. nodes containing more than one variable) and 34% (n = 49) were visually arranged so that the constituent arcs flowed in the same direction (e.g. top-to-bottom).

Conclusion: There is substantial variation in the use and reporting of DAGs in applied health research. Although this partly reflects their flexibility, it also highlights some potential areas for improvement. This review hence offers several recommendations to improve the reporting and use of DAGs in future research.

Keywords: Directed acyclic graphs; causal diagrams; causal inference; confounding; covariate adjustment; graphical model theory; observational studies; reporting practices.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Illustration of the main components of a DAG, the most common types of contextual variables and the most common types of paths. The DAG has been visually arranged so that all constituent arcs flow from top-to-bottom.
Figure 2
Figure 2
Flow of bibliographic records into the final sample of 234 articles.
Figure 3.
Figure 3.
Distribution of the 234 articles included in the review sample, by year of publication, country of first author’s primary affiliation and journal citation category.

Similar articles

Cited by

References

    1. Hernán MA, Hsu J, Healy B.. A second chance to get causal inference right: a classification of data science tasks. Chance 2019; 32:42–49.
    1. Deaton A, Cartwright N.. Understanding and misunderstanding randomized controlled trials. Soc Sci Med 2018;210:2–21. - PMC - PubMed
    1. Morgan SL, Winship C.. Counterfactuals and Causal Inference. 2nd edn. Cambridge, UK: Cambridge University Press, 2015.
    1. Hernán MA. The C-word: scientific euphemisms do not improve causal inference from observational data. Am J Public Health 2018;108:616–19. - PMC - PubMed
    1. Heinze G, Wallisch C, Dunkler D.. Variable selection - a review and recommendations for the practicing statistician. Biom J 2018;60:431–49. - PMC - PubMed

Publication types