Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 May;40(4):304-14.
doi: 10.1002/gepi.21965. Epub 2016 Apr 7.

Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator

Affiliations

Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator

Jack Bowden et al. Genet Epidemiol. 2016 May.

Abstract

Developments in genome-wide association studies and the increasing availability of summary genetic association data have made application of Mendelian randomization relatively straightforward. However, obtaining reliable results from a Mendelian randomization investigation remains problematic, as the conventional inverse-variance weighted method only gives consistent estimates if all of the genetic variants in the analysis are valid instrumental variables. We present a novel weighted median estimator for combining data on multiple genetic variants into a single causal estimate. This estimator is consistent even when up to 50% of the information comes from invalid instrumental variables. In a simulation analysis, it is shown to have better finite-sample Type 1 error rates than the inverse-variance weighted method, and is complementary to the recently proposed MR-Egger (Mendelian randomization-Egger) regression method. In analyses of the causal effects of low-density lipoprotein cholesterol and high-density lipoprotein cholesterol on coronary artery disease risk, the inverse-variance weighted method suggests a causal effect of both lipid fractions, whereas the weighted median and MR-Egger regression methods suggest a null effect of high-density lipoprotein cholesterol that corresponds with the experimental evidence. Both median-based and MR-Egger regression methods should be considered as sensitivity analyses for Mendelian randomization investigations with multiple genetic variants.

Keywords: Egger regression; Mendelian randomization; instrumental variables; pleiotropy; robust statistics.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Illustrative diagram representing the hypothesized relationships between genetic variant Gj, exposure X, disease Y, and confounders U when Gj is a valid instrumental variable (IV). Crosses indicate violations of assumptions IV2 and IV3 that potentially lead to invalid inferences from conventional methods.
Figure 2
Figure 2
Fictional example of a Mendelian randomization analysis with 10 genetic variants–six valid instrumental variables (hollow circles) and four invalid instrumental variables (solid circles) for finite sample size (left) and infinite sample size (right) showing IVW (solid line) and simple median (dashed line) estimates compared with the true causal effect (dotted line). The ratio estimate for each genetic variant is the gradient of the line connecting the relevant datapoint for that variant to the origin; the simple median estimate is the median of these ratio estimates.
Figure 3
Figure 3
Empirical distribution functions of ordered ratio instrumental variable estimates (β^j) used for calculation of the simple median estimate (black) and two weighted median estimates (shown in red and blue) using the weights given in Table 1.
Figure 4
Figure 4
Scatter plots of genetic associations with the outcome (coronary artery disease risk, CAD) against genetic associations with the exposure (low‐density lipoprotein cholesterol, LDL‐c; high‐density lipoprotein cholesterol, HDL‐c; triglycerides). Left side: all genetic variants, right side: genetic variants having primary association with the target exposure. Solid line represents IVW estimate, dashed line represents weighted median estimate, and dotted line represents MR‐Egger estimate.

Similar articles

Cited by

References

    1. Bochud M, Chiolero A, Elston R, Paccaud F. 2008. A cautionary note on the use of Mendelian randomization to infer causation in observational epidemiology. Int J Epidemiol 37(2):414–416. - PubMed
    1. Bowden J, Thompson J, Burton P. 2006. Using pseudo‐data to correct for publication bias in meta‐analysis. Stat Med 25: 3798–3813. - PubMed
    1. Bowden J, Davey Smith G, Burgess S. 2015. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol 44(2):512–525. - PMC - PubMed
    1. Burgess S, Thompson SG. 2015. Mendelian Randomization: Methods for Using Genetic Variants in Causal Estimation. Chapman & Hall, Boca Raton, FL, USA.
    1. Burgess S, Thompson S, CRP CHD Genetics Collaboration . 2011. Avoiding bias from weak instruments in Mendelian randomization studies. Int J Epidemiol 40(3):755–764. - PubMed

Publication types

MeSH terms