Abstract

Instrumental variables have been widely used for estimating the causal effect between exposure and outcome. Conventional estimation methods require complete knowledge about all the instruments’ validity; a valid instrument must not have a direct effect on the outcome and not be related to unmeasured confounders. Often, this is impractical as highlighted by Mendelian randomization studies where genetic markers are used as instruments and complete knowledge about instruments’ validity is equivalent to complete knowledge about the involved genes’ functions. In this article, we propose a method for estimation of causal effects when this complete knowledge is absent. It is shown that causal effects are identified and can be estimated as long as less than 50% of instruments are invalid, without knowing which of the instruments are invalid. We also introduce conditions for identification when the 50% threshold is violated. A fast penalized ℓ 1 estimation method, called sisVIVE, is introduced for estimating the causal effect without knowing which instruments are valid, with theoretical guarantees on its performance. The proposed method is demonstrated on simulated data and a real Mendelian randomization study concerning the effect of body mass index(BMI) on health-related quality of life (HRQL) index. An R package sisVIVE is available on CRAN. Supplementary materials for this article are available online.

Keywords

Mendelian randomizationInstrumental variableIdentification (biology)ConfoundingEstimationComputer scienceOutcome (game theory)Causal inferenceEconometricsStatisticsData miningMathematicsMachine learningGenetic variants

Related Publications

Publication Info

Year
2015
Type
article
Volume
111
Issue
513
Pages
132-144
Citations
281
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

281
OpenAlex
40
Influential

Cite This

Hyunseung Kang, Anru R. Zhang, Tommaso Cai et al. (2015). Instrumental Variables Estimation With Some Invalid Instruments and its Application to Mendelian Randomization. Journal of the American Statistical Association , 111 (513) , 132-144. https://doi.org/10.1080/01621459.2014.994705

Identifiers

DOI
10.1080/01621459.2014.994705
arXiv
1401.5755

Data Quality

Data completeness: 84%