class: center, middle, inverse, title-slide .title[ # RD and Instrumental Variables ] .subtitle[ ## EDLD 650: Week 5 ] .author[ ### David D. Liebowitz ] --- <style type="text/css"> .inverse { background-color : #2293bf; } </style> # Agenda ### 1. Roadmap (9:00-9:05) ### 2. Holden paper and DARE #2 (9:05-10:20) - Discussion questions - DARE debrief ### 3. Break (10:20-10:35) ### 4. Instrumental variables (10:35-11:40) ### 5. Wrap-up (11:40-11:50) --- # Roadmap <img src="causal_id.jpg" width="1707" style="display: block; margin: auto;" /> --- # Goals ### 1. Conduct and interpret RD analysis in simplified data ### 2. Assess the basic assumptions of the RD design ### 3. Describe the conceptual and simple mathematical approach for identifying causal effects using the instrumental variables approach --- class: middle, inverse # Class 5 Discussion Questions --- class: middle, inverse # You DARE-devils! --- class: middle, inverse # Break --- class: middle, inverse # Instrumental variables --- # The set-up .blue[**Consider the following relationship we would like to estimate:**] **The effect of a treatment `\((D_{i})\)` on an outcome of interest `\((Y_{i})\)`:** `$$Y_{i} = \beta_{0} + \beta_{1}D_{i} + \varepsilon_{i}$$` -- **Now, let's apply this to a particular context:** `$$INCOME_{i} = \beta_{0} + \beta_{1}COLLEGE_{i} + \varepsilon_{i}$$` -- .red[**Don't be so crass!**] -- `$$VOTE_{i} = \beta_{0} + \beta_{1}COLLEGE_{i} + \textbf{X}_{i}\theta + \varepsilon_{i}$$` -- > Describe to your neighbor using the language of causal inference (*omitted variable bias, endogenous, causality, selection bias*) what is wrong with fitting this last regression in a nationally representative sample of adults for which we have records of their voting participation, highest level of education and rich demographic covariates `\((\textbf{X}_{i})\)`. --- # A mysterious solution .blue[**Can we fix this without a sudden change across time and geography that we might term a "natural" experiment?**] -- - What if we have data on another mysterious variable...? - Let's call this variable an "instrument" and assign it the letter `\(Z_{i}\)` for each individual `\(i\)` - Let's suppose that it predicts treatment `\((D_{i})\)` and is itself exogenously determined .large[.blue[**What would this mean???**]] --- # A mysterious "instrument" <img src="iv1.jpg" width="1707" style="display: block; margin: auto;" /> -- .pull-left[ **OLS estimate**: ratio of the area of *overlap of `\(Y\)` and `\(D\)`* to the *total area of `\(D\)`*: ] .pull-right[ `$$\hat{\beta}_{1}^{OLS} = \frac{S_{YD}}{S_{D}^{2}}$$` ] --- # A mysterious "instrument" <img src="iv2.jpg" width="1707" style="display: block; margin: auto;" /> -- .pull-left[ **IV estimate**: ratio of area of *overlap of `\(Y\)` and `\(Z\)`* to area of *overlap of `\(D\)` and `\(Z\)`*. Depends entirely on variation in `\(Z\)` that predicts variation in `\(Y\)` and `\(D\)`: ] .pull-right[ `$$\hat{\beta}_{1}^{IVE} = \frac{S_{YD}}{S_{DZ}}$$` a .red-pink[**Local Average Treatment Effect**] ] --- # An instrument? **But what serves as a helpful and valid instrument?** -- ### Valid instruments: 1. Instrument `\((Z_{i})\)` must be correlated with treatment `\((D_{i})\)`, .red[*but*] 2. Instrument `\((Z_{i})\)` must be orthogonal `\((\perp)\)` to all other determinants of the outcome `\((Y_{i})\)` - Another way of saying it must be uncorrelated with the residuals `\((\varepsilon_{i})\)` 3. Instrument must be related to the outcome .red[*only*] through the treatment - This is known as the .red-pink[**exclusion restriction**] (we'll come back to this) -- .blue[Can you think of things that might serve as good instruments in the example of college attendance ("treatment") and voting ("outcome")?] --- ## Two-stage least squares (2SLS) IV ### 1<sup>st</sup> stage: Regress endogenous treatment `\((D_{i})\)` on instrumental variable `\((Z_{i})\)` using OLS: `$$D_{i} = \alpha_{0} + \alpha_{1}Z_{i} + \nu_{i}$$` Obtain the *predicted values* of the treatment `\((\hat{D_{i}})\)` from this fit.<sup>[1]</sup> .footnote[[1] This doesn't get the standard errors correct, have to adjust *post-hoc*, but this is automated in all statistical software.] -- ### 2<sup>nd</sup> stage: Regress outcome `\((Y_{i})\)` on predicted values of treatment `\((\hat{D_{i}})\)` using OLS: `$$Y_{i} = \beta_{0} + \beta_{1}\hat{D_{i}} + \varepsilon_{i}$$` --- # IV assumptions (re-stated) Stage 1: `\(D_{i} = \alpha_{0} + \alpha_{1}Z_{i} + \nu_{i}\)` Stage 2: `\(Y_{i} = \beta_{0} + \beta_{1}\hat{D_{i}} + \varepsilon_{i}\)` ### Assumptions: 1. .red-pink[Instrument must be correlated with the endogenous predictor (i.e., cannot be a **"weak" instrument**)] 2. Instrument must *not* be correlated with the residuals in the first stage equation `\((\sigma_{Z\nu} = 0)\)` 3. Instrument must *not* be correlated with the residuals in the second stage equation `\((\sigma_{Z\varepsilon} = 0)\)` --- # IV assumptions ### Assumptions: 1. .red-pink[Instrument must be correlated with the endogenous predictor (i.e., cannot be a **"weak" instrument**)] 2. Instrument must *not* be correlated with the residuals in the first stage equation `\((\sigma_{Z\nu} = 0)\)` 3. Instrument must *not* be correlated with the residuals in the second stage equation `\((\sigma_{Z\varepsilon} = 0)\)` -- .purple[**Problems with #1:**] - If `\(Z\)` does not predict `\(D\)`, it would be a .red-pink[**"weak instrument"**] - There would be no (minimal) variation in the obtained predicted values of the question predictor in the second stage - The estimated regression slope would be *indeterminate* (or close to it) --- # IV assumptions ### Assumptions: 1. Instrument must be correlated with the endogenous predictor (i.e., cannot be a **"weak" instrument**) 2. .red-pink[Instrument must *not* be correlated with the residuals in the first stage equation] `\(\color{red}{(\sigma_{Z\nu} = 0)}\)` 3. Instrument must *not* be correlated with the residuals in the second stage equation `\((\sigma_{Z\varepsilon} = 0)\)` -- .purple[**Problems with #2:**] - If `\(Z\)` is correlated with `\(\nu_{i}\)`, then `\(Z\)` would be endogenous in the first stage equation - The values of the question predictor would be replaced by biased predicted values, and the estimated regression coefficient would be biased in ways similar to biased multi-variate regression models --- # IV assumptions ### Assumptions: 1. Instrument must be correlated with the endogenous predictor (i.e., cannot be a **"weak" instrument**) 2. Instrument must *not* be correlated with the residuals in the first stage equation `\((\sigma_{Z\nu} = 0)\)` 3. .red-pink[Instrument must *not* be correlated with the residuals in the second stage equation] `\(\color{red}{(\sigma_{Z\varepsilon} = 0)}\)` -- .purple[**Problems with #3:**] The statistical basis for IV of a potentially endogenous relationship is that: $$ `\begin{aligned} \beta_{1} = \left( \frac{\sigma_{YZ}}{\sigma_{DZ}} \right) - \left( \frac{\sigma_{\varepsilon Z}}{\sigma_{DZ}} \right) \end{aligned}` $$ where `\(\sigma_{YZ}\)` is the population covariance of outcome `\(Y\)` and instrument `\(Z\)`, `\(\sigma_{\varepsilon Z}\)` is the population covariation of residual `\(\varepsilon\)` and instrument `\(Z\)`, and `\(\sigma_{DZ}\)` is the population covariance of treatment `\(D\)` and instrument `\(Z\)`. --- # IV assumptions ### Assumptions: 1. Instrument must be correlated with the endogenous predictor (i.e., cannot be a **"weak" instrument**) 2. Instrument must *not* be correlated with the residuals in the first stage equation `\((\sigma_{Z\nu} = 0)\)` 3. .red-pink[Instrument must *not* be correlated with the residuals in the second stage equation] `\(\color{red}{(\sigma_{Z\varepsilon} = 0)}\)` .purple[**Problems with #3:**] The statistical basis for IV of a potentially endogenous relationship is that: $$ `\begin{aligned} \beta_{1} = \left( \frac{\sigma_{YZ}}{\sigma_{DZ}} \right) - \left( \frac{\sigma_{\varepsilon Z}}{\sigma_{DZ}} \right) \end{aligned}` $$ .red[**as long as**]: (1) `\(\sigma_{\varepsilon Z} = 0\)`; and (2) `\(\sigma_{DZ} \neq 0\)`. -- If `\(Z\)` correlated with `\(\varepsilon\)`, then `\(\sigma_{\varepsilon Z} \neq 0\)` and `\(\beta_{1}\)` will be biased. --- ## Exclusion restriction (visually) **Those DAG-gone things come back...** <img src="exclusion.jpg" width="1503" style="display: block; margin: auto;" /> -- The .red-pink[**exclusion restriction**] states that the path by which the instrument influences the outcome goes **exclusively** through the endogenous predictor. --- # Instrument examples | Outcome | Treatment | Omitted variables | Instrument |------------------------------------------------------------------------------------------------- | Health | Smoking cigarettes | Other neg health behaviors| [Tobacco taxes](https://www.sciencedirect.com/science/article/pii/S0047272798000905) | Labor market success| Assimilation | Ability; motivation | [Scrabble score of name](https://www.journals.uchicago.edu/doi/10.1086/692531) | Crime rate | Patrol hours | # of criminals | [Election cycles](https://www.jstor.org/stable/2951346) | Female labor market | Number of children | Family preferences; religiosity | [First two children same-sex](https://www.jstor.org/stable/116844); [twin births](https://www.jstor.org/stable/2117765) | Conflicts | Economic growth | Simultaneous causality | Rainfall 😄 -- .small[.blue[Can you think of a good instrument in the example of college and voting (**turn and talk**)?] *Hint: try to find something that exogenously predicts college attendance but is unrelated to voting.*] -- .small[*"A necessary but not a sufficient condition for having an instrument that can satisfy the exclusion restriction is if people are confused when you tell them about the instrument’s relationship to the outcome.” (Cunningham, 2021, p. 321)*] --- # Very special instruments | Outcome | Treatment | Omitted variables | Instrument |----------------------------------------------------------------------------------------------------------------- | Test scores | Voucher-based private school attendance| Non-compliance | Original NYSP lottery | Reading achievement | Class size | Incomplete compliance to Maimonides' Rule| Rule-based assignment | Reading achievement | Randomly assigned reading intervention | Incomplete compliance; attrition | Assignment to intervention -- All instances in which assignment to treatment is *as-good-as random* but there is .red-pink[**imperfect compliance**]. --- # NY vouchers (Ch. 4 *MM*) **Recall the NY Voucher experiment from Week 1** - Low-income families *randomized* by lottery to **treatment** and **control** groups + Treatment families *received a voucher* to cover partial tuition costs at private schools; + Control families *received no voucher* - Subsequent academic achievement measured for participating children: -- `$$READ_{i} = \beta_{0} + \beta_{1}VOUCHER_{i} + \textbf{X}_{i}\gamma + \varepsilon_{i}$$` -- - `\(\beta_{1}\)` represents the causal effect of **voucher receipt** on reading achievement - Because children were randomly assigned to the experimental conditions, predictor `\(VOUCHER_{i}\)` is exogenous and children in the “Voucher” and “No Voucher” conditions are equal in expectation, prior to treatment. - We can obtain an unbiased estimate of `\(\beta_{1}\)` straightforwardly, using OLS regression analysis. --- # But wait a minute... While families were randomly assigned to "Voucher" and "No Voucher" conditions, **actual attendance** at private versus public schools was *not* randomly assigned. -- .pull-left[ In first year of experiment, **5% of kids** whose families *did not receive vouchers* went to private school anyway - Families wanted their kids taught in a religious setting - Families wanted their kids out of public schools - Families had greater financial and social resources ] -- .pull-right[ In first year of experiment, **18% of kids** whose families *received vouchers* still went to public school - Families lived too far from school and couldn't transport - Families couldn't make up difference of private school fees - Families didn't feel welcome in private school setting ] -- .small[ As a result of these unobserved choices: 1. **Attendance** at public and private school was *not* assigned exogenously 2. Children who **attended** each kind of school were **not .red-pink[equal in expectation]** beforehand ] --- ## Same ole' endogeneity problem Want to estimate: `$$READ_{i} = \beta_{0} + \beta_{1}PRIVATE_{i} + \textbf{X}_{i}\gamma + \varepsilon_{i}$$` but, unobserved characteristics (such as school accessibility, family resources, motivation, etc.) may determine whether the child goes to private school and also determine her outcomes. -- Because these **unobserved characteristics** are omitted as explicit predictors but affect the outcome, their effects are present in the residual `\((\varepsilon_{i})\)`. -- Consequently, `\(PRIVATE_{i}\)` will be correlated with the residuals and an OLS estimate of `\(\beta_{1}\)` will be biased! --- # An IV solution! - Offer of voucher *was* .red-pink[**randomized and exogenous**] - Offer of voucher likely to be correlated with **attendance** at a private or a public school because many families who got the voucher did in fact use it, and many families who didn’t sent their kids to public school - Being randomized to voucher receipt is unlikely to predict the child’s ultimate achievement, *except through its impact on private school attendance* + No .red-pink[**third path**]! -- We can use instrumental variable estimation, with attendance at private school `\((PRIVATE_{i})\)` as the endogenous question predictor and lottery-based receipt of a voucher `\((VOUCHER_{i})\)` as the instrument! --- # An IV solution! We can use instrumental variable estimation, with attendance at private school `\((PRIVATE_{i})\)` as the endogenous question predictor and lottery-based receipt of a voucher `\((VOUCHER_{i})\)` as the instrument! .blue[**How would you write this?**] -- **1<sup>st</sup> stage:** `$$PRIVATE_{i} = \alpha_{0} + \alpha_{1}VOUCHER_{i} + \alpha_{2}READ^{pre}_{i} + \delta_{i}$$` -- **2<sup>nd</sup> stage:** `$$READ^{post}_{i} = \beta_{0} + \beta_{1} \hat{PRIVATE}_{i} + \beta_{2}READ^{pre}_{i} + \varepsilon_{i}$$` -- Note the inclusion of baseline reading scores `\((READ^{pre}_{i})\)` to improve precision and the inclusion of all covariates from Stage 1 in Stage 2! -- `\(\beta_{1}\)` is our causal parameter of interest and represents the estimated .red-pink[**Local Average Treatment Effect (LATE)**] of attending private school on lagged-score adjusted reading scores. .small[.blue[*But to whom is it "local"? To whom do these estimates pertain?*]] --- ## On whom does IV depend? .small[This is a helpful tool for any causal analysis that relies on an original randomized offer, but is followed by endogenous “take-up.” It also provides considerable insight into what an IV estimator is actually estimating:] -- .small[ | | Never takers | Always takers | Compliers |----------------------------------------------------------------------------------------------------------------- | | *(never accept treatment)* | *(always seek out and obtain treatment)* | *(accept treatment if assigned; accept control if assigned)* | **Voucher=1** | "Not treated" | "Treated" | "Treated" | **Voucher=0** | "Not treated" | "Treated" | "Not treated" ] .small[An IV estimate is often referred to as the effect of the .blue[**Treatement on the Treated (TOT)**].] -- .small[Covariance algebra tells us that only the .red-pink[**compliers**] actually contribute to the IV estimate. IV is a LATE estimator, so only those participants whose actions respond to the instrument will participate in the estimate. .red-pink[**IV is the treatment effect for the compliers.**]<sup>[1]</sup>] .footnote[.small[[1] Another term for the LATE you may encounter is the Causal Average Complier Effect (CACE).]] -- .small[This approach assumes that there are no .red-pink[**defiers**]. These are people who seek out private schools *only when they don't receive the voucher* (otherwise would have gone to public) and vice-versa.] --- class: middle, inverse # Wrap-up --- # Goals ### 1. Conduct and interpret RD analysis in simplified data ### 2. Assess the basic assumptions of the RD design ### 3. Describe the conceptual and simple mathematical approach for identifying causal effects using the instrumental variables approach --- # To-Dos ### Week 6: Instrumental variables ### Readings: - Murnane & Willett (2011), *MM* Chapters 10-11 - Dee (2004) Are there civic returns to education? - Angrist et al. (2016) Effects of Boston charter schools - Further, *MHE*: Ch. 4; *'Metrics*: Ch. 3, *Mixtape*: Ch. 7 ### Assignments Due: - DARE #3: 2/18, 11:59pm --- # Feedback ## Plus/Deltas Front side of index card ## Clear/Murky On back