class: center, middle, inverse, title-slide .title[ # DD and Regression Discontinuity ] .subtitle[ ## EDLD 650: Week 3 ] .author[ ### David D. Liebowitz ] --- <style type="text/css"> .inverse { background-color : #2293bf; } </style> # Agenda .large[ 1. Roadmap and Goals (9:00-9:10) 2. Analyzing Liebowitz, Porter & Bragg (9:10-10:10) 3. How DARE you! (10:10-10:45) - More questions? - Preview debrief 4. Break (10:45-11:00) 5. Regression discontinuity (11:00-11:40) 6. Wrap-up (11:40-11:50) - Research project proposal - Plus/Deltas & Clear/Murky ] --- # Roadmap <img src="causal_id.jpg" width="1707" style="display: block; margin: auto;" /> --- # Goals ### 1. Describe threats to validity in difference-in-differences (DD) identification strategy and approaches to address these threats. ### 2. Conduct applied difference-in-difference analysis and interpret these results. ### 3. Describe conceptual approach to regression discontinuity analysis --- class: middle, inverse # So random... --- class: middle, inverse # How DARE you?!?! --- # DARE 1 review process .blue[*...we'll do this next class...*] .large[ 1. Form groups of 2-3 (not your DARE partner) 2. Independently review DARE #1 model response (10 min) - Take notes on where answers differ from model - Think about conceptual questions (modeling decisions, robustness checks, etc.) - Think about presentation and language (what to leave in, what to leave out? how to describe results?) 3. Discuss in group conceptual questions (15 min) 4. Independently review code posted on course website (5 min) 5. Join partner (for solo authors join another author) 6. Discuss coding choices and questions (10 min) 7. Whole-group debrief (10 min) - Lingering questions - Process ] --- class: middle, inverse # Break --- class: middle, inverse # Discontinuities everywhere! --- # Recall the basic set up of DD <img src="EDLD_650_3_RD_1_files/figure-html/unnamed-chunk-3-1.png" style="display: block; margin: auto;" /> -- Time (literally the passage of years) can be considered a .blue[**forcing variable**]--a measure, movement along which in one direction or another "forces" participants into treatment or control status. --- # DD's cousin: C-ITS **The Comparative-Interrupted Time Series approach:** <img src="EDLD_650_3_RD_1_files/figure-html/unnamed-chunk-4-1.png" style="display: block; margin: auto;" /> -- .small[In this way, DD may just be a specific application of a causal identification framework known as the **Regression Discontinuity (RD)**] design.<sup>1</sup> .footnote[[1] In fact, many believe that repeated measures over time (a discrete ordered variable) are not divisible in same way as true continuous forcing variables are (e.g., test score, height, draft number). .red[*This is crucial to determine the credibility of identification.*]] --- # C-ITS considered .purple[**Strengths**] - Takes advantage of full range of data - Compared to mean-effect-only DD, allows differentiation of discontinuous jump vs. post-trend - Permits modeling of fully flexible functional form (can include quadratic, cubic, quartic relationships, interactions and more!) - Data-responsive approach -- .purple[**Weaknesses**] - Units on each side of discontinuity not .blue[*equal-in-expectation*] w/o other assumptions - Encourages over-fitting; risks generating unstable models - Estimates are functional-form dependent -- .small[**Note that a fully-saturated C-ITS model (i.e., a model that estimates a coefficient on an indicator for each time period) is identical to an event study.**] --- ## Failing (passing) graduation test .large[Massachusetts, like somee other states (including Oregon until very recently) requires students to pass a test in order to graduate. **How does just passing (or failing affect students)?**]<sup>1</sup> <img src="PapayMurnaneWillett2016.jpg" width="45%" style="display: block; margin: auto;" /> .footnote[[1] Papay, J.P., Murnane, R.J., & Willett, J.B. (2016). [The impact of test-score labels on human capital investment decisions.](http://jhr.uwpress.org/content/51/2/357) *Journal of Human Resources, 51*(2), 357-388.] --- # Failing a graduation test <img src="EDLD_650_3_RD_1_files/figure-html/unnamed-chunk-6-1.png" style="display: block; margin: auto;" /> --- # Failing a graduation test <img src="EDLD_650_3_RD_1_files/figure-html/unnamed-chunk-7-1.png" style="display: block; margin: auto;" /> --- # Failing a graduation test <img src="EDLD_650_3_RD_1_files/figure-html/unnamed-chunk-8-1.png" style="display: block; margin: auto;" /> --- # Failing a graduation test <img src="EDLD_650_3_RD_1_files/figure-html/unnamed-chunk-9-1.png" style="display: block; margin: auto;" /> --- # A different effect? <img src="EDLD_650_3_RD_1_files/figure-html/unnamed-chunk-10-1.png" style="display: block; margin: auto;" /> --- # Education FTW!<sup>1</sup> <img src="original_rdd.jpg" width="70%" style="display: block; margin: auto;" /> .footnote[[1] Thistlethwaite, D. L., & Campbell, D. T. (1960). [Regression-discontinuity analysis: An alternative to the ex post facto experiment](https://psycnet.apa.org/record/1962-00061-001). Journal of Educational Psychology, 51(6), 309–317] --- ## The benefits of being recognized <img src="origin_rdd_2.jpg" width="80%" style="display: block; margin: auto;" /> --- ## The basic set up in regression Given a continuous forcing variable `\(S_{i}\)` such that individuals receive a treatment `\((D_{i})\)` when `\(S_{i} \geq\)` a cutoff `\((C)\)`: `$$Y_i=\beta_{0} + \beta_{1} S_{i} + \mathbb{1}(S_{i} \geq C)\beta_{2} + \varepsilon_{i}$$` -- .blue[**Can you explain what is happening in this regression?**] -- .blue[**What about applied in a specific context?**] `$$p(COLL_{i}=1)= \beta_{0} + \beta_{1} TESTSCORE_{i} + 1(TESTSCORE_{i} \geq 60)\beta_{2} + \varepsilon_{i}$$` -- > This equation estimates a linear probability model, in which whether individuals attend college or not (expressed as a dichomotous indicator taking on the values of 0 or 1), is regressed on a linear measure of individual *i*'s test score `\((TESTSCORE_{i})\)` and a indicator variable that takes the value of 1 if individual *i* scored 60 or higher on the test. `\(\beta_{2}\)` is the causal parameter of interests and represents the discontinuous jump in the probability (p.p.) of attending college (adjusting for test score) of scoring just above the pass score. --- class: middle, inverse # Wrap-up --- # Goals ### 1. Describe threats to validity in difference-in-differences (DD) identification strategy and approaches to address these threats. ### 2. Conduct applied difference-in-difference analysis and interpret these results. ### 3. Describe conceptual approach to regression discontinuity analysis --- # To-Dos ## Week 4: Regression Discontinuity **Readings:** - Murnane & Willet, Ch. 9 - Angrist & Lavy (1999) - Dee & Penner (2017) - *Further*: MHE: Ch. 6, 'Metrics: Ch. 4, Mixtape: Ch. 6 **Research Project Proposal due 11:59pm, 1/28** - Talk to me! **DARE #2 due 11:59pm, 2/5** --- # Feedback ## Plus/Deltas Front side of index card ## Clear/Murky On back