06:00
Data Analytics and Visualization with R
Session 12
University of Mannheim
Spring 2023
06:00
You need to explore if women are more likely to quit politics after losing their first race than men. Discuss the following research design choices.
What candidates should be included in the sample? (e.g., only those who lost the election once? only those who decided to quit politics?)
What would be the unit of analysis?
What model would you use? Why?
Model-based vs. Design-based Identification
\[ \widehat{\text{Running for Office}} = \hat\beta_0 + \hat\beta_1 \text{Losing Elections} + \hat\beta_2 \text{Female} +\\ \hat\beta_3 (\text{Losing Elections} \times \text{Female}) \]
Pre mean | Post mean | ∆ (post − pre) | |
---|---|---|---|
Control | A (never treated) |
B (never treated) |
B − A |
Treatment | C (not yet treated) |
D (treated) |
D − C |
∆ (treatment − control) |
C − A | D − B | (B − A) − (D − C) or (B − D) − (A − C) |
Pre mean | Post mean | ∆ (post − pre) | |
---|---|---|---|
Control | A (never treated) |
B (never treated) |
B − A |
Treatment | C (not yet treated) |
D (treated) |
D − C |
∆ (treatment − control) |
A − C | B − D | (B − A) − (D − C) or (B − D) − (A − C) |
Pre mean | Post mean | ∆ (post − pre) | |
---|---|---|---|
Control | A (never treated) |
B (never treated) |
B − A |
Treatment | C (not yet treated) |
D (treated) |
D − C |
∆ (treatment − control) |
C − A | D − B | (B − A) − (D − C) or (B − D) − (A − C) |
Pre mean | Post mean | ∆ (post − pre) | |
---|---|---|---|
Control | A (never treated) |
B (never treated) |
B − A |
Treatment | C (not yet treated) |
D (treated) |
D − C |
∆ (treatment − control) |
C − A | D − B | (D − C) − (B − A) or (D − B) − (C − A) |
Pre mean | Post mean | ∆ (post − pre) | |
---|---|---|---|
Pennsylvania | 23.33 A |
21.17 B |
-2.16 B − A |
New Jersey | 20.44 C |
21.03 D |
0.59 D − C |
∆ (NJ − PA) |
-2.89 C − A |
-0.14 D − B |
(0.59) − (−2.16) = 2.75 |
\[\begin{aligned} \color{#440154FF}{Y_{it}}\ =\ &\alpha + \beta\ \color{#35608DFF}{\text{Group}_i} + \gamma\ \color{#22A884FF}{\text{Time}_t} + \delta\ \color{#800010}{(\text{Group}_i \times \text{Time}_t)} + \varepsilon_{it} \end{aligned}\]
α = Mean of control, pre-treatment
β = Increase in outcome across groups
γ = Increase in outcome over time within units
δ = Difference in differences!
Pre mean | Post mean | ∆ (post − pre) | |
---|---|---|---|
Control | α | α + γ | γ |
Treatment | α + β | α + β + γ + δ | γ + δ |
∆ (trtmt − ctrl) | β | β + δ | δ |