Session 11
MATH 80667A: Experimental Design and Statistical Methods
HEC Montréal
Basics of causal inference
Basics of causal inferenceDirected acyclic graphs
Basics of causal inferenceDirected acyclic graphs Causal mediation
For individual i, we postulate the existence of a potential outcomes
Both are possible, but only one will be realized.
Observe outcome for a single treatment
With binary treatment Xi, I observe either Yi∣do(Xi=1) or Yi∣do(Xi=0).
i | Xi | Yi(0) | Yi(1) | Yi(1)−Yi(0) |
---|---|---|---|---|
1 | 1 | ? | 4 | ? |
2 | 0 | 3 | ? | ? |
3 | 1 | ? | 6 | ? |
4 | 0 | 1 | ? | ? |
5 | 0 | 5 | ? | ? |
6 | 1 | ? | 7 | ? |
Since we can't estimate individual treatment, we consider the average treatment effect (average over population) E{Y(1)−Y(0)}.
The latter can be estimated as
ATE=E(Y∣X=1)expected response amongtreatment group−E(Y∣X=0)expected response amongcontrol group
When is this a valid causal effect?
For the ATE to be equivalent to E{Y(1)−Y(0)}, the following are sufficient:
Experimental
You have control over which units get treatment
Experimental
You have control over which units get treatment
Observational
You don't have control over which units get treatment
Directed acyclic graphs (DAGs)
Directed: Each node has an arrow that points to another node
Acyclic: You can't cycle back to a node (and arrows only have one direction)
Graph: A set of nodes (variables) and vertices (arrows indicating interdependence)
Directed acyclic graphs (DAGs)
Graphical model of the process that generates the data
Maps your logical model
Confounding
Common cause
Causation
Mediation
Collision
Selection /
endogeneity
X causes Y
But Z causes both X and Y
Z confounds the X → Y association
What are the paths
between money and win margin?
Money → Margin
Money ← Quality → Margin
Quality is a confounder
Since we randomize assignment to treatment X, all arrows incoming in X are removed.
With observational data, we need to explicitly model the relationship and strip out the effect of X on Y.
X causes Y
X causes
Z which causes Y
Z is a mediator
X causes Z
Y causes Z
Should you control for Z?
Colliders can create
fake causal effects
Colliders can hide
real causal effects
Height is unrelated to basketball skill… among NBA players
A new collider bias teaching example. Sample selects on marriage (not divorced) so: satisfaction ––> [not divorced] <–– children (Richard McElreath, Apr 26, 2021 on Twitter)
Example of confounder: https://doi.org/10.1177/109467051454314
Confounding
Common cause Causal forks X ← Z → Y
Causation Mediation Causal chain X → Z → Y
Collision
Selection /
endogeneity
inverted fork X → Z ← Y
Postulated DAG for the effect of smoking on fetal alcohol spectrum disorders (FASD)
Source: Andrew Heiss (?), likely from
McQuire, C., Daniel, R., Hurt, L. et al. The causal web of foetal alcohol spectrum disorders: a review and causal diagram. Eur Child Adolesc Psychiatry 29, 575–594 (2020). https://doi.org/10.1007/s00787-018-1264-3
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
Session 11
MATH 80667A: Experimental Design and Statistical Methods
HEC Montréal
Basics of causal inference
Basics of causal inferenceDirected acyclic graphs
Basics of causal inferenceDirected acyclic graphs Causal mediation
For individual i, we postulate the existence of a potential outcomes
Both are possible, but only one will be realized.
Observe outcome for a single treatment
With binary treatment Xi, I observe either Yi∣do(Xi=1) or Yi∣do(Xi=0).
i | Xi | Yi(0) | Yi(1) | Yi(1)−Yi(0) |
---|---|---|---|---|
1 | 1 | ? | 4 | ? |
2 | 0 | 3 | ? | ? |
3 | 1 | ? | 6 | ? |
4 | 0 | 1 | ? | ? |
5 | 0 | 5 | ? | ? |
6 | 1 | ? | 7 | ? |
Since we can't estimate individual treatment, we consider the average treatment effect (average over population) E{Y(1)−Y(0)}.
The latter can be estimated as
ATE=E(Y∣X=1)expected response amongtreatment group−E(Y∣X=0)expected response amongcontrol group
When is this a valid causal effect?
For the ATE to be equivalent to E{Y(1)−Y(0)}, the following are sufficient:
Experimental
You have control over which units get treatment
Experimental
You have control over which units get treatment
Observational
You don't have control over which units get treatment
Directed acyclic graphs (DAGs)
Directed: Each node has an arrow that points to another node
Acyclic: You can't cycle back to a node (and arrows only have one direction)
Graph: A set of nodes (variables) and vertices (arrows indicating interdependence)
Directed acyclic graphs (DAGs)
Graphical model of the process that generates the data
Maps your logical model
Confounding
Common cause
Causation
Mediation
Collision
Selection /
endogeneity
X causes Y
But Z causes both X and Y
Z confounds the X → Y association
What are the paths
between money and win margin?
Money → Margin
Money ← Quality → Margin
Quality is a confounder
Since we randomize assignment to treatment X, all arrows incoming in X are removed.
With observational data, we need to explicitly model the relationship and strip out the effect of X on Y.
X causes Y
X causes
Z which causes Y
Z is a mediator
X causes Z
Y causes Z
Should you control for Z?
Colliders can create
fake causal effects
Colliders can hide
real causal effects
Height is unrelated to basketball skill… among NBA players
A new collider bias teaching example. Sample selects on marriage (not divorced) so: satisfaction ––> [not divorced] <–– children (Richard McElreath, Apr 26, 2021 on Twitter)
Example of confounder: https://doi.org/10.1177/109467051454314
Confounding
Common cause Causal forks X ← Z → Y
Causation Mediation Causal chain X → Z → Y
Collision
Selection /
endogeneity
inverted fork X → Z ← Y
Postulated DAG for the effect of smoking on fetal alcohol spectrum disorders (FASD)
Source: Andrew Heiss (?), likely from
McQuire, C., Daniel, R., Hurt, L. et al. The causal web of foetal alcohol spectrum disorders: a review and causal diagram. Eur Child Adolesc Psychiatry 29, 575–594 (2020). https://doi.org/10.1007/s00787-018-1264-3