Counterfactual Fairness Assessment

Description

Counterfactual Fairness Assessment evaluates whether a model's predictions would remain unchanged if an individual's protected attributes (race, gender, age) were different, whilst keeping all other causally legitimate factors constant. The technique requires constructing a causal graph that maps relationships between variables, then using do-calculus or structural causal models to simulate counterfactual scenarios. For example, it asks: 'Would this loan application still be approved if the applicant were a different race, holding constant their actual qualifications and economic circumstances?' This individual-level fairness criterion helps identify when decisions depend improperly on protected characteristics.

Example Use Cases

Fairness

Evaluating a hiring algorithm by testing whether qualified candidates would receive the same evaluation scores if their gender were different, whilst controlling for actual skills, experience, and education, revealing whether gender bias affects recruitment decisions.

Assessing a criminal sentencing model by examining whether defendants with identical criminal histories and case circumstances would receive the same sentence recommendations regardless of their race, identifying potential discriminatory patterns in judicial AI systems.

Limitations

  • Requires explicit specification of causal relationships between variables, which involves subjective assumptions about what constitutes legitimate versus illegitimate causal pathways.
  • May be mathematically impossible to satisfy simultaneously with other fairness criteria (like statistical parity), forcing practitioners to choose between competing fairness definitions.
  • Implementation complexity is high, requiring sophisticated causal inference techniques and structural causal models that are difficult to construct and validate.
  • Depends heavily on the quality and completeness of the causal graph, which may be incorrect or missing important confounding variables.

Resources

Counterfactual Fairness
Research PaperMatt J. Kusner et al.Dec 4, 2017
fairlearn/fairlearn
Software Package
Counterfactual Fairness in Text Classification through Robustness
Research PaperSahaj Garg et al.Sep 27, 2018

Tags

Applicable Models:
Data Requirements:
Data Type:
Evidence Type:
Explanatory Scope:
Fairness Approach:
Lifecycle Stage:
Technique Type: