Relabelling
Description
A preprocessing fairness technique that modifies class labels in training data to achieve equal positive outcome rates across protected groups. Also known as 'data massaging', this method identifies instances that contribute to discriminatory patterns and flips their labels (from positive to negative or vice versa) to balance the proportion of positive outcomes between demographic groups. The technique aims to remove historical bias from training data whilst preserving the overall class distribution, enabling standard classifiers to learn from discrimination-free datasets.
Example Use Cases
Fairness
Preprocessing historical hiring datasets by relabelling borderline cases to ensure equal hiring rates across gender and ethnicity groups, correcting for past discriminatory practices whilst maintaining overall qualification standards for fair recruitment model training.
Transparency
Creating transparent credit scoring datasets by documenting which loan applications had labels modified to address historical lending discrimination, providing clear audit trails showing how training data bias was systematically corrected before model development.
Reliability
Improving reliability of medical diagnosis training data by relabelling cases where demographic bias may have influenced historical diagnoses, ensuring models learn from corrected labels that reflect true medical conditions rather than biased historical treatment patterns.
Limitations
- Altering training labels risks introducing new biases or artificial patterns that may not reflect genuine relationships in the data.
- Deciding which instances to relabel requires careful selection criteria and domain expertise to avoid inappropriate label changes.
- May reduce prediction accuracy if too many labels are changed, particularly when the modifications conflict with genuine patterns in the data.
- Requires access to ground truth or expert knowledge to determine whether original labels reflect genuine outcomes or discriminatory bias.
- Effectiveness depends on accurate identification of discriminatory instances, which can be challenging when bias patterns are subtle or complex.