Model Development Biases

../_images/m.png

Fig. 3 A simplified schematic of a project worklow.

Aggregation Bias

Deliberative Prompts

  • Are there any evaluative methods (e.g. model comparison) that can help you identify aggregation bias?

Related biases: Representation Bias, Label Bias

Evaluation Bias

Deliberative Prompts

  • How will you divide your dataset into separate training and testing datasets?

  • Will you validate the model against an external benchmark population? If not, have you taken steps to report these limitations?

Related biases: Representation Bias

Confounding

Deliberative Prompts

  • Are there methods you can use (e.g. propensity score matching, causal diagrams) that could help reduce bias that results from confounding (e.g. in the estimation of the average treatment effect)?

  • Is the sample size sufficient (i.e. large enough) to minimise the impact of confounders?

Related biases: Collider Bias

Collider Bias

Deliberative Prompts

  • Is it possible that the exposure and outcome of interest under study could be independently driving inclusion within the sample?

  • Are there methods or techniques (e.g. causal diagrams) that could help you identify the effects of colliders?

Related biases: Confounding, Admission Rate Bias

Training-Serving Skew

Deliberative Prompts

  • What steps have you taken to measure and evaluate the performance of your model within the intended domain (e.g. use of synthetic data, external validation on similar datasets)?

  • Have you engaged relevant stakeholder groups to ensure these steps are adequate (e.g. sufficiently representative of the impacted users)?

Related biases: Representation Bias