Generalized Additive Models

Description

An intrinsically interpretable modelling technique that extends linear models by allowing flexible, nonlinear relationships between individual features and the target whilst maintaining the additive structure that preserves transparency. Each feature's effect is modelled separately as a smooth function, visualised as a curve showing how the feature influences predictions across its range. GAMs achieve this through spline functions or other smoothing techniques that capture complex patterns in individual variables without interactions, making them particularly valuable for domains requiring both predictive accuracy and model interpretability.

Example Use Cases

Transparency

Predicting hospital readmission risk with a GAM that provides transparent, auditable risk assessments by showing how readmission probability varies nonlinearly with patient age, blood pressure, and medication adherence, enabling clinicians to understand and trust the model's reasoning for regulatory compliance.

Explainability

Building a credit scoring model that explains loan decisions to applicants by visualising how income, credit history, and debt-to-income ratio individually affect approval likelihood, providing clear feature attributions that satisfy fair lending requirements and regulatory explainability mandates.

Reliability

Developing an environmental monitoring system that reliably predicts air quality using GAMs to model the smooth, nonlinear relationships between weather variables, ensuring stable predictions across seasonal variations whilst maintaining interpretable relationships that environmental scientists can validate.

Limitations

  • Cannot capture complex interactions between features unless explicitly modelled, limiting their ability to represent relationships where variables influence each other.
  • Setup requires domain expertise to decide which features need nonlinear treatment and appropriate smoothing parameters, making model specification more challenging than linear models.
  • Fitting process is computationally more expensive than linear models, particularly for large datasets with many features requiring smoothing.
  • Risk of overfitting individual feature relationships if smoothing parameters are not properly regularised, potentially reducing generalisation performance.
  • Interpretation complexity increases with the number of nonlinear features, as understanding multiple smooth curves simultaneously becomes cognitively demanding.

Resources

Generalized Additive Models
Research PaperTrevor Hastie and Robert TibshiraniJan 1, 1986
pyGAM: Generalized Additive Models in Python
Software Package
mgcv: Mixed GAM Computation Vehicle with Automatic Smoothness Estimation
Software Package
A Tour of pyGAM — pyGAM documentation
Tutorial

Tags

Assurance Goal Category:
Data Requirements:
Data Type:
Expertise Needed:
Explanatory Scope:
Technique Type: