Generalized Additive Models

Description

An intrinsically interpretable modelling technique that extends linear models by allowing flexible, nonlinear relationships between individual features and the target whilst maintaining the additive structure that preserves transparency. Each feature's effect is modelled separately as a smooth function, visualised as a curve showing how the feature influences predictions across its range. GAMs achieve this through spline functions or other smoothing techniques that capture complex patterns in individual variables without interactions, making them particularly valuable for domains requiring both predictive accuracy and model interpretability.

Example Use Cases

Transparency

Predicting hospital readmission risk with a GAM that provides transparent, auditable risk assessments by showing how readmission probability varies nonlinearly with patient age, blood pressure, and medication adherence, enabling clinicians to understand and trust the model's reasoning for regulatory compliance.

Explainability

Building a credit scoring model that explains loan decisions to applicants by visualising how income, credit history, and debt-to-income ratio individually affect approval likelihood, providing clear feature attributions that satisfy fair lending requirements and regulatory explainability mandates.

Reliability

Developing an environmental monitoring system that reliably predicts air quality using GAMs to model the smooth, nonlinear relationships between weather variables, ensuring stable predictions across seasonal variations whilst maintaining interpretable relationships that environmental scientists can validate.

Limitations

  • Cannot capture complex interactions between features unless explicitly modelled, limiting their ability to represent relationships where variables influence each other.
  • Setup requires domain expertise to decide which features need nonlinear treatment and appropriate smoothing parameters, making model specification more challenging than linear models.
  • Fitting process is computationally more expensive than linear models, particularly for large datasets with many features requiring smoothing.
  • Risk of overfitting individual feature relationships if smoothing parameters are not properly regularised, potentially reducing generalisation performance.
  • Interpretation complexity increases with the number of nonlinear features, as understanding multiple smooth curves simultaneously becomes cognitively demanding.

Resources

Research Papers

Generalized Additive Models
Trevor Hastie and Robert TibshiraniJan 1, 1986

Software Packages

pyGAM
Jan 19, 2017

[CONTRIBUTORS WELCOME] Generalized Additive Models in Python

mgcv: Mixed GAM Computation Vehicle with Automatic Smoothness Estimation
Jan 1, 2024

Documentations

A Tour of pyGAM — pyGAM documentation
Pygam DevelopersJan 1, 2018

Tags

Explainability Dimensions

Surrogate Models:
Visualization Methods:
Explanatory Scope:

Other Categories

Data Requirements:
Data Type:
Expertise Needed:
Technique Type: