Description

Sobol Indices quantify how much each input feature contributes to the total variance in a model's predictions through global sensitivity analysis. The technique calculates first-order indices (individual feature contributions) and total-order indices (including all interaction effects involving that feature). By systematically sampling the input space and decomposing output variance, Sobol Indices reveal which features drive model uncertainty and which interactions between features are most important for predictions.

Example Use Cases

Explainability

Analysing a climate prediction model to determine which atmospheric parameters (temperature, humidity, pressure) contribute most to rainfall forecast uncertainty, helping meteorologists understand which measurements need the highest precision.

Evaluating a financial risk model to identify which economic indicators (interest rates, inflation, GDP growth) drive the most variability in portfolio value predictions, enabling better risk management strategies.

Fairness

Analysing a credit scoring model to quantify how much prediction variance stems from zip code (a potential proxy for race), helping identify features that may cause disparate impact across demographic groups.

Limitations

  • Computationally expensive, requiring thousands of model evaluations to achieve stable variance estimates, making it impractical for very slow models.
  • Assumes input features are independently distributed, which can lead to misleading results when features are correlated in real data.
  • Curse of dimensionality makes the technique increasingly difficult and expensive to apply as the number of input features grows beyond 10-20.
  • Requires defining appropriate probability distributions for input features, which may not accurately reflect real-world feature distributions.

Resources

Sobol Tensor Trains for Global Sensitivity Analysis
Research PaperRafael Ballester-Ripoll, Enrique G. Paredes, and Renato PajarolaDec 1, 2017
Sobol indices — UQpy v4.2.0 documentation
Documentation
Sobol Indices to Measure Feature Importance | Towards Data Science
Tutorial
Basics — SALib's documentation
Documentation
UQpy (Uncertainty Quantification with python)
Software Package
SALib/SALib
Software Package

Tags

Applicable Models:
Data Requirements:
Data Type:
Evidence Type:
Expertise Needed:
Explanatory Scope:
Lifecycle Stage:
Technique Type: