explanatory scope

local

Provides explanations for individual predictions

23 techniques
GoalsModelsData TypesDescription
SHapley Additive exPlanations
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
SHAP explains model predictions by quantifying how much each input feature contributes to the outcome. It assigns an...
Integrated Gradients
Algorithmic
Architecture/neural Networks
Paradigm/parametric
+3
Any
Integrated Gradients is an attribution technique that explains a model's prediction by quantifying the contribution of...
DeepLIFT
Algorithmic
Architecture/neural Networks
Requirements/white Box
+1
Any
DeepLIFT (Deep Learning Important FeaTures) explains neural network predictions by decomposing the difference between...
Layer-wise Relevance Propagation
Algorithmic
Architecture/neural Networks
Paradigm/parametric
+2
Any
Layer-wise Relevance Propagation (LRP) explains neural network predictions by working backwards through the network to...
Contextual Decomposition
Algorithmic
Architecture/neural Networks/recurrent
Requirements/white Box
+1
Text
Contextual Decomposition explains LSTM and RNN predictions by decomposing the final hidden state into contributions from...
Taylor Decomposition
Algorithmic
Architecture/neural Networks
Requirements/gradient Access
+2
Any
Taylor Decomposition is a mathematical technique that explains neural network predictions by computing first-order and...
Local Interpretable Model-Agnostic Explanations
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
LIME (Local Interpretable Model-agnostic Explanations) explains individual predictions by approximating the complex...
Individual Conditional Expectation Plots
Visualization
Architecture/model Agnostic
Requirements/black Box
Any
Individual Conditional Expectation (ICE) plots display the predicted output for individual instances as a function of a...
Saliency Maps
Algorithmic
Architecture/neural Networks
Requirements/differentiable
+1
Image
Saliency maps are visual explanations for image classification models that highlight which pixels in an image most...
Gradient-weighted Class Activation Mapping
Algorithmic
Architecture/neural Networks/convolutional
Requirements/architecture Specific
+2
Image
Grad-CAM creates visual heatmaps showing which regions of an image a convolutional neural network focuses on when making...
Occlusion Sensitivity
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Image
Occlusion sensitivity tests which parts of the input are important by occluding (masking or removing) them and seeing...
Classical Attention Analysis in Neural Networks
Algorithmic
Architecture/neural Networks/recurrent
Requirements/architecture Specific
+1
Any
Classical attention mechanisms in RNNs and CNNs create alignment matrices and temporal attention patterns that show how...
Influence Functions
Algorithmic
Architecture/linear Models
Architecture/neural Networks
+6
Any
Influence functions quantify how much each training example influenced a model's predictions by computing the change in...
Contrastive Explanation Method
Algorithmic
Architecture/neural Networks
Paradigm/discriminative
+4
Any
The Contrastive Explanation Method (CEM) explains model decisions by generating contrastive examples that reveal what...
ANCHOR
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
ANCHOR generates high-precision if-then rules that explain individual predictions by identifying the minimal set of...
Counterfactual Fairness Assessment
Algorithmic
Architecture/model Agnostic
Paradigm/supervised
+1
Any
Counterfactual Fairness Assessment evaluates whether a model's predictions would remain unchanged if an individual's...
Sensitivity Analysis for Fairness
Algorithmic
Architecture/model Agnostic
Paradigm/supervised
+2
Any
Sensitivity Analysis for Fairness systematically evaluates how model predictions change when sensitive attributes or...
Neuron Activation Analysis
Algorithmic
Architecture/neural Networks
Requirements/model Internals
+1
Text
Neuron activation analysis examines the firing patterns of individual neurons in neural networks by probing them with...
Causal Mediation Analysis in Language Models
Mechanistic Interpretability
Architecture/neural Networks/transformer
Architecture/neural Networks/transformer/llm
+3
Text
Causal mediation analysis in language models is a mechanistic interpretability technique that systematically...
Feature Attribution with Integrated Gradients in NLP
Algorithmic
Architecture/neural Networks/transformer
Architecture/neural Networks/transformer/llm
+4
Text
Applies Integrated Gradients to natural language processing models to attribute prediction importance to individual...
Rows per page
Page 1 of 2