explanatory scope
local
Provides explanations for individual predictions
23 techniques
| Goals | Models | Data Types | Description | |||
|---|---|---|---|---|---|---|
| SHapley Additive exPlanations | Algorithmic | Architecture/model Agnostic Requirements/black Box | Any | SHAP explains model predictions by quantifying how much each input feature contributes to the outcome. It assigns an... | ||
| Integrated Gradients | Algorithmic | Architecture/neural Networks Paradigm/parametric +3 | Any | Integrated Gradients is an attribution technique that explains a model's prediction by quantifying the contribution of... | ||
| DeepLIFT | Algorithmic | Architecture/neural Networks Requirements/white Box +1 | Any | DeepLIFT (Deep Learning Important FeaTures) explains neural network predictions by decomposing the difference between... | ||
| Layer-wise Relevance Propagation | Algorithmic | Architecture/neural Networks Paradigm/parametric +2 | Any | Layer-wise Relevance Propagation (LRP) explains neural network predictions by working backwards through the network to... | ||
| Contextual Decomposition | Algorithmic | Architecture/neural Networks/recurrent Requirements/white Box +1 | Text | Contextual Decomposition explains LSTM and RNN predictions by decomposing the final hidden state into contributions from... | ||
| Taylor Decomposition | Algorithmic | Architecture/neural Networks Requirements/gradient Access +2 | Any | Taylor Decomposition is a mathematical technique that explains neural network predictions by computing first-order and... | ||
| Local Interpretable Model-Agnostic Explanations | Algorithmic | Architecture/model Agnostic Requirements/black Box | Any | LIME (Local Interpretable Model-agnostic Explanations) explains individual predictions by approximating the complex... | ||
| Individual Conditional Expectation Plots | Visualization | Architecture/model Agnostic Requirements/black Box | Any | Individual Conditional Expectation (ICE) plots display the predicted output for individual instances as a function of a... | ||
| Saliency Maps | Algorithmic | Architecture/neural Networks Requirements/differentiable +1 | Image | Saliency maps are visual explanations for image classification models that highlight which pixels in an image most... | ||
| Gradient-weighted Class Activation Mapping | Algorithmic | Architecture/neural Networks/convolutional Requirements/architecture Specific +2 | Image | Grad-CAM creates visual heatmaps showing which regions of an image a convolutional neural network focuses on when making... | ||
| Occlusion Sensitivity | Algorithmic | Architecture/model Agnostic Requirements/black Box | Image | Occlusion sensitivity tests which parts of the input are important by occluding (masking or removing) them and seeing... | ||
| Classical Attention Analysis in Neural Networks | Algorithmic | Architecture/neural Networks/recurrent Requirements/architecture Specific +1 | Any | Classical attention mechanisms in RNNs and CNNs create alignment matrices and temporal attention patterns that show how... | ||
| Influence Functions | Algorithmic | Architecture/linear Models Architecture/neural Networks +6 | Any | Influence functions quantify how much each training example influenced a model's predictions by computing the change in... | ||
| Contrastive Explanation Method | Algorithmic | Architecture/neural Networks Paradigm/discriminative +4 | Any | The Contrastive Explanation Method (CEM) explains model decisions by generating contrastive examples that reveal what... | ||
| ANCHOR | Algorithmic | Architecture/model Agnostic Requirements/black Box | Any | ANCHOR generates high-precision if-then rules that explain individual predictions by identifying the minimal set of... | ||
| Counterfactual Fairness Assessment | Algorithmic | Architecture/model Agnostic Paradigm/supervised +1 | Any | Counterfactual Fairness Assessment evaluates whether a model's predictions would remain unchanged if an individual's... | ||
| Sensitivity Analysis for Fairness | Algorithmic | Architecture/model Agnostic Paradigm/supervised +2 | Any | Sensitivity Analysis for Fairness systematically evaluates how model predictions change when sensitive attributes or... | ||
| Neuron Activation Analysis | Algorithmic | Architecture/neural Networks Requirements/model Internals +1 | Text | Neuron activation analysis examines the firing patterns of individual neurons in neural networks by probing them with... | ||
| Causal Mediation Analysis in Language Models | Mechanistic Interpretability | Architecture/neural Networks/transformer Architecture/neural Networks/transformer/llm +3 | Text | Causal mediation analysis in language models is a mechanistic interpretability technique that systematically... | ||
| Feature Attribution with Integrated Gradients in NLP | Algorithmic | Architecture/neural Networks/transformer Architecture/neural Networks/transformer/llm +4 | Text | Applies Integrated Gradients to natural language processing models to attribute prediction importance to individual... |
Rows per page
Page 1 of 2