Explainability

Comprehensibility

Produces human-understandable formats

18 techniques in this subcategory

18 techniques
GoalsModelsData TypesDescription
Coefficient Magnitudes (in Linear Models)
Metric
Architecture/linear Models
Paradigm/parametric
+2
Tabular
Coefficient Magnitudes assess feature influence in linear models by examining the absolute values of their coefficients....
Ridge Regression Surrogates
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
This technique approximates a complex model by training a ridge regression (a linear model with L2 regularisation) on...
Partial Dependence Plots
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
Partial Dependence Plots show how changing one or two features affects a model's predictions on average. The technique...
Saliency Maps
Algorithmic
Architecture/neural Networks
Requirements/differentiable
+1
Image
Saliency maps are visual explanations for image classification models that highlight which pixels in an image most...
Gradient-weighted Class Activation Mapping
Algorithmic
Architecture/neural Networks/convolutional
Requirements/architecture Specific
+2
Image
Grad-CAM creates visual heatmaps showing which regions of an image a convolutional neural network focuses on when making...
Factor Analysis
Algorithmic
Architecture/model Agnostic
Paradigm/unsupervised
+1
Tabular
Factor analysis is a statistical technique that identifies latent variables (hidden factors) underlying observed...
Principal Component Analysis
Algorithmic
Architecture/model Agnostic
Paradigm/unsupervised
+1
Any
Principal Component Analysis transforms high-dimensional data into a lower-dimensional representation by finding the...
t-SNE
Visualization
Architecture/model Agnostic
Requirements/black Box
Any
t-SNE (t-Distributed Stochastic Neighbour Embedding) is a non-linear dimensionality reduction technique that creates 2D...
Prototype and Criticism Models
Algorithmic
Architecture/model Agnostic
Paradigm/supervised
+3
Any
Prototype and Criticism Models provide data understanding by identifying two complementary sets of examples: prototypes...
Contrastive Explanation Method
Algorithmic
Architecture/neural Networks
Paradigm/discriminative
+4
Any
The Contrastive Explanation Method (CEM) explains model decisions by generating contrastive examples that reveal what...
ANCHOR
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
ANCHOR generates high-precision if-then rules that explain individual predictions by identifying the minimal set of...
RuleFit
Algorithmic
Architecture/model Agnostic
Paradigm/supervised
+1
Any
RuleFit creates interpretable surrogate models that can explain complex black-box models or serve as interpretable...
Model Distillation
Algorithmic
Architecture/neural Networks
Paradigm/parametric
+3
Any
Model distillation transfers knowledge from a large, complex model (teacher) to a smaller, more efficient model...
Generalized Additive Models
Algorithmic
Architecture/linear Models/gam
Paradigm/parametric
+2
Tabular
An intrinsically interpretable modelling technique that extends linear models by allowing flexible, nonlinear...
Model Pruning
Algorithmic
Architecture/neural Networks
Paradigm/parametric
+4
Any
Model pruning systematically removes less important weights, neurons, or entire layers from neural networks to create...
Neuron Activation Analysis
Algorithmic
Architecture/neural Networks
Requirements/model Internals
+1
Text
Neuron activation analysis examines the firing patterns of individual neurons in neural networks by probing them with...
Concept Activation Vectors
Algorithmic
Architecture/neural Networks
Requirements/gradient Access
+2
Any
Concept Activation Vectors (CAVs), also known as Testing with Concept Activation Vectors (TCAV), identify mathematical...
Attention Visualisation in Transformers
Algorithmic
Architecture/neural Networks/transformer
Requirements/architecture Specific
+1
Image
Text
Attention Visualisation in Transformers analyses the multi-head self-attention mechanisms that enable transformers to...
Rows per page
Page 1 of 1