Explainability

Comprehensibility

Produces human-understandable formats

18 techniques in this subcategory

18 techniques

	Goals		Models	Data Types	Description
Coefficient Magnitudes (in Linear Models)		Metric	Architecture/linear Models Paradigm/parametric +2	Tabular	Coefficient Magnitudes assess feature influence in linear models by examining the absolute values of their coefficients....
Ridge Regression Surrogates		Algorithmic	Architecture/model Agnostic Requirements/black Box	Any	This technique approximates a complex model by training a ridge regression (a linear model with L2 regularisation) on...
Partial Dependence Plots		Algorithmic	Architecture/model Agnostic Requirements/black Box	Any	Partial Dependence Plots show how changing one or two features affects a model's predictions on average. The technique...
Gradient Saliency		Algorithmic	Architecture/neural Networks Requirements/differentiable +1	Image	Gradient saliency (also known as vanilla gradient saliency) produces visual explanations for image classification models...
Gradient-weighted Class Activation Mapping		Algorithmic	Architecture/neural Networks/convolutional Requirements/architecture Specific +2	Image	Grad-CAM creates visual heatmaps showing which regions of an image a convolutional neural network focuses on when making...
Factor Analysis		Algorithmic	Architecture/model Agnostic Paradigm/unsupervised +1	Tabular	Factor analysis is a statistical technique that identifies latent variables (hidden factors) underlying observed...
Principal Component Analysis		Algorithmic	Architecture/model Agnostic Paradigm/unsupervised +1	Any	Principal Component Analysis transforms high-dimensional data into a lower-dimensional representation by finding the...
t-SNE		Visualization	Architecture/model Agnostic Requirements/black Box	Any	t-SNE (t-Distributed Stochastic Neighbour Embedding) is a non-linear dimensionality reduction technique that creates 2D...
Prototype and Criticism Models		Algorithmic	Architecture/model Agnostic Paradigm/supervised +3	Any	Prototype and Criticism Models provide data understanding by identifying two complementary sets of examples: prototypes...
Contrastive Explanation Method		Algorithmic	Architecture/neural Networks Paradigm/discriminative +4	Any	The Contrastive Explanation Method (CEM) explains model decisions by generating contrastive examples that reveal what...
ANCHOR		Algorithmic	Architecture/model Agnostic Requirements/black Box	Any	ANCHOR generates high-precision if-then rules that explain individual predictions by identifying the minimal set of...
RuleFit		Algorithmic	Architecture/model Agnostic Paradigm/supervised +2	Any	RuleFit creates interpretable surrogate models that can explain complex black-box models or serve as interpretable...
Model Distillation		Algorithmic	Architecture/neural Networks Paradigm/parametric +3	Any	Model distillation transfers knowledge from a large, complex model (teacher) to a smaller, more efficient model...
Generalized Additive Models		Algorithmic	Architecture/linear Models/gam Paradigm/parametric +2	Tabular	An intrinsically interpretable modelling technique that extends linear models by allowing flexible, nonlinear...
Model Pruning		Algorithmic	Architecture/neural Networks Paradigm/parametric +4	Any	Model pruning systematically removes less important weights, neurons, or entire layers from neural networks to create...
Neuron Activation Analysis		Algorithmic	Architecture/neural Networks Requirements/model Internals +1	Text	Neuron activation analysis examines the firing patterns of individual neurons in neural networks by probing them with...
Concept Activation Vectors		Algorithmic	Architecture/neural Networks Requirements/gradient Access +2	Any	Concept Activation Vectors (CAVs), also known as Testing with Concept Activation Vectors (TCAV), identify mathematical...
Attention Visualisation in Transformers		Algorithmic	Architecture/neural Networks/transformer Requirements/architecture Specific +1	Image Text	Attention Visualisation in Transformers analyses the multi-head self-attention mechanisms that enable transformers to...

Rows per page

Page 1 of 1

← Back to Explainability|All categories