applicable models

transformer

Techniques for transformer-based architectures

5 techniques
GoalsModelsData TypesDescription
Neuron Activation Analysis
Algorithmic
Neural Network
LLM
+1
Text
Neuron activation analysis examines the firing patterns of individual neurons in neural networks by probing them with...
Causal Mediation Analysis in Language Models
Mechanistic Interpretability
LLM
Transformer
Text
Causal mediation analysis in language models is a mechanistic interpretability technique that systematically...
Feature Attribution with Integrated Gradients in NLP
Gradient Based
Transformer
LLM
Text
Applies Integrated Gradients to natural language processing models to attribute prediction importance to individual...
Concept Activation Vectors
Algorithmic
Neural Network
Transformer
+1
Any
Concept Activation Vectors (CAVs), also known as Testing with Concept Activation Vectors (TCAV), identify mathematical...
Attention Visualisation in Transformers
Algorithmic
Transformer
Text
Image
Attention Visualisation in Transformers analyses the multi-head self-attention mechanisms that enable transformers to...
Rows per page
Page 1 of 1