applicable models
transformer
Techniques for transformer-based architectures
5 techniques
Goals | Models | Data Types | Description | |||
---|---|---|---|---|---|---|
Neuron Activation Analysis | Algorithmic | Neural Network LLM +1 | Text | Neuron activation analysis examines the firing patterns of individual neurons in neural networks by probing them with... | ||
Causal Mediation Analysis in Language Models | Mechanistic Interpretability | LLM Transformer | Text | Causal mediation analysis in language models is a mechanistic interpretability technique that systematically... | ||
Feature Attribution with Integrated Gradients in NLP | Gradient Based | Transformer LLM | Text | Applies Integrated Gradients to natural language processing models to attribute prediction importance to individual... | ||
Concept Activation Vectors | Algorithmic | Neural Network Transformer +1 | Any | Concept Activation Vectors (CAVs), also known as Testing with Concept Activation Vectors (TCAV), identify mathematical... | ||
Attention Visualisation in Transformers | Algorithmic | Transformer | Text Image | Attention Visualisation in Transformers analyses the multi-head self-attention mechanisms that enable transformers to... |
Rows per page
Page 1 of 1