applicable models

transformer

Techniques for transformer-based architectures

6 techniques
GoalsModelsData TypesDescription
Causal Mediation Analysis in Language Models
Mechanistic Interpretability
Architecture/neural Networks/transformer
Architecture/neural Networks/transformer/llm
+3
Text
Causal mediation analysis in language models is a mechanistic interpretability technique that systematically...
Feature Attribution with Integrated Gradients in NLP
Algorithmic
Architecture/neural Networks/transformer
Architecture/neural Networks/transformer/llm
+4
Text
Applies Integrated Gradients to natural language processing models to attribute prediction importance to individual...
Attention Visualisation in Transformers
Algorithmic
Architecture/neural Networks/transformer
Requirements/architecture Specific
+1
Image
Text
Attention Visualisation in Transformers analyses the multi-head self-attention mechanisms that enable transformers to...
Embedding Bias Analysis
Algorithmic
Architecture/neural Networks
Architecture/neural Networks/transformer
+3
Text
Image
Embedding bias analysis examines learned representations to identify biases, spurious correlations, and problematic...
Hallucination Detection
Testing
Architecture/neural Networks/transformer
Architecture/neural Networks/transformer/llm
+2
Text
Hallucination detection identifies when generative models produce factually incorrect, fabricated, or ungrounded...
Multimodal Alignment Evaluation
Testing
Architecture/neural Networks
Architecture/neural Networks/transformer
+1
Image
Text
Multimodal alignment evaluation assesses whether different modalities (vision, language, audio) are synchronised and...
Rows per page
Page 1 of 1