applicable models
transformer
Techniques for transformer-based architectures
6 techniques
| Goals | Models | Data Types | Description | |||
|---|---|---|---|---|---|---|
| Causal Mediation Analysis in Language Models | Mechanistic Interpretability | Architecture/neural Networks/transformer Architecture/neural Networks/transformer/llm +3 | Text | Causal mediation analysis in language models is a mechanistic interpretability technique that systematically... | ||
| Feature Attribution with Integrated Gradients in NLP | Algorithmic | Architecture/neural Networks/transformer Architecture/neural Networks/transformer/llm +4 | Text | Applies Integrated Gradients to natural language processing models to attribute prediction importance to individual... | ||
| Attention Visualisation in Transformers | Algorithmic | Architecture/neural Networks/transformer Requirements/architecture Specific +1 | Image Text | Attention Visualisation in Transformers analyses the multi-head self-attention mechanisms that enable transformers to... | ||
| Embedding Bias Analysis | Algorithmic | Architecture/neural Networks Architecture/neural Networks/transformer +3 | Text Image | Embedding bias analysis examines learned representations to identify biases, spurious correlations, and problematic... | ||
| Hallucination Detection | Testing | Architecture/neural Networks/transformer Architecture/neural Networks/transformer/llm +2 | Text | Hallucination detection identifies when generative models produce factually incorrect, fabricated, or ungrounded... | ||
| Multimodal Alignment Evaluation | Testing | Architecture/neural Networks Architecture/neural Networks/transformer +1 | Image Text | Multimodal alignment evaluation assesses whether different modalities (vision, language, audio) are synchronised and... |
Rows per page
Page 1 of 1