Safety

42 techniques

Ensuring AI systems operate safely and do not cause harm.

42 techniques
GoalsModelsData TypesDescription
Out-of-Distribution Detector for Neural Networks
Algorithmic
Architecture/neural Networks
Paradigm/discriminative
+3
Any
ODIN (Out-of-Distribution Detector for Neural Networks) identifies when a neural network encounters inputs significantly...
Synthetic Data Generation
Algorithmic
Architecture/neural Networks/generative/gan
Architecture/neural Networks/generative/vae
+5
Any
Synthetic data generation creates artificial datasets that aim to preserve the statistical properties, distributions,...
Federated Learning
Algorithmic
Architecture/linear Models
Architecture/neural Networks
+4
Any
Federated learning enables collaborative model training across multiple distributed parties (devices, organisations, or...
Homomorphic Encryption
Algorithmic
Architecture/linear Models
Architecture/neural Networks/feedforward
+4
Any
Homomorphic encryption allows computation on encrypted data without decrypting it first, producing encrypted results...
Deep Ensembles
Algorithmic
Architecture/neural Networks
Paradigm/parametric
+2
Any
Deep ensembles combine predictions from multiple neural networks trained independently with different random...
Cross-validation
Algorithmic
Architecture/model Agnostic
Paradigm/supervised
+2
Any
Cross-validation evaluates model performance and robustness by systematically partitioning data into multiple subsets...
Safety Envelope Testing
Testing
Architecture/model Agnostic
Requirements/black Box
Any
Safety envelope testing systematically evaluates AI system performance at the boundaries of its intended operational...
Internal Review Boards
Process
Architecture/model Agnostic
Requirements/black Box
Any
Internal Review Boards (IRBs) provide independent, systematic evaluation of AI/ML projects throughout their lifecycle to...
Red Teaming
Procedural
Architecture/model Agnostic
Requirements/black Box
Any
Red teaming involves systematic adversarial testing of AI/ML systems by dedicated specialists who attempt to identify...
Anomaly Detection
Algorithmic
Architecture/model Agnostic
Requirements/black Box
+1
Any
Anomaly detection identifies unusual behaviours, inputs, or outputs that deviate significantly from established normal...
Human-in-the-Loop Safeguards
Process
Architecture/model Agnostic
Requirements/black Box
Any
Human-in-the-loop safeguards establish systematic checkpoints where human experts review, validate, or override AI/ML...
Confidence Thresholding
Algorithmic
Architecture/model Agnostic
Requirements/black Box
+1
Any
Confidence thresholding creates decision boundaries based on model uncertainty scores, routing predictions into...
Runtime Monitoring and Circuit Breakers
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
Runtime monitoring and circuit breakers establish continuous surveillance of AI/ML systems in production, tracking...
Model Cards
Documentation
Architecture/model Agnostic
Requirements/black Box
Any
Model cards are standardised documentation frameworks that systematically document machine learning models through...
Datasheets for Datasets
Documentation
Architecture/model Agnostic
Requirements/black Box
Any
Datasheets for datasets establish comprehensive documentation standards for datasets, systematically recording creation...
MLflow Experiment Tracking
Process
Architecture/model Agnostic
Requirements/black Box
Any
MLflow is an open-source platform that tracks machine learning experiments by automatically logging parameters, metrics,...
Data Version Control
Process
Architecture/model Agnostic
Requirements/black Box
Any
Data Version Control (DVC) is a Git-like version control system specifically designed for machine learning data, models,...
Automated Documentation Generation
Algorithmic
Architecture/model Agnostic
Requirements/black Box
Any
Automated documentation generation creates and maintains up-to-date documentation using various methods including...
Model Distillation
Algorithmic
Architecture/neural Networks
Paradigm/parametric
+3
Any
Model distillation transfers knowledge from a large, complex model (teacher) to a smaller, more efficient model...
Model Pruning
Algorithmic
Architecture/neural Networks
Paradigm/parametric
+4
Any
Model pruning systematically removes less important weights, neurons, or entire layers from neural networks to create...
Rows per page
Page 1 of 3