Quickstart

Quickstart#

AutoEmulate’s goal is to make it easy to create an emulator for your simulation.

This tutorial’s purpose is to walk you through the the basic functionality of the Python API using simple toy simulation as example.

We’ll demonstrate following steps:

Getting input and output tensor data from our example simulation
Creating, comparing and evaluating Emulators with AutoEmulate
Using an Emulator model to predict outputs for new inputs
Saving Emulator models (and associated metadata) to disk

# General imports for the notebook
import warnings
warnings.filterwarnings("ignore")

Toy simulation#

Before we build an emulator with AutoEmulate, we need to get a set of input/output pairs from our simulation to use as training data.

Below is a toy simulation for a projectile’s motion with drag (see here for details). The simulation includes:

Inputs: drag coefficient (log scale), velocity
Outputs: distance the projectile travelled

from autoemulate.simulations.projectile import Projectile

projectile = Projectile(log_level="error")
n_samples = 500
x = projectile.sample_inputs(n_samples).float()
y, _ = projectile.forward_batch(x)
y = y.float()

x.shape, y.shape

(torch.Size([500, 2]), torch.Size([500, 1]))

Data#

As you can see, our simulator inputs (x) and outputs (y) are PyTorch tensors. PyTorch tensors are a common data structure used in machine learning, and AutoEmulate is built to work with them.

We can also visualize the simulation data before training emulators where the output of the simulator is depicted as the colour of each scatter point.

import matplotlib.pyplot as plt

plt.scatter(x[:, 0], x[:, 1], c=y[:, 0], cmap='viridis')
plt.xlabel(projectile.param_names[0])
plt.ylabel(projectile.param_names[1])
plt.colorbar(label=projectile.output_names[0])
plt.show()

../../_images/4508f9678521348297e61ebd9fdd7ba2591932cf6703a7b7df0338acd0f8ceca.png

Build and compare Emulators#

With our simulator inputs and outputs, we can run a full machine learning pipeline, including data processing, model fitting, model selection and hyperparameter optimisation in just a few lines of code.

First, let’s import AutoEmulate and check the names of the available Emulator models. The columns indicate whether the emulator has a PyTorch backend, supports multioutput data and provides predictive uncertainty quantification. The list shows only the default set of emulators, but you can also see all available emulators by passing default_only=False to the function.

from autoemulate import AutoEmulate

AutoEmulate.list_emulators()

	Emulator	PyTorch	Multioutput	Uncertainty_Quantification	Automatic_Differentiation
0	GaussianProcessMatern32	True	True	True	True
1	GaussianProcessRBF	True	True	True	True
2	RadialBasisFunctions	True	True	False	True
3	PolynomialRegression	True	True	False	True
4	MLP	True	True	False	True
5	EnsembleMLP	True	True	True	True

We’re now ready run AutoEmulate to build and compare emulators.

This will fit (including hyperparameter tuning) emulator models to the simulation input and output to the data, evaluating performance on witheld test data.

# Run AutoEmulate with default settings
ae = AutoEmulate(x, y, log_level="error")

For more information about the configuration options available, see the AutoEmulate API docs. Here’s a brief overview of some important options:

Model selection

By default, AutoEmulate will fit all the above listed emulator models, but you can also specify a subset or additional models to use if you already know which kinds of models are most suitable for your data.

Specify models used by AutoEmulate with the models argument, for example:

models = ["GaussianProcessRBF", "GaussianProcessCorrelatedRBF", "RadialBasisFunctions"]
ae = AutoEmulate(x, y, models=models)

The user can also directly restrict the selection to just probabilistic models by using the only_probabilistic argument without having to list all the models individually:

ae = AutoEmulate(x, y, only_probabilistic=True)

Logging

When running AutoEmulate, you may also wish to enable logging to track the progress and performance of the emulator comparison. You can do this by setting the log_level argument when creating the AutoEmulate instance:

ae = AutoEmulate(x, y, models=models, log_level="info")

Try setting various log levels to see the difference. The options are “progress_bar”, “debug”, “info”, “warning”, “error”, or “critical”.

Metrics

The user can specify what metrics to be used for both the tuning and evaluation. For tuning, only one metric is accepted. This is the metric used to determine which hyperparameter set is the best. For evaluation, multiple metrics can be accepted. These are the metrics reported baack to measure performance on the train and test datasets.

ae = AutoEmulate(x, y, models=models, tuning_metric='r2',  evaluation_metrics=['mse', 'r2'])

Available metrics can be seen by:

from autoemulate.core.metrics import AVAILABLE_METRICS

print(AVAILABLE_METRICS.keys())

Now that we have run AutoEmulate, let’s look at the summary for a comparison of emulator performance (r-squared and RMSE) on both the train and test data.

ae.summarise()

	model_name	x_transforms	y_transforms	params	r2_test	r2_test_std	rmse_test	rmse_test_std	r2_train	r2_train_std	rmse_train	rmse_train_std
1	GaussianProcessRBF	[StandardizeTransform()]	[StandardizeTransform()]	{'epochs': 100, 'lr': 0.5, 'likelihood_cls': <...	0.999961	0.000011	46.204273	4.876751	0.999984	0.000003	32.840511	2.535756
0	GaussianProcessMatern32	[StandardizeTransform()]	[StandardizeTransform()]	{'epochs': 200, 'lr': 0.1, 'likelihood_cls': <...	0.999942	0.000026	55.354351	13.860095	0.999994	0.000002	19.563028	3.564464
2	RadialBasisFunctions	[StandardizeTransform()]	[StandardizeTransform()]	{'kernel': 'quintic', 'degree': 2, 'smoothing'...	0.998484	0.000498	298.605927	74.271767	0.999685	0.000078	142.159790	14.877204
4	MLP	[StandardizeTransform()]	[StandardizeTransform()]	{'epochs': 100, 'layer_dims': [16, 8], 'lr': 0...	0.989933	0.002713	739.437317	77.577332	0.995389	0.000871	545.615417	30.770531
5	EnsembleMLP	[StandardizeTransform()]	[StandardizeTransform()]	{'n_emulators': 8, 'epochs': 200, 'layer_dims'...	0.988177	0.003898	813.687195	200.500992	0.983159	0.002671	1058.740356	157.320663
3	PolynomialRegression	[StandardizeTransform()]	[StandardizeTransform()]	{'lr': 0.001, 'epochs': 200, 'batch_size': 8, ...	0.766453	0.058621	3645.852783	352.358032	0.814364	0.018911	3478.452393	198.888321

Choosing an Emulator#

From this list, we can choose an emulator based on the index from the summary dataframe, or quickly get the best performing one using the best_result function, which picks based on the r2_test metric by default.

Choosing a metric for determining the best model

metric_name can be set in the best_result method to choose what metric is used to determine the best model:

ae.best_result(metric_name='rmse')

best = ae.best_result()
print("Model with id: ", best.id, " performed best: ", best.model_name)

Model with id:  1  performed best:  GaussianProcessRBF

best.model.untransformed_model_name

'GaussianProcessRBF'

Let’s take a look at the configuration of the best model. These are the values of the model’s hyperparameters.

print(best.params)

{'epochs': 100, 'lr': 0.5, 'likelihood_cls': <class 'gpytorch.likelihoods.multitask_gaussian_likelihood.MultitaskGaussianLikelihood'>, 'scheduler_cls': None, 'scheduler_params': {}}

We can quickly visualise the performance of this Emulator with a plot of its predictions against the simulator outputs for the heldout test data. We also save the plot to a file.

ae.plot_preds(best, output_names=projectile.output_names)

../../_images/0fe44425ec5f4be03c52dc194602e167340cb379d80229b232c481a66c492ba5.png

We can also visualise the predictions against each input feature.

ae.plot(best, output_names=projectile.output_names, input_names=projectile.param_names)

../../_images/8458e2cdd86cb74182f6a7eb3c26189dbcde94b42c0a55abc0fd14deb88222f2.png

We can subset the data included in the feature plots by providing input and output ranges.

ae.plot(best, input_ranges={0: (0, 4), 1: (200, 500)}, output_ranges={0: (0, 10)})

../../_images/8ccd5e479b5c594f7dd7061e4f99c5e6d33fdcda19db7c6100e00d603624f951.png

As well as plotting the data, we can directly plot the predicted mean and variance of the emulator for a pair of variables while holding the other variables constant at a given quantile. API to support plotting for a subset of the parameter and output range is also supported.

The emulator predicted mean captures the simulated data plotted at the top of the tutorial well. The predicted variance is low where we have data, and increases away from the data.

ae.plot_surface(best.model, projectile.parameters_range, quantile=0.5)

../../_images/c0c220a9fb29ed26dac2769c58665d945a2a5f513f7d7c9cf71942d499b61d88.png

We can also visualise the calibration of the emulator’s predicted uncertainty on the held out test data. The closer the line is to the diagonal, the better calibrated the uncertainty is. Line above the diagonal overestimates the uncertainty while line below the diagonal underestimates it.

ae.plot_calibration(best.model)

../../_images/db625ee7c817432a381055a2aa6e2b7fdab85a3b9119c7156fe9e4554aa3b62d.png

Predictions#

We can use the emulator to make predictions using the predict method.

best.model.predict(x[:10])

Independent(Normal(loc: torch.Size([10, 1]), scale: torch.Size([10, 1])), 1)

Saving and loading emulators#

Emulators and their metadata (hyperparameter config and performance metrics) can be saved to disk and loaded again later.

# Make a directory to save Emulator models
import os
path = "my_emulators"
if not os.path.exists(path):
    os.makedirs(path)

Let’s save the best result, the best performing emulator plus metadata, to disk.

# The use_timestamp paramater ensures a new result is saved each time the save method is called
best_result_filepath = ae.save(best, path, use_timestamp=True)
print("Model and metadata saved to: ", best_result_filepath)

Model and metadata saved to:  my_emulators/GaussianProcessRBF_1_20251127_093157

You should now have a two files saved to disk, one with the emulator model and one with the metadata that has the same name and a .csv extension.

You can later pass this filepath to the load_model method to use the model again.

model = AutoEmulate.load_model(best_result_filepath)

model.predict(x[:10])

Independent(Normal(loc: torch.Size([10, 1]), scale: torch.Size([10, 1])), 1)