Prediction

Contents

Prediction#

DeepSensor provides a convenient high-level interface to predict directly to xarray or pandas objects in the original units and coordinate system of your data. This is achieved using the model.predict method. We’ll use our trained model from the Training page to demonstrate DeepSensor’s prediction functionality.

The two key arguments of model.predict are 1) a Task (or list of Tasks) containing context data, and 2) a set of target prediction locations, X_t. This page will demonstrate how we can predict on-grid or off-grid based on the form of X_t. We will also see how we can use optional extra arguments in model.predict for more advanced usage.

task_loader = TaskLoader(
    context=[era5_ds, land_mask_ds, lowres_aux_ds],
    target=era5_ds,
)
task_loader.load_dask()
print(task_loader)

TaskLoader(3 context sets, 1 target sets)
Context variable IDs: (('2m_temperature',), ('GLDAS_mask',), ('elevation', 'cos_D', 'sin_D'))
Target variable IDs: (('2m_temperature',),)

set_gpu_default_device()

# Set up model
model = ConvNP(data_processor, task_loader, deepsensor_folder)

Predict off-grid to pandas#

Predicting at off-grid locations with model.predict is very similar to the on-grid case above. If X_t is 1) a shape \((2, N)\) numpy array, or 2) a pandas object containing spatial indexes, the values of the Prediction returned by model.predict will be pandas.DataFrames whose columns are the prediction parameters.

Let’s see an example where we pass a list of Tasks to model.predict, with context sets spanning the second half of 2019. Check out the indexes of the resulting pandas.DataFrame!

# Predict at two off-grid locations over six months of 2019 with 200 random context points (fixed across time)
test_tasks = task_loader(pd.date_range("2019-06-01", "2019-12-31"), [200, "all", "all"], seed_override=42)
X_t = np.array([[50, -80],
                [40, -110]]).T
pred = model.predict(test_tasks, X_t=X_t)

# plot the target locations and the context locations on a map
fig, ax = plt.subplots(figsize=(5, 5), subplot_kw={"projection": ccrs.PlateCarree()})
deepsensor.plot.offgrid_context(ax, test_tasks[0], data_processor, task_loader)
ax.scatter(X_t[1], X_t[0], c="r", s=50)
ax.coastlines()
ax.set_title("Target locations (red)")
ax.add_feature(ccrs.cartopy.feature.STATES)

No artists with labels found to put in legend.  Note that artists whose label start with an underscore are ignored when legend() is called with no argument.

<cartopy.mpl.feature_artist.FeatureArtist at 0x7fd4927df650>

../_images/69155f2d6771a4e04a41e8895f57ec7eff76ff407340f19b947c9d2d64603883.png

pred["2m_temperature"]

			mean	std
time	lat	lon
2019-06-01	50	-80	280.406281	1.834735
2019-06-01	40	-110	290.741547	2.129284
2019-06-02	50	-80	282.370087	1.753442
2019-06-02	40	-110	292.015839	1.990584
2019-06-03	50	-80	281.934479	1.701114
...	...	...	...	...
2019-12-29	40	-110	261.715698	3.435812
2019-12-30	50	-80	263.644775	2.347633
2019-12-30	40	-110	261.7883	3.48374
2019-12-31	50	-80	267.722748	1.995918
2019-12-31	40	-110	262.06546	3.490994

428 rows × 2 columns

fig = deepsensor.plot.prediction(pred)

../_images/1cc96366fcab69b189d48e20b148a3dfc0adad2b2cdfe0007d37cbcf4842cbe7.png