autocast.data.encoded_dataset#
- class EncodedBatchMixin[source]#
Bases:
objectA mixin class to provide EncodedBatch conversion functionality.
- class EncodedDataset[source]#
Bases:
Dataset,EncodedBatchMixinA base class for encoded datasets.
- class MiniWellDataset(file, steps=1, stride=1)[source]#
Bases:
DatasetCreates a mini-Well dataset.
- From LOLA:
- class MiniWellInputOutput(file_name, n_steps_input, n_steps_output, steps=1, stride=1)[source]#
Bases:
EncodedDataset,EncodedBatchMixinA wrapper around The Well’s MiniwellDataset to provide Batch objects.
- miniwell_dataset: MiniWellDataset#
- class CachedLatentDataset(cache_dir, n_steps_input=1, n_steps_output=1, stride=1, in_memory=True, **kwargs)[source]#
Bases:
EncodedDatasetDataset that reads pre-encoded latent trajectories from a cache directory.
Each
.ptfile contains a dict with keyencoded_fieldsholding the full encoded trajectory and optionallyglobal_cond. Windowing (n_steps_input,n_steps_output,stride) is applied at load time, allowing runtime configuration without re-encoding.These files are produced by
autocast.scripts.cache_latents.cache_latents().
- class EncodedDataModule(data_path=None, n_steps_input=1, n_steps_output=1, stride=1, batch_size=16, num_workers=0, dataset_cls=None, in_memory=True, **dataset_kwargs)[source]#
Bases:
LightningDataModuleDataModule for encoded datasets that produce EncodedBatch objects.
This datamodule wraps datasets that produce EncodedSample objects (like MiniWellInputOutput) and provides train/val/test dataloaders that collate samples into EncodedBatch objects.
- Parameters:
- setup(stage=None)[source]#
Set up datasets for the given stage.
- Parameters:
stage (str | None)
- Return type:
None
- rollout_test_dataloader(batch_size=None)[source]#
DataLoader for rollout evaluation on test data.
For cached latent datasets, creates a full-trajectory dataset so that ground truth is available for the entire rollout horizon (analogous to
full_trajectory_mode=Truein the non-cached datamodules).- Parameters:
batch_size (int | None)
- Return type:
- class MiniWellDataModule(data_path=None, n_steps_input=1, n_steps_output=1, stride=1, batch_size=16, num_workers=0)[source]#
Bases:
LightningDataModuleDataModule for MiniWell datasets.
This datamodule wraps MiniWellInputOutput datasets and provides train/val/test dataloaders that collate samples into EncodedBatch objects. Accepts a data_path with train/valid/test subdirectories containing data.h5.
- Parameters:
- setup(stage=None)[source]#
Set up datasets for the given stage.
- Parameters:
stage (str | None)
- Return type:
None