dataeval.utils.data

Utility functions for dataset splitting and data and metadata manipulation.

Classes

DatasetSplits

Output class containing test indices and a list of TrainValSplits.

TrainValSplit

Dataclass containing train and validation indices.

Functions

flatten_metadata(…)

Flattens a nested metadata dictionary and converts values to numeric values when possible.

merge_metadata(…)

Merge a collection of metadata dictionaries into a single flattened dictionary.

split_dataset(dataset[, num_folds, stratify, ...])

Dataset splitting function. Returns a dataclass containing a list of train and validation indices.

unzip_dataset(dataset, per_target)

Unzips a dataset into separate generators for images and targets.