API Reference#
DataEval’s API is split into several submodules which support specific goals and are detailed below. The base module
is empty except for __version__ by design.
Submodules#
Detectors can determine if a dataset or individual images in a dataset are indicative of a specific issue. |
|
Drift detectors identify if the statistical properties of the data has changed. |
|
Linters help identify potential issues in training and test data and are an important aspect of data cleaning. |
|
Out-of-distribution (OOD)` detectors identify data that is different from the data used to train a particular model. |
|
Metrics are a way to measure the performance of your models or datasets that can then be analyzed in the context of a given problem. |
|
Bias metrics check for skewed or imbalanced datasets and incomplete feature representation which may impact model performance. |
|
Estimators calculate performance bounds and the statistical distance between datasets. |
|
Statistics metrics calculate a variety of image properties and pixel statistics and label statistics against the images and labels of a dataset. |
|
The utility classes and functions are provided by DataEval to assist users in setting up architectures that are guaranteed to work with applicable DataEval metrics. |
|
PyTorch is the primary backend for metrics that require neural networks. |
|
Workflows perform a sequence of actions to analyze the dataset and make predictions. |