dataeval.quality

Identify potential issues in training and test data.

Classes

Duplicates

Finds duplicate images using non-cryptographic and perceptual hashing.

Outliers

Calculates statistical outliers of a dataset using various statistical tests applied to each image.

Prioritize

Prioritize dataset samples based on their position in the embedding space.

Output Classes

DuplicatesOutput

Output class for Duplicates detector.

OutliersOutput

Output class for Outliers lint detector.

PrioritizeOutput

Output class for Prioritize quality evaluator.