dataeval.quality.DuplicatesOutput

class dataeval.quality.DuplicatesOutput

Output class for Duplicates detector.

Provides separate duplicate detection results for item-level (full images/videos) and target-level (bounding boxes/detections) duplicates.

items

Duplicate groups for full items (images, videos, etc.). Indices are simple integers referring to the item index in the dataset for single-dataset detection. For cross-dataset detection, indices are DatasetItemTuple objects containing dataset id and item id.

Type:

DuplicateDetectionResult[int] | DuplicateDetectionResult[DatasetItemTuple]

targets

Duplicate groups for individual targets/detections within items. Indices are SourceIndex objects containing (item, target, channel) information for single-dataset. For cross-dataset detection, indices are DatasetItemTuple objects where the id field contains a SourceIndex.

Type:

DuplicateDetectionResult[SourceIndex] | DuplicateDetectionResult[DatasetItemTuple]

Notes

  • Item indices are simple integers (e.g., [0, 5, 7]) for single-dataset

  • Target indices are SourceIndex objects with item, target, and channel info

  • For cross-dataset detection, indices are DatasetItemTuple objects

data()

The output data as a dictionary.

Return type:

dict[str, Any]

meta()

Metadata about the execution of the function or method for the Output class.

Return type:

ExecutionMetadata