dataeval.metrics.stats.hashstats¶
-
dataeval.metrics.stats.hashstats(dataset, *, per_box=
False)¶ Calculates hashes for each image.
This function computes hashes from the images including exact hashes and perception-based hashes. These hash values can be used to determine if images are exact or near matches.
- Parameters:¶
- Returns:¶
A dictionary-like object containing the computed hashes for each image.
- Return type:¶
See also
DuplicatesExamples
Calculate the hashes of a dataset of images, whose shape is (C, H, W)
>>> results = hashstats(dataset) >>> print(results.xxhash[:5]) ['66a93f556577c086', 'd8b686fb405c4105', '7ffdb4990ad44ac6', '42cd4c34c80f6006', 'c5519e36ac1f8839'] >>> print(results.pchash[:5]) ['e666999999266666', 'e666999999266666', 'e666999966666299', 'e666999999266666', '96e91656e91616e9']