dataeval.core.feature_distance

dataeval.core.feature_distance(continuous_data_1, continuous_data_2)

Measures the feature-wise distance between two continuous distributions and computes a p-value to evaluate its significance.

Uses the Earth Mover’s Distance and the Kolmogorov-Smirnov two-sample test, featurewise.

Parameters:
continuous_data_1 : NDArray[np.float64]

Array of values to be used as reference.

continuous_data_2 : NDArray[np.float64]

Array of values to be compare with the reference.

Returns:

A sequence of KSTestResult tuples as defined by scipy.stats.ks_2samp.

Return type:

Sequence[tuple[float, float, float, float]]

See also

Earth, Kolmogorov-Smirnov