dataeval.performance.SufficiencyOutput¶
- class dataeval.performance.SufficiencyOutput¶
Output class for
Sufficiencyworkflow.- measures¶
3D array [runs, substep, classes] of values for all runs observed for each sample size step for each measure
- averaged_measures¶
Average of values for all runs observed for each sample size step for each measure
- unit_interval¶
Constrains the power law to the interval [0, 1]. Set True (default) for metrics such as accuracy, precision, and recall which are defined to take values on [0,1]. Set False for metrics not on the unit interval.
-
inv_project(targets, n_iter=
1000)¶ Compute training samples needed to achieve target metric values.
- Parameters:¶
- targets : Mapping[str, ArrayLike] | ArrayLike¶
Mapping of target metric scores (from 0.0 to 1.0) that we want to achieve, where the key is the name of the metric. If an ArrayLike is provided, the same targets are applied to all metrics.
- n_iter : int, default 1000¶
Number of iterations to perform in the basin-hopping curve-fit process
- Returns:¶
- DataFrame with columns:
target: float - Target metric value <metric>: int - One column per metric (multi-class as metric_0, metric_1, …) Values of -1 indicate the target is unachievable.
- Return type:¶
pl.DataFrame
-
project(projection, n_iter=
1000)¶ Project metric values to new sample sizes using the fitted power law.
- Parameters:¶
- Returns:¶
- DataFrame with columns:
step: int - The projected sample size <metric>: float - One column per metric (multi-class as metric_0, metric_1, …)
- Return type:¶
pl.DataFrame
- Raises:¶
ValueError – If projection is not numerical
- to_dataframe()¶
Convert averaged measures to a DataFrame.