dataeval.performance.SufficiencyOutput¶
- class dataeval.performance.SufficiencyOutput¶
Output class for
Sufficiencyworkflow.- measures¶
3D array [runs, substep, classes] of values for all runs observed for each sample size step for each measure
- Type:¶
dict[str, NDArray]
- averaged_measures¶
Average of values for all runs observed for each sample size step for each measure
- Type:¶
dict[str, NDArray]
- unit_interval¶
Constrains the power law to the interval [0, 1]. Set True (default) for metrics such as accuracy, precision, and recall which are defined to take values on [0,1]. Set False for metrics not on the unit interval.
- Type:¶
bool, default True
-
inv_project(targets, n_iter=
1000)¶ Calculate training samples needed to achieve target metric values.
- Parameters:¶
- targets : Mapping[str, ArrayLike] | ArrayLike¶
Mapping of target metric scores (from 0.0 to 1.0) that we want to achieve, where the key is the name of the metric. If an ArrayLike is provided, the same targets are applied to all metrics.
- n_iter : int, default 1000¶
Number of iterations to perform in the basin-hopping curve-fit process
- Returns:¶
- DataFrame with columns:
target: float - Target metric value <metric>: int - One column per metric (multi-class as metric_0, metric_1, …) Values of -1 indicate the target is unachievable.
- Return type:¶
pl.DataFrame
-
project(projection, n_iter=
1000)¶ Project metric values to new sample sizes using the fitted power law.
- to_dataframe()¶
Convert averaged measures to a DataFrame.