dataeval.quality.Prioritize.Config

class dataeval.quality.Prioritize.Config

Configuration for Prioritize evaluator.

encoder

Encoder to use for extracting embeddings from data.

Type:

EmbeddingEncoder or None

method

Ranking method to use.

Type:

{“knn”, “kmeans_distance”, “kmeans_complexity”}, default “knn”

k

Number of nearest neighbors for “knn” method.

Type:

int or None, default None

c

Number of clusters for kmeans methods.

Type:

int or None, default None

n_init

Number of K-means initializations.

Type:

int or “auto”, default “auto”

policy

Selection policy to apply after ranking.

Type:

{“hard_first”, “easy_first”, “stratified”, “class_balance”}, default “hard_first”

num_bins

Number of bins for “stratified” policy.

Type:

int, default 50

class_labels

Class labels for “class_balance” policy.

Type:

NDArray[np.integer] or None, default None