dataeval.extractors

Feature extractors that transform input data into arrays.

All extractors implement the FeatureExtractor protocol (__call__(data) -> Array) and can be used standalone or passed to Embeddings for batching, caching, and memory-mapped storage.

Classes

BoVWExtractor

Computes Bag of Visual Words histograms using SIFT keypoints.

ClassifierUncertaintyExtractor

Computes prediction entropy from a classification model for drift detection.

FlattenExtractor

Simple NumPy-based feature extractor that flattens images to 1D vectors.

OnnxExtractor

Extracts embeddings via ONNX Runtime with lazy model loading.

TorchExtractor

Extracts embeddings from a PyTorch model, with optional intermediate layer hooking.