DataEval¶
Our Mission¶
DataEval is an effective, powerful, and reliable set of tools for any T&E engineer. Throughout all stages of the machine learning lifecycle, DataEval supports model development, data analysis, and monitoring with state-of-the-art algorithms to help you solve difficult problems. With a focus on computer vision tasks, DataEval provides simple, but effective metrics for performance estimation, bias detection, and dataset linting.
DataEval is easy to install, supports a wide range of Python versions, and is compatible with many of the most popular packages in the scientific and T&E communities. DataEval also has native interopability between JATIC’s suite of tools when using MAITE-compliant datasets and models.
Key Features¶
DataEval provides many powerful tools to assist in the following T&E tasks:
Model-agnostic metrics that bound real-world performance
relevance/completeness/coverage
metafeatures (data complexity)
Model-specific metrics that guide model selection and training
dataset sufficiency
data/model complexity mismatch
Metrics for post-deployment monitoring of data with bounds on model performance to guide retraining
dataset-shift metrics
model performance bounds under covariate shift
guidance on sampling to assess model error and model retraining
Acknowledgement¶
CDAO Funding Acknowledgement¶
This material is based upon work supported by the Chief Digital and Artificial Intelligence Office under Contract No. W519TC-23-9-2033. The views and conclusions contained herein are those of the author(s) and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the U.S. Government.