DataEval Change Logยถ
v0.91.0ยถ
๐ง Deprecations and Removals
a37019ff- Remove MAITE dataset helpers
๐ ๏ธ Improvements and Enhancements
e6a3f2d3- added resolve_indices() and related unit testse8a32098- Refactor stats to core module and remove dimensionstats from imagestats
๐พ Fixes
dd8f1cf5- Connect BER to MST implementation in _fast_mst.py
๐ Miscellaneous
6609e929- Refactor core functionality to dataeval.core submodule
v0.90.1ยถ
๐ ๏ธ Improvements and Enhancements
789fb41a- Speedup for MST option in Divergence
๐พ Fixes
a4802a52- Issue #922 Indexing Bug Fix
๐ Miscellaneous
2a94dfa4- Refactor NullModelMetrics for typing and encapsulation
v0.90.0ยถ
๐ Feature Release
7581907a- Integrate null model metrics
๐ Miscellaneous
512a6385- Refactor and move functions to newfunctionalsubmodule to prepare for new API0cd25665- Minor changes to notebooks
v0.89.1ยถ
๐ ๏ธ Improvements and Enhancements
9e0b441b- Additional clarification to Embeddings Concept page1567f818- Sufficiency: Improve Global Minimization
๐ Miscellaneous
ecbf11bb- Update Duplicates docstring59f56fd1- Address security vulnerabilities found in lock file
v0.89.0ยถ
๐ Feature Release
08984b1a- Embeddings Concept page
๐ ๏ธ Improvements and Enhancements
356052b6- Sufficiency: explicitly implement for operational metrics3ca1b01a- pruning concept page
๐ Miscellaneous
7ed52bd9- Drift docstring update
v0.88.1ยถ
๐ ๏ธ Improvements and Enhancements
b3e258d3- Sufficiency: Adjust measure plotsca707ba6- Blended metadata exploration tools into Metadata concept page.
v0.88.0ยถ
๐ง Deprecations and Removals
1e86ffb8- Remove dataset builders and update min python target for the project6d11a131- Remove Python 3.9 support
๐ ๏ธ Improvements and Enhancements
fdc9b015- 947 Sufficiency: Return data from all runs
๐ Miscellaneous
95446290- Update docstrings to be imperative and include all methods, attributes, and properties
v0.87.0ยถ
๐ Feature Release
28e238ed- New OOD_KNN detector, works with Embeddings class
๐ง Deprecations and Removals
808bf568- Remove utility datasets from the dataeval package
๐ ๏ธ Improvements and Enhancements
75a22785- Implement dataset validation functione83b4c31- Integrate prioritization re-ranking policies14eeb978- Switch to using is_continuous function for metadata binning
๐พ Fixes
75995b42- 412 Sufficiency inverse projection does not handle unachievable targetef7da431- Fix datasets datum metadata to be MAITE protocol compliant06a5727d- Fix balance to treat arrays of discrete distinct values as continuous
๐ Miscellaneous
bf8c7ea6- Improved Selection docstrings
v0.86.9ยถ
๐ ๏ธ Improvements and Enhancements
52218096- Expanded sufficiency docstring with descriptions of parameter default valuesโฆ
v0.86.8ยถ
v0.86.7ยถ
๐พ Fixes
a93ccfa2- Hotfix: Address metadata issues for datasets with empty targets
v0.86.6ยถ
v0.86.5ยถ
๐ ๏ธ Improvements and Enhancements
a5154294- Add dataframe format to StatsOutputs
๐พ Fixes
980eee6f- Hotfix: Allow for nan values in outliers and account for them correctly2a484972- Hotfix: Labelstats do not correctly account for 0 targets on an image
v0.86.4ยถ
๐พ Fixes
1a7a6ca1- Fix metadata and labelstats regressions
v0.86.3ยถ
๐ ๏ธ Improvements and Enhancements
3f301870- Remove targets and switch to DataFrames for label stats calculations
v0.86.2ยถ
๐ ๏ธ Improvements and Enhancements
5caee32e- Update Metadata class to use Polars DataFrame
๐พ Fixes
f04da307- MILCO has trouble reading box coordinates delimited by a variable number of spaces.
v0.86.1ยถ
๐ ๏ธ Improvements and Enhancements
28d186c5- Adjusting dataset loaders for uniformity and deterministic behaviorb0873930- Improvements to DimensionStats and plotting8ea38cfe- Add array types for inputs to dataset helpers5fe8e4de- Fix boxratiostats calculations and add missing docstrings20e8fab6- add target size and tests96fd2e23- include empty factors
๐พ Fixes
897e4e0f- Translated MILCO box corners for MAITE compliance
v0.86.0ยถ
๐ Feature Release
3a6c1e8b- Port multi-variate domain classifier from NannyML
๐ ๏ธ Improvements and Enhancements
6615745b- Add functionality for image level factors on object detection target metadata485bc051- Make pandas a required dependency
๐พ Fixes
27758e18- Fix indexing error in subselection
v0.85.0ยถ
๐ Feature Release
019d011d- Enable ClassFilter for ObjectDetectionDataset9d15be15- Add concept page for Completeness
๐ ๏ธ Improvements and Enhancements
140cec9d- Refactor data classes (Embeddings, Images, Metadata, Targets) and function (split_dataset) to be in a first level submodule9eec7e96- Add save and load functionality to embeddings
v0.84.1ยถ
๐ ๏ธ Improvements and Enhancements
82329adc- Move to new MAITE/NumPy ArrayLike and loosen type restriction on Datasetsc31915dc- Split ClassBalance from ClassFilter48bc2426- Update drift classes to use DataEval data structures and simplify torch utility functions
v0.84.0ยถ
๐ Feature Release
0c41ca26- Add factory class method to Embeddings to create from array3d585e81- Implement completeness77bba6d7- Add caching (in-memory) to embeddings
๐ ๏ธ Improvements and Enhancements
5a2dc69e- Change split_dataset to take in Datasets and use Metadata internallye07b8297- Add plot function to Images and change coverage plot to take Images656c8166- Add user sections to welcome page
v0.83.0ยถ
๐ Feature Release
22206752- Add transforms to Select dataset classfca7e907- Add metadata_ood_mi function as find_ood_predictors
๐ ๏ธ Improvements and Enhancements
54c28111- Adjust tagline/purpose statement to mention user effort
๐ Miscellaneous
bf415bf8- Update datatsets docstrings for completeness and move Transform type to typing module
v0.82.1ยถ
๐ ๏ธ Improvements and Enhancements
4b6bbab8- Intrinsic metadata using image statistics
๐พ Fixes
9bae7f1a- Perform dict override hack before setting other parametersa9e3fde1- HOTFIX: remove cyclical call to getattr in Select wrapper class
๐ Miscellaneous
94b31bf1- Create a package wide configuration for random seeds36746ef9- Switch to DeviceLike for typing of torch device13ff9fd0- Better genericize output classes and add Sequence based output collection class
v0.82.0ยถ
๐ Feature Release
7ad22a7c- Switch all stats classes to use dataset inputs and changedatasetstatstoimagestatse1da1768- Integrate meta_distribution_compare as metadata_distance
๐ ๏ธ Improvements and Enhancements
fc8b68c2- Spike: Define which metrics comprise completeness metrics
๐ Miscellaneous
70a8e1e2- Create lightweight dataset wrapper factory functionsfef938d5- Simplify typing and add more docstrings to dataeval.typing module
v0.81.0ยถ
๐ Feature Release
101fea34- Add selection feature for datasets
๐ Miscellaneous
efc90630- Add index based selection helper666e2866- Add collate helper functions for Dataloaders4e92ae90- Restructure typing for Datasets to allow better extensibility for other data classes
v0.80.0ยถ
๐ Feature Release
94a10b03- Refactor Images, Embeddings and Metadata as a stateful classes using Dataset inputs353ac6bd- Add DataProcessor class to handle extraction of images, embeddings, targets and metadata from datasets
๐ ๏ธ Improvements and Enhancements
529e5595- Merging current WIP state of OODdetector, based on universal embeddings from sigma-optimal-VAE, along with OOD_VAE_minimal notebook.
๐พ Fixes
a2a7057e- Adjust merge to handle numpy arraysAdds in a numpy array check and if true returns the array as a list prior to normal processing
๐ Miscellaneous
e55cb3f2- Use cpu as default torch device
v0.79.0ยถ
๐ Feature Release
841425ff- Release Metadata OOD function most_deviated_factorsAdds a new explanatory function using Metadata and an OODOutput
v0.78.0ยถ
๐ Feature Release
bff82522- Add collate function and convert packaged datasets to MAITE protocolsChanges all dataset utility classes to use
MAITEprotocol formats (MNIST,CIFAR10, andVOCDetection)Addes
collateto aggregate (and encode)MAITEdatasets into images/embeddings, targets, and metadata
๐ ๏ธ Improvements and Enhancements
d9e0f8b0- Enforce embeddings on functions/methods that take embedding inputs
v0.77.1ยถ
๐ ๏ธ Improvements and Enhancements
9a420f7d- Update Assess the data space tutorial to fit JATIC DR-2.33ab63f3e- Integrate clusterer speed improvements with numba
v0.77.0ยถ
๐ Feature Release
a1974e41- Add global config module to control default device and max processes
๐พ Fixes
c5ca814d- Enforce unit interval in OOD detector and coverage metric41c4437b- CoverageOutput attributes renamed for clarityAttributes renamed:
indices->uncovered_indicesradii->critical_value_radiicritical_value->coverage_radius
99631a94- Fix ax.hist on small ranges in NumPy 2.1+
v0.76.1ยถ
๐ ๏ธ Improvements and Enhancements
a8a4cd4f- Remove merge from preprocess and address metadata array length inconsistenciesf8061eca- Add option to return dropped keys from metadata utility functionsa4ddbed1- Add pandas dependency toallextras option5b05981e- Expose dropped keys from nested lists and inconsistent keys in metadata merge and preprocess
๐ Miscellaneous
a20766ec- Updates to documentation961ad923- Miscellaneous docs changes
v0.76.0ยถ
๐ Feature Release
4647edca- Expose flatten metadata function and update docstring
๐ ๏ธ Improvements and Enhancements
27d34a0c- Incorporating NAWCAD feedback to improve the documentation for the stat functions, outliers class and coverage class
๐ Miscellaneous
c9998971- Switch themes to sphinx-immaterial, enable graphviz and restructure documentation3dedde8f- Adding templates for auto generation of docs8aaa89f3- add deep dive prototyped9d902f3- Allow for float type bounding boxes9bc0f910- Add additional code coverage tests15f1ae84- Add logging to output metadata decorator01cef92b- Split conftest for tests and doctestsadc8e293- Publish MR docs and code coverage to deployment environmentsa3fc1f6c- Moves document link to body to match other header titles69892fd3- Visibility enhancements to BalanceOutput.plot() heatmapb6ab03a6- Simplify docker build script for docs
v0.75.0ยถ
๐ Feature Release
3aa12cb3- Refactor bias metadata helpersMetadata preprocessing functions have been moved from
dataeval.metrics.bias.metadata_preprocessingtodataeval.utils.metadata.
๐ ๏ธ Improvements and Enhancements
ed98b6b1- Return empty string for hashes on too small imagespchashnow returns empty string when attempting to perform perception hashing against images or chips that are too small to meaningfully hash.Duplicatesalso ignore empty perception hashes to avoid false positive detections.b144fa1c- Change torch to be required dependencyPyTorch is now a required dependency and the
torchextra is no longer required for full functionality
๐ Miscellaneous
6e4474b2- Refactor utils and fix associated docstrings, documentation and notebooksff87cee6- Update documentation and CI pipelines to comply with SDP DR-3aa7d9205- Updated README.md format, added tagline and cdao funding acknowledgment82559846- Replace manual markdown files with autoapi generated rst files
v0.74.2ยถ
๐ Miscellaneous
e7a284de- Update dataset split unit testsf8731a44- Add initial logging framework and unit test771dc1d1- Add conda tests to pipeline2d9fd55a- Update RTD yaml to use uv for installation0ab99a7f- Initial prototyping of underspecification tests
v0.74.1ยถ
๐ Miscellaneous
102664de- Remove tensorflow from projecte782dad1- Refactor OutputMetadata and clean up set_metadata decorator80aae3a6- Just use KSOutput as a MappingOutput instance instead of extracting the dict attribute it no longer has.b738e01f- Allow docker cmds within dev container16839b46- Add MappingOutput classe2cfda94- Made metadata_tools/ks_compare compatible with new KSOutput class.
v0.74.0ยถ
๐ Feature Release
73c1e1be- Implement PyTorch AutoEncoder based OOD detectorAdds initial PyTorch based Autoencoder OOD detector available when installed with the
torchextra.
๐ ๏ธ Improvements and Enhancements
70794b5f- Moved discretization of metadata out of bias functions
๐ Miscellaneous
4d94e602- Added test assertions for how_to notebooks7723e242- Introduce Pytorch OOD detector, with its new training procedure, into OOD howto notebook.f5ac4bdd- Added new KSOutput class and adapted tests and other functions accordingly3a01a81a- Introduce new Pytorch OOD detection into prototype metadata demo notebooks.dc155554- Fix torch gmm functions and enable testsa715c1ef- Adjust docs to incorporate new metadata function0719bad0- Update dependencies to remove hdbscan
v0.73.1ยถ
๐พ Fixes
cac3e2b8- Fixes drift with pre-processing and shuffles MNIST by default
๐ Miscellaneous
bacbd0e7- Use build script specifically for docs0a87e912- docker build for docs only671b60a5- Prototype function to infer whether a 1D sample is continuous or discreted0b8004a- Use explicit re namespace for compile, search, sub, and MULTILINE502ca2df- Change to nox for automation test scripts5b46ebea- Add new bias functional tests and set groundwork for rediscretization
v0.73.0ยถ
๐ Feature Release
e055acf0- Metadata utility function to merge, extend and flatten metadata95b28ae1- Adjust bias plotting functions to return figure
๐ Miscellaneous
532f92a2- Minimum spanning tree and Clusterer are rewritten using numba for large code speed up7377e012- Switch jobs to use uv and tox natively7af75016- Add lazyloading for tensorflow modules
v0.72.2ยถ
๐ ๏ธ Improvements and Enhancements
ba52ef2e- Refactor away _internal module
๐ Miscellaneous
6e55451c- Integration of distribution compare and OOD MI metadata tools (continued)e4f82173- Streamlined testsac8fe3ee- Fix type mismatch on training AEGMM6289c7d0- Add plotting helper functions to diversity and balance14d0cfd4- Integration of low-level metadata drift/OOD exploration functions
v0.72.1ยถ
๐ Miscellaneous
32ba1f29- Data split tests76f73770- Updated glossary and other files to use new style of links20efd27e- Add support for Python 3.12
v0.72.0ยถ
๐ Feature Release
14ef382c- Update dependencies for conda compatibility
v0.71.1ยถ
๐ ๏ธ Improvements and Enhancements
97849b01- Update support for tensorflow >=2.16 with explicit keras v2
๐ Miscellaneous
85bafa30- Swap brightness and darkness96a30ad0- Make optional checks more granular55ca81d6- Use native int for dict keys for Outliers639e140b- silence warnings for docs and doctest
v0.71.0ยถ
๐ Feature Release
cdae8a17- Parallelize existing stats metric functions and introduce dedicated channelstats functionRunning statistical analysis functions take significant time against large datasets. Due to the natural parallelism of analyzing individual images, we introduced parallel processing leveraging the
multiprocessinglibrary to accelerate processing times.Affected functions:
datasetstatsdimensionstatshashstatspixelstatsvisualstats
Additionally,
channelstatswas added which performs the functionality ofdatasetstatsbut only for the functions that support per-channel stat calculation,pixelstatsandvisualstats.
๐ Miscellaneous
552668a0- Update EDA part 1 tutorial with miscellaneous changes
v0.70.1ยถ
๐ ๏ธ Improvements and Enhancements
d1cdcda5- API changes with supporting documentation updates
๐ Miscellaneous
5ecd4d3a- expose datasets API6c19bba7- Make sufficiency args more permissive1bc2d067- Improving MNIST classd23b3461- Extract small-scope reusable functions from tools made for prototype Associate[Drift|OOD]withMetadataTutorial notebooks.5bea9512- remove tf-iogcs-fs
v0.70.0ยถ
๐ Feature Release
71e7ff06- Integrate labelstats functionf40bf0e4- Redesign stats functions for expansion to per-box, per-channel, and boxratiostats
๐ ๏ธ Improvements and Enhancements
72390edc- Change input format of balance and diversity to be the same as parity
๐พ Fixes
f598c46a- Update pytorch to 2.2.0+
๐ Miscellaneous
b8f0d502- Create copy onto_numpyby default04a71337- Fix CI docs job to load on build9286f5e8- Skip or rework MNIST based unit tests704f44e3- Investigate the use of metadata to help explain observed dataset drifts and OOD examplese25f84f3- Expose SufficiencyOutput and move class methods to output class742a084c- Adding algorithm compatability/requirements table7ce85be7- Misc concept documentation
v0.69.4ยถ
๐ Miscellaneous
7bca6ed4- Unified all MNIST and MNIST corrupt datasets to a single internal MNIST class66ad1c92- new drift detector: multivariate domain classifier
v0.69.3ยถ
๐ Miscellaneous
6745e39d- Document: Class Label Statistical Independence and Coverage Documentation1f7689ac- Adding bias tutorial (parity-balance-diversity)
v0.69.2ยถ
๐ Miscellaneous
f7d5bac3- Adds stats for bounding boxes18be58a3- Adding label stats809d1d7a- Always produce p-val and distance metrics for drift5cd7c205- Improving imagestats and channelstats functionsb379d44c- Add dataset splitting features80b68a73- Use regex to replace markdown links1d99455a- Tag LKG at the correct commit SHAad0e368b- Always run tasks
v0.69.1ยถ
๐ Miscellaneous
d9068a2c- Fix release and changelog script
v0.69.0ยถ
๐ Miscellaneous
63ab70d7- Remove automatic update of documentation notebooks
v0.68.0ยถ
๐ Feature Release
47b48e14- Allow Duplicates and Outliers detectors to take in multiple StatsOutput objects
๐ Miscellaneous
65d8f3de- Combine classwise bias metric outputs with non-classwiseccfd72ef- Adding clustering/coverage tutorial6d09d710- Add CONTRIBUTING.md72387d9c- Updated version replacement script to include cache files5285f01b- Prototype Performance Estimation3ae16116- concept pages for balance and diversity, rescale Simpson diversity3e16a905- Switching documentation themes to the pydata theme
v0.67.0ยถ
๐ Feature Release
a0b04800- Refactor DataEval functions and classes and update documentationChanges DataEval functions and classes to be more hierarchical in modules:
detectors
drift (DriftCVS, DriftKS, DriftMMD, DriftUncertainty)
linters (Clusterer, Duplicates, Outliers)
ood (OOD_AE, OOD_AEGMM, OOD_LLR, OOD_VAE, OOD_VAEGMM)
flags (ImageStat)
metrics
bias (balance, coverage, diversity, parity)
estimators (ber, divergence, uap)
stats (imagestats, channelstats)
workflows (Sufficiency)
Backends have been moved from
modelstotensorflowandtorchRenamed following classes:
Linter->Outliersparity->label_parityparity_metadata->parityDriftOutput->DriftBaseOutputDriftUnivariateOutput->DriftOutput
Miscellaneous fixes:
Documentation updated
Streamlined optional import checks in the
__init__.pytreeFixed misspelling in glossary
๐พ Fixes
84aae760- balance test cleanup
๐ Miscellaneous
6d09d710- Add CONTRIBUTING.md72387d9c- Updated version replacement script to include cache files5285f01b- Prototype Performance Estimation3ae16116- concept pages for balance and diversity, rescale Simpson diversity3e16a905- Switching documentation themes to the pydata themed50d9cd1- Update Landing Page2fd7fa59- Author drift detection tutorial49b5af42- Use uv instead of pyenv for python deployment0f6eb6b0- Pin notebooks on release to specific version4f101a4e- Adjust imagestats and channelstats reference guides to new format0ee82ede- Only build data image in main pipeline7b84ceb5- Improve test coveraged3c5258a- Add StatsOutput as input type for linter and duplicatescf73393a- Updates drift reference guides and concept page4ce5cdf7- Adjust model reference guides to new format17195a2b- Adjust parity reference guides to new formate9761b4d- Adjust out of distribution reference guides to new formateaf707a7- Adjust uap reference guide to new format335ac3be- Adjust sufficiency reference guide to new format3a866f01- Change Optional[Type] to Type | None per 3.10+ standards
v0.66.0ยถ
๐ Feature Release
a0b04800- Refactor DataEval functions and classes and update documentationChanges DataEval functions and classes to be more hierarchical in modules:
detectors
drift (DriftCVS, DriftKS, DriftMMD, DriftUncertainty)
linters (Clusterer, Duplicates, Outliers)
ood (OOD_AE, OOD_AEGMM, OOD_LLR, OOD_VAE, OOD_VAEGMM)
flags (ImageStat)
metrics
bias (balance, coverage, diversity, parity)
estimators (ber, divergence, uap)
stats (imagestats, channelstats)
workflows (Sufficiency)
Backends have been moved from
modelstotensorflowandtorchRenamed following classes:
Linter->Outliersparity->label_parityparity_metadata->parityDriftOutput->DriftBaseOutputDriftUnivariateOutput->DriftOutput
Miscellaneous fixes:
Documentation updated
Streamlined optional import checks in the
__init__.pytreeFixed misspelling in glossary
๐ ๏ธ Improvements and Enhancements
5f730baa- Refactor ImageStats and ChannelStats as metric functions
๐พ Fixes
84aae760- balance test cleanup3ebd278c- handle float-type categorical variables in balance metric066b7153- Fixes modzscore to account for division by 0
๐ Miscellaneous
d50d9cd1- Update Landing Page2fd7fa59- Author drift detection tutorial49b5af42- Use uv instead of pyenv for python deployment0f6eb6b0- Pin notebooks on release to specific version4f101a4e- Adjust imagestats and channelstats reference guides to new format0ee82ede- Only build data image in main pipeline7b84ceb5- Improve test coveraged3c5258a- Add StatsOutput as input type for linter and duplicatescf73393a- Updates drift reference guides and concept page4ce5cdf7- Adjust model reference guides to new format17195a2b- Adjust parity reference guides to new formate9761b4d- Adjust out of distribution reference guides to new formateaf707a7- Adjust uap reference guide to new format335ac3be- Adjust sufficiency reference guide to new format3a866f01- Change Optional[Type] to Type | None per 3.10+ standardsfe1e292d- Use output dataclass with metadatab3f6a027- Unify handling of image reshaping
v0.65.0ยถ
๐ ๏ธ Improvements and Enhancements
5f730baa- Refactor ImageStats and ChannelStats as metric functions
๐พ Fixes
3ebd278c- handle float-type categorical variables in balance metric066b7153- Fixes modzscore to account for division by 0
๐ Miscellaneous
fe1e292d- Use output dataclass with metadatab3f6a027- Unify handling of image reshaping
v0.64.0ยถ
๐ Feature Release
bea0446c- Torch Dataset Reader
๐ ๏ธ Improvements and Enhancements
eda88822- Refactor metrics
๐ Miscellaneous
a4b8e919- Created new documentation issue templates1028d082- Remove is_arraylike functiondbcecec6- Refactored read_dataset to handle common dataset returns61b1f854- Updated Workflow Landing Pagecf96c7f2- Run doctest in CI pipelineecfcf89b- Adjusted notebooks to work on google colab and added environment requirements5f863782- Update remaining metric output to NamedTuplee58f4dba- Add metadata parity documentation6319a1d4- Adding Duplicates concept787545f5- Adding ImageStats and ChannelStats concept document7826405c- Update Data Cleaning concept50047116- Change to Semantic Versioning9e43399c- Bayes Error Rate - explanation documentation266ad738- Updated BER docstrings with NDArray, shapes, and examples
v0.63.0ยถ
๐ ๏ธ Improvements and Enhancements
3225cf18- Convert remaining metrics and detectors to ArrayLike5d88b82a- Add Torch and Tensorflow interop through ArrayLike protocol and to_numpy converterd3342275- Refactor linter and duplicates to call evaluate with data65d5aaa8- Refactor metrics to call evaluate with data
v0.61.0ยถ
๐ ๏ธ Improvements and Enhancements
cd59debb- Release DataEval v0.61.0!DAML is now officially rebranded as DataEval! New name, same great camel flavor.
v0.56.0ยถ
๐ Feature Release
64416675- Update clusterer class and documentationClustererdetector released
This class assists in exploratory data analysis of unlabeled data by identifying duplicates and outliers. Additional information on usage is available in our documentation.
v0.55.0ยถ
๐ Feature Release
278b4dc1- Release Linter, Duplicates, ImageStats, ChannelStats and ParityLinter,Duplicatesdetectors andImageStats,ChannelStats, andParitymetrics are now released. The existing metrics available have also been moved into different modules (detectorsandworkflows) that better reflect their functionality.detectorsDrift detectors:
DriftCVM,DriftKS,DriftMMD,DriftUncertaintyand supporting classesOut-of-distribution detectors:
OOD_AE,OOD_AEGMM,OOD_LLR,OOD_VAE,OOD_VAEGMMand supporting classesLinterDuplicates
metricsBERDivergenceParityImageStatsChannelStatsUAP
workflowsSufficiency
v0.54.0ยถ
๐ ๏ธ Improvements and Enhancements
58263ac7- Move niter param to evaluate and calculate and retain curve coefficients in output dictionaryThis change enhances the output of the
Sufficiencymetric to provide the coefficients for the learning curve by measure/class when running the metric. These parameters were previously recalculated each call to project and plot. The parameters are provided as aDict[str, np.ndarray]under the_CURVE_PARAMS_key in the output dictionary.
v0.53.0ยถ
๐ Feature Release
322fc830- Add parameterkto BER estimator for KNN to enablek>1for better consistency with ground truth in certain cases
v0.52.0ยถ
๐ ๏ธ Improvements and Enhancements
07b12ac2- Fully integrate outlier detectionOutlier Detection API has been changed. Additional details are available in our documentation.
v0.51.0ยถ
๐ Feature Release
2ed88a07- Implement Drift Detection MetricsThis change adds 4 types of Drift Detection metrics which allow for the detection of potential drift in the dataset.
Kolmogorov-Smirnov
Cramรฉr-von Mises
Maximum Mean Discrepancy
Classifier Uncertainty
The conceptual source is derived from Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift and the implementation is derived from Alibi-Detect v0.11.4.
v0.45.0ยถ
๐ง Deprecations and Removals
5cc48bec- Divergence metric naming corrected to HP DivergenceDivergence metric output now returns a dictionary of
{ "divergence": float, "error": int }instead of{ "dpdivergence": float, "error": int }. Code, documentation and tutorials have been updated to the correct nomenclature of HP (Henze-Penrose) divergence.
v0.44.6ยถ
๐ Feature Release
41b20d3a- Add rules for release label pipeline workflow and merge request release template
๐ ๏ธ Improvements and Enhancements
7ee53c9c- Update Divergence default to MST
v0.44.2ยถ
๐ ๏ธ Improvements and Enhancements
1468aa5c- Switch to markdown and updated docs
v0.43.0ยถ
๐ ๏ธ Improvements and Enhancements
670a0db5- Add support for classwise Sufficiency metricsb96ee099- Have sufficiency train and eval functions take indices and batch size instead of a DataLoader
v0.42.2ยถ
๐ ๏ธ Improvements and Enhancements
5225c491- Change output classes to dictionaries45040682- Make Sufficiency a stateful class and revise SufficiencyOutput7c5fdcff- Pass method as a parameter to determine metric algorithm to use2e883f6d- Add better optimizer to find global minimumc3c78680- Expose AETrainer to public API to use model multiple times after training
๐พ Fixes
93564b95- Updating pyproject.toml and lock file to set dependency less than numpy 2.0
v0.42.0ยถ
๐ ๏ธ Improvements and Enhancements
601cfae8- Sufficiency Plotting of Multiple Metrics during one run3d68a6f1- Add parameter to plot function for optional file output
๐ง Deprecations and Removals
a6ce3e72- Remove UAP_MST metric
v0.40.2ยถ
๐ ๏ธ Improvements and Enhancements
f3eddaed- Flavor 2 - Remove models from metrics entirely
v0.40.1ยถ
๐ง Deprecations and Removals
db888bb7- Remove usage of DamlDataset for ARiA metrics
v0.38.1ยถ
๐ ๏ธ Improvements and Enhancements
42617f43- Enable GPU functionality in pytorch features
v0.38.0ยถ
๐ Feature Release
c9b5116e- ARiA Autoencoder as PyTorch Model
๐ ๏ธ Improvements and Enhancements
8fe97232- Add export_model functionality and improve test coverage42cc77ea- Add empirical upper bound to UAP metric output
๐พ Fixes
636dfdaf- update project with version metadata
v0.36.1ยถ
๐ Feature Release
7d1a599f- Implement the uap class
v0.36.0ยถ
๐ ๏ธ Improvements and Enhancements
0799523b- Object detection model training
v0.29.0ยถ
๐ Feature Release
166df3b0- Implement Dataset Sufficiency Metric
๐ ๏ธ Improvements and Enhancements
5c4e6e06- Use convolutional autoencoder for BER and Divergence metrics
๐พ Fixes
c78e5502- Sufficiency typecheck bugfix
v0.28.5ยถ
๐ ๏ธ Improvements and Enhancements
9d1c354c- Add fit_dataset, format_dataset to DpDivergence & BER
v0.28.4ยถ
๐พ Fixes
c39e009e- Fix typecheck issues found with pyright-1.1.333
v0.26.13ยถ
๐ Feature Release
949e09bd- Add kNN BER implementation
v0.26.10ยถ
๐ ๏ธ Improvements and Enhancements
dab0a8ff- Handle MST edge cases
v0.26.4ยถ
๐ ๏ธ Improvements and Enhancements
bf31996f- BER lower bound capability
v0.25.11ยถ
๐ ๏ธ Improvements and Enhancements
dfe0bddb- Add support for python 3.11
v0.25.4ยถ
๐ ๏ธ Improvements and Enhancements
2ca285cc- update BER metric to return a dataclass instead of dict
v0.25.3ยถ
๐พ Fixes
67f08b27- Fix: Alibi-detect-models-have-fixed-architecture-shapes
v0.25.2ยถ
๐ ๏ธ Improvements and Enhancements
db4adaff- 69 convert metric output dictionary to dataclass
v0.24.8ยถ
๐ Feature Release
79614577- Implement Multiclass MST version of BER
v0.24.6ยถ
๐ Feature Release
2ad9fed5- Implement BER estimate
v0.23.1ยถ
๐ Feature Release
99d2fd22- Implement outlier detection metrics using the alibi-detect VAE method
v0.23.0ยถ
๐ Feature Release
85eb2c1f- Implement outlier detection metrics using the alibi-detect auto-encoder method