This package offers tools to analyze the unimodality of data sampled from multivariate distributions lying in the Euclidean Space. To read an independent explanation and summary of this paper, please refer to the write-up by AIModels.fyi here.
- The
mud-podtest: A multivariate unimodality test. - The
dip-meansclustering algorithm: A wrapper ofk-meansthat also detects the numbers of clusters.
To install mudpod, you can use pdm, which is a modern packaging tool that manages your Python packages without the need for creating a virtualenv in a traditional sense.
Ensure you have pdm installed on your system. If not, install it using the following command:
curl -sSL https://pdm-project.org/install-pdm.py | python3 -Please run:
pdm install -G coreNote: If you want to run the tests or the experiments, please install the additional dependencies, i.e., test and exps, respectively, using the following command:
pdm install -G GROUP_NAMEIf you find this code useful in your research, please cite:
@InProceedings{pmlr-v286-kolyvakis25a,
title = {A Multivariate Unimodality Test Harnessing the Dip Statistic of Mahalanobis Distances Over Random Projections},
author = {Kolyvakis, Prodromos and Likas, Aristidis},
booktitle = {Proceedings of the Forty-first Conference on Uncertainty in Artificial Intelligence},
pages = {2255--2268},
year = {2025},
editor = {Chiappa, Silvia and Magliacane, Sara},
volume = {286},
series = {Proceedings of Machine Learning Research},
month = {21--25 Jul},
publisher = {PMLR},
pdf = {https://raw.githubusercontent.com/mlresearch/v286/main/assets/kolyvakis25a/kolyvakis25a.pdf},
url = {https://proceedings.mlr.press/v286/kolyvakis25a.html},
abstract = {Unimodality, pivotal in statistical analysis, offers insights into dataset structures and drives sophisticated analytical procedures. While unimodality’s confirmation is straightforward for one-dimensional data using methods like Silverman’s approach and Hartigans’ dip statistic, its generalization to higher dimensions remains challenging. By extrapolating one-dimensional unimodality principles to multi-dimensional spaces through linear random projections and leveraging point-to-point distancing, our method, rooted in $\alpha$-unimodality assumptions, presents a novel multivariate unimodality test named $\textit{mud-pod}$. Both theoretical and empirical studies confirm the efficacy of our method in unimodality assessment of multidimensional datasets as well as in estimating the number of clusters.}
}
This project is licensed under the GNU Affero General Public License v3.0 - see the LICENSE file for details.