An anomaly detection method specifically designed for microbiome data.
This Github repository contains three folder:
- src - Contains the implementation of KADAIF (MicrobiomeIsolationForest.py and KADAIF.py), an implementation of CLOUD (published by Montassier et al.), and scripts for contamination and mislabeling tests as described in the manuscript.
- run_on_datasets - Bash and Python scripts for running and reproducing the specific mislabeling and contamination tests outlined in the manuscript. Due to GitHub file size limitations, raw data and results are not included.
- analysis - Jupyter notebooks with the analysis results from the manuscript.
For details on KADAIF's implementation, attributes and functions, see here: