Skip to content

ytabatabaee/ensemble-clustering-data

Repository files navigation

Datasets for FastEnsemble paper

This repository contains the datasets and scripts used in the following paper:

Y. Tabatabaee, E. Wedell, M. Park, T. Warnow. FastEnsemble: A new scalable ensemble clustering method. International Conference on Complex Networks and their Applications (CNA) 2024. Preprint available at https://arxiv.org/abs/2409.02077.

For experiments in this study, we generated a collection of artifical networks such as ring of cliques, Erdos-Renyi (ER) graphs, and combinations of ER graphs with LFR graphs and ring of cliques. All these datasets were generated using NetworkX. Additionally, we used a collection of 27 synthetic LFR graphs from Park et. al. (2024), that were generated based on the properties of a collection of real networks and their Leiden clusterings with different resolutions. These datasets are available at Illinois Data Bank.

This repository includes the new datasets generated for this study, as well as the output of different clustering methods in each experiment.

Releases

No releases published

Packages

No packages published