This is a chewBBACA-generated cgMLST scheme used for typing Klebsiella aerogenes complex (KAC) species to clonal cluster (CC) level.
- Install chewBBACA.
- Install cgmlst-dists.
- Pull all files in this repository.
- Run AlleleCall in chewBBACA with the downloaded files.
Example command:
chewBBACA.py AlleleCall -i /path/to/InputAssemblies -g /path/to/KAC_wgMLST -o /path/to/OutputFolderName --cpu 20 --genes-list KAC_cgMLST_2067
- Concatenate the output file named
results_alleles.tsv
with the pre-called dataset namedpre-called_cgMLST_dataset.tsv
. - Run cgmlst-dists on the combined dataset to obtain the pairwise cgMLST distances.
- You may gather the distance matrix, using csvtk or R, to find out the closest genome and the number of different loci in the pre-called dataset for each of your query genomes.
- Check pre-defined CC for each pre-called genome and evaluate the distance using the pre-defined cutoff for CC determination.
Please cite the following work when using this database:
Feng, Y., Yang, Y., Hu, Y., Xiao, Y., Xie, Y., Wei, L., Wen, H., Zhang, L., McNally, A., and Zong, Z. (2024). Population genomics uncovers global distribution, antimicrobial resistance, and virulence genes of the opportunistic pathogen Klebsiella aerogenes. Cell Reports, 43. https://doi.org/10.1016/j.celrep.2024.114602