Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some genome were not found in the clustering results #245

Open
xjhzjucas opened this issue Dec 2, 2024 · 2 comments
Open

Some genome were not found in the clustering results #245

xjhzjucas opened this issue Dec 2, 2024 · 2 comments

Comments

@xjhzjucas
Copy link

Hi developer,
Thank you for this nice tool! I met a problem that I put 577 genomes into dRep using dRep dereplicate ./ -g ./genome577/*.fa -pa 0.99, and it run successfully. I found that only 498 genomes were found to be clustered according to the Primary_clustering_dendrogram.pdf, 79 genomes were not clustered into any cluster, can you tell me why and how to make every genome clustered.
And in Cdb.csv, I only got 498 rows too.
Thank you so much!

@MrOlm
Copy link
Owner

MrOlm commented Dec 2, 2024

They are probably being filtered out by checkM. You can change the checkM filtering chritera, use "cluster" instead of "dereplicate", or add `--ignore_genome_quality" to your command.

Matt

@xjhzjucas
Copy link
Author

Thank you for your help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants