Replies: 2 comments
-
Hi Varada, Some answers that I hope might help.
If I were in your situation, I'd start by questioning whether you really need the C60 model. To do this, you could try fitting various simpler models on a subset of the loci, then comparing the things you are intersted in (e.g. topologies, bootstrap values) to determine how much benefit C60 really brings you. You coudl e.g. estimtae trees under C60 plus many other models, then us the AU/KH tests to ask whether the C60 model would actually reject any of those other trees produced under simpler models. It may be that C60 is not necessary to get what you need with the level of precision that you require. Rob |
Beta Was this translation helpful? Give feedback.
-
I've had a similar issue, but not nearly as extreme as yours. Not to detract from Rob's advice, but in parallel with using the C30 model like recommend you could also try another approach on your HPC. Rather than relying on IQTree to perform the gene tree search, I split up my alignment using AMAS and then used a slurm array to do the individual searches. This really helps with my time constraints on the HPC I use. Afterwards I would concat the treefiles. It would look something like this:
|
Beta Was this translation helpful? Give feedback.
-
Hi there,
I'm attempting to make gene trees (13000+ of them) with a mixture model. Some of the gene families have over 1000+ sequences and take a very long time to make even on an HPC cluster.
E.g. A gene family alignment with 418 sequences and ~500AA took 6GB RAM and a total of 11h30m with the following commands:
I need a 5000 bootstraps for a tree reconciliation in some later steps. Ideally I would use 10000 but I've already reduced this. I have about 1200 trees to go and would really appreciate some suggestions on speeding up the larger alignments. Here's what I've considered so far after scouring the manual.
Reducing the mixture model to C20 for the larger alignments however, I'm not sure whether it is appropriate to mix mixture models for some gene families like this? Or whether I should rerun all of them (rest 11800) with the C20 as well?
I saw from the manual that I can use iqtree-mpi - which we successfully installed and ran but I run into a multicoring/RAM usage issue since some of the larger alignments require 20GB ram... even if I have a total of 100GB ram for the tree then I can only split it into 5 processes. Plus it says in the manual that this will only be beneficial for longer alignments and mine range between 50AA - 3000AA.
Splitting the bootstrap into 1000 and concatenating the results. But from what I read, this was possible for the standard bootstrap - is it also possible for the ultra-fast bootstrap? The log files show that bootstrap goes through 100 iterations at a time and writes out to the .ufboot file multiple times - how would this work if I were to split the bootstrap into chunks?
Any input from the community is greatly appreciated!
Thanks,
Varada
Beta Was this translation helpful? Give feedback.
All reactions