-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
make_pdb
ESMFold jobs running even when input structure is provided
#82
Comments
Thanks for letting us know about this! We've seen this occur occasionally with other analyses as well and haven't been able to ID the cause. We'll look into it. |
One hypothesis that @mertcelebi shared is that this might occur when the timestamp of the FASTA file is later than that of the PDB file. If so, it's possible that this is a more general issue with how Snakemake detects files. Could you try running this code from within the
|
Separately from this issue, the experimentally-determined PDB provided appears to contain multiple chains, which unfortunately isn't supported yet by the pipeline. We're working to resolve the current issue, but I expect you'll run into a new error once the replacement issue is resolved. |
I just did the experiment you suggested and that seemed to be the issue - when the FASTA is older than the PDB file then it works as expected and no However, if you swap those two |
Before you implement a way of handling multiple chains, would it be helpful to handle multi-chain PDB files in a nicer way? Like maybe automatically taking chain A or throwing a warning or something. |
That's a helpful suggestion; we can explore whether an intermediate version like this would help. |
@naailkhan28 I think we've fixed this in #83, which should ensure that |
Just tested the latest version, looks fixed! Great work guys :) |
Description of the bug
When running the Search mode Snakemake pipeline, I see that the
make_pdb
rule is run and a request is made to the ESMFold API, even when I've provided a .pdb file containing my input structure. My experimentally solved crystal structure is being replaced with a less accurate ESMFold prediction which is not ideal.I've attached my full Snakemake log (it failed due to a python error but ignore this), and my input .yml config, .pdb and .fasta files.
Command used and terminal output
Relevant files
P49861_fasta_pdb_yml_config_snakemake_log.zip
System information
VM.Standard.E4.Flex
instance in Oracle Cloud - 64 cores, 1024 GB RAM, 25 TB storage availableThe text was updated successfully, but these errors were encountered: