Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HelixFold3 predicts unreasonable structures #343

Open
Garhorne0813 opened this issue Sep 5, 2024 · 2 comments
Open

HelixFold3 predicts unreasonable structures #343

Garhorne0813 opened this issue Sep 5, 2024 · 2 comments

Comments

@Garhorne0813
Copy link

Hello! I noticed that HelixFold3 tends to predict unreasonable structures for regions without templates (as shown in the dashed lines in the image), both in the online service and the open-source version. Do you have any solutions for this issue?

Below is the amino acid sequence of the protein:

MANQALSVSVGNALRRVRSYLFLVRGMGQLLRRRLDPTVRAQPAVIVLSLGSKGSSARVAAAARARGYRVVVFCAELPFAEARYMDHYHRIDCVTDFDKALETARGYAPEAILLEGKNRLLPMQNNLAQTLGVTAVGNAAVKSSNSKIDLHASLDRAGLANLPWEILPEDGRSKLSFPVVSKPDVGTSSMGVQYLDSLDTFRNDKAYWDKVAQDTDIDGQIMLESYIDGRQFDVEGVARDGAFHILTVVEEYYQNAAPYFPPSWFLFNPPIPEEQRARLEKRVEEALKAFGVTVGGWHCESRFSDEKYGDGSLRPGIAGNEIYVLDYANRMGYNQLVSESCGADFAGAYVDTMLPRPFSPPQITRRSVLQIMIRDTETLRRAKALAQARPDVVHRGAFVPFEFSAHTYFGHIVLSCPDFETLRDALAAHDLIPDTWAGFYPDAMAGA

Here is the visualization of HelixFold3's prediction:

20240905151055

@magnusbauer
Copy link

I get similar results when the MSA's are shallow. It is a weird issue because sequence order is kept in pdb/cif indexing but the residues in the structure are swapped in atom positions. This is why pymol has a hard time showing it right (dashed lines). It basically puts the residue in the correct position but it has the wrong residue identity. You won't see it in the pdb file sequence order only by following the amino acid chain in the structure residue by residue as shown below.

Screenshot 2024-09-08 at 3 29 07 PM

@Fairly
Copy link
Collaborator

Fairly commented Sep 9, 2024

@Garhorne0813 @magnusbauer
Thank you for the feedback. We are currently investigating the issue. We have observed that these phenomena are more likely to occur with a shallower MSA and shorter sequence, where the predicted plDDT of part of the structure tends to be very low and the model incorrectly connects atoms. While we are still not certain, the problem may be due to underfitting of the model. We are actively working on enhancing it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants