Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Polishing errors close to window borders #76

Open
fmlt opened this issue Jun 2, 2023 · 1 comment
Open

Polishing errors close to window borders #76

fmlt opened this issue Jun 2, 2023 · 1 comment

Comments

@fmlt
Copy link

fmlt commented Jun 2, 2023

Hi
We are evaluting the quality of raven assemblies using Nanopore V14 raw reads and we noticed errors close to window borders that we belive may be caused by racon. More specifically, after mapping the reads to the polished unitigs, we can find places where almost all reads suggest that the assembled/polished sequence should be different.

error1
error2

We used a basic variant detector to find where these errors occur and they seem to happen "close to" the window border (it is hard to have a clear conclusion because there is no direct way to know where the borders though). The following tables show high frequency variants with windows of 300, 500 and 700 bp length.

300
variants_300
500
variants_500
700
variants_700

We tried a two-step polishing approach using 500 and 700 windows on the first and second step without better results. The "--no-trimming" option also did not help. Is there anything you can suggest to improve such errors?

Thank you in advance for the help.

@JWDebler
Copy link

JWDebler commented Jun 5, 2023

Interesting, because I just noticed something when the --no-trimming option is used (#77) (due to removal of telomeres otherwise), that BUSCO scores drop significantly. Maybe the introduced errors you show lead to broken gene models and therefore lower BUSCO scores.

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants