-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What's the meaning of adapter content? #138
Comments
Y axis is the percentage of reads where the 12 bp probe for the adapter has been detected. In your case it seems to be in about ~25% of the cases. (I am not couunting polyA as it is common in the human genome) Some further advice: do not use porechop. I recommend using cutadapt. Nanopore adapters are very hard to detect because they occur at the ends of the reads where a lot of errors are. Aligning the adapter is almost impossible in these regions. The best solution is to simly cut of the ends. That should also get rid of adapters.
Cutadapt has no issues with compressed input and also supports uBAM reading. |
Thanks for your reply! I have considered cutadapt, but due to my use of public data, some data did not provide adapter sequence. Do you have recommendation of adapter detector? |
I just test cutadapt with your recommend codes on the same data as before:
It seems that |
Well technically cutadapt can. But then you should take a look at its adapter features. PyChopper looks really nice! Thanks for pointing that out to me. I didn't know of that tool. It seems very to the point for nanopore adapter trimming. If I may ask, how did you hear of sequali? I am trying to get more users. One way is to write a paper about it, but this is not going very fast. (Also I find improving sequali more fun than writing about it). |
OK, then I will use pychopper to cut adapters. However, when the amount of data is large, the multi-threading feature of this tool seems to fail and no longer produces any results. |
I have noticed that many nanopore tools, with the exception of the basecaller (dorado) seem to be generally of very poor software quality. It was one of the reasons to start this project.
Good to know. Thanks! The adapter detection feature is basically the same what FastQC does. FastQC is very useful for illumina adapters, but it does not detect nanopore ones. Technically the nanopore adapters could have been added to FastQC, but FastQC's string searching algorithm does not scale that well. So Sequali uses state-of-the-art multiple pattern matching to enable fast matching of a multitude of adapter probes. |
Hi, I used your tool to check the performance of porechop. After I ran porechop, I checked the adapter content in my result file with sequali. But I don't understand the "adapter content" item in the html report. The figure below is my result. I have some questions. Hope you can help me. Thank you!
The text was updated successfully, but these errors were encountered: