Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MD tags with Nanopore data #9

Open
leonard-creator opened this issue Oct 25, 2023 · 3 comments
Open

MD tags with Nanopore data #9

leonard-creator opened this issue Oct 25, 2023 · 3 comments

Comments

@leonard-creator
Copy link

Hi Hongyu Ding,
I was succesgfully working with MAMnet on small training data. I moved on to use the GIAB OxfordNanopore data. You stated in your README that u tested with that data and the minimap2 aligner. Unfotunately, I ran inbto the following Problem:

I calculated the missing MD tags for the nanopore data with samtools calmd but the program was not able to calculate the MD tags for secondary alignments (flag 256 in Bam-file), because it was missing the SEQ, QUAL columns for those reads. Hence, MAMnet was throwing an error and the program aborted.

Since you were stating you where able to use OxfordNanopore data, what data did you use and how did you use it? Is there a workaround? Can MAMnet handle those errors without crashing?
I tried remapping with minimap2 and the -secondary-seq flag which was not giving the promised results unfortunately.

Bam file data from GIAB

@micahvista
Copy link
Owner

Dear leonard-creator:
I just updated the code. Now MAMnet allows running on alignments without MD tag.
Hongyu Ding

@leonard-creator
Copy link
Author

Thank you very much. I am still interested how you used the nanopore and did not encounter that error. What if I want to include the secondary alignment reads into the analysis?

@micahvista
Copy link
Owner

Dear leonard-creator:
I updated the code to skip secondary alignment. Recently, I have been studying long read mapping. For my experience, the secondary alignment often led to wrong result. But if you are interested in more challenging regions like repetitive regions, secondary alignment may be the correct one. However, MAMnet is not designed for these challenging regions, and may not benefit from the additional information within secondary alignments.
Hongyu, Ding

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants