Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How are ambiguity charaters handled in BWA? #422

Open
SaitouAsuka opened this issue Jun 26, 2024 · 0 comments
Open

How are ambiguity charaters handled in BWA? #422

SaitouAsuka opened this issue Jun 26, 2024 · 0 comments

Comments

@SaitouAsuka
Copy link

I have some reads with UMI(Unique Molecular Identifier). The reads contians same UMI are from the same sequence template. They will merged to a consensus reads which some bases are ambiguous. I use the consensus reads mapping to the reference genome, and here is mapping results:
image
As the seqence I highlighted with red rectangle, there is an ambiguity bases 'N' in the reads. I want to know why it is 'A' (base from reference genome ) but not 'N' ( base from reads) in 'MD' tag(0T85A5).
Is there any official documentation on how aligner handles the ambiguity bases?

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant