Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only sample the read extremities rather than the entire read to prevent a lot of false positive overrepresented sequences. #205

Merged
merged 7 commits into from
Nov 20, 2024
Prev Previous commit
Next Next commit
Add overrepresented sequencing sampling to the changelog
rhpvorderman committed Nov 20, 2024
commit f347ae23317fea9306f043f950272cc9bf75b71d
8 changes: 8 additions & 0 deletions CHANGELOG.rst
Original file line number Diff line number Diff line change
@@ -7,6 +7,14 @@ Changelog
.. This document is user facing. Please word the changes in such a way
.. that users understand how the changes affect the new version.
version 0.13.0-dev
------------------
+ Only sample the first 100 bp from the beginning and end of the read by
default for the overrepresented sequences analysis. This prevents a lot of
false positives from common human genome repeats. The amount of base pairs
that are sampled from the beginning and end is user settable with an option
to sample everything.

version 0.12.0
------------------
+ Properly name percentiles as such in the sequence length distribution rather