Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarify Strandedness Options for SummarizeOverlaps in Documentation #7

Open
groverj3 opened this issue Jul 25, 2019 · 1 comment
Open

Comments

@groverj3
Copy link

groverj3 commented Jul 25, 2019

Hey there,

It seems to me that due to how most RNAseq libraries are now constructed, most people who want to count reads per feature with summarizeOverlaps will actually want to use ignore.strand = FALSE and preprocess.reads = invertStrand. However, I only stumbled across this after realizing that my output was wildly different from what htseq-count with -s reverse gave me when I only used ignore.strand = FALSE.

That you need to use preprocess.reads = invertStrand with most modern stranded RNAseq libraries is not mentioned in the documentation for summarizeOverlaps or in the example RNAseq workflow (which is not this package, but it's still confusing). The only information that I could find to lead me to this was in the bioconductor support forums.

Could the documentation be clarified in some way? I could see this tripping up new users.

@michaeldshapiro
Copy link

Hear! Hear!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants