Old vs New Velocity Tutorial #719
Unanswered
ACastanza
asked this question in
Q&A (single-cell specific)
Replies: 2 comments
-
Hey, just bumping this to show wider interest :) What I would like is: an accurate, fast (ish), streamlined pipeline for mapping to unspliced / spliced / ambiguous reads. So the question that is most interesting to me is understanding how does decoy mode vs splici mode affect accuracy and timings. It would be great if you could comment on this! Thanks, Will |
Beta Was this translation helpful? Give feedback.
0 replies
-
I've just discovered that these questions were answered in the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello @rob-p and all,
I have a couple questions about the old vs new RNA-velocity with Alevin tutorials.
After reading through them several times, and having fully implemented the old tutorial into a pipeline, I have a couple questions about converting.
The major benefit of the new tutorial seems to be that alevin-fry produces the "usa mode" result when fed a three column tx2gene map file which is then simple to parse into the h5ad spec expected by scanpy/scvelo. With the previous tutorial, instead there was a two column tx2gene map with -I suffixes to indicate the unspliced sequences. Is it possible to get standard alevin to produce the "usa mode" outputs without fully converting to alevin-fry?
(If not, I'd like to propose this as a feature request)
Additionally, it appears that the flank length has been changed from read length -1 in the old tutorial to read length -5 in the new tutorial. Is there a particular significance to this change?
The new tutorial uses a simple "splici reference" and no longer adds the full genome as a decoy sequence. Is is no longer advisable to use the full decoy or partial decoy indices when computing with either alevin or alevin-fry (assuming that speed/memory are not the highest concern)? If adding the decoys is not contraindicated, is it possible to run the new tutorial pipeline with an index which has decoy sequences added to it ala the old tutorial?
Edit: I see that decoys are not compatible with alevin-fry mode?
Parsing the USA mode file combines the Spliced and Ambiguous reads into a single layer rather than assigning the ambiguous reads to the expected ambiguous layer. Would it not be better to assign the ambiguous reads to the ambiguous layer (which tools like scvelo expect) and then let scvelo handle the interpretation of this layer? Combining S+A "early" seems to negate the benefit of the new USA mode over the old tutorial where alevin was allowed to assign these ambiguous reads as it saw fit (learning the expected proportions from the data). Assuming my understanding is correct here would it be possible to get instructions for extracting the three layers separately so that they can be assembled into the expected three layer h5ad file?
Since the previous velocity tutorial explicitly requested a decoy aware index and the new alevin-fry mode doesn't support this. It seems like an option to produce the USA mode result file from standard alevin would provide a "best of both worlds" situation.
Edit: additional question:
When making the intron aware transcritpome fasta the "joinOverlappingIntrons" parameter in
getFeatureRanges
has been changed toTRUE
where it was previouslyFALSE
, is there a particular reason for this change?I'm sure I'll have some more questions as I work through the new velocity tutorial but that's all I've got for now, thanks!
Beta Was this translation helpful? Give feedback.
All reactions