Skip to content

feature barcoding data (like CITE-seq) is taking too much time to process #630

Discussion options

You must be logged in to vote

Basically the answer lies in how complicated the UMI graph network is. Experiment with the antibody derived barcodes (ADT) with ~200 protein panel, generally, doesn't need the --naiveEqclass mode UMI deduplication, unless the experiment is super deeply sequenced. However, for very low diversity like 20 barcodes e.g. for HTO like sample barcodes, the graphical network becomes exponentially hard to solve and potentially increases the running time for alevin.

In general, we'd recommend if you expect very low diversity in the number of barcodes in your experiment, use --naiveEqclass otherwise prefer avoiding it. Generally, the experiment with low diversity barcodes results in such a highly de…

Replies: 1 comment

Comment options

k3yavi
Feb 12, 2021
Collaborator Author

You must be logged in to vote
0 replies
Answer selected by k3yavi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
1 participant