Hi Tinyi,
Thank you for developing and maintaining BayesPrism — it's a powerful and well-documented tool.
I’m currently trying to use BayesPrism to deconvolute TCGA bulk RNA-seq data using single-cell RNA-seq data as a reference. I have a question regarding the impact of using 5’ versus 3’ single-cell RNA-seq data as the reference input.
Does the use of 5’ or 3’ single-cell RNA-seq (e.g., from 10x Genomics) significantly affect the deconvolution results when applying BayesPrism to bulk RNA-seq data (like TCGA)? Is one more suitable or recommended than the other?
I understand that BayesPrism models gene expression distributions, so I was wondering whether the capture bias introduced by 5'/3' protocols would skew the reference profiles or have any downstream impact.
Any guidance on best practices or relevant references would be greatly appreciated.
Thanks in advance for your time!
Best regards,
Jayden