Update performNormalization.R #111

LinearParadox · 2024-08-13T21:02:38Z

added parallelization to normalize function. This might be memory intensive, but still would offer a speed benefit over a strictly single threaded approach. Under the hood, bpvec splits the vector into the number of workers and then executes the function on them, before joining it back together.

I tested it against the equivalent single thread method and got an identical result.

added parallelization to normalize function. This might be memory intensive, but still would offer a speed benefit over a strictly single threaded approach. Under the hood, bpvec splits the vector into the number of workers and then executes the function on them, before joining it back together. I tested it against the equivalent single thread method and got an identical result.

improved efficiency in the case of a provided scale factor, fixed for cases where scale factor is not provided

Adding bpvec to importfrom

ncborcherding · 2024-08-15T15:07:46Z

Thanks for making a pull request - agreed that the speed for performNormalization() could be improved substantially. I am working through the testing here (see above) and will get to it this weekend

LinearParadox · 2024-08-15T18:45:21Z

If it passes checks, can we wait a bit to merge? I think the alist suffers on larger datasets with my internal testing, and I have some ideas on how to do it more efficiently after I slept on it. I'll try to push a few commits over the next 2-3 days if that's alright with you? Thank you so much for promptly responding to the push and providing the correction for the important!!

significantly speed up calculating scale factors

ncborcherding · 2024-08-15T23:13:36Z

Absolutely. It might make things easier if you run devtools::check() locally with the updated commit. Also if interested please feel free to add your name to the function (as @author) or the package DESCRIPTION.

Nick

LinearParadox · 2024-08-17T00:19:55Z

The tests seemed to have run locally! I ended up not parallelizing with bpparallel, mostly because it already runs fairly fast since most of it is now vectorized. If someone really wants to, it's fairly easy to just copy the function and change the mapply call to a bpmapply.

ncborcherding · 2024-08-20T19:11:55Z

Hey it looks like the R-CMD-check failed on this one due to mismatch in the new documentation. Would you mind updating the .Rd files on your end with devtools::document()? Then I think we will be golden.

Thanks so much for your help,
Nick

added documentation for groups. added ucell split data matrix since it is an internal function and can't be imported. minor code cleanup.

removed redundant split matrix function (already present). added matrix dependency to imports.

LinearParadox · 2024-08-20T22:56:18Z

Got it, should be done now. I benchmarked it on my dataset, and it seems like for around 50 gene sets on 9000 cells, runtime is down from around 10 mins to 3 seconds!

rolled back function renaming

ncborcherding · 2024-08-22T19:29:59Z

Looks good to me - thanks for all the help!!

LinearParadox and others added 3 commits August 13, 2024 14:00

Update performNormalization.R

685a12c

improved efficiency in the case of a provided scale factor, fixed for cases where scale factor is not provided

Update performNormalization.R

41d0798

Adding bpvec to importfrom

Update performNormalization.R

26e935c

significantly speed up calculating scale factors

rewrote normalize to be vectorized.

c4d3996

LinearParadox added 2 commits August 16, 2024 17:25

added contributor name

1245b1d

updated to pass chunking, fixing bug with scale factors

44e7faf

LinearParadox added 2 commits August 20, 2024 13:42

documentation. minor code cleanup

46541f2

added documentation for groups. added ucell split data matrix since it is an internal function and can't be imported. minor code cleanup.

minor refactor

cb97c24

removed redundant split matrix function (already present). added matrix dependency to imports.

Update densityEnrichment.R

a3116ff

rolled back function renaming

ncborcherding merged commit b5307d0 into BorchLab:master Aug 22, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update performNormalization.R #111

Update performNormalization.R #111

LinearParadox commented Aug 13, 2024

ncborcherding commented Aug 15, 2024

LinearParadox commented Aug 15, 2024

ncborcherding commented Aug 15, 2024

LinearParadox commented Aug 17, 2024

ncborcherding commented Aug 20, 2024

LinearParadox commented Aug 20, 2024

ncborcherding commented Aug 22, 2024

Update performNormalization.R #111

Update performNormalization.R #111

Conversation

LinearParadox commented Aug 13, 2024

ncborcherding commented Aug 15, 2024

LinearParadox commented Aug 15, 2024

ncborcherding commented Aug 15, 2024

LinearParadox commented Aug 17, 2024

ncborcherding commented Aug 20, 2024

LinearParadox commented Aug 20, 2024

ncborcherding commented Aug 22, 2024