Speed improvements #206

jpromeror · 2024-06-04T16:56:17Z

This PR includes a series of commits intended to reduce the computational time when running the algorithm in doublet mode. Mostly intended for high definition assays (big number of spots), but usable with all other methods.

Main changes:

1. Change parallelization approach

Use foreach + DoParallel for the multicore implementation. This prevents launching multiple R sessions and we also included a progress bar for a cleaner UI.

2. Speed up gather_results

Bottle neck in the algorithm. Fully vectorized the function to remove unnecessary loops. Added progress bar as well

3. General speed up

Modified existing functions to improve overall performance.

4. Add MIN_OBS as parameter to create.RCTD

Adds MIN_OBS as a variable to allow running in specific scenarios. Allows more control and customized running, but user should be aware of the drawbacks (i.e. sampling noise). Kept the original default value.

To do list:

Adapt new parallelization approach to other modes (full & multi)
Improve screen messages for easier progress tracking

…f creating new one

csangara · 2024-09-30T16:41:14Z

R/postProcessing.R

+
+  results_df <- data.frame(spot_class = factor(sapply(results,function(X){return(X$spot_class)}),levels=spot_levels),
+                           first_type = sapply(results,function(X){return(X$first_type)}),
+                           scond_type = sapply(results,function(X){return(X$second_type)}),


I think there is a typo of 'second_type' here

Thanks for catching that!

dpaysan · 2025-04-25T10:51:55Z

I have tried running the branch but in my case the function choose_sigma_c.R now runs substantially slower: While with the version from dmcable on my Visium HD data the 8 epochs complete within one hour using 28 cores, not even one epoch is processed within that time using the updated version proposed in this pull request. Could that be due to the fact that no cluster is created with makeClusters ? I also see very little CPU usage in general with most threads being in the S state and not running.

jpromeror · 2025-04-25T16:25:02Z

Hi @dpaysan! It is a bit difficult to identify the issue without any code. Here are some suggestions to see if we can get it to work as intended:

We only updated the algorithm when running in "doublet mode". Can you confirm you are using this mode?
We changed the parallel approach and we don't use makeClusters anymore (instead registerDoParallel(cores=max_cores) ), if your multicore session in enabled you should see a message like this: "Multicore enabled using ", max_cores," cores". Can you confirm this is the case?

jpromeror added 10 commits March 8, 2024 15:07

(IRWLS.R) speed up S_mat construction

a8ba492

(platform_effect_normalization.R) Add choose sigma multicore function

8724454

(postProcessing.R) Speed up gather_results

0c361bb

(prob_model.R) format code for better display

e1ee054

(RCTD_helper.R) re-write and speed up process_bead_doublet

5fd8fca

(runRCTD.R) change parallel approach using foreach and DoParallel

205b2c9

(platform_effect_normalization.R) Replace existing function instead o…

98055eb

…f creating new one

(platform_effect_normalization.R) rename package to load extdata

f5bdbe8

(platform_effect_normalization.R) rename package to load extdata

a2cfcfd

(spacexr.R) Add MIN_OBS as parameter to create.RCTD function

33d375c

csangara reviewed Sep 30, 2024

View reviewed changes

jpromeror mentioned this pull request Jan 2, 2025

Inquiry Regarding Memory Requirements for CRC HD Data Deconvolution 10XGenomics/HumanColonCancer_VisiumHD#23

Closed

Japrin mentioned this pull request Feb 23, 2025

enhancement for RCTD function gather_results to increase speed. #196

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed improvements #206

Speed improvements #206

jpromeror commented Jun 4, 2024

Uh oh!

csangara Sep 30, 2024

Uh oh!

jpromeror Oct 8, 2024

Uh oh!

dpaysan commented Apr 25, 2025

Uh oh!

jpromeror commented Apr 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Speed improvements #206

Are you sure you want to change the base?

Speed improvements #206

Conversation

jpromeror commented Jun 4, 2024

Main changes:

To do list:

Uh oh!

csangara Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

jpromeror Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

dpaysan commented Apr 25, 2025

Uh oh!

jpromeror commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jpromeror commented Apr 25, 2025 •

edited

Loading