Skip to content

Memory and Time Issues with Xenium 5K samples #68

@atrull314

Description

@atrull314

Hi all,

We’ve been using BANKSY to perform analysis on Xenium samples. We’ve so far used it on two Xenium projects both of which used the standard ~300-400 gene panel, however, we’ve recently begun working with a project that contains 12 samples using the new 5K panel, and we’ve noticed a major increase in time and memory usage.

As background, we’ve normalized and integrated these samples and are running these through BANKSY with a number of combinations of lambdas, k values, pc values, and resolutions so we can determine the best values for this dataset. We’ve separated out the various BANKSY function calls into separate scripts to allow us to parallelize on the HPC we execute analyses on and to help determine where any memory/time bottlenecks are. We have found that executing clusterBanksy on this dataset for a single parameter combination has a peak memory at 512GB and the time to run this function can peak at 53 hours. This is causing a noticeable bottleneck in our workflow as a result, so we wanted to ask if there any plans on further optimization or if there were any additional suggestions on what we could do?

As some additional information, we are currently using the following 'flow' in order to execute Banksy:

computeBanksy() > runBanksyPCA() > RunHarmony() > runBanksyUMAP() > clusterBanksy()

And here's the code we are using to execute clusterBanksy (I'm happy to provide additional code snippets and any logging that may be informative). I'll also add that we are using SpatialExperiment objects to execute Banksy, and these have been converted from Seurat objects prior to starting the Banksy portion of the analysis.

spe_xenium_obj <- clusterBanksy(
    spe_xenium_obj,
    dimred = "BANKSY_harmony",
    resolution = opt$res,
    lambda = opt$lambda,
    ndims = opt$nPCs,
    seed = 1234,
    verbose = TRUE
)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions