Skip to content

Copy the Cell Ranger reference locally #103

@arteymix

Description

@arteymix

Our Cells Ranger tasks spend considerable amount of time copying the reference transcriptome over NFS.

This is mitigated in the bulk RNA-Seq pipeline by using shared memory and unloading the segment only if no other jobs are using it.

This is not possible with Cell Ranger as calling STAR is an internal implementation detail.

Thus, a way of mitigating the networking cost would be to copy the reference locally. Care must be taken to coordinate copies since this is a shared resource.

  • add a wrapper script that rewrites the --transcriptome argument to refer to a local file
  • copy the reference to a local scratch directory
  • add a lockfile to coordinate the copy
  • use os.execv to delegate the process to Cell Ranger with the file locked in shared mode
  • make this optional and make the scratch directory configurable

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions