-
Notifications
You must be signed in to change notification settings - Fork 5
Milestone
Description
Our Cells Ranger tasks spend considerable amount of time copying the reference transcriptome over NFS.
This is mitigated in the bulk RNA-Seq pipeline by using shared memory and unloading the segment only if no other jobs are using it.
This is not possible with Cell Ranger as calling STAR is an internal implementation detail.
Thus, a way of mitigating the networking cost would be to copy the reference locally. Care must be taken to coordinate copies since this is a shared resource.
- add a wrapper script that rewrites the --transcriptome argument to refer to a local file
- copy the reference to a local scratch directory
- add a lockfile to coordinate the copy
- use os.execv to delegate the process to Cell Ranger with the file locked in shared mode
- make this optional and make the scratch directory configurable
Metadata
Metadata
Assignees
Labels
No labels