Best practices for using CellBox on different datasets

For external users wanting to use CellBox on their own dataset, what is the best practice to train the model? How many total models, differed by the seed, or `--working_index`, should be trained before the collection of models achieves statistical power? This question follows the Network Interpretation in the Methods section from the original CellBox paper, when 1000 models were trained for downstream analysis. CellBox and its ODE solver is susceptible to suboptimal weight initialization: setting the wrong random seed (`--working_index`) while keeping other configs and arguments the same can lead to very different results. Therefore, for new users with a new dataset, should they train only one model or multiple models with different random seeds to yield the best performance?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Best practices for using CellBox on different datasets #55

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Best practices for using CellBox on different datasets #55

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions