[Bug] convert_sync_batchnorm missing 'training' attribute

### Prerequisite

- [X] I have searched [Issues](https://github.com/open-mmlab/mmengine/issues) and [Discussions](https://github.com/open-mmlab/mmengine/discussions) but cannot get the expected help.
- [X] The bug has not been fixed in the latest version(https://github.com/open-mmlab/mmengine).

### Environment

(Issue obvious from source)

### Reproduces the problem - code sample

(Issue obvious from source)

### Reproduces the problem - command or script

(Issue obvious from source)

### Reproduces the problem - error message

(Issue obvious from source)

### Additional information

In [torch.nn.modules.batchnorm.py](https://github.com/pytorch/pytorch/blob/69b883d7ace02cb84a713c3a73e804ace1158eae/torch/nn/modules/batchnorm.py#L875) the `SyncBatchNorm.convert_sync_batchnorm()` method copies over the `training` attribute like `module_output.training = module.training`.

The [mmengine version](https://github.com/open-mmlab/mmengine/blob/a8c74c346d2ef3e5501115529ba588accb5f2a03/mmengine/model/utils.py#L250) is missing this. However it is present in the `revert_sync_batchnorm()` method right above.

Not having this will cause a NCCL timeout when the BN layer is kept in eval mode for fine-tuning while the model is in training mode. I figured this out due to a similar issue/solution [here](https://github.com/tianweiy/CenterPoint/issues/224#issuecomment-986228097)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug] convert_sync_batchnorm missing 'training' attribute #1624

Prerequisite

Environment

Reproduces the problem - code sample

Reproduces the problem - command or script

Reproduces the problem - error message

Additional information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] convert_sync_batchnorm missing 'training' attribute #1624

Description

Prerequisite

Environment

Reproduces the problem - code sample

Reproduces the problem - command or script

Reproduces the problem - error message

Additional information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions