Skip to content

Conversation

juliusberner
Copy link
Contributor

@juliusberner juliusberner commented Jun 24, 2025

PhysicsNeMo Pull Request

Description

  • Diffusion models: consolidate ApexGroupNorm and GroupNorm in models/diffusion/layers.py with a factory get_group_norm that can be used to instantiate either one of them. this ensures that the state_dict remains consistent with previous versions.
  • Log missing and unexpected keys when loading checkpoints.
  • In SongUNet-based diffusion models, add persistent=False for deterministic, non-learnable positional embeddings.

Closes #1001 .

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.
  • The CHANGELOG.md is up to date with these changes.
  • An issue is linked to this pull request.

Signed-off-by: Julius Berner <jberner@nvidia.com>
@CharlelieLrt CharlelieLrt self-requested a review June 24, 2025 21:05
@CharlelieLrt CharlelieLrt added bug Something isn't working 3 - Ready for Review Ready for review by team labels Jun 24, 2025
Julius Berner added 2 commits July 9, 2025 01:13
Signed-off-by: Julius Berner <jberner@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt
Copy link
Collaborator

Note: need to add tests (see this thread)

@CharlelieLrt CharlelieLrt added ! - Release PRs or Issues releating to a release 4 - In Review Currently Under Review and removed 3 - Ready for Review Ready for review by team labels Aug 1, 2025
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt CharlelieLrt self-requested a review August 11, 2025 18:49
@CharlelieLrt
Copy link
Collaborator

/blossom-ci

Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt
Copy link
Collaborator

/blossom-ci

Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt
Copy link
Collaborator

/blossom-ci

Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt
Copy link
Collaborator

/blossom-ci

Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt
Copy link
Collaborator

/blossom-ci

Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt
Copy link
Collaborator

/blossom-ci

@prem-krishnan
Copy link
Collaborator

@CharlelieLrt Can we associate this task with an epic?

Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt
Copy link
Collaborator

/blossom-ci

@CharlelieLrt
Copy link
Collaborator

/blossom-ci

@CharlelieLrt CharlelieLrt merged commit fb4140c into NVIDIA:main Aug 13, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
4 - In Review Currently Under Review bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

🐛[BUG]: GroupNorm creates unused parameters when use_apex_gn=True
5 participants