[TTS] MagpieTTS: Implement Frechet Codec Distance metric + some minor inference bugfixes #15223

rfejgin · 2025-12-23T06:47:36Z

What does this PR do ?

Adds the Frechet Codec Distance metric and integrates it in MagpieTTS inference scripts. Also fixes some minor MagpieTTS inference bugs.

Collection: TTS

Changelog

The Frechet Distance (FD) is commonly used to evaluate generative models (e.g. Frechet Inception Distance, Frechet Audio Distance). In this PR we implements FD in the embedding space of a neural codec. This is a metric that measures how closely the distributions of real and generated codec frames match, at the single frame level.

Changes:

frechet_codec_distance.py: An implementation of FD in codec embedding space. Builds on TorchMetrics' FID implementation. We provide the audio codec as a custom feature extractor.
test_frechet_coec_distance.py: Unit test
Integration of the FCD in MagpieTTS inference scripts. If desired, FCD calculation can be disabled using the --disable_fcd command line argument to magpietts_inference.py
Inference bugfixes
- fix a logging statement that was reporting errors due to incorrect formatting syntax
- disable logging of thousands of messages during loading of the titanet_small speaker representation model. This was present in earlier versions of the inference scripts and appears to have been accidentally lost in recent refactorings
- Fix an issue where filewise metrics were not being filtered to a spcified subset as intended

PR Type:

New Feature
Bugfix
Documentation

nemo/collections/tts/modules/magpietts_inference/evaluate_generated_audio.py

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

Instead of taking a codec instance, accept a codec name: local path or HF/NGC name. This simplifies the metric's integration in calling code. Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

* address some CI linting issues * include a file that was missed in last commit Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

nemo/collections/tts/metrics/frechet_codec_distance.py

* Add (optional) saving of generated codes and FCD calcualtion to longform version of inference * Clean up how disabling FCD is done: make it an explicit part of EvaluationConfig Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

rfejgin · 2026-01-07T02:10:07Z

@subhankar-ghosh Could you please review just the latest commit in this PR? That part touches EvaluationConfig and longform inference. I mean the commit titled Integrate FCD in longform inference and rework --disable_fcd. Thanks!

subhankar-ghosh · 2026-01-07T03:46:01Z

nemo/collections/tts/modules/magpietts_inference/evaluate_generated_audio.py


-    if codec_model_path is not None:
+    if with_fcd:
        fcd_metric = FrechetCodecDistance(codec_name=codec_model_path).to(device)


I am not sure if from torchmetrics.image.fid import FrechetInceptionDistance is installed in NeMo container by default. If it is not then we might want to check for it's availability and based on that log a warning message if it is not installed.

subhankar-ghosh · 2026-01-07T04:02:01Z

nemo/collections/tts/modules/magpietts_inference/evaluate_generated_audio.py

        fcd_metric.reset()
    else:
-        fcd = 0.0
+        fcd = float('nan')


Be careful about setting the None value for metrics, check how it is formatted in def compute_mean_with_confidence_interval. We should have a uniform default value for the condition if <any_metric> is None: case.

subhankar-ghosh

Left a few comments. They are the points where I thought things might break just for you to double check. Otherwise LGTM. Make sure the tests pass.

github-actions bot added the TTS label Dec 23, 2025

github-advanced-security bot found potential problems Dec 23, 2025

View reviewed changes

nemo/collections/tts/modules/magpietts_inference/evaluate_generated_audio.py Fixed Show fixed Hide fixed

rfejgin marked this pull request as ready for review December 23, 2025 06:58

rfejgin marked this pull request as draft December 23, 2025 07:11

rfejgin added 5 commits December 22, 2025 23:15

Add metric: Freceht Distance in codec embedding space

db86b81

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

Frechet Codec Distance API change

c91dd16

Instead of taking a codec instance, accept a codec name: local path or HF/NGC name. This simplifies the metric's integration in calling code. Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

Integrate Frechet Codec Distance in inference scripts

85fcb09

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

Add a __init__.py package marker to test directory

14a9a27

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

Cleanup and add missing files

3fc5f37

* address some CI linting issues * include a file that was missed in last commit Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

rfejgin force-pushed the magpietts_frechet_codec_distance branch from 8d997ac to 3fc5f37 Compare December 23, 2025 07:15

rfejgin added the Run CICD label Dec 23, 2025

chtruong814 added Run CICD and removed Run CICD labels Dec 23, 2025

chtruong814 had a problem deploying to test December 23, 2025 07:18 — with GitHub Actions Error

Comments and cleanup

78d64ed

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>

chtruong814 added Run CICD and removed Run CICD labels Dec 23, 2025

rfejgin marked this pull request as ready for review December 23, 2025 18:27

Merge branch 'main' into magpietts_frechet_codec_distance

570a806

rfejgin requested a review from blisc December 23, 2025 18:28

chtruong814 added Run CICD and removed Run CICD labels Dec 23, 2025

rfejgin requested a review from subhankar-ghosh December 23, 2025 18:28

chtruong814 temporarily deployed to test December 23, 2025 18:29 — with GitHub Actions Inactive

blisc requested changes Dec 30, 2025

View reviewed changes

nemo/collections/tts/metrics/frechet_codec_distance.py Show resolved Hide resolved

Merge branch 'main' into magpietts_frechet_codec_distance

5effe85

chtruong814 added Run CICD and removed Run CICD labels Jan 6, 2026

chtruong814 temporarily deployed to test January 6, 2026 05:43 — with GitHub Actions Inactive

blisc previously approved these changes Jan 6, 2026

View reviewed changes

blisc enabled auto-merge (squash) January 6, 2026 16:12

rfejgin marked this pull request as draft January 7, 2026 00:31

auto-merge was automatically disabled January 7, 2026 00:31
Pull request was converted to draft

rfejgin dismissed blisc’s stale review via 187c24b January 7, 2026 02:08

chtruong814 removed the Run CICD label Jan 7, 2026

rfejgin requested a review from blisc January 7, 2026 02:08

chtruong814 added the Run CICD label Jan 7, 2026

chtruong814 temporarily deployed to test January 7, 2026 02:09 — with GitHub Actions Inactive

rfejgin marked this pull request as ready for review January 7, 2026 02:10

subhankar-ghosh reviewed Jan 7, 2026

View reviewed changes

subhankar-ghosh approved these changes Jan 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TTS] MagpieTTS: Implement Frechet Codec Distance metric + some minor inference bugfixes #15223

[TTS] MagpieTTS: Implement Frechet Codec Distance metric + some minor inference bugfixes #15223

Uh oh!

rfejgin commented Dec 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

rfejgin commented Jan 7, 2026

Uh oh!

subhankar-ghosh Jan 7, 2026

Uh oh!

subhankar-ghosh Jan 7, 2026

Uh oh!

subhankar-ghosh left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[TTS] MagpieTTS: Implement Frechet Codec Distance metric + some minor inference bugfixes #15223

Are you sure you want to change the base?

[TTS] MagpieTTS: Implement Frechet Codec Distance metric + some minor inference bugfixes #15223

Uh oh!

Conversation

rfejgin commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Uh oh!

Uh oh!

Uh oh!

rfejgin commented Jan 7, 2026

Uh oh!

subhankar-ghosh Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

subhankar-ghosh Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

subhankar-ghosh left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rfejgin commented Dec 23, 2025 •

edited

Loading