Skip to content

Conversation

@tbartley94
Copy link
Member

@tbartley94 tbartley94 commented Dec 9, 2024

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Before your PR is "Ready for review"

Pre checks:

  • Have you signed your commits? Use git commit -s to sign.
  • Do all unittests finish successfully before sending PR?
    1. pytest or (if your machine does not have GPU) pytest --cpu from the root folder (given you marked your test cases accordingly @pytest.mark.run_only_on('CPU')).
    2. Sparrowhawk tests bash tools/text_processing_deployment/export_grammars.sh --MODE=test ...
  • If you are adding a new feature: Have you added test cases for both pytest and Sparrowhawk here.
  • Have you added __init__.py for every folder and subfolder, including data folder which has .TSV files?
  • Have you followed codeQL results and removed unused variables and imports (report is at the bottom of the PR in github review box) ?
  • Have you added the correct license header Copyright (c) 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved. to all newly added Python files?
  • If you copied nemo_text_processing/text_normalization/en/graph_utils.py your header's second line should be Copyright 2015 and onwards Google, Inc.. See an example here.
  • Remove import guards (try import: ... except: ...) if not already done.
  • If you added a new language or a new feature please update the NeMo documentation (lives in different repo).
  • Have you added your language support to tools/text_processing_deployment/pynini_export.py.

PR Type:

  • New Feature
  • Bugfix
  • Documentation
  • Test

If you haven't finished some of the above items you can still open "Draft" PR.

Signed-off-by: tbartley94 <tbartley@nvidia.com>
Signed-off-by: tbartley94 <tbartley@nvidia.com>
@tbartley94 tbartley94 requested a review from mgrafu December 11, 2024 18:07
CONTRIBUTING.md Outdated

Naively, one may be tempted to simply include the property string `gender: "masc"` and check for this string during the verbalization phase. **This is not advised.** While the NeMo-Text-Processing library itself will permit any custom string in the tagger, Sparrowhawk limits permissible strings, and will fail with custom property strings. Given the performance loss in not providing Sparrowhawk support, we cannot integrate new graphs that cause Sparrowhawk failure. As such, tagged properties should be limited to Sparrowhawk supported strings.

For all classes, Sparrowhawk support the `morphosyntactic_features` property, and it is recommended to default to this property for tagging additional features. For example:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

supports?

Signed-off-by: tbartley94 <tbartley@nvidia.com>
@tbartley94 tbartley94 merged commit 942b26a into NVIDIA:main Dec 11, 2024
4 of 5 checks passed
ngachchi pushed a commit to ngachchi/NeMo-text-processing that referenced this pull request Jun 23, 2025
* contributing update

Signed-off-by: tbartley94 <tbartley@nvidia.com>

* adding edits

Signed-off-by: tbartley94 <tbartley@nvidia.com>

* spelling

Signed-off-by: tbartley94 <tbartley@nvidia.com>

---------

Signed-off-by: tbartley94 <tbartley@nvidia.com>
Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com>
FredHaa pushed a commit to FredHaa/NeMo-text-processing that referenced this pull request Aug 15, 2025
* contributing update

Signed-off-by: tbartley94 <tbartley@nvidia.com>

* adding edits

Signed-off-by: tbartley94 <tbartley@nvidia.com>

* spelling

Signed-off-by: tbartley94 <tbartley@nvidia.com>

---------

Signed-off-by: tbartley94 <tbartley@nvidia.com>
Rajanv307 pushed a commit to RajanPutty/NeMo-text-processing that referenced this pull request Jan 6, 2026
* contributing update

Signed-off-by: tbartley94 <tbartley@nvidia.com>

* adding edits

Signed-off-by: tbartley94 <tbartley@nvidia.com>

* spelling

Signed-off-by: tbartley94 <tbartley@nvidia.com>

---------

Signed-off-by: tbartley94 <tbartley@nvidia.com>
Signed-off-by: RajanPutty <rputty@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants