Skip to content

Add hi_en Code Switched#415

Merged
mgrafu merged 4 commits into
NVIDIA:staging/hi_en_itn_codeswitchedfrom
RajanPutty:hi_en_itn_codeswitched
May 27, 2026
Merged

Add hi_en Code Switched#415
mgrafu merged 4 commits into
NVIDIA:staging/hi_en_itn_codeswitchedfrom
RajanPutty:hi_en_itn_codeswitched

Conversation

@RajanPutty
Copy link
Copy Markdown
Contributor

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Before your PR is "Ready for review"

Pre checks:

  • Have you signed your commits? Use git commit -s to sign.
  • Do all unittests finish successfully before sending PR?
    1. pytest or (if your machine does not have GPU) pytest --cpu from the root folder (given you marked your test cases accordingly @pytest.mark.run_only_on('CPU')).
    2. Sparrowhawk tests bash tools/text_processing_deployment/export_grammars.sh --MODE=test ...
  • If you are adding a new feature: Have you added test cases for both pytest and Sparrowhawk here.
  • Have you added __init__.py for every folder and subfolder, including data folder which has .TSV files?
  • Have you followed codeQL results and removed unused variables and imports (report is at the bottom of the PR in github review box) ?
  • Have you added the correct license header Copyright (c) 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved. to all newly added Python files?
  • If you copied nemo_text_processing/text_normalization/en/graph_utils.py your header's second line should be Copyright 2015 and onwards Google, Inc.. See an example here.
  • Remove import guards (try import: ... except: ...) if not already done.
  • If you added a new language or a new feature please update the NeMo documentation (lives in different repo).
  • Have you added your language support to tools/text_processing_deployment/pynini_export.py.

PR Type:

  • New Feature
  • Bugfix
  • Documentation
  • Test

If you haven't finished some of the above items you can still open "Draft" PR.

RajanPutty and others added 2 commits April 17, 2026 19:47
Signed-off-by: RajanPutty <rputty@nvidia.com>
@RajanPutty RajanPutty marked this pull request as ready for review April 17, 2026 14:56
Comment thread nemo_text_processing/inverse_text_normalization/hi_en/data/en_whitelist.tsv Outdated
Comment thread nemo_text_processing/inverse_text_normalization/hi_en/data/hi_whitelist.tsv Outdated
Comment thread nemo_text_processing/inverse_text_normalization/run_evaluate.py
Comment thread tools/text_processing_deployment/pynini_export.py
Comment thread tools/text_processing_deployment/pynini_export.py
@github-actions
Copy link
Copy Markdown

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

@github-actions github-actions Bot added the Stale label May 12, 2026
@mgrafu mgrafu removed the Stale label May 12, 2026
@github-actions
Copy link
Copy Markdown

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

@github-actions github-actions Bot added the Stale label May 27, 2026
RajanPutty added a commit to RajanPutty/NeMo-text-processing that referenced this pull request May 27, 2026
…i_en tests, add hi_en CI

Signed-off-by: Rajan Putty <rputty@nvidia.com>
…i_en tests, add hi_en CI

Signed-off-by: Rajan Putty <rputty@nvidia.com>
@RajanPutty RajanPutty force-pushed the hi_en_itn_codeswitched branch from 94fc18e to 620b2f5 Compare May 27, 2026 17:32
@mgrafu mgrafu merged commit f629f59 into NVIDIA:staging/hi_en_itn_codeswitched May 27, 2026
2 checks passed
@mgrafu mgrafu mentioned this pull request May 27, 2026
14 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants