Skip to content

Conversation

@andrusenkoau
Copy link
Collaborator

@andrusenkoau andrusenkoau commented Jul 2, 2025

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do?

Implementation of GPU-accelerated Phrase-Boosting method (GPU-PB) in shallow fusion mode for AED decoding (e.g. Canary) based on NGPU-LM infrastructure. The method supports greedy (beam=1) and beam search decoding. GPU-PB is compatible with NGPU-LM at the same decoding run.

Collection: [ASR]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

Step 1 (Optional): Build boosting tree for the specific ASR model
You can skip this step and provide a context file or a list with key phrases directly to speech_to_text_eval.py (step 2)

python scripts/asr_context_biasing/build_gpu_boosting_tree.py \
        asr_model_path=${ASR_NEMO_MODEL_FILE} \
        key_phrases_file=${CONTEXT_BIASING_LIST} \
        save_to=${PATH_TO_SAVE_BTREE} \
        context_score=${CONTEXT_SCORE} \
        depth_scaling=${DEPTH_SCALING} \
        use_triton=True

Step 2: Run AED decoding with boosting tree

python examples/asr/speech_to_text_eval.py \
        model_path=${MODEL_NAME} \
        dataset_manifest=${EVAL_MANIFEST} \
        batch_size=${BATCH_SIZE} \
        output_filename=${OUT_MANIFEST} \
        multitask_decoding.strategy="beam" \
        multitask_decoding.beam.beam_size=${BEAM_SIZE} \
        multitask_decoding.beam.boosting_tree.key_phrases_file=${CONTEXT_BIASING_LIST} \
        multitask_decoding.beam.boosting_tree.context_score=1.0 \
        multitask_decoding.beam.boosting_tree.depth_scaling=1.0 \
        multitask_decoding.beam.boosting_tree_alpha=${BT_ALPHA} \
        gt_lang_attr_name="target_lang" \
        gt_text_attr_name="text" \
        +prompt.pnc=no \
        +prompt.task=asr \
        +prompt.source_lang=en \
        +prompt.target_lang=en \

Step 3: Evaluate F-score for context phrases

python scripts/asr_context_biasing/compute_key_words_fscore.py \
        --input_manifest=${DECODING_MANIFEST} \
        --key_words_file=${CONTEXT_PHRASES_LIST}

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
@github-actions github-actions bot added the ASR label Jul 2, 2025
andrusenkoau and others added 2 commits July 2, 2025 02:40
Signed-off-by: andrusenkoau <[email protected]>
andrusenkoau and others added 4 commits July 2, 2025 02:59
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
@andrusenkoau andrusenkoau marked this pull request as ready for review July 2, 2025 10:49
@andrusenkoau andrusenkoau requested review from artbataev and Copilot July 2, 2025 10:49
@ko3n1g ko3n1g added Run CICD and removed Run CICD labels Jul 11, 2025
andrusenkoau and others added 2 commits July 12, 2025 00:51
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
@ko3n1g ko3n1g added Run CICD and removed Run CICD labels Jul 11, 2025
Copy link
Collaborator

@artbataev artbataev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot! Great work!

@andrusenkoau andrusenkoau merged commit 93a77b1 into NVIDIA-NeMo:main Jul 12, 2025
130 checks passed
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Jul 23, 2025
…o#14108)

* add initial scripts

Signed-off-by: andrusenkoau <[email protected]>

* add boosting tree construction

Signed-off-by: andrusenkoau <[email protected]>

* add pb support to rnnt greedy decoding for python impl only

Signed-off-by: andrusenkoau <[email protected]>

* first step for the integration of PB for rnnt cuda decoding

Signed-off-by: andrusenkoau <[email protected]>

* some fixes

Signed-off-by: andrusenkoau <[email protected]>

* revert changes for rnnt decoding

Signed-off-by: andrusenkoau <[email protected]>

* add initial step for pb for aed model

Signed-off-by: andrusenkoau <[email protected]>

* fix a bug with fusion models integration for aed beam decoding

Signed-off-by: andrusenkoau <[email protected]>

* partial fix for eos score increasing after detection of context phrase

Signed-off-by: andrusenkoau <[email protected]>

* fix node score in the context graph

Signed-off-by: andrusenkoau <[email protected]>

* fix backoff weight for end node

Signed-off-by: andrusenkoau <[email protected]>

* add dummy boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add tests for the boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add more tests

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* fix not closed file

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* Update tests/collections/asr/test_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update tests/collections/asr/decoding/test_multi_task_decoding.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/compute_key_words_fscore.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* add asr model path + name

Signed-off-by: andrusenkoau <[email protected]>

* add bt config

Signed-off-by: andrusenkoau <[email protected]>

* add loading boosting tree from config

Signed-off-by: andrusenkoau <[email protected]>

* add a new test for test_boosting_tree_model_from_config

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* pr fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

---------

Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
Co-authored-by: andrusenkoau <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Charlie Truong <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
monica-sekoyan pushed a commit that referenced this pull request Aug 4, 2025
* add initial scripts

Signed-off-by: andrusenkoau <[email protected]>

* add boosting tree construction

Signed-off-by: andrusenkoau <[email protected]>

* add pb support to rnnt greedy decoding for python impl only

Signed-off-by: andrusenkoau <[email protected]>

* first step for the integration of PB for rnnt cuda decoding

Signed-off-by: andrusenkoau <[email protected]>

* some fixes

Signed-off-by: andrusenkoau <[email protected]>

* revert changes for rnnt decoding

Signed-off-by: andrusenkoau <[email protected]>

* add initial step for pb for aed model

Signed-off-by: andrusenkoau <[email protected]>

* fix a bug with fusion models integration for aed beam decoding

Signed-off-by: andrusenkoau <[email protected]>

* partial fix for eos score increasing after detection of context phrase

Signed-off-by: andrusenkoau <[email protected]>

* fix node score in the context graph

Signed-off-by: andrusenkoau <[email protected]>

* fix backoff weight for end node

Signed-off-by: andrusenkoau <[email protected]>

* add dummy boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add tests for the boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add more tests

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* fix not closed file

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* Update tests/collections/asr/test_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update tests/collections/asr/decoding/test_multi_task_decoding.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/compute_key_words_fscore.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* add asr model path + name

Signed-off-by: andrusenkoau <[email protected]>

* add bt config

Signed-off-by: andrusenkoau <[email protected]>

* add loading boosting tree from config

Signed-off-by: andrusenkoau <[email protected]>

* add a new test for test_boosting_tree_model_from_config

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* pr fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

---------

Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
Co-authored-by: andrusenkoau <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Charlie Truong <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
…o#14108)

* add initial scripts

Signed-off-by: andrusenkoau <[email protected]>

* add boosting tree construction

Signed-off-by: andrusenkoau <[email protected]>

* add pb support to rnnt greedy decoding for python impl only

Signed-off-by: andrusenkoau <[email protected]>

* first step for the integration of PB for rnnt cuda decoding

Signed-off-by: andrusenkoau <[email protected]>

* some fixes

Signed-off-by: andrusenkoau <[email protected]>

* revert changes for rnnt decoding

Signed-off-by: andrusenkoau <[email protected]>

* add initial step for pb for aed model

Signed-off-by: andrusenkoau <[email protected]>

* fix a bug with fusion models integration for aed beam decoding

Signed-off-by: andrusenkoau <[email protected]>

* partial fix for eos score increasing after detection of context phrase

Signed-off-by: andrusenkoau <[email protected]>

* fix node score in the context graph

Signed-off-by: andrusenkoau <[email protected]>

* fix backoff weight for end node

Signed-off-by: andrusenkoau <[email protected]>

* add dummy boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add tests for the boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add more tests

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* fix not closed file

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* Update tests/collections/asr/test_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update tests/collections/asr/decoding/test_multi_task_decoding.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/compute_key_words_fscore.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* add asr model path + name

Signed-off-by: andrusenkoau <[email protected]>

* add bt config

Signed-off-by: andrusenkoau <[email protected]>

* add loading boosting tree from config

Signed-off-by: andrusenkoau <[email protected]>

* add a new test for test_boosting_tree_model_from_config

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* pr fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

---------

Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
Co-authored-by: andrusenkoau <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Charlie Truong <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
…o#14108)

* add initial scripts

Signed-off-by: andrusenkoau <[email protected]>

* add boosting tree construction

Signed-off-by: andrusenkoau <[email protected]>

* add pb support to rnnt greedy decoding for python impl only

Signed-off-by: andrusenkoau <[email protected]>

* first step for the integration of PB for rnnt cuda decoding

Signed-off-by: andrusenkoau <[email protected]>

* some fixes

Signed-off-by: andrusenkoau <[email protected]>

* revert changes for rnnt decoding

Signed-off-by: andrusenkoau <[email protected]>

* add initial step for pb for aed model

Signed-off-by: andrusenkoau <[email protected]>

* fix a bug with fusion models integration for aed beam decoding

Signed-off-by: andrusenkoau <[email protected]>

* partial fix for eos score increasing after detection of context phrase

Signed-off-by: andrusenkoau <[email protected]>

* fix node score in the context graph

Signed-off-by: andrusenkoau <[email protected]>

* fix backoff weight for end node

Signed-off-by: andrusenkoau <[email protected]>

* add dummy boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add tests for the boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add more tests

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* fix not closed file

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* Update tests/collections/asr/test_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update tests/collections/asr/decoding/test_multi_task_decoding.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/compute_key_words_fscore.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* add asr model path + name

Signed-off-by: andrusenkoau <[email protected]>

* add bt config

Signed-off-by: andrusenkoau <[email protected]>

* add loading boosting tree from config

Signed-off-by: andrusenkoau <[email protected]>

* add a new test for test_boosting_tree_model_from_config

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* pr fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

---------

Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
Co-authored-by: andrusenkoau <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Charlie Truong <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
nasretdinovr pushed a commit to nasretdinovr/NeMo that referenced this pull request Aug 8, 2025
…o#14108)

* add initial scripts

Signed-off-by: andrusenkoau <[email protected]>

* add boosting tree construction

Signed-off-by: andrusenkoau <[email protected]>

* add pb support to rnnt greedy decoding for python impl only

Signed-off-by: andrusenkoau <[email protected]>

* first step for the integration of PB for rnnt cuda decoding

Signed-off-by: andrusenkoau <[email protected]>

* some fixes

Signed-off-by: andrusenkoau <[email protected]>

* revert changes for rnnt decoding

Signed-off-by: andrusenkoau <[email protected]>

* add initial step for pb for aed model

Signed-off-by: andrusenkoau <[email protected]>

* fix a bug with fusion models integration for aed beam decoding

Signed-off-by: andrusenkoau <[email protected]>

* partial fix for eos score increasing after detection of context phrase

Signed-off-by: andrusenkoau <[email protected]>

* fix node score in the context graph

Signed-off-by: andrusenkoau <[email protected]>

* fix backoff weight for end node

Signed-off-by: andrusenkoau <[email protected]>

* add dummy boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add tests for the boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add more tests

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* fix not closed file

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* Update tests/collections/asr/test_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update tests/collections/asr/decoding/test_multi_task_decoding.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/compute_key_words_fscore.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* add asr model path + name

Signed-off-by: andrusenkoau <[email protected]>

* add bt config

Signed-off-by: andrusenkoau <[email protected]>

* add loading boosting tree from config

Signed-off-by: andrusenkoau <[email protected]>

* add a new test for test_boosting_tree_model_from_config

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* pr fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

---------

Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
Co-authored-by: andrusenkoau <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Charlie Truong <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
guyueh1 pushed a commit to guyueh1/NeMo that referenced this pull request Aug 25, 2025
…o#14108)

* add initial scripts

Signed-off-by: andrusenkoau <[email protected]>

* add boosting tree construction

Signed-off-by: andrusenkoau <[email protected]>

* add pb support to rnnt greedy decoding for python impl only

Signed-off-by: andrusenkoau <[email protected]>

* first step for the integration of PB for rnnt cuda decoding

Signed-off-by: andrusenkoau <[email protected]>

* some fixes

Signed-off-by: andrusenkoau <[email protected]>

* revert changes for rnnt decoding

Signed-off-by: andrusenkoau <[email protected]>

* add initial step for pb for aed model

Signed-off-by: andrusenkoau <[email protected]>

* fix a bug with fusion models integration for aed beam decoding

Signed-off-by: andrusenkoau <[email protected]>

* partial fix for eos score increasing after detection of context phrase

Signed-off-by: andrusenkoau <[email protected]>

* fix node score in the context graph

Signed-off-by: andrusenkoau <[email protected]>

* fix backoff weight for end node

Signed-off-by: andrusenkoau <[email protected]>

* add dummy boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add tests for the boosting tree

Signed-off-by: andrusenkoau <[email protected]>

* add more tests

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* fix not closed file

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* Update tests/collections/asr/test_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update tests/collections/asr/decoding/test_multi_task_decoding.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/compute_key_words_fscore.py

Co-authored-by: Copilot <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* add asr model path + name

Signed-off-by: andrusenkoau <[email protected]>

* add bt config

Signed-off-by: andrusenkoau <[email protected]>

* add loading boosting tree from config

Signed-off-by: andrusenkoau <[email protected]>

* add a new test for test_boosting_tree_model_from_config

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* pr fixes

Signed-off-by: andrusenkoau <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

* minor fixes

Signed-off-by: andrusenkoau <[email protected]>

* minor fix

Signed-off-by: andrusenkoau <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Update scripts/asr_context_biasing/build_gpu_boosting_tree.py

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <[email protected]>

---------

Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: andrusenkoau <[email protected]>
Signed-off-by: Andrei Andrusenko <[email protected]>
Co-authored-by: andrusenkoau <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Charlie Truong <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Guyue Huang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants