Skip to content

Conversation

@lilithgrigoryan
Copy link
Collaborator

@lilithgrigoryan lilithgrigoryan commented Apr 29, 2025

What does this PR do ?

Batched beam search for CTC with NGPU-LM (n-gram LM on GPU).

Results for nvidia/parakeet-ctc-1.1b on SLURP (out-of-domain):

Decoding Method Beam Size Beam Beta LM Weight* SLURP Test WER (%) ↓ RTFx ↑
greedy 19.93 484
beam_batch 8 1.0 0.6 15.84 439
beam_batch 12 1.0 0.6 15.75 432
beam_batch 16 1.0 0.6 15.72 427

*LM weight - optimal on SLURP dev set with step 0.1

Batch size 32, high fp32 matmul precision, sorted manifest, A5000 GPU.
6-gram LM is built on SLURP train texts.

Collection: [ASR]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

Usage without LM

python examples/asr/speech_to_text_eval.py \
   pretrained_name="nvidia/parakeet-ctc-1.1b" \
   dataset_manifest=<dataset_manifest>  \
   batch_size=32 \
   output_filename=decoded.jsonl \
   ctc_decoding.strategy="beam_batch" \
   ctc_decoding.beam.beam_size=12 \
   ctc_decoding.beam.beam_beta=0.3

Usage with LM

Step 1: Build LM for ASR Model (tokenizer-dependent) on domain texts

python nemo/scripts/asr_language_modeling/ngram_lm/train_kenlm.py \
      nemo_model_file="nvidia/parakeet-ctc-1.1b" \
      train_paths=["<train_manifest>"] \
      kenlm_bin_path=$KENLM_BIN_PATH \
      kenlm_model_file=parakeet-ctc-1.1b_lm-o6.arpa \
      ngram_length=6 \
      preserve_arpa=true \
      save_nemo=true

Step 2: Run Decoding with LM

python examples/asr/speech_to_text_eval.py \
   pretrained_name="nvidia/parakeet-ctc-1.1b" \
   dataset_manifest=<dataset_manifest>  \
   batch_size=32 \
   output_filename=decoded.jsonl \
   ctc_decoding.strategy="beam_batch" \
   ctc_decoding.beam.ngram_lm_model="parakeet-ctc-1.1b_lm-o6.arpa.nemo" \
   ctc_decoding.beam.ngram_lm_alpha=0.4 \
   ctc_decoding.beam.beam_beta=0.4 \
   ctc_decoding.beam.beam_size=12

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
@github-actions github-actions bot added the ASR label Apr 29, 2025
Signed-off-by: lilithgrigoryan <[email protected]>
@github-actions
Copy link
Contributor

github-actions bot commented Jun 9, 2025

[🤖]: Hi @lilithgrigoryan 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

artbataev
artbataev previously approved these changes Jun 9, 2025
Copy link
Collaborator

@artbataev artbataev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot! Approved. 🚀

@github-actions github-actions bot removed the Run CICD label Jun 9, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Jun 9, 2025

[🤖]: Hi @lilithgrigoryan 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

Signed-off-by: lilithgrigoryan <[email protected]>
@github-actions
Copy link
Contributor

[🤖]: Hi @lilithgrigoryan 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

@lilithgrigoryan lilithgrigoryan merged commit e0783ae into main Jun 10, 2025
1036 of 1051 checks passed
@lilithgrigoryan lilithgrigoryan deleted the lgrigoryan/ctc_beam_search_pr branch June 10, 2025 07:40
gdengk pushed a commit to gdengk/NeMo that referenced this pull request Jun 12, 2025
* add ctc beam decoding

Signed-off-by: lilithgrigoryan <[email protected]>

* add utils

Signed-off-by: lilithgrigoryan <[email protected]>

* first working

Signed-off-by: lilithgrigoryan <[email protected]>

* working cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bugs with cudagraohs

Signed-off-by: lilithgrigoryan <[email protected]>

* working

Signed-off-by: lilithgrigoryan <[email protected]>

* small fix

Signed-off-by: lilithgrigoryan <[email protected]>

* minor fix

Signed-off-by: lilithgrigoryan <[email protected]>

* add logging

Signed-off-by: lilithgrigoryan <[email protected]>

* add print

Signed-off-by: lilithgrigoryan <[email protected]>

* to log sum exp

Signed-off-by: lilithgrigoryan <[email protected]>

* back to max score

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bug in cudagraphs, save before refactor

Signed-off-by: lilithgrigoryan <[email protected]>

* rm log10

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prints

Signed-off-by: lilithgrigoryan <[email protected]>

* add reallocation

Signed-off-by: lilithgrigoryan <[email protected]>

* rm logprobs from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm nexts from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prev lm states

Signed-off-by: lilithgrigoryan <[email protected]>

* small clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* cudagraph working

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up torch working

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rm files

Signed-off-by: lilithgrigoryan <[email protected]>

* save

Signed-off-by: lilithgrigoryan <[email protected]>

* add flatten

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* add timestamps

Signed-off-by: lilithgrigoryan <[email protected]>

* rm file

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rename file

Signed-off-by: lilithgrigoryan <[email protected]>

* add batched beam tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* add tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* changed return type

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* minor changes

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* renamed variables

Signed-off-by: lilithgrigoryan <[email protected]>

* changed is_tdt to model_type

Signed-off-by: lilithgrigoryan <[email protected]>

* unified batched beam hyps

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Update cuda_program_name

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and and commments

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and small fixes

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix

Signed-off-by: lilithgrigoryan <[email protected]>

* fix tests

Signed-off-by: lilithgrigoryan <[email protected]>

* added check on model type

Signed-off-by: lilithgrigoryan <[email protected]>

* minor change

Signed-off-by: lilithgrigoryan <[email protected]>

* rm repetitions LM scoring

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix lm repetitions for cudahraphs

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

---------

Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Co-authored-by: lilithgrigoryan <[email protected]>
Co-authored-by: oliver könig <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Jul 23, 2025
* add ctc beam decoding

Signed-off-by: lilithgrigoryan <[email protected]>

* add utils

Signed-off-by: lilithgrigoryan <[email protected]>

* first working

Signed-off-by: lilithgrigoryan <[email protected]>

* working cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bugs with cudagraohs

Signed-off-by: lilithgrigoryan <[email protected]>

* working

Signed-off-by: lilithgrigoryan <[email protected]>

* small fix

Signed-off-by: lilithgrigoryan <[email protected]>

* minor fix

Signed-off-by: lilithgrigoryan <[email protected]>

* add logging

Signed-off-by: lilithgrigoryan <[email protected]>

* add print

Signed-off-by: lilithgrigoryan <[email protected]>

* to log sum exp

Signed-off-by: lilithgrigoryan <[email protected]>

* back to max score

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bug in cudagraphs, save before refactor

Signed-off-by: lilithgrigoryan <[email protected]>

* rm log10

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prints

Signed-off-by: lilithgrigoryan <[email protected]>

* add reallocation

Signed-off-by: lilithgrigoryan <[email protected]>

* rm logprobs from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm nexts from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prev lm states

Signed-off-by: lilithgrigoryan <[email protected]>

* small clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* cudagraph working

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up torch working

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rm files

Signed-off-by: lilithgrigoryan <[email protected]>

* save

Signed-off-by: lilithgrigoryan <[email protected]>

* add flatten

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* add timestamps

Signed-off-by: lilithgrigoryan <[email protected]>

* rm file

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rename file

Signed-off-by: lilithgrigoryan <[email protected]>

* add batched beam tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* add tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* changed return type

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* minor changes

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* renamed variables

Signed-off-by: lilithgrigoryan <[email protected]>

* changed is_tdt to model_type

Signed-off-by: lilithgrigoryan <[email protected]>

* unified batched beam hyps

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Update cuda_program_name

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and and commments

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and small fixes

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix

Signed-off-by: lilithgrigoryan <[email protected]>

* fix tests

Signed-off-by: lilithgrigoryan <[email protected]>

* added check on model type

Signed-off-by: lilithgrigoryan <[email protected]>

* minor change

Signed-off-by: lilithgrigoryan <[email protected]>

* rm repetitions LM scoring

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix lm repetitions for cudahraphs

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

---------

Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Co-authored-by: lilithgrigoryan <[email protected]>
Co-authored-by: oliver könig <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
* add ctc beam decoding

Signed-off-by: lilithgrigoryan <[email protected]>

* add utils

Signed-off-by: lilithgrigoryan <[email protected]>

* first working

Signed-off-by: lilithgrigoryan <[email protected]>

* working cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bugs with cudagraohs

Signed-off-by: lilithgrigoryan <[email protected]>

* working

Signed-off-by: lilithgrigoryan <[email protected]>

* small fix

Signed-off-by: lilithgrigoryan <[email protected]>

* minor fix

Signed-off-by: lilithgrigoryan <[email protected]>

* add logging

Signed-off-by: lilithgrigoryan <[email protected]>

* add print

Signed-off-by: lilithgrigoryan <[email protected]>

* to log sum exp

Signed-off-by: lilithgrigoryan <[email protected]>

* back to max score

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bug in cudagraphs, save before refactor

Signed-off-by: lilithgrigoryan <[email protected]>

* rm log10

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prints

Signed-off-by: lilithgrigoryan <[email protected]>

* add reallocation

Signed-off-by: lilithgrigoryan <[email protected]>

* rm logprobs from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm nexts from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prev lm states

Signed-off-by: lilithgrigoryan <[email protected]>

* small clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* cudagraph working

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up torch working

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rm files

Signed-off-by: lilithgrigoryan <[email protected]>

* save

Signed-off-by: lilithgrigoryan <[email protected]>

* add flatten

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* add timestamps

Signed-off-by: lilithgrigoryan <[email protected]>

* rm file

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rename file

Signed-off-by: lilithgrigoryan <[email protected]>

* add batched beam tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* add tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* changed return type

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* minor changes

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* renamed variables

Signed-off-by: lilithgrigoryan <[email protected]>

* changed is_tdt to model_type

Signed-off-by: lilithgrigoryan <[email protected]>

* unified batched beam hyps

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Update cuda_program_name

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and and commments

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and small fixes

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix

Signed-off-by: lilithgrigoryan <[email protected]>

* fix tests

Signed-off-by: lilithgrigoryan <[email protected]>

* added check on model type

Signed-off-by: lilithgrigoryan <[email protected]>

* minor change

Signed-off-by: lilithgrigoryan <[email protected]>

* rm repetitions LM scoring

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix lm repetitions for cudahraphs

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

---------

Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Co-authored-by: lilithgrigoryan <[email protected]>
Co-authored-by: oliver könig <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
* add ctc beam decoding

Signed-off-by: lilithgrigoryan <[email protected]>

* add utils

Signed-off-by: lilithgrigoryan <[email protected]>

* first working

Signed-off-by: lilithgrigoryan <[email protected]>

* working cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bugs with cudagraohs

Signed-off-by: lilithgrigoryan <[email protected]>

* working

Signed-off-by: lilithgrigoryan <[email protected]>

* small fix

Signed-off-by: lilithgrigoryan <[email protected]>

* minor fix

Signed-off-by: lilithgrigoryan <[email protected]>

* add logging

Signed-off-by: lilithgrigoryan <[email protected]>

* add print

Signed-off-by: lilithgrigoryan <[email protected]>

* to log sum exp

Signed-off-by: lilithgrigoryan <[email protected]>

* back to max score

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bug in cudagraphs, save before refactor

Signed-off-by: lilithgrigoryan <[email protected]>

* rm log10

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prints

Signed-off-by: lilithgrigoryan <[email protected]>

* add reallocation

Signed-off-by: lilithgrigoryan <[email protected]>

* rm logprobs from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm nexts from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prev lm states

Signed-off-by: lilithgrigoryan <[email protected]>

* small clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* cudagraph working

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up torch working

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rm files

Signed-off-by: lilithgrigoryan <[email protected]>

* save

Signed-off-by: lilithgrigoryan <[email protected]>

* add flatten

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* add timestamps

Signed-off-by: lilithgrigoryan <[email protected]>

* rm file

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rename file

Signed-off-by: lilithgrigoryan <[email protected]>

* add batched beam tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* add tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* changed return type

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* minor changes

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* renamed variables

Signed-off-by: lilithgrigoryan <[email protected]>

* changed is_tdt to model_type

Signed-off-by: lilithgrigoryan <[email protected]>

* unified batched beam hyps

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Update cuda_program_name

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and and commments

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and small fixes

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix

Signed-off-by: lilithgrigoryan <[email protected]>

* fix tests

Signed-off-by: lilithgrigoryan <[email protected]>

* added check on model type

Signed-off-by: lilithgrigoryan <[email protected]>

* minor change

Signed-off-by: lilithgrigoryan <[email protected]>

* rm repetitions LM scoring

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix lm repetitions for cudahraphs

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

---------

Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Co-authored-by: lilithgrigoryan <[email protected]>
Co-authored-by: oliver könig <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
nasretdinovr pushed a commit to nasretdinovr/NeMo that referenced this pull request Aug 8, 2025
* add ctc beam decoding

Signed-off-by: lilithgrigoryan <[email protected]>

* add utils

Signed-off-by: lilithgrigoryan <[email protected]>

* first working

Signed-off-by: lilithgrigoryan <[email protected]>

* working cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bugs with cudagraohs

Signed-off-by: lilithgrigoryan <[email protected]>

* working

Signed-off-by: lilithgrigoryan <[email protected]>

* small fix

Signed-off-by: lilithgrigoryan <[email protected]>

* minor fix

Signed-off-by: lilithgrigoryan <[email protected]>

* add logging

Signed-off-by: lilithgrigoryan <[email protected]>

* add print

Signed-off-by: lilithgrigoryan <[email protected]>

* to log sum exp

Signed-off-by: lilithgrigoryan <[email protected]>

* back to max score

Signed-off-by: lilithgrigoryan <[email protected]>

* fix bug in cudagraphs, save before refactor

Signed-off-by: lilithgrigoryan <[email protected]>

* rm log10

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prints

Signed-off-by: lilithgrigoryan <[email protected]>

* add reallocation

Signed-off-by: lilithgrigoryan <[email protected]>

* rm logprobs from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm nexts from state

Signed-off-by: lilithgrigoryan <[email protected]>

* rm prev lm states

Signed-off-by: lilithgrigoryan <[email protected]>

* small clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up cuda graphs

Signed-off-by: lilithgrigoryan <[email protected]>

* cudagraph working

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up torch working

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rm files

Signed-off-by: lilithgrigoryan <[email protected]>

* save

Signed-off-by: lilithgrigoryan <[email protected]>

* add flatten

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* add timestamps

Signed-off-by: lilithgrigoryan <[email protected]>

* rm file

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* rename file

Signed-off-by: lilithgrigoryan <[email protected]>

* add batched beam tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* add tests

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* changed return type

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* minor changes

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* renamed variables

Signed-off-by: lilithgrigoryan <[email protected]>

* changed is_tdt to model_type

Signed-off-by: lilithgrigoryan <[email protected]>

* unified batched beam hyps

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* Update cuda_program_name

Co-authored-by: Vladimir Bataev <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and and commments

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up and small fixes

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix

Signed-off-by: lilithgrigoryan <[email protected]>

* fix tests

Signed-off-by: lilithgrigoryan <[email protected]>

* added check on model type

Signed-off-by: lilithgrigoryan <[email protected]>

* minor change

Signed-off-by: lilithgrigoryan <[email protected]>

* rm repetitions LM scoring

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* add enum model type

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* fix lm repetitions for cudahraphs

Signed-off-by: lilithgrigoryan <[email protected]>

* Apply isort and black reformatting

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

* clean up

Signed-off-by: lilithgrigoryan <[email protected]>

---------

Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Co-authored-by: lilithgrigoryan <[email protected]>
Co-authored-by: oliver könig <[email protected]>
Co-authored-by: Vladimir Bataev <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants