-
Notifications
You must be signed in to change notification settings - Fork 3.2k
add CTC batched beam search #13337
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add CTC batched beam search #13337
Conversation
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
…/ctc_beam_search_pr
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
…IDIA/NeMo into lgrigoryan/ctc_beam_search_pr
…/ctc_beam_search_pr
Signed-off-by: lilithgrigoryan <[email protected]>
Signed-off-by: lilithgrigoryan <[email protected]>
|
[🤖]: Hi @lilithgrigoryan 👋, We wanted to let you know that a CICD pipeline for this PR just finished successfully. So it might be time to merge this PR or get some approvals. |
…/ctc_beam_search_pr
…IDIA/NeMo into lgrigoryan/ctc_beam_search_pr
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot! Approved. 🚀
|
[🤖]: Hi @lilithgrigoryan 👋, We wanted to let you know that a CICD pipeline for this PR just finished successfully. So it might be time to merge this PR or get some approvals. |
Signed-off-by: lilithgrigoryan <[email protected]>
|
[🤖]: Hi @lilithgrigoryan 👋, We wanted to let you know that a CICD pipeline for this PR just finished successfully. So it might be time to merge this PR or get some approvals. |
* add ctc beam decoding Signed-off-by: lilithgrigoryan <[email protected]> * add utils Signed-off-by: lilithgrigoryan <[email protected]> * first working Signed-off-by: lilithgrigoryan <[email protected]> * working cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * fix bugs with cudagraohs Signed-off-by: lilithgrigoryan <[email protected]> * working Signed-off-by: lilithgrigoryan <[email protected]> * small fix Signed-off-by: lilithgrigoryan <[email protected]> * minor fix Signed-off-by: lilithgrigoryan <[email protected]> * add logging Signed-off-by: lilithgrigoryan <[email protected]> * add print Signed-off-by: lilithgrigoryan <[email protected]> * to log sum exp Signed-off-by: lilithgrigoryan <[email protected]> * back to max score Signed-off-by: lilithgrigoryan <[email protected]> * fix bug in cudagraphs, save before refactor Signed-off-by: lilithgrigoryan <[email protected]> * rm log10 Signed-off-by: lilithgrigoryan <[email protected]> * rm prints Signed-off-by: lilithgrigoryan <[email protected]> * add reallocation Signed-off-by: lilithgrigoryan <[email protected]> * rm logprobs from state Signed-off-by: lilithgrigoryan <[email protected]> * rm nexts from state Signed-off-by: lilithgrigoryan <[email protected]> * rm prev lm states Signed-off-by: lilithgrigoryan <[email protected]> * small clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * cudagraph working Signed-off-by: lilithgrigoryan <[email protected]> * clean up torch working Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rm files Signed-off-by: lilithgrigoryan <[email protected]> * save Signed-off-by: lilithgrigoryan <[email protected]> * add flatten Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * add timestamps Signed-off-by: lilithgrigoryan <[email protected]> * rm file Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rename file Signed-off-by: lilithgrigoryan <[email protected]> * add batched beam tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * add tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * changed return type Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * minor changes Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * renamed variables Signed-off-by: lilithgrigoryan <[email protected]> * changed is_tdt to model_type Signed-off-by: lilithgrigoryan <[email protected]> * unified batched beam hyps Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Update cuda_program_name Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> * clean up and and commments Signed-off-by: lilithgrigoryan <[email protected]> * clean up and small fixes Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix Signed-off-by: lilithgrigoryan <[email protected]> * fix tests Signed-off-by: lilithgrigoryan <[email protected]> * added check on model type Signed-off-by: lilithgrigoryan <[email protected]> * minor change Signed-off-by: lilithgrigoryan <[email protected]> * rm repetitions LM scoring Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix lm repetitions for cudahraphs Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> --------- Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Co-authored-by: lilithgrigoryan <[email protected]> Co-authored-by: oliver könig <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]>
* add ctc beam decoding Signed-off-by: lilithgrigoryan <[email protected]> * add utils Signed-off-by: lilithgrigoryan <[email protected]> * first working Signed-off-by: lilithgrigoryan <[email protected]> * working cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * fix bugs with cudagraohs Signed-off-by: lilithgrigoryan <[email protected]> * working Signed-off-by: lilithgrigoryan <[email protected]> * small fix Signed-off-by: lilithgrigoryan <[email protected]> * minor fix Signed-off-by: lilithgrigoryan <[email protected]> * add logging Signed-off-by: lilithgrigoryan <[email protected]> * add print Signed-off-by: lilithgrigoryan <[email protected]> * to log sum exp Signed-off-by: lilithgrigoryan <[email protected]> * back to max score Signed-off-by: lilithgrigoryan <[email protected]> * fix bug in cudagraphs, save before refactor Signed-off-by: lilithgrigoryan <[email protected]> * rm log10 Signed-off-by: lilithgrigoryan <[email protected]> * rm prints Signed-off-by: lilithgrigoryan <[email protected]> * add reallocation Signed-off-by: lilithgrigoryan <[email protected]> * rm logprobs from state Signed-off-by: lilithgrigoryan <[email protected]> * rm nexts from state Signed-off-by: lilithgrigoryan <[email protected]> * rm prev lm states Signed-off-by: lilithgrigoryan <[email protected]> * small clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * cudagraph working Signed-off-by: lilithgrigoryan <[email protected]> * clean up torch working Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rm files Signed-off-by: lilithgrigoryan <[email protected]> * save Signed-off-by: lilithgrigoryan <[email protected]> * add flatten Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * add timestamps Signed-off-by: lilithgrigoryan <[email protected]> * rm file Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rename file Signed-off-by: lilithgrigoryan <[email protected]> * add batched beam tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * add tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * changed return type Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * minor changes Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * renamed variables Signed-off-by: lilithgrigoryan <[email protected]> * changed is_tdt to model_type Signed-off-by: lilithgrigoryan <[email protected]> * unified batched beam hyps Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Update cuda_program_name Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> * clean up and and commments Signed-off-by: lilithgrigoryan <[email protected]> * clean up and small fixes Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix Signed-off-by: lilithgrigoryan <[email protected]> * fix tests Signed-off-by: lilithgrigoryan <[email protected]> * added check on model type Signed-off-by: lilithgrigoryan <[email protected]> * minor change Signed-off-by: lilithgrigoryan <[email protected]> * rm repetitions LM scoring Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix lm repetitions for cudahraphs Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> --------- Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Co-authored-by: lilithgrigoryan <[email protected]> Co-authored-by: oliver könig <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: Amir Hussein <[email protected]>
* add ctc beam decoding Signed-off-by: lilithgrigoryan <[email protected]> * add utils Signed-off-by: lilithgrigoryan <[email protected]> * first working Signed-off-by: lilithgrigoryan <[email protected]> * working cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * fix bugs with cudagraohs Signed-off-by: lilithgrigoryan <[email protected]> * working Signed-off-by: lilithgrigoryan <[email protected]> * small fix Signed-off-by: lilithgrigoryan <[email protected]> * minor fix Signed-off-by: lilithgrigoryan <[email protected]> * add logging Signed-off-by: lilithgrigoryan <[email protected]> * add print Signed-off-by: lilithgrigoryan <[email protected]> * to log sum exp Signed-off-by: lilithgrigoryan <[email protected]> * back to max score Signed-off-by: lilithgrigoryan <[email protected]> * fix bug in cudagraphs, save before refactor Signed-off-by: lilithgrigoryan <[email protected]> * rm log10 Signed-off-by: lilithgrigoryan <[email protected]> * rm prints Signed-off-by: lilithgrigoryan <[email protected]> * add reallocation Signed-off-by: lilithgrigoryan <[email protected]> * rm logprobs from state Signed-off-by: lilithgrigoryan <[email protected]> * rm nexts from state Signed-off-by: lilithgrigoryan <[email protected]> * rm prev lm states Signed-off-by: lilithgrigoryan <[email protected]> * small clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * cudagraph working Signed-off-by: lilithgrigoryan <[email protected]> * clean up torch working Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rm files Signed-off-by: lilithgrigoryan <[email protected]> * save Signed-off-by: lilithgrigoryan <[email protected]> * add flatten Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * add timestamps Signed-off-by: lilithgrigoryan <[email protected]> * rm file Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rename file Signed-off-by: lilithgrigoryan <[email protected]> * add batched beam tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * add tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * changed return type Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * minor changes Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * renamed variables Signed-off-by: lilithgrigoryan <[email protected]> * changed is_tdt to model_type Signed-off-by: lilithgrigoryan <[email protected]> * unified batched beam hyps Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Update cuda_program_name Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> * clean up and and commments Signed-off-by: lilithgrigoryan <[email protected]> * clean up and small fixes Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix Signed-off-by: lilithgrigoryan <[email protected]> * fix tests Signed-off-by: lilithgrigoryan <[email protected]> * added check on model type Signed-off-by: lilithgrigoryan <[email protected]> * minor change Signed-off-by: lilithgrigoryan <[email protected]> * rm repetitions LM scoring Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix lm repetitions for cudahraphs Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> --------- Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Co-authored-by: lilithgrigoryan <[email protected]> Co-authored-by: oliver könig <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: Amir Hussein <[email protected]>
* add ctc beam decoding Signed-off-by: lilithgrigoryan <[email protected]> * add utils Signed-off-by: lilithgrigoryan <[email protected]> * first working Signed-off-by: lilithgrigoryan <[email protected]> * working cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * fix bugs with cudagraohs Signed-off-by: lilithgrigoryan <[email protected]> * working Signed-off-by: lilithgrigoryan <[email protected]> * small fix Signed-off-by: lilithgrigoryan <[email protected]> * minor fix Signed-off-by: lilithgrigoryan <[email protected]> * add logging Signed-off-by: lilithgrigoryan <[email protected]> * add print Signed-off-by: lilithgrigoryan <[email protected]> * to log sum exp Signed-off-by: lilithgrigoryan <[email protected]> * back to max score Signed-off-by: lilithgrigoryan <[email protected]> * fix bug in cudagraphs, save before refactor Signed-off-by: lilithgrigoryan <[email protected]> * rm log10 Signed-off-by: lilithgrigoryan <[email protected]> * rm prints Signed-off-by: lilithgrigoryan <[email protected]> * add reallocation Signed-off-by: lilithgrigoryan <[email protected]> * rm logprobs from state Signed-off-by: lilithgrigoryan <[email protected]> * rm nexts from state Signed-off-by: lilithgrigoryan <[email protected]> * rm prev lm states Signed-off-by: lilithgrigoryan <[email protected]> * small clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * cudagraph working Signed-off-by: lilithgrigoryan <[email protected]> * clean up torch working Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rm files Signed-off-by: lilithgrigoryan <[email protected]> * save Signed-off-by: lilithgrigoryan <[email protected]> * add flatten Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * add timestamps Signed-off-by: lilithgrigoryan <[email protected]> * rm file Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rename file Signed-off-by: lilithgrigoryan <[email protected]> * add batched beam tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * add tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * changed return type Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * minor changes Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * renamed variables Signed-off-by: lilithgrigoryan <[email protected]> * changed is_tdt to model_type Signed-off-by: lilithgrigoryan <[email protected]> * unified batched beam hyps Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Update cuda_program_name Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> * clean up and and commments Signed-off-by: lilithgrigoryan <[email protected]> * clean up and small fixes Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix Signed-off-by: lilithgrigoryan <[email protected]> * fix tests Signed-off-by: lilithgrigoryan <[email protected]> * added check on model type Signed-off-by: lilithgrigoryan <[email protected]> * minor change Signed-off-by: lilithgrigoryan <[email protected]> * rm repetitions LM scoring Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix lm repetitions for cudahraphs Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> --------- Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Co-authored-by: lilithgrigoryan <[email protected]> Co-authored-by: oliver könig <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: Amir Hussein <[email protected]>
* add ctc beam decoding Signed-off-by: lilithgrigoryan <[email protected]> * add utils Signed-off-by: lilithgrigoryan <[email protected]> * first working Signed-off-by: lilithgrigoryan <[email protected]> * working cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * fix bugs with cudagraohs Signed-off-by: lilithgrigoryan <[email protected]> * working Signed-off-by: lilithgrigoryan <[email protected]> * small fix Signed-off-by: lilithgrigoryan <[email protected]> * minor fix Signed-off-by: lilithgrigoryan <[email protected]> * add logging Signed-off-by: lilithgrigoryan <[email protected]> * add print Signed-off-by: lilithgrigoryan <[email protected]> * to log sum exp Signed-off-by: lilithgrigoryan <[email protected]> * back to max score Signed-off-by: lilithgrigoryan <[email protected]> * fix bug in cudagraphs, save before refactor Signed-off-by: lilithgrigoryan <[email protected]> * rm log10 Signed-off-by: lilithgrigoryan <[email protected]> * rm prints Signed-off-by: lilithgrigoryan <[email protected]> * add reallocation Signed-off-by: lilithgrigoryan <[email protected]> * rm logprobs from state Signed-off-by: lilithgrigoryan <[email protected]> * rm nexts from state Signed-off-by: lilithgrigoryan <[email protected]> * rm prev lm states Signed-off-by: lilithgrigoryan <[email protected]> * small clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up cuda graphs Signed-off-by: lilithgrigoryan <[email protected]> * cudagraph working Signed-off-by: lilithgrigoryan <[email protected]> * clean up torch working Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rm files Signed-off-by: lilithgrigoryan <[email protected]> * save Signed-off-by: lilithgrigoryan <[email protected]> * add flatten Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * add timestamps Signed-off-by: lilithgrigoryan <[email protected]> * rm file Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * rename file Signed-off-by: lilithgrigoryan <[email protected]> * add batched beam tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * add tests Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * changed return type Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * minor changes Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * renamed variables Signed-off-by: lilithgrigoryan <[email protected]> * changed is_tdt to model_type Signed-off-by: lilithgrigoryan <[email protected]> * unified batched beam hyps Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * Update cuda_program_name Co-authored-by: Vladimir Bataev <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> * clean up and and commments Signed-off-by: lilithgrigoryan <[email protected]> * clean up and small fixes Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix Signed-off-by: lilithgrigoryan <[email protected]> * fix tests Signed-off-by: lilithgrigoryan <[email protected]> * added check on model type Signed-off-by: lilithgrigoryan <[email protected]> * minor change Signed-off-by: lilithgrigoryan <[email protected]> * rm repetitions LM scoring Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * add enum model type Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * fix lm repetitions for cudahraphs Signed-off-by: lilithgrigoryan <[email protected]> * Apply isort and black reformatting Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> * clean up Signed-off-by: lilithgrigoryan <[email protected]> --------- Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Signed-off-by: lilithgrigoryan <[email protected]> Co-authored-by: lilithgrigoryan <[email protected]> Co-authored-by: oliver könig <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]>
What does this PR do ?
Batched beam search for CTC with NGPU-LM (n-gram LM on GPU).
Results for
nvidia/parakeet-ctc-1.1bon SLURP (out-of-domain):*LM weight - optimal on SLURP dev set with step
0.1Batch size 32, high fp32 matmul precision, sorted manifest, A5000 GPU.
6-gram LM is built on SLURP train texts.
Collection: [ASR]
Changelog
Usage
Usage without LM
Usage with LM
Step 1: Build LM for ASR Model (tokenizer-dependent) on domain texts
python nemo/scripts/asr_language_modeling/ngram_lm/train_kenlm.py \ nemo_model_file="nvidia/parakeet-ctc-1.1b" \ train_paths=["<train_manifest>"] \ kenlm_bin_path=$KENLM_BIN_PATH \ kenlm_model_file=parakeet-ctc-1.1b_lm-o6.arpa \ ngram_length=6 \ preserve_arpa=true \ save_nemo=trueStep 2: Run Decoding with LM
GitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information