Add typing to speech_to_text_finetune.py by Garvys · Pull Request #15326 · NVIDIA-NeMo/NeMo

Garvys · 2026-01-27T17:58:44Z

What does this PR do ?

This PR adds typing to the functions used in speech_to_text_finetune.py in order to make the code easier to navigate with an IDE.
Collection: [ASR]

Changelog

Added typing to speech_to_text_finetune.py

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

Improvements

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Signed-off-by: Alexandre Caulier <alexandre.caulier.a@gmail.com>

Garvys · 2026-01-27T18:03:57Z

Very quick PR if you have a bit of time @nithinraok
thanks a lot

Copilot

Pull request overview

This PR adds type annotations to helper functions in speech_to_text_finetune.py to improve IDE navigation and code maintainability. The changes follow the PR's stated goal of making the code easier to navigate with an IDE by adding explicit type hints.

Changes:

Added DictConfig import from omegaconf
Added type annotations to get_base_model, check_vocabulary, update_tokenizer, and setup_dataloaders functions in speech_to_text_finetune.py
Added partial type annotation to add_global_adapter_cfg function in train_asr_adapter.py

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
examples/asr/speech_to_text_finetune.py	Added type annotations (pl.Trainer, DictConfig, ASRModel) to four helper functions, with DictConfig import added to support the changes
examples/asr/asr_adapters/train_asr_adapter.py	Added partial type annotation (ASRModel) to the add_global_adapter_cfg function's model parameter

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-27T18:56:13Z

examples/asr/speech_to_text_finetune.py



-def update_tokenizer(asr_model, tokenizer_dir, tokenizer_type):
+def update_tokenizer(asr_model: ASRModel, tokenizer_dir, tokenizer_type) -> ASRModel:


The type annotations for parameters tokenizer_dir and tokenizer_type are missing. Based on the change_vocabulary method signature in the ASRModel classes (e.g., rnnt_bpe_models.py line 340-344), these should be typed as:

tokenizer_dir: Union[str, DictConfig] (can be a directory path string or a DictConfig for 'agg' tokenizer type)

tokenizer_type: str

This would also require adding Union to the imports from typing module.

Copilot · 2026-01-27T18:56:13Z

examples/asr/asr_adapters/train_asr_adapter.py



-def add_global_adapter_cfg(model, global_adapter_cfg):
+def add_global_adapter_cfg(model: ASRModel, global_adapter_cfg):


The type annotation for parameter global_adapter_cfg is missing. Based on the function body (lines 131-135), this parameter can be a dataclass, DictConfig, or dict. The appropriate type annotation would be Union[DictConfig, dict] since the function checks for these types explicitly. This would require ensuring Union is imported from the typing module.

Signed-off-by: Alexandre Caulier <alexandre.caulier.a@gmail.com>

nithinraok

Thanks, LGTM!

github-actions bot added the ASR label Jan 27, 2026

Garvys added 2 commits January 27, 2026 18:59

Fix typing

0ce400e

Signed-off-by: Alexandre Caulier <alexandre.caulier.a@gmail.com>

Add typing

6a525eb

Signed-off-by: Alexandre Caulier <alexandre.caulier.a@gmail.com>

Garvys force-pushed the typing-fixes branch from 8cd5c1f to 6a525eb Compare January 27, 2026 17:59

Merge branch 'main' into typing-fixes

3c911ce

Garvys marked this pull request as ready for review January 27, 2026 18:02

github-actions bot added the community-request label Jan 27, 2026

nithinraok requested a review from Copilot January 27, 2026 18:49

Copilot started reviewing on behalf of nithinraok January 27, 2026 18:50 View session

Copilot AI reviewed Jan 27, 2026

View reviewed changes

Garvys added 2 commits January 27, 2026 20:04

Add types following copilot review

17dd4ba

Signed-off-by: Alexandre Caulier <alexandre.caulier.a@gmail.com>

isort

92bf18d

Signed-off-by: Alexandre Caulier <alexandre.caulier.a@gmail.com>

nithinraok approved these changes Jan 27, 2026

View reviewed changes

nithinraok enabled auto-merge (squash) January 27, 2026 19:12

nithinraok added the Run CICD label Jan 27, 2026

nithinraok merged commit 09616b9 into NVIDIA-NeMo:main Jan 27, 2026
54 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add typing to speech_to_text_finetune.py#15326

Add typing to speech_to_text_finetune.py#15326
nithinraok merged 5 commits intoNVIDIA-NeMo:mainfrom
Blynt-AI:typing-fixes

Garvys commented Jan 27, 2026 •

edited

Loading

Uh oh!

Garvys commented Jan 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 27, 2026

Uh oh!

Copilot AI Jan 27, 2026

Uh oh!

nithinraok left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		def update_tokenizer(asr_model, tokenizer_dir, tokenizer_type):
		def update_tokenizer(asr_model: ASRModel, tokenizer_dir, tokenizer_type) -> ASRModel:



		def add_global_adapter_cfg(model, global_adapter_cfg):
		def add_global_adapter_cfg(model: ASRModel, global_adapter_cfg):

Conversation

Garvys commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Who can review?

Additional Information

Uh oh!

Garvys commented Jan 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

nithinraok left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Garvys commented Jan 27, 2026 •

edited

Loading