Skip to content

Conversation

@ruchaa-apte
Copy link
Contributor

What does this PR do ?

This PR adds Part 2 of the tutorial, which covers the evaluation phase.

Usage

python prepare_dataset.py --datasets [mmlu, gpqa, gpqa_diamond, all]
python deploy_nemo_model.py --checkpoint_path <checkpoint_path> --dataset <dataset> --output_prefix <output_prefix>
python evaluate_model.py --input_file <input_file> --output_file <output_file> --model_name <model_name>

Before your PR is "Ready for review"

Pre checks:

  • [Y] Make sure you read and followed Contributor guidelines
  • [N] Did you write any new necessary tests?
  • [Y] Did you add or update any necessary documentation?
  • [N] Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • [Y] New Feature
  • [N] Bugfix
  • [N] Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

@ruchaa-apte
Copy link
Contributor Author

cc - @Maghoumi , @aasthajh

problem['Question'], problem['Choice 1'], problem['Choice 2'], problem['Choice 3'], problem['Choice 4']
)

response = get_response(prompt)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I noticed this loop takes a very long time to go through, and GPU utilization is pretty low.

Is this a legit triton server, with support for multiple parallel queries?

If so, we could easily parallelize this with multiple threads and significantly improve both runtime and GPU utilization.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey @Maghoumi I checked with NeMo eng team and they mentioned that this cannot be parallelized, but there is another PR they are trying to raise which addresses this concern :(

ruchaa-apte and others added 2 commits June 11, 2025 09:23
Signed-off-by: Rucha Apte <[email protected]>
@ko3n1g ko3n1g added Run CICD and removed Run CICD labels Jun 12, 2025
@ko3n1g ko3n1g added Run CICD and removed Run CICD labels Jun 12, 2025
@suiyoubi suiyoubi merged commit 4517c34 into NVIDIA-NeMo:main Jun 12, 2025
57 checks passed
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Jul 23, 2025
* reasoning model evaluation mmlu gpqa

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Addressing PR Comments

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Add license

Signed-off-by: Rucha Apte <[email protected]>

---------

Signed-off-by: Rucha Apte <[email protected]>
Signed-off-by: ruchaa-apte <[email protected]>
Co-authored-by: ruchaa-apte <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
* reasoning model evaluation mmlu gpqa

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Addressing PR Comments

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Add license

Signed-off-by: Rucha Apte <[email protected]>

---------

Signed-off-by: Rucha Apte <[email protected]>
Signed-off-by: ruchaa-apte <[email protected]>
Co-authored-by: ruchaa-apte <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
* reasoning model evaluation mmlu gpqa

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Addressing PR Comments

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Add license

Signed-off-by: Rucha Apte <[email protected]>

---------

Signed-off-by: Rucha Apte <[email protected]>
Signed-off-by: ruchaa-apte <[email protected]>
Co-authored-by: ruchaa-apte <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
nasretdinovr pushed a commit to nasretdinovr/NeMo that referenced this pull request Aug 8, 2025
* reasoning model evaluation mmlu gpqa

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Addressing PR Comments

Signed-off-by: Rucha Apte <[email protected]>

* Apply isort and black reformatting

Signed-off-by: ruchaa-apte <[email protected]>

* Add license

Signed-off-by: Rucha Apte <[email protected]>

---------

Signed-off-by: Rucha Apte <[email protected]>
Signed-off-by: ruchaa-apte <[email protected]>
Co-authored-by: ruchaa-apte <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants