Improve reliability of PyTorch test reporting #3723

Flamefire · 2025-05-20T11:06:12Z

(created using eb --new-pr)

This fixes 2 blocking issues:

Test counts can be off when unittest.subTest is used. E.g.

<?xml version="1.0"?>
<testsuites>
  <testsuite name="pytest" errors="0" failures="1" skipped="0" tests="2" time="13.590" timestamp="2025-05-08T02:18:36.745086" hostname="c68">
    <testcase classname="MiscTests" name="test_pytree_tree_leaves" time="13.493" file="dynamo/test_misc.py">
      <failure message="torch._dynamo.exc.Unsupported: 'skip function isclass in file [snip]">Traceback (most recent call last):
[snip]</failure>
      <system-out>inline_call [snip]
</system-out>
    </testcase>
  </testsuite>
</testsuites>

--> tests=2 vs 1 <testcase>

A single test might have multiple <skipped> elements:

    <testcase classname="TestNestedTensorOpInfoCPU" name="test_compile_backward_xlogy_cpu_float32" time="9.881" file="test_nestedtensor.py">
      <skipped type="pytest.skip" message="Skipped!">PyTorch/2.7.0/foss-2024a/pytorch-v2.7.0/test/test_nestedtensor.py:8798: Skipped!</skipped>
      <skipped type="pytest.skip" message="Skipped!">PyTorch/2.7.0/foss-2024a/pytorch-v2.7.0/test/test_nestedtensor.py:8798: Skipped!</skipped>
    </testcase>

Additionally I added a try-catch in parse_test_result_file to report the failing file on error. So view the diff with ignored whitespace as except for added comments only if len(test_cases) != num_tests: was removed from that function

lexming · 2025-05-20T14:22:08Z

Thanks for the quick PR, testing it...

boegel · 2025-05-21T07:47:34Z

@Flamefire For which PyTorch versions is this "blocking"?

Flamefire · 2025-05-21T08:06:22Z

At least 2.6+, but IIRC I've seen it for 2.3 too in one occasion.

boegel · 2025-05-21T17:56:48Z

Test report by @boegel

Overview of tested easyconfigs (in order)

SUCCESS PyTorch-2.1.2-foss-2023a.eb

Build succeeded (with --ignore-test-failure) for 1 out of 1 (1 easyconfigs in total)
node3505.doduo.os - Linux RHEL 9.4, x86_64, AMD EPYC 7552 48-Core Processor (zen2), Python 3.9.18
See https://gist.github.com/boegel/5a64b169d935099ba7a1b7d2c7f1aab7 for a full test report.

lexming

LGTM

lexming · 2025-06-16T13:16:28Z

Merging, thanks @Flamefire !

Improve reliability of PyTorch test reporting

dc36a59

Flamefire mentioned this pull request May 20, 2025

{tools}[foss/2024a] PyTorch v2.6.0, parameterized v0.9.0, optree v0.14.1, ... easybuilders/easybuild-easyconfigs#22824

Merged

boegel added the bug fix label May 21, 2025

boegel added this to the next release (5.1.0) milestone May 21, 2025

boegel modified the milestones: next release (5.1.0), release after 5.1.0 May 22, 2025

lexming approved these changes Jun 16, 2025

View reviewed changes

lexming merged commit bacc8b3 into easybuilders:develop Jun 16, 2025
17 checks passed

Flamefire deleted the 20250520130609_new_pr_pytorch branch June 16, 2025 14:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve reliability of PyTorch test reporting #3723

Improve reliability of PyTorch test reporting #3723

Uh oh!

Flamefire commented May 20, 2025 •

edited

Loading

Uh oh!

lexming commented May 20, 2025

Uh oh!

boegel commented May 21, 2025

Uh oh!

Flamefire commented May 21, 2025

Uh oh!

boegel commented May 21, 2025

Uh oh!

lexming left a comment

Uh oh!

lexming commented Jun 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve reliability of PyTorch test reporting #3723

Improve reliability of PyTorch test reporting #3723

Uh oh!

Conversation

Flamefire commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lexming commented May 20, 2025

Uh oh!

boegel commented May 21, 2025

Uh oh!

Flamefire commented May 21, 2025

Uh oh!

boegel commented May 21, 2025

Overview of tested easyconfigs (in order)

Uh oh!

lexming left a comment

Choose a reason for hiding this comment

Uh oh!

lexming commented Jun 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Flamefire commented May 20, 2025 •

edited

Loading