ci: Update result tracking with new golden format by rjodinchr · Pull Request #2706 · KhronosGroup/OpenCL-CTS

rjodinchr · 2026-05-27T08:57:19Z

The primary goal of this commit is to improve CI tracking by
introducing a new golden format that can differentiate test results
based on command-line arguments. To cleanly extract and pass these
arguments into the JSON result outputs, the command-line parsing
infrastructure across the CTS required a significant refactoring.

Key changes include:

Enhanced CI Tracking: Updates ci/compare_results.py,
ci/pocl/golden.json, and saveResultsToJson to include and evaluate
an args key. The golden JSON now uses a nested format mapping
specific argument strings (e.g., --wimpy -1) to their expected
results, allowing the CI to validate the same binary run under
different parameters.
Centralized Parsing Infrastructure: Introduces the ParseArgsFn
callback and runTestHarnessWithCheckAndParse. This offloads custom
argument parsing from individual test main() functions and safely
extracts the arguments used so they can be logged by the test harness.
Help Text Consolidation: Replaces fragmented printUsage()
functions with unified help string references populated directly by
the standard parsing callbacks.

The primary goal of this commit is to improve CI tracking by introducing a new golden format that can differentiate test results based on command-line arguments. To cleanly extract and pass these arguments into the JSON result outputs, the command-line parsing infrastructure across the CTS required a significant refactoring. Key changes include: * Enhanced CI Tracking: Updates `ci/compare_results.py`, `ci/pocl/golden.json`, and `saveResultsToJson` to include and evaluate an `args` key. The golden JSON now uses a nested format mapping specific argument strings (e.g., `--wimpy -1`) to their expected results, allowing the CI to validate the same binary run under different parameters. * Centralized Parsing Infrastructure: Introduces the `ParseArgsFn` callback and `runTestHarnessWithCheckAndParse`. This offloads custom argument parsing from individual test `main()` functions and safely extracts the arguments used so they can be logged by the test harness. * Help Text Consolidation: Replaces fragmented `printUsage()` functions with unified `help` string references populated directly by the standard parsing callbacks. [run-test: test_computeinfo] [run-test: test_bruteforce -1 -w] [run-test: test_cl_copy_images small_images --num-worker-threads 2 1D] [run-test: test_image_streams 1D --num-worker-threads 2 CL_R CL_FILTER_NEAREST]

rjodinchr · 2026-06-18T13:38:20Z

ref #2723

ahesham-arm · 2026-06-23T08:46:49Z

Hi @rjodinchr, is it possible to separate the work into multiple commits, e.g. one for each of the bullet points you listed in the description. There is a lot of value in your recent PRs but they can be exhuasting to review because of the volume of unrelated/non-core changes required.

rjodinchr · 2026-06-23T09:04:44Z

Hi @rjodinchr, is it possible to separate the work into multiple commits, e.g. one for each of the bullet points you listed in the description. There is a lot of value in your recent PRs but they can be exhuasting to review because of the volume of unrelated/non-core changes required.

I understand the difficulty of reviewing such a large PR.
It feels very complicated to break this one down into smaller pieces. Those commits would either not be able to compile or would add unused code.

I can suggest the following strategy for the review:

First, let's focus on the test_common/harness part.
Once we agree on that, each individual test makes use of the new harness changes. They are all independent but heavily depend on the harness changes.
We can have a look at the ci part at the end.

The harness which is the core of this PR should not be too big to start with:

4 files changed, 144 insertions(+), 72 deletions(-)

ahesham-arm · 2026-06-23T09:09:02Z

        else if (!strcmp(argv[i], "--wimpy") || !strcmp(argv[i], "-w"))
        {
            delArg++;
+            removed_args.push_back("--wimpy");


Why is this the only argument that gets pushed back as a hardcoded string and not argv[i]?

This ensures consistency, as we don't need both -w and --wimpy to be reported. It forces all reports to use a single format so that two identical runs aren't mistaken for being different just because they used different naming conventions.

ahesham-arm · 2026-06-23T09:11:01Z

+            if (!help)
+            {
+                help = true;
+                removed_args.push_back("--help");


And this one too I guess.

Same as for wimpy, even so I believe it is less of a problem with the help argument.

ahesham-arm · 2026-06-23T09:14:34Z

                                   cl_command_queue_properties queueProps,
                                   DeviceCheckFn deviceCheckFn);

+typedef test_status (*ParseArgsFn)(int &argc, const char *argv[],


Personal perference but I think using is easier to read for functions, i.e.

using ParseArgsFn = test_status (*)( int &argc, const char *argv[], std::vector<std::string> &removed_args, std::string &help_description );

I'm indifferent. clang-format should be trusted to handle repository requirements. If multiple formats are permitted, we should either configure clang-format to enforce a specific one or simply accept whatever clang-format allows.

ahesham-arm · 2026-06-23T09:23:07Z

+    {
+        log_info("\n");
+        log_info("**************************\n");
+        log_info("***    !! WARNING !!   ***\n");


Nit: this is different than the existing message. I don't mind it, just making sure this was intentional.

This is intentional. Some tests trigger an additional message when wimpy mode is enabled. I wanted to ensure that any tests not printing an additional message can rely on that message to explicitly signal the warning part that most other messages display.

rjodinchr force-pushed the ci branch 3 times, most recently from cf68937 to d773d37 Compare May 28, 2026 13:48

rjodinchr changed the title ~~ci: restructure golden.json to use nested command arguments~~ ci: Update result tracking with new golden format May 28, 2026

rjodinchr force-pushed the ci branch 2 times, most recently from fe486da to f6f47e4 Compare May 28, 2026 14:51

This was referenced May 28, 2026

math_brute_force: Refactor input generation & reduce default test size #2697

Draft

images: Remove global variables to allow for parallel execution #2696

Draft

rjodinchr added the focused review label May 28, 2026

rjodinchr force-pushed the ci branch 4 times, most recently from 8767292 to f6f47e4 Compare May 29, 2026 12:12

This was referenced Jun 2, 2026

conversions: Refactor tests to reduce dataset size #2711

Draft

half: Improve tests using special values migrated to harness #2718

Closed

integer-ops: Optimize tests to improve runtime #2719

Closed

select: Stop testing every integers as cmp value #2720

Draft

rjodinchr mentioned this pull request Jun 11, 2026

clCopyImage: improve performance by removing redundant copies #2721

Closed

rjodinchr force-pushed the ci branch from f6f47e4 to cb52cde Compare June 12, 2026 09:18

rjodinchr force-pushed the ci branch from cb52cde to 1473280 Compare June 12, 2026 09:39

ahesham-arm reviewed Jun 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: Update result tracking with new golden format#2706

ci: Update result tracking with new golden format#2706
rjodinchr wants to merge 1 commit into
KhronosGroup:mainfrom
rjodinchr:ci

rjodinchr commented May 27, 2026 •

edited

Loading

Uh oh!

rjodinchr commented Jun 18, 2026

Uh oh!

ahesham-arm commented Jun 23, 2026

Uh oh!

rjodinchr commented Jun 23, 2026

Uh oh!

ahesham-arm Jun 23, 2026

Uh oh!

rjodinchr Jun 23, 2026

Uh oh!

ahesham-arm Jun 23, 2026

Uh oh!

rjodinchr Jun 23, 2026

Uh oh!

ahesham-arm Jun 23, 2026

Uh oh!

rjodinchr Jun 23, 2026

Uh oh!

ahesham-arm Jun 23, 2026

Uh oh!

rjodinchr Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rjodinchr commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rjodinchr commented Jun 18, 2026

Uh oh!

ahesham-arm commented Jun 23, 2026

Uh oh!

rjodinchr commented Jun 23, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rjodinchr commented May 27, 2026 •

edited

Loading