[BUG] Fix RandomForest Builder Sampling by tarang-jain · Pull Request #7422 · rapidsai/cuml

tarang-jain · 2025-11-01T00:13:55Z

The initial value chosen for the mask is 0. As a result, the mask computed with SubtractLeft always marks feature 0 as "selected" even though it is not. Instead we set it to -1.

Failing tests that this PR adds to the xfail-list:

"sklearn.inspection.tests.test_permutation_importance::test_permutation_importance_correlated_feature_regression_pandas[0.5-1]"
"sklearn.inspection.tests.test_permutation_importance::test_permutation_importance_correlated_feature_regression_pandas[0.5-2]"
"sklearn.inspection.tests.test_permutation_importance::test_permutation_importance_correlated_feature_regression_pandas[1.0-1]"
"sklearn.inspection.tests.test_permutation_importance::test_permutation_importance_correlated_feature_regression_pandas[1.0-2]"

csadorf · 2025-11-03T15:48:25Z

Would it be possible to add a unit test that covers this?

csadorf · 2025-11-03T18:30:37Z

Can you investigate the test failures, please?

tarang-jain · 2025-11-03T20:48:25Z

@csadorf those failures are not related to this PR, it looks like some UMAP failures. Just merged upstream to see if they resolve on their own. As far as testing is concerned, I can potentially add a basic test on the C++ side that checks if every feature is sampled at least once using all the different sampling algorithms.

csadorf · 2025-11-03T23:20:39Z

As far as testing is concerned, I can potentially add a basic test on the C++ side that checks if every feature is sampled at least once using all the different sampling algorithms.

Whatever is appropriate, but we should make sure to prevent a future regression.

…ampling

…into fix-rf-sampling

csadorf · 2025-11-04T17:25:36Z

It looks like this PR is introducing a regression in permutation importance as indicated by the scikit-learn upstream tests. I am currently investigating the problem.

tarang-jain · 2025-11-04T17:33:29Z

There was also a problem in one of the SHAP tests, which had hardcoded values (as you had indicated earlier from the logs) -- that is fixed now.

csadorf

This patch appears correct to me, but we probably have a secondary sampling bias issue by setting excess items to n - 1, which is a valid index and thus is guaranteed to be included in the selection whenever we randomly drew less than k unique indices in the first sampling iteration. The probability of that is non-zero.

We should identify a clear MRE to demonstrate these sampling issues and expand our testing to ensure that this critical bug is covered to improve our confidence in correctness and prevent future regressions.

csadorf · 2025-11-04T22:58:51Z

+    // Use -1 as the initial value since it can't match any valid column index [0, n-1]
    BlockAdjacentDifferenceT(temp_storage.diff)
-      .SubtractLeft(items, mask, CustomDifference<IdxT>(), mask[0]);
+      .SubtractLeft(items, mask, CustomDifference<IdxT>(), IdxT(-1));


This appears correct to me. The previous implementation was comparing the first randomly selected column index against the initial value of mask[0] which is always zero. Outside the fact that comparing against a mask value makes absolutely no sense here, this also means it would never be selected, because the items array is sorted.

…into fix-rf-sampling

csadorf · 2025-11-05T21:01:15Z

Let's add the failing scikit-learn tests to the xfail list. We will remove them as we fix the wider problem in #7448 .

tarang-jain · 2025-11-06T17:35:58Z

This PR has been refactored to only address the issue wherein the first column (feature 0) was not being sampled at all. Other bugs are being tracked separately.

…into fix-rf-sampling

tarang-jain · 2025-11-07T17:37:32Z

/merge

fix SubtractLeft

bcba849

tarang-jain requested a review from a team as a code owner November 1, 2025 00:13

tarang-jain requested a review from vyasr November 1, 2025 00:13

github-actions Bot assigned tarang-jain Nov 1, 2025

github-actions Bot added the CUDA/C++ label Nov 1, 2025

tarang-jain added bug Something isn't working non-breaking Non-breaking change labels Nov 1, 2025

tarang-jain mentioned this pull request Nov 3, 2025

[FEA] Feature Importances for Random Forests #7275

Merged

Merge branch 'main' into fix-rf-sampling

155ef8e

Merge branch 'main' into fix-rf-sampling

961a8ba

tarang-jain added 2 commits November 3, 2025 17:52

fix shap test inputs

6a3dd1d

Merge branch 'main' of https://github.com/rapidsai/cuml into fix-rf-s…

e7d8dfe

…ampling

tarang-jain requested a review from a team as a code owner November 4, 2025 01:53

github-actions Bot added the Cython / Python Cython or Python issue label Nov 4, 2025

Merge branch 'fix-rf-sampling' of https://github.com/tarang-jain/cuml …

e4283b3

…into fix-rf-sampling

csadorf requested changes Nov 4, 2025

View reviewed changes

csadorf linked an issue Nov 5, 2025 that may be closed by this pull request

[BUG] RandomForest Builder Does Not Sample 0th Feature #7445

Closed

csadorf and others added 5 commits November 5, 2025 12:56

Add pytest that captures the undersampling for feature 0.

aeead10

Merge branch 'main' into fix-rf-sampling

56bce3a

handle n-1 bias

e80f3da

Merge branch 'fix-rf-sampling' of https://github.com/tarang-jain/cuml …

e6a0075

…into fix-rf-sampling

fix compilation

939136c

csadorf mentioned this pull request Nov 5, 2025

[BUG] RandomForest Builder Does Not Sample 0th Feature #7445

Closed

csadorf requested changes Nov 5, 2025

View reviewed changes

Comment thread cpp/src/decisiontree/batched-levelalgo/kernels/builder_kernels.cuh Outdated

csadorf mentioned this pull request Nov 5, 2025

Fix the RandomForest sampling bias #7449

Merged

tarang-jain and others added 3 commits November 6, 2025 09:14

rollback n-1 change

d9eed26

add failing tests to xfail-list

d5d96c7

Merge branch 'main' into fix-rf-sampling

eb9b9f4

tarang-jain added 2 commits November 6, 2025 10:45

rollback CustomDifference change

f9aa627

Merge branch 'fix-rf-sampling' of https://github.com/tarang-jain/cuml …

bb509c0

…into fix-rf-sampling

csadorf approved these changes Nov 6, 2025

View reviewed changes

tarang-jain and others added 2 commits November 6, 2025 13:54

correct instantiation for builder_kernel

3fce17f

Merge branch 'main' into fix-rf-sampling

6985f05

divyegala approved these changes Nov 6, 2025

View reviewed changes

Update xfail list

ea3fe82

rapids-bot Bot merged commit cc3ac08 into rapidsai:main Nov 7, 2025
106 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix RandomForest Builder Sampling#7422

[BUG] Fix RandomForest Builder Sampling#7422
rapids-bot[bot] merged 19 commits intorapidsai:mainfrom
tarang-jain:fix-rf-sampling

tarang-jain commented Nov 1, 2025 •

edited

Loading

Uh oh!

csadorf commented Nov 3, 2025

Uh oh!

csadorf commented Nov 3, 2025

Uh oh!

tarang-jain commented Nov 3, 2025 •

edited

Loading

Uh oh!

csadorf commented Nov 3, 2025

Uh oh!

csadorf commented Nov 4, 2025

Uh oh!

tarang-jain commented Nov 4, 2025

Uh oh!

csadorf left a comment

Uh oh!

csadorf Nov 4, 2025

Uh oh!

Uh oh!

csadorf commented Nov 5, 2025

Uh oh!

tarang-jain commented Nov 6, 2025

Uh oh!

tarang-jain commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

tarang-jain commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

csadorf commented Nov 3, 2025

Uh oh!

csadorf commented Nov 3, 2025

Uh oh!

tarang-jain commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

csadorf commented Nov 3, 2025

Uh oh!

csadorf commented Nov 4, 2025

Uh oh!

tarang-jain commented Nov 4, 2025

Uh oh!

csadorf left a comment

Choose a reason for hiding this comment

Uh oh!

csadorf Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

csadorf commented Nov 5, 2025

Uh oh!

tarang-jain commented Nov 6, 2025

Uh oh!

tarang-jain commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tarang-jain commented Nov 1, 2025 •

edited

Loading

tarang-jain commented Nov 3, 2025 •

edited

Loading