Add more estimators to the compatibility test suite by betatim · Pull Request #7069 · rapidsai/cuml

betatim · 2025-07-30T13:19:21Z

This adds more estimators and xfails for them to the compatibility test suite.

This is step 1 for #7061

Along the way I'm creating issues for issues that arise when running the checks that are more serious (aka we can't just mark the check as xfail). They should all refer to #7061 so you can see them there

copy-pr-bot · 2025-07-30T13:19:25Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

betatim · 2025-07-30T16:17:06Z

/ok to test

betatim · 2025-07-31T11:35:30Z

IMHO we can already merge this, even though there are more estimators that could be added. The CI gods are looking favourably on this PR right now, and the diff is already large.

csadorf

LGTM, just one question.

csadorf

We need to fix some of the imports.

betatim · 2025-08-04T15:41:07Z

After fixing the imports and adding more xfails and skipping more tests I am now at a point where estimators like RandomForestClassifier that were in a good state fail the most basic "is this estimator clone'able" check (which you can't xfail or skip) with a "bad alloc" failure. It seems like failures in one estimator effect other estimators. In particular some failed checks lead to an increased chance of future checks failing with MemoryErrors. Skipping those tests works to some extent, but eventually you get to the point that RandomForestClassifer is now at: even the most basic check fails, but if you run the test suite only on RandomForestClassifier everything passes or is xfailed.

Seems like a bit of a dead end to work on this without fixing what ever underlying problem is causing this, because there isn't much point is skipping all the checks.

betatim · 2025-08-04T16:08:10Z

The memory error that you see a lot is variations on this: MemoryError: std::bad_alloc: CUDA error (failed to allocate 1280 bytes) at: /home/thead/miniforge3/envs/cuml-20250729/include/rmm/mr/device/cuda_memory_resource.hpp

You can run the tests with pytest -sv --disable-warnings --tb=auto test_sklearn_compatibility.py which will eventually crash and produce a lot of MemoryErrors. You can also only run a subset, for example for TruncatedSVD with pytest -sv --disable-warnings --tb=auto test_sklearn_compatibility.py -k TruncatedSVD - if you run only the TruncatedSVD tests everything will either pass, be skipped or xfail. If you run all the tests new failures will appear within the TruncatedSVD tests.

csadorf · 2025-08-04T18:37:43Z

I'd suggest to split off the RandomForestClassifier tests for now so that we can unblock this and then address those problems separately.

betatim · 2025-08-05T09:47:31Z

We can completely skip RandomForestClassifier and the estimators after them. However the problem isn't with those estimators per se. The problem is that at some point a check breaks some global state that leads to RMM being unable to allocate any amount of memory. As a result we end up marking many checks as xfail, even though they'd pass if you test only a specific estimator.

betatim · 2025-08-05T12:00:14Z

Ok, I like this solution better: we skip the naive bayes estimators as they are the ones that cause the problems. All the other estimators are tested.

csadorf

Pre-approving since this generally LGTM, but I'd ask that we reference relevant issues in-code wherever applicable.

…-tests

betatim · 2025-08-12T07:39:08Z

/merge

Add more estimators to the compatibility test suite

cbb316c

github-actions Bot added the Cython / Python Cython or Python issue label Jul 30, 2025

github-actions Bot assigned betatim Jul 30, 2025

Add more estimators

bb7854f

betatim added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Jul 31, 2025

betatim marked this pull request as ready for review July 31, 2025 11:34

betatim requested a review from a team as a code owner July 31, 2025 11:34

betatim requested review from cjnolet and dantegd July 31, 2025 11:34

csadorf approved these changes Aug 1, 2025

View reviewed changes

Comment thread python/cuml/cuml/tests/test_sklearn_compatibility.py Outdated

csadorf requested changes Aug 4, 2025

View reviewed changes

Skip NaiveBayes estimators as they break other estimators

040b9be

betatim force-pushed the more-common-tests branch from 364b8c6 to 040b9be Compare August 5, 2025 11:52

Merge branch 'branch-25.10' into more-common-tests

12fc3f5

csadorf approved these changes Aug 6, 2025

View reviewed changes

Comment thread python/cuml/tests/test_sklearn_compatibility.py Outdated

betatim mentioned this pull request Aug 8, 2025

[BUG] Applying scikit-learn common estimator checks leads to MemoryErrors that then multiply #7100

Closed

betatim commented Aug 8, 2025

View reviewed changes

Comment thread python/cuml/tests/test_sklearn_compatibility.py Outdated

betatim added 2 commits August 11, 2025 17:40

Merge remote-tracking branch 'upstream/branch-25.10' into more-common…

e2157d1

…-tests

Reference issue

587e6d5

betatim force-pushed the more-common-tests branch from 607d2ea to 587e6d5 Compare August 11, 2025 15:41

rapids-bot Bot merged commit e8b1405 into rapidsai:branch-25.10 Aug 12, 2025
130 of 132 checks passed

betatim deleted the more-common-tests branch August 12, 2025 08:37

Conversation

betatim commented Jul 30, 2025

Uh oh!

copy-pr-bot Bot commented Jul 30, 2025

Uh oh!

betatim commented Jul 30, 2025

Uh oh!

betatim commented Jul 31, 2025

Uh oh!

csadorf left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

csadorf left a comment

Choose a reason for hiding this comment

Uh oh!

betatim commented Aug 4, 2025

Uh oh!

betatim commented Aug 4, 2025

Uh oh!

csadorf commented Aug 4, 2025

Uh oh!

betatim commented Aug 5, 2025

Uh oh!

betatim commented Aug 5, 2025

Uh oh!

csadorf left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

betatim commented Aug 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants