Skip to content

HDBSCAN and KMeans API improvements for improving CPU interoperability #6181

Merged
rapids-bot[bot] merged 9 commits intorapidsai:branch-25.02from
dantegd:fix-cuml-sklearn-api-fixes
Dec 24, 2024
Merged

HDBSCAN and KMeans API improvements for improving CPU interoperability #6181
rapids-bot[bot] merged 9 commits intorapidsai:branch-25.02from
dantegd:fix-cuml-sklearn-api-fixes

Conversation

@dantegd
Copy link
Copy Markdown
Member

@dantegd dantegd commented Dec 13, 2024

No description provided.

@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Dec 13, 2024

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@github-actions github-actions Bot added the Cython / Python Cython or Python issue label Dec 13, 2024
@dantegd dantegd changed the title KMeans API improvements for matching Scikit-learn API Estimator API improvements for matching Scikit-learn API Dec 15, 2024
@dantegd dantegd changed the title Estimator API improvements for matching Scikit-learn API HDBSCAN and KMeans API improvements for improving CPU interoperability Dec 17, 2024
@dantegd dantegd added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Dec 17, 2024
DOC comment explaining behavior
@dantegd dantegd marked this pull request as ready for review December 17, 2024 03:45
@dantegd dantegd requested a review from a team as a code owner December 17, 2024 03:45
@dantegd dantegd requested review from betatim and divyegala December 17, 2024 03:45
Comment on lines +572 to +573
self._gen_min_span_tree = \
GlobalSettings().accelerator_active or value
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general we should not set attributes to values that are different from the constructor argument.

I tried to work out why doing this increases compatibility, but couldn't the default of gen_min_span_tree in HDBSCAN is also False. What was the thinking for making this default on when the accelerator is enabled?

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Over-zealousness of wanting the mst to be available always when I was testing locally, forgot that the original hdbscan package has the same gen_min_span_tree parameter, so I removed this logic :)

@dantegd
Copy link
Copy Markdown
Member Author

dantegd commented Dec 24, 2024

/merge

@rapids-bot rapids-bot Bot merged commit 7731ba2 into rapidsai:branch-25.02 Dec 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Cython / Python Cython or Python issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants