[REVIEW]Add support and tests for cuML and XGBoost #330

VibhuJawa · 2021-11-23T23:42:57Z

Add support and tests for cuML and XGBoost

This PR addresses #309

TODO:

Ensure efficient path for non distributed models and allow single GPU xgboost to succeed

 For more context around this see issue: rapidsai/cuml#4406 .

 We were previously training on client which is:
   a. Very inefficient and possibly problematic in multi-node clusters and heterogeneous setup.
   b. Training non distributed xgboost models on dask collections is not supported.

Test for singe GPU cuml model
Test for multi gpu cuml model
Test for singe gpu xgboost model
Test for multi gpu xgboost model
Update GPU-CI environment with the right libraries
See PR: https://github.com/rapidsai/dask-build-environment/pull/16/files

Follow Up work to enable predict with multi GPU cuML models:

Triage single gpu cuml model failure
See issue: [BUG]Predict fails with multi GPU cuML models #332

tests/integration/test_model.py

codecov-commenter · 2021-11-23T23:51:44Z

Codecov Report

Merging #330 (2195a06) into main (96524ee) will increase coverage by 0.01%.
The diff coverage is 88.23%.

@@            Coverage Diff             @@
##             main     #330      +/-   ##
==========================================
+ Coverage   95.69%   95.70%   +0.01%     
==========================================
  Files          65       65              
  Lines        2854     2863       +9     
  Branches      534      536       +2     
==========================================
+ Hits         2731     2740       +9     
+ Misses         75       74       -1     
- Partials       48       49       +1

Impacted Files	Coverage Δ
dask_sql/physical/rel/custom/create_model.py	`93.22% <88.23%> (-2.94%)`	⬇️
dask_sql/physical/utils/sort.py	`90.62% <0.00%> (+7.29%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 96524ee...2195a06. Read the comment docs.

…xgboost

VibhuJawa · 2021-11-24T19:56:33Z

CC: @charlesbluca , How do i go about adding xgboost and cuML to the gpu-CI we have setup ?

charlesbluca · 2021-11-24T22:07:11Z

You can open a PR adding these packages to the environment created in this dockerfile:

https://github.com/rapidsai/dask-build-environment/blob/main/dask_sql.Dockerfile

VibhuJawa · 2021-11-30T19:01:47Z

@charlesbluca , This is now ready for review.

This PR is dependent on https://github.com/rapidsai/dask-build-environment/pull/16/files

VibhuJawa · 2021-11-30T19:08:00Z

dask_sql/physical/rel/custom/create_model.py

+            X_d = X.repartition(npartitions=1).to_delayed()
+            if y is not None:
+                y_d = y.repartition(npartitions=1).to_delayed()
+            else:
+                y_d = None


For more context around this see issue: rapidsai/cuml#4406 .

We were previously training on client which is:

a. Very inefficient and possibly problematic in multi-node clusters and heterogeneous setup.
b. Training non distributed xgboost models on dask collections is not supported.

charlesbluca

Thanks for the work here @VibhuJawa 😄 a couple comments, I'm not the most knowledgeable on Dask ML stuff so would feel good getting a second review on this:

tests/integration/test_model.py

ChrisJar

LGTM!

charlesbluca

Missed this. but generally things look good here! Happy to merge this once cuML / XGBoost are successfully added to gpuCI and tests are passing

tests/integration/fixtures.py

VibhuJawa · 2021-12-06T17:37:55Z

@GPUtester rerun tests .

charlesbluca

Think we need to mark these as fixtures to expose them to tests?

tests/integration/fixtures.py

charlesbluca · 2021-12-06T19:00:17Z

rerun tests

charlesbluca · 2021-12-06T19:19:06Z

rerun tests

Add tests with cuML and XGBoost

e4f51e2

VibhuJawa commented Nov 23, 2021

View reviewed changes

tests/integration/test_model.py Show resolved Hide resolved

Enabled efficient path for single GPU/CPU models and non distributed …

4f441c1

…xgboost

VibhuJawa changed the title ~~[WIP]Add tests with cuML and XGBoost~~ [WIP]Add support and tests for cuML and XGBoost Nov 24, 2021

VibhuJawa mentioned this pull request Nov 24, 2021

[BUG] Training cuML single GPU models on dask dataframe objects uses client instead of worker rapidsai/cuml#4406

Open

VibhuJawa mentioned this pull request Nov 29, 2021

[BUG]Predict fails with multi GPU cuML models #332

Closed

Merge branch 'dask-contrib:main' into cuml_test

9b89d26

VibhuJawa mentioned this pull request Nov 30, 2021

[REVIEW] Add xgboost and cuML as dependency rapidsai/dask-build-environment#16

Merged

uncommented cuml.dask prediction test

9b8ee7f

VibhuJawa changed the title ~~[WIP]Add support and tests for cuML and XGBoost~~ [REVIEW]Add support and tests for cuML and XGBoost Nov 30, 2021

VibhuJawa marked this pull request as ready for review November 30, 2021 19:01

VibhuJawa commented Nov 30, 2021

View reviewed changes

charlesbluca reviewed Nov 30, 2021

View reviewed changes

tests/integration/test_model.py Outdated Show resolved Hide resolved

tests/integration/test_model.py Show resolved Hide resolved

VibhuJawa added 4 commits November 30, 2021 14:35

Added @pytest.mark.gpu decorator

f583c14

removed extra pytest.importorskip("cudf")

19fc5fa

added import check for a pytest.fixture

f1f0911

test on cluster again

1e2921d

ChrisJar approved these changes Dec 1, 2021

View reviewed changes

charlesbluca added the blocked Blocked by work in another pull request label Dec 2, 2021

charlesbluca reviewed Dec 2, 2021

View reviewed changes

tests/integration/fixtures.py Show resolved Hide resolved

charlesbluca approved these changes Dec 3, 2021

View reviewed changes

charlesbluca mentioned this pull request Dec 6, 2021

Add cuML and XGBoost to dask-sql gpuCI images rapidsai/dask-build-environment#22

Merged

charlesbluca requested changes Dec 6, 2021

View reviewed changes

tests/integration/fixtures.py Show resolved Hide resolved

tests/integration/fixtures.py Show resolved Hide resolved

Add decorator to GPU cluster/client fixures

2195a06

charlesbluca removed the blocked Blocked by work in another pull request label Dec 6, 2021

charlesbluca approved these changes Dec 6, 2021

View reviewed changes

charlesbluca merged commit 1f48686 into dask-contrib:main Dec 6, 2021

VibhuJawa mentioned this pull request Mar 7, 2022

[ENH] Add tests for cuML+Dask-ML+dask-sql #309

Closed

[REVIEW]Add support and tests for cuML and XGBoost #330

[REVIEW]Add support and tests for cuML and XGBoost #330

Uh oh!

Conversation

VibhuJawa commented Nov 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add support and tests for cuML and XGBoost

Follow Up work to enable predict with multi GPU cuML models:

Uh oh!

Uh oh!

codecov-commenter commented Nov 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

VibhuJawa commented Nov 24, 2021

Uh oh!

charlesbluca commented Nov 24, 2021

Uh oh!

VibhuJawa commented Nov 30, 2021

Uh oh!

VibhuJawa Nov 30, 2021

Choose a reason for hiding this comment

Uh oh!

charlesbluca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ChrisJar left a comment

Choose a reason for hiding this comment

Uh oh!

charlesbluca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

VibhuJawa commented Dec 6, 2021

Uh oh!

charlesbluca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

charlesbluca commented Dec 6, 2021

Uh oh!

charlesbluca commented Dec 6, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

VibhuJawa commented Nov 23, 2021 •

edited

Loading

codecov-commenter commented Nov 23, 2021 •

edited

Loading