[GSK-1504] Integration with W&B #1288

rabah-khalek · 2023-08-02T11:58:16Z

Integration

Prerequisites

Create a wandb account here.
Open your docker app

Install python package and server

pip install wandb
wandb login --relogin # input the API key you get from the website
wandb server start --upgrade # this will download the docker images if they're not already downloaded

Features

import giskard, wandb
# [...] wrap model and dataset with giskard
scan_results = giskard.scan(giskard_model, giskard_dataset)
test_suite_results = scan_results.generate_test_suite().run()
shap_results = giskard.explain_with_shap(giskard_model, giskard_dataset)

wandb.login()
giskard_dataset.to_wandb() # log dataset
scan_results.to_wandb() # log scan results
test_suite_results.to_wandb() # log test suite results
shap_results.to_wandb() # log shap results as plots

Todo

log scan results (@rabah-khalek)
log test suite results (@AbSsEnT)
log dataset artifact (@AbSsEnT)
think of other objects we can log (@AbSsEnT and @rabah-khalek)
- log shapely values as interactive graphs (@AbSsEnT)
write tests (@rabah-khalek and @AbSsEnT)
write doc (@rabah-khalek and @AbSsEnT)
- write 1 demo notebook (credit scoring) (@AbSsEnT)
write article once PR is accepted (@AbSsEnT and @rabah-khalek)

linear · 2023-08-02T11:58:18Z

GSK-1505 Exploring the tool

mykyta go through https://docs.wandb.ai/quickstart and explore a bit the platform
mykyta go through this example: https://docs.deepchecks.com/stable/general/usage/exporting_results/auto_examples/plot_exports_output_to_wandb.html#exporting-a-suite-s-output-suiteresult-to-wandb
once you're familiar with it, let's discuss https://docs.wandb.ai/guides/integrations

#1294) * Added new method to the Dataset class to log dataset to the WandB run. * updated to_wandb --------- Co-authored-by: Rabah Abdul Khalek <[email protected]>

linear · 2023-08-04T12:52:02Z

GSK-1504 Integration with W&B

…into GSK-1505-wandb

* Added new method to the TestSuiteResult class to log its execution results to the WandB run. * Resolved issues. * refactoring _parse_test_name --------- Co-authored-by: Rabah Abdul Khalek <[email protected]>

* Initial commit with the implementation of the SHAP explanation graphs logging to the WandB run. * Changed logic of obtaining feature names and types. * Removed redundant 'model.prepare_dataframe'. Small refactoring. * Added sorting of logged dataset, test suite result and scan result to distinct panels. * Moved 'explain' function below shap-related functions. * Code refactoring. * Changed naming for variables inside functions. * Removed explainer return, as it is not needed. * Moved 'prepare_df' to the separate utils.py file to avoid code duplication. * Added docstring to the '_get_cls_prediction_explanation' * Created dataclass ShapResult to store shap explanations there and encapsulate the logic of uploading SHAP charts to the WandB. * Refactoring of the 'background_example' function. * Refactoring. * Refactoring. * Changed enum class declaration. * Refactored model_explanation.py to be able to perform testing of explanation results equality. Added unit-tests for the SHAP logging to the WandB. * Small fix in comments. * Uncommented fixture. * Refactored "_get_highest_prob_shap" function. Made it more compact and self-explainable. * Removed #noqa options from the shap imports. Optimized imports. * Refactored _prepare_for_explanation function. Changed naming of the function output to highlight, that this data will be explained. * Renamed explain_full(one) to "_calculate_dataset(sample)_shap_values" * Refactored _get_background_example function. * Refactored 'explain_with_shap' function and 'ShapResult' dataclass for better handling classification models explanation. * Fixed bugs with unit-tests for wandb. * Transferred '_compare_explain_functions' to the 'test_model_explanation.py' * Refactoring. Renaming and functions replacement. * Renaming. * Transferred plotting functions from the shap_result.py to the wandb_utils.py to better handle wandb importing necessity. * small update to error msg * updated unit test * small update --------- Co-authored-by: Rabah Abdul Khalek <[email protected]>

GSK-1538

GSK-1566

GSK-1580

andreybavt

Generally speaking looks good to me, I left a few non-major comments

python-client/docs/integrations/wandb/index.md

python-client/pyproject.toml

python-client/giskard/models/shap_result.py

python-client/giskard/datasets/base/__init__.py

* Added docstrings to the "model_explanation.py". * Added docstrings to the "shap_result.py". * Fix in docstrings * Added docstring to the 'Dataset.to_wandb'. * Added docstring to the 'ScanResult.to_wandb'. * Added docstring to the 'TestSuiteResult.to_wandb'. * Resolved issues after PR review. * updated docstrings --------- Co-authored-by: Rabah Abdul Khalek <[email protected]>

andreybavt

LGTM

sonarqubecloud · 2023-08-30T07:53:34Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
1 Code Smell

91.6% Coverage
0.0% Duplication

rabah-khalek added 2 commits August 2, 2023 13:46

added wandb run contextmanager

16551d3

added to_wandb for scan results

7dfa8b1

rabah-khalek assigned rabah-khalek and AbSsEnT Aug 2, 2023

rabah-khalek added the Python Pull requests that update Python code label Aug 2, 2023

Merge branch 'main' into GSK-1505-wandb

a35b874

rabah-khalek marked this pull request as draft August 2, 2023 11:59

rabah-khalek added the Integrations label Aug 2, 2023

AbSsEnT and others added 4 commits August 3, 2023 19:10

Added new method to the Dataset class to log dataset to the WandB run. (

2ebe2d4

#1294) * Added new method to the Dataset class to log dataset to the WandB run. * updated to_wandb --------- Co-authored-by: Rabah Abdul Khalek <[email protected]>

Merge branch 'main' into GSK-1505-wandb

9dbbc0c

setting up the doc skeleton

40c6daa

Merge branch 'main' into GSK-1505-wandb

b590fa5

rabah-khalek changed the title ~~[GSK-1505] Integration with W&B~~ [GSK-1504] Integration with W&B Aug 4, 2023

rabah-khalek and others added 13 commits August 4, 2023 14:55

Merge branch 'main' into GSK-1505-wandb

c326588

updated pyproject and pdm lock with wandb

a0d5abf

Merge branch 'GSK-1505-wandb' of https://github.com/Giskard-AI/giskard …

55735e4

…into GSK-1505-wandb

working on tests

b1e5a4b

GSK-1531 (#1301)

d70fb75

* Added new method to the TestSuiteResult class to log its execution results to the WandB run. * Resolved issues. * refactoring _parse_test_name --------- Co-authored-by: Rabah Abdul Khalek <[email protected]>

functional tests implemented (GSK-1535)

dba9570

fixed code smell

27ca803

updated docs

091d0b8

Merge branch 'main' into GSK-1505-wandb

0290038

updated imports

80a2c52

Merge branch 'main' into GSK-1505-wandb

ec0e35e

added errors and telemetry

d9ba6ff

rabah-khalek marked this pull request as ready for review August 16, 2023 17:48

fixing code smells

c10735d

rabah-khalek requested a review from andreybavt August 17, 2023 07:57

rabah-khalek and others added 16 commits August 17, 2023 10:07

turned off validation of Dataset in model_explanation

b9b64a8

exposed explain_with_shap

052cea0

converted error to warning

b08a641

updated tests

500f08f

restored fixtures

b695154

New example notebook to show WandB integration functionality.

1be6b00

WandB notebook refactoring. Committing images.

2130bed

Removed blank cell.

a8aa900

Removed blank cell.

ada7d6a

updated docs

477f917

Replaced screenshots with the giskard scan result.

b7dd43d

Merge pull request #1314 from Giskard-AI/GSK-1538-wandb

00d54c9

GSK-1538

Fixed 'explain_with_shap' issue, when the model is the LGBM.

44422ff

Merge pull request #1318 from Giskard-AI/GSK-1566-fix-shap-lgbm

44508c8

GSK-1566

Updated screenshot with test-suite results comparison for multiple runs.

e40c757

Merge pull request #1319 from Giskard-AI/GSK-1580-update-wandb-screen

8514d08

GSK-1580

andreybavt suggested changes Aug 25, 2023

View reviewed changes

rabah-khalek added 3 commits August 28, 2023 09:56

Merge branch 'main' into GSK-1505-wandb

95b5e8a

updated pdm lock

246fa57

implementing AA's feedback

121ad25

rabah-khalek requested a review from andreybavt August 28, 2023 14:39

rabah-khalek and others added 4 commits August 28, 2023 16:39

Merge branch 'main' into GSK-1505-wandb

6cdd8fc

Merge branch 'main' into GSK-1505-wandb

a2ced58

Merge branch 'main' into GSK-1505-wandb

20ce024

andreybavt approved these changes Aug 30, 2023

View reviewed changes

andreybavt merged commit cb2a75a into main Aug 30, 2023

Hartorn deleted the GSK-1505-wandb branch September 22, 2023 10:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[GSK-1504] Integration with W&B #1288

[GSK-1504] Integration with W&B #1288

Uh oh!

rabah-khalek commented Aug 2, 2023 •

edited by AbSsEnT

Loading

Uh oh!

linear bot commented Aug 2, 2023

Uh oh!

linear bot commented Aug 4, 2023

Uh oh!

andreybavt left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreybavt left a comment

Uh oh!

sonarqubecloud bot commented Aug 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

Uh oh!

[GSK-1504] Integration with W&B #1288

[GSK-1504] Integration with W&B #1288

Uh oh!

Conversation

rabah-khalek commented Aug 2, 2023 • edited by AbSsEnT Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Integration

Prerequisites

Features

Todo

Uh oh!

linear bot commented Aug 2, 2023

Uh oh!

linear bot commented Aug 4, 2023

Uh oh!

andreybavt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreybavt left a comment

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Aug 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

rabah-khalek commented Aug 2, 2023 •

edited by AbSsEnT

Loading