[GSK-1321] Integration with MLflow #1189

rabah-khalek · 2023-06-21T14:48:58Z

1. mlflow Plug-in via evaluate

Description

Integration of mlflow via the model_evaluator plugin.

Installation requirements:

pip install mlflow # mlflow is needed for .evaluate() not mlflow-skinny shipped with giskard
pip install "git+https://github.com/Giskard-AI/giskard.git@gsk-1321/mlflow-integration#subdirectory=python-client" -q

Code example:

import mlflow

from giskard import demo
model1, df = demo.titanic(max_iter=5)
model2, df = demo.titanic(max_iter=100)

with mlflow.start_run(run_name="model1") as run1:
    model1_uri = mlflow.sklearn.log_model(model1, "sklearn_model1", pyfunc_predict_fn="predict_proba").model_uri
    mlflow.evaluate(model=model1_uri, model_type="classifier", data=df, targets="Survived", evaluators="giskard", evaluator_config={"classification_labels": ["no", "yes"]})

with mlflow.start_run(run_name="model2") as run2:
    model2_uri = mlflow.sklearn.log_model(model2, "sklearn_model2", pyfunc_predict_fn="predict_proba").model_uri
    mlflow.evaluate(model=model2_uri, model_type="classifier", data=df, targets="Survived", evaluators="giskard", evaluator_config={"classification_labels": ["no", "yes"]})

Running mlflow ui in the terminal, one gets:

the html results of scan embedded as Artifacts.
the test suite result of scan logged as Metrics.
model and dataset artifacts

Two notebooks to test this feature:

Run them locally in order to run the mlflow ui

titanic demo (sklearn): https://colab.research.google.com/drive/1j2w7MtTThXej8fEagdYUxwbeSnMrg9Cc?usp=sharing
text classification (TensorFlow): https://colab.research.google.com/drive/1MbWVLYZeryKtv_VrTJ_zlYdNoB_BFlaf?usp=sharing

2. Giskard API via to_mlflow

Description

logging artifacts and metrics from giskard to mlflow

Code example:

scan_results = giskard.scan(giskard_model, giskard_dataset)
test_suite = results.generate_test_suite("My first test suite")
test_suite_results = test_suite.run()

import mlflow

# Option 1 (via the fluent API)
with mlflow.start_run() as run:
    giskard_model.to_mlflow()
    giskard_dataset.to_mlflow()
    scan_results.to_mlflow()
    test_suite_results.to_mlflow()

# Option 2 (via MlflowClient)
from mlflow import MlflowClient

client = MlflowClient()
experiment_id = "0"
run = client.create_run(experiment_id)

giskard_model.to_mlflow(client, run.info.run_id) 
giskard_dataset.to_mlflow(client, run.info.run_id) 
scan_results.to_mlflow(client, run.info.run_id) 
test_suite_results.to_mlflow(client, run.info.run_id)

notebook to test this feature:

Run them locally in order to run the mlflow ui:

titanic demo (sklearn): https://colab.research.google.com/drive/1cnYTOmFQnQXRGTlYif3MMYe7aOkCwC5d?usp=sharing

Open questions

what if the model logged into MLflow is not giskard-compliant?
how does MLflow handle data preprocessing?
- MLflow allows the custom creation of models if pre- or post-processing is needed, see: https://mlflow.org/docs/latest/python_api/mlflow.pyfunc.html?highlight=preprocessing

Todo:

Tech

Writing

LLM models comparison

import pandas as pd from langchain import PromptTemplate, LLMChain from langchain.llms import OpenAI import mlflow import openai import os df = pd.read_csv('https://raw.githubusercontent.com/sunnysai12345/News_Summary/master/news_summary_more.csv') df_filtered = pd.DataFrame(df["text"].sample(10, random_state=11)) prompt = PromptTemplate(template="Create a reader comment according to the following article summary: '{text}''", input_variables=["text"]) openai.api_key = os.getenv("OPENAI_API_KEY") llm1 = OpenAI(openai_api_key=openai.api_key, request_timeout=20, max_retries=100, temperature=0, model_name="text-ada-001", ) # Possibility to select another model chain1 = LLMChain(prompt=prompt, llm=llm1) with mlflow.start_run(run_name="text-ada-001") as run2: model_uri = mlflow.langchain.log_model(chain1, "langchain").model_uri mlflow.evaluate(model=model_uri, model_type="text", data=df_filtered, evaluators="giskard") llm2 = OpenAI(openai_api_key=openai.api_key, request_timeout=20, max_retries=100, temperature=0, model_name="text-embedding-ada-002", ) # Possibility to select another model chain2 = LLMChain(prompt=prompt, llm=llm2) with mlflow.start_run(run_name="text-embedding-ada-002") as run1: model_uri = mlflow.langchain.log_model(chain2, "langchain").model_uri mlflow.evaluate(model=model_uri, model_type="text", data=df_filtered, evaluators="giskard")

doc

Write an article

Optional

Populate MLflow's evaluation examples with the giskard evaluator.

Capture giskard.scanner.logger ERROR Detector LLMToxicityDetector failed with error: 'PyFuncModel' object has no attribute 'rewrite_prompt' and output instead something like giskard.scanner.logger ERROR Detector LLMToxicityDetector is not supported by giskard plug-in through mlflow.evaluate

linear · 2023-06-21T14:49:02Z

GSK-1321 Integration with MLflow

…ard-AI/giskard into gsk-1321/mlflow-integration

# Conflicts: # python-client/giskard/core/suite.py # python-client/giskard/models/__init__.py # python-client/giskard/models/base/__init__.py # python-client/giskard/models/catboost/__init__.py # python-client/giskard/models/huggingface/__init__.py # python-client/giskard/models/langchain.py # python-client/giskard/models/sklearn/__init__.py # python-client/giskard/scanner/result.py # python-client/tests/models/automodel/test_infer_giskard_cls.py

andreybavt · 2023-07-25T09:33:05Z

@andreybavt I am currently wrapping the PyFunc model I have access to in mlflow.evaluate with a custom giskard CloudPickleBasedModel. I only needed to customise model_predict, which was the advantage, since PyFunc standardise predict().

The drawback of not having access to the underlying model object, is that we can't run all scan detectors. This mainly happened only once, for the toxicity detector where we needed the underling langchain model object to regenerate prompts, see:

https://github.com/Giskard-AI/giskard/blob/116e1fd4f652cedcc0497a54eb7a78ecba089c89/python-client/giskard/models/langchain/__init__.py#L51-L71

We can put a pin in this, but I think we might be forced to re-think the unwrapping of the model-object from the artifact logged by MLflow for our own purposes (scan).

I think we can look at model_type argument (or the value of model_type in model_config) to either wrap it with just a PyFuncModel or a LangchainModel and then be able to call rewrite_prompt in case it's a text_generation model

rabah-khalek · 2023-07-25T10:27:49Z

@andreybavt I am currently wrapping the PyFunc model I have access to in mlflow.evaluate with a custom giskard CloudPickleBasedModel. I only needed to customise model_predict, which was the advantage, since PyFunc standardise predict().
The drawback of not having access to the underlying model object, is that we can't run all scan detectors. This mainly happened only once, for the toxicity detector where we needed the underling langchain model object to regenerate prompts, see:
https://github.com/Giskard-AI/giskard/blob/116e1fd4f652cedcc0497a54eb7a78ecba089c89/python-client/giskard/models/langchain/__init__.py#L51-L71

We can put a pin in this, but I think we might be forced to re-think the unwrapping of the model-object from the artifact logged by MLflow for our own purposes (scan).

I think we can look at model_type argument (or the value of model_type in model_config) to either wrap it with just a PyFuncModel or a LangchainModel and then be able to call rewrite_prompt in case it's a text_generation model

As it's not a blocker, I think it's a good idea to merge this branch and to take care of this point in a new one. This will unlock some marketing actions. WDYT?

sonarqubecloud · 2023-07-25T23:42:06Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
1 Code Smell

80.4% Coverage
0.0% Duplication

implemented mlflow integration

fdd1592

rabah-khalek self-assigned this Jun 21, 2023

rabah-khalek marked this pull request as draft June 21, 2023 14:49

rabah-khalek and others added 26 commits June 21, 2023 16:49

Merge branch 'main' into gsk-1321/mlflow-integration

44e67d4

updated pyproject.toml

bde7756

cleanup

10fad13

Merge branch 'gsk-1321/mlflow-integration' of https://github.com/Gisk…

2f72ff4

…ard-AI/giskard into gsk-1321/mlflow-integration

updated pdm lock

03d92e7

Merge branch 'main' into gsk-1321/mlflow-integration

59e2204

restored pyproject.toml

06966f4

Merge branch 'main' into gsk-1321/mlflow-integration

1601df7

Merge branch 'main' into gsk-1321/mlflow-integration

f55968a

Merge branch 'main' into gsk-1321/mlflow-integration

78b5ef5

Make scan widget optionally embeddable

24ec3e0

updated giskard_evaluator.py

295cc07

Merge branch 'task/scan-widget-embed' into gsk-1321/mlflow-integration

a39f5e4

Merge branch 'main' into gsk-1321/mlflow-integration

f2460e3

Merge branch 'gsk-1321/mlflow-integration' of https://github.com/Gisk…

2d29095

…ard-AI/giskard into gsk-1321/mlflow-integration

fixed metric names

f9400dc

updated metrics name

58ca60d

added pyfunc native methods to MLFlowBasedModel class

0d4ef1d

Added unit-test

918712f

updated save_model

1c289ae

updated save_model

0c0067b

correct implementation of save_model for pyfuncmodel

e83667d

removed old giskard class wrapper for pyfunc

4054933

a return was missing in load_model

5d3b145

added mlflow in pyproject under test group

205f362

corrected positional args in MLFlowBasedModel

46d3546

rabah-khalek and others added 15 commits July 11, 2023 22:40

Removed redundant test

69cdd5a

Merge branch 'main' into gsk-1321/mlflow-integration

7080cf7

Merge branch 'main' into gsk-1321/mlflow-integration

89f61da

implemented EvaluationResult

ad0dfdb

Merge branch 'gsk-1321/mlflow-integration' of https://github.com/Gisk…

dde95a8

…ard-AI/giskard into gsk-1321/mlflow-integration

added success message

95fe94f

small fix to scan summary

9984aed

working on docs

1a81304

Merge branch 'main' into gsk-1321/mlflow-integration

07bfbc5

updating docs

8be61ee

fixed conflicts with main

ee4598e

fixed conflicts with main

0d737f3

fixed conflicts with main

6dcdf3c

Merge branch 'main' into gsk-1321/mlflow-integration

d92d195

rabah-khalek added 6 commits July 25, 2023 18:50

Merge branch 'main' into gsk-1321/mlflow-integration

b190d3a

updated docs

8aabab7

updated docs

ad940f5

updated docs

9f954d7

updated docs

3eb0898

updated docs

58e7c3d

updated pdm lock

5dba2eb

rabah-khalek merged commit 38da0e5 into main Jul 26, 2023

linear bot mentioned this pull request Jul 27, 2023

[GSK-1473] Enabling rewrite_prompt for giskard evaluator #1266

Merged

3 tasks

rabah-khalek added Python Pull requests that update Python code Integrations labels Aug 2, 2023

Hartorn deleted the gsk-1321/mlflow-integration branch September 22, 2023 10:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[GSK-1321] Integration with MLflow #1189

[GSK-1321] Integration with MLflow #1189

Uh oh!

rabah-khalek commented Jun 21, 2023 •

edited

Loading

Uh oh!

linear bot commented Jun 21, 2023

Uh oh!

andreybavt commented Jul 25, 2023

Uh oh!

rabah-khalek commented Jul 25, 2023

Uh oh!

sonarqubecloud bot commented Jul 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

Uh oh!

[GSK-1321] Integration with MLflow #1189

[GSK-1321] Integration with MLflow #1189

Uh oh!

Conversation

rabah-khalek commented Jun 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Two notebooks to test this feature:

Description

notebook to test this feature:

Todo:

Tech

Writing

Optional

Uh oh!

linear bot commented Jun 21, 2023

Uh oh!

andreybavt commented Jul 25, 2023

Uh oh!

rabah-khalek commented Jul 25, 2023

Uh oh!

sonarqubecloud bot commented Jul 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

rabah-khalek commented Jun 21, 2023 •

edited

Loading