-
-
Notifications
You must be signed in to change notification settings - Fork 380
feature/debug #968
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature/debug #968
Conversation
|
In this commit 19de6c6: details (deprecated)
Update 1Update: regarding the second point, I think that's a more fair comparison to do (that favours int masks): Update 2Python bool is 1 byte, int is 4 bytes. So if the model is at least 75% correct it’s cheaper to store int line numbers and not bool map. |
|
|
||
| repeated uint32 actual_slices_size = 21; | ||
| repeated uint32 reference_slices_size = 22; | ||
| string output_df_id = 23; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
where is it used?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Googleton could you answer this one?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
According to @Googleton :
it’s what we use for the frontend to know which debugging session to open
80c1113 to
be66b69
Compare
…giskard into feature/debug_output
# Conflicts: # python-client/giskard/core/suite.py # python-client/giskard/ml_worker/server/ml_worker_service.py
# Conflicts: # frontend/src/views/main/project/Datasets.vue
… model if none found
# Conflicts: # frontend/src/api.ts # frontend/src/views/main/project/Datasets.vue # frontend/src/views/main/project/modals/SuiteTestInfoModal.vue # python-client/giskard/datasets/base/__init__.py
|
Kudos, SonarCloud Quality Gate passed! |










ToDo
comment (deprecated)
I found couple of issues with the previous implementation:
https://github.com/Giskard-AI/giskard/tree/GSK-106_Reduce_the_number_of_test_outputs
datasetwe use. Once we have CV, we might not havepandas.DataFrameanymore. Therefore, the slicing operation shouldn't be only valid for the latter.datasetand not a df that can be ready to be rendered in the UI (with new name, uuid, etc.).debug_filtersalong which we define theslices_to_debugshould not be duplicated and defined per test, rather collected somewhere for readability and better handling.Here's my first preliminary proposal:
https://github.com/Giskard-AI/giskard/blob/3d9ae1e0b20185979b5fab8ec6d977147e2a23a1/python-client/giskard/ml_worker/testing/tests/debug_utils.py#L13-L17
https://github.com/Giskard-AI/giskard/blob/3d9ae1e0b20185979b5fab8ec6d977147e2a23a1/python-client/giskard/ml_worker/testing/tests/debug_utils.py#L20-L26
slices_to_debugto theTestResultby using thedebug_filterscatalogue:https://github.com/Giskard-AI/giskard/blob/3d9ae1e0b20185979b5fab8ec6d977147e2a23a1/python-client/giskard/ml_worker/testing/tests/performance.py#L45-L49
so basically:
debug_filters.get(test_name)will get the filter for the corresponding test.(gsk_dataset, prediction)are the parameters needed by this filter.Datasets) assigned toslices_to_debug.This will result in:

@andreybavt and @jmsquare let me know what you think about this.
minutes of 12/04/2023
After the 12/04/2023 discussion with @jmsquare and @andreybavt :
debugarg to the testsThe name slice for(deprecated)Datasetmight not be ideal -- to specThe(abandonned)catalogueidea to review by @andreybavtsliceper type of tests:https://linear.app/giskard/project/test-debug-feature-0024fb6da8c5?filter=eyJhbmQiOlt7InN0YXRlIjp7Im5hbWUiOnsibmluIjpbIkRvbmUiXX19fV19