chore: enforce type hints across attacks, config, and safemodel modules by shamykyzer · Pull Request #422 · AI-SDC/SACRO-ML

shamykyzer · 2026-03-10T19:35:34Z

Linting & CI:

Enabled the Ruff ANN rule set to require type hints on all function signatures
Disabled ANN401 to permit Any in cases where interfaces are intentionally dynamic
Added a mypy (v1.19.1) pre-commit hook for static type analysis

Annotations:

Added type annotations to function signatures across 17 files in attacks/, config/, and safemodel/
Refined local variable annotations — kept where they add clarity, removed where redundant

Cleanup:

Removed unused imports (PyTorchDataHandler, SklearnDataHandler) from target.py

Closes #415

…tic data

for more information, see https://pre-commit.ci

rpreen · 2026-03-10T22:30:51Z

Looks like a good start - anything and everything we can lock down the better - adding them to the variable definitions inside functions too would be good because it's such an easy way to catch any unexpected behaviour.

codecov · 2026-03-11T02:04:57Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.51%. Comparing base (a9524af) to head (107401b).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #422      +/-   ##
==========================================
- Coverage   99.51%   99.51%   -0.01%     
==========================================
  Files          23       23              
  Lines        2687     2686       -1     
==========================================
- Hits         2674     2673       -1     
  Misses         13       13

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

for more information, see https://pre-commit.ci

Signed-off-by: Shamy <110725453+shamykyzer@users.noreply.github.com>

shamykyzer · 2026-03-15T23:17:21Z

hello @rpreen, can you have a look at this and let me know if there is anything you want me to add or change?

sacroml/attacks/utils.py

rpreen · 2026-03-16T21:22:59Z

This also needs a slight change to report.py to deal with the conflict from the PR just merged.

shamykyzer · 2026-03-19T03:50:44Z

Hi @rpreen , I was going through the type hints and ran into a couple of things worth noting:

Renamed indices_train to train_set in likelihood_attack.py:248 — mypy flagged it as a redefinition because indices_train is first returned as np.ndarray from get_shadow_model on line 239, then converted to set[int] on line 248. Renaming avoids reusing the same name with two different types.
Wrapped min_child_weight in structural_attack.py:206 to fit the 88-char line limit in ruff.

Also found what I think is a pre-existing bug: get_shadow_model in utils.py:148 loads indices_train.pkl twice, the second one should be indices_test.pkl.

It works correctly when saving but not during loading, was this intentional?

rpreen · 2026-03-20T00:10:30Z

Also found what I think is a pre-existing bug: get_shadow_model in utils.py:148 loads indices_train.pkl twice, the second one should be indices_test.pkl.

Good find - we need a bug fix PR for this as high priority.

rpreen · 2026-03-20T00:28:51Z

Also found what I think is a pre-existing bug: get_shadow_model in utils.py:148 loads indices_train.pkl twice, the second one should be indices_test.pkl.

Good find - we need a bug fix PR for this as high priority.

Thankfully it looks like this doesn't actually effect anything because the test indices aren't used when computing the LiRA scores (the check is simply if not in train indices) - but still important to fix in case they get used in future code.

jim-smith · 2026-03-20T11:31:53Z

@shamykyzer nice find!. this is why collaborative s/w is best

shamykyzer · 2026-03-27T04:27:10Z

Hello @rpreen, can you review this PR when you get a chance?

Type hints are at 99% coverage with 0 missing annotation violations from ruff and 16 remaining are ANN401 in 4 files for dynamic interfaces.

Should I expand the mypy files list beyond the current 8 files and are the 16 Any usages fine or should I narrow them?

Also, current ruff configuration only flags missing hints, wouldn't it be worth adding monkeytype or autotyping to auto-generate them going forward?

Or is it good as it is? Thanks.

rpreen · 2026-03-30T10:28:19Z

This looks good as it is I think - I don't think we need to have everything perfectly typed, just adding as much as we can sensibly helps. I don't know enough about auto-generating types to comment on whether that would be useful to add - but I would think that is not something that would be added to Ruff/pre-commit, but would be done in a special PR like this (or a future one) and carefully reviewed. I suspect that since you have 99% done here that an auto-generator would likely struggle with the remaining complex cases and possibly just put Any for the areas we currently do need dynamic types anyway.

My only last question here: is there a reason for removing the existing types from sacroml/config/target.py and sacroml/config/attack.py?

We can expand more files later.

shamykyzer · 2026-03-31T12:38:54Z

This looks good as it is I think - I don't think we need to have everything perfectly typed, just adding as much as we can sensibly helps. I don't know enough about auto-generating types to comment on whether that would be useful to add - but I would think that is not something that would be added to Ruff/pre-commit, but would be done in a special PR like this (or a future one) and carefully reviewed. I suspect that since you have 99% done here that an auto-generator would likely struggle with the remaining complex cases and possibly just put Any for the areas we currently do need dynamic types anyway.

My only last question here: is there a reason for removing the existing types from sacroml/config/target.py and sacroml/config/attack.py?

We can expand more files later.

Well, I thought since every function now has type annotations on its parameters and return values, mypy will infer the local variable types from the function call so annotating the variable too would've been redundant.

  def _get_defaults(name: str) -> dict:                                                                                                                    
      ...                                                                                                                                                  
   
  params: dict = _get_defaults(name)  # redundant — mypy already knows it's a dict                                                                         
  params = _get_defaults(name)        # mypy infers dict from the return type

Here _get_defaults is annotated with -> dict, so when mypy sees params = _get_defaults(name) it already knows params is a dict from the return type and adding : dict on the variable just states the same thing twice.

Same with name: str = prompt(...)
the prompt() returns str, so mypy will already know name is a str without the explicit annotation.

This applies to all the removed annotations in both files, happy to restore them if you prefer?

Also I agree on expanding later, I think the safemodel/classifiers would be a good candidate for a future issue.

shamykyzer linked an issue Mar 10, 2026 that may be closed by this pull request

[Chore] Enforce type hints #415

Open

shamykyzer self-assigned this Mar 10, 2026

shamykyzer changed the title ~~Chore/enforce type hints~~ chore: enforce type hints Mar 10, 2026

shamykyzer and others added 5 commits March 10, 2026 19:56

test: replace OpenML fetches in tests with deterministic local synthe…

8439bd9

…tic data

relaxed factory accuracy assetion

b662454

chore: enforce Ruff type annotations and mypy pre-commit checks added

7a3ad0e

[pre-commit.ci] auto fixes from pre-commit.com hooks

be5543d

for more information, see https://pre-commit.ci

fix: bug resolved for failed checks

ec98cae

shamykyzer force-pushed the chore/enforce-type-hints branch from 1cb8014 to ec98cae Compare March 10, 2026 19:57

shamykyzer marked this pull request as ready for review March 10, 2026 20:09

chore: revert unrelated test changes and keep OpenML paths

0219599

shamykyzer requested a review from rpreen March 10, 2026 20:48

shamykyzer added 2 commits March 10, 2026 22:18

Merge branch 'main' into chore/enforce-type-hints

6fd54b4

Merge branch 'main' into chore/enforce-type-hints

c04a211

shamykyzer and others added 7 commits March 11, 2026 09:04

chore: enforcing type hints attacks

12ebe87

[pre-commit.ci] auto fixes from pre-commit.com hooks

2454cf6

for more information, see https://pre-commit.ci

fixed a failed check

a0d36c0

chore: enforce type hints across attacks, config, and safemodel modules

e70663f

[pre-commit.ci] auto fixes from pre-commit.com hooks

e36f09b

for more information, see https://pre-commit.ci

Merge branch 'main' into chore/enforce-type-hints

dee7726

comment out PD901 rule in pyproject.toml instead of deleting

94b6210

Signed-off-by: Shamy <110725453+shamykyzer@users.noreply.github.com>

shamykyzer changed the title ~~chore: enforce type hints~~ chore: enforce type hints across attacks, config, and safemodel modules Mar 15, 2026

rpreen reviewed Mar 16, 2026

View reviewed changes

sacroml/attacks/utils.py Outdated Show resolved Hide resolved

shamykyzer added 2 commits March 19, 2026 03:05

merge: resolve conflict in report.py with origin/main

61508a2

chore: restore and add type hints across attack modules

d3422f9

shamykyzer requested a review from rpreen March 19, 2026 03:50

shamykyzer and others added 2 commits March 27, 2026 03:34

merge: resolve conflicts with origin/main

1acdfd2

style: pre-commit fixes

f10d503

shamykyzer requested a review from jim-smith March 27, 2026 04:31

Merge branch 'main' into chore/enforce-type-hints

107401b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: enforce type hints across attacks, config, and safemodel modules#422

chore: enforce type hints across attacks, config, and safemodel modules#422
shamykyzer wants to merge 20 commits intomainfrom
chore/enforce-type-hints

shamykyzer commented Mar 10, 2026 •

edited

Loading

Uh oh!

rpreen commented Mar 10, 2026

Uh oh!

codecov bot commented Mar 11, 2026 •

edited

Loading

Uh oh!

shamykyzer commented Mar 15, 2026

Uh oh!

Uh oh!

rpreen commented Mar 16, 2026

Uh oh!

shamykyzer commented Mar 19, 2026 •

edited

Loading

Uh oh!

rpreen commented Mar 20, 2026

Uh oh!

rpreen commented Mar 20, 2026

Uh oh!

jim-smith commented Mar 20, 2026

Uh oh!

shamykyzer commented Mar 27, 2026

Uh oh!

rpreen commented Mar 30, 2026 •

edited

Loading

Uh oh!

shamykyzer commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

shamykyzer commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rpreen commented Mar 10, 2026

Uh oh!

codecov bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

shamykyzer commented Mar 15, 2026

Uh oh!

Uh oh!

rpreen commented Mar 16, 2026

Uh oh!

shamykyzer commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rpreen commented Mar 20, 2026

Uh oh!

rpreen commented Mar 20, 2026

Uh oh!

jim-smith commented Mar 20, 2026

Uh oh!

shamykyzer commented Mar 27, 2026

Uh oh!

rpreen commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shamykyzer commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shamykyzer commented Mar 10, 2026 •

edited

Loading

codecov bot commented Mar 11, 2026 •

edited

Loading

shamykyzer commented Mar 19, 2026 •

edited

Loading

rpreen commented Mar 30, 2026 •

edited

Loading