Fixes for test_unique narwhals test by scott-routledge2 · Pull Request #920 · bodo-ai/Bodo

scott-routledge2 · 2025-11-10T22:16:14Z

Changes included in this PR

adds extra projection in "subset" case to ensure column ordering matches Pandas
adds error checking for duplicated in the JIT fallback path

Testing strategy

narwhals: test_unique -- all params now passing, (keep="none" falls back to pandas)

User facing changes

Checklist

Pipelines passed before requesting review. To run CI you must include [run CI] in your commit message.
I am familiar with the Contributing Guide
I have installed + ran pre-commit hooks.

codecov · 2025-11-10T23:06:46Z

Codecov Report

❌ Patch coverage is 37.50000% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 68.79%. Comparing base (c33fbb5) to head (a90b4bb).
⚠️ Report is 115 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #920      +/-   ##
==========================================
+ Coverage   66.68%   68.79%   +2.10%     
==========================================
  Files         186      195       +9     
  Lines       66795    67557     +762     
  Branches     9507     9594      +87     
==========================================
+ Hits        44543    46474    +1931     
+ Misses      19572    18251    -1321     
- Partials     2680     2832     +152

scott-routledge2 · 2025-11-11T17:28:59Z

The results don't seem to be deterministic with 3 workers. The original dataframe is: {'a': [1, 3, 2], 'b': [4, 4, 6], 'z': [7.0, 8.0, 9.0]} Sometimes df.unique(subset=["b"], keep="any") is
{'a': [1, 2], 'b': [4, 6], 'z': [8.0, 9.0]} (z and a values don't match), and sometimes it's correct.

Edit: oh this could be related to how dataframes are evaluated. The correct result is either going to be (1, 7.0) or (3, 8.0) but if the columns are evaluated independently then that would explain the sometimes mismatch.

ehsantn

Looks good to me.

DrTodd13

LGTM! Thanks.

scott-routledge2 added 2 commits November 10, 2025 17:15

fixes for test_unique narwhals test

fec8960

[run ci]

a90b4bb

scott-routledge2 marked this pull request as ready for review November 11, 2025 15:57

scott-routledge2 requested review from DrTodd13 and ehsantn November 11, 2025 15:58

ehsantn approved these changes Nov 11, 2025

View reviewed changes

DrTodd13 approved these changes Nov 11, 2025

View reviewed changes

scott-routledge2 merged commit 2257ef4 into main Nov 11, 2025
50 of 54 checks passed

scott-routledge2 deleted the scott/drop_duplicates_fixes branch November 11, 2025 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for test_unique narwhals test#920

Fixes for test_unique narwhals test#920
scott-routledge2 merged 2 commits intomainfrom
scott/drop_duplicates_fixes

scott-routledge2 commented Nov 10, 2025 •

edited

Loading

Uh oh!

codecov bot commented Nov 10, 2025 •

edited

Loading

Uh oh!

scott-routledge2 commented Nov 11, 2025 •

edited

Loading

Uh oh!

ehsantn left a comment

Uh oh!

DrTodd13 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

scott-routledge2 commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes included in this PR

Testing strategy

User facing changes

Checklist

Uh oh!

codecov bot commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

scott-routledge2 commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ehsantn left a comment

Choose a reason for hiding this comment

Uh oh!

DrTodd13 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

scott-routledge2 commented Nov 10, 2025 •

edited

Loading

codecov bot commented Nov 10, 2025 •

edited

Loading

scott-routledge2 commented Nov 11, 2025 •

edited

Loading