Skip to content

Enable random subsampling#122

Merged
rapids-bot[bot] merged 22 commits intorapidsai:branch-24.08from
tfeher:re_enable_subsampling
Aug 1, 2024
Merged

Enable random subsampling#122
rapids-bot[bot] merged 22 commits intorapidsai:branch-24.08from
tfeher:re_enable_subsampling

Conversation

@tfeher
Copy link
Copy Markdown
Contributor

@tfeher tfeher commented May 15, 2024

Random sampling of training set for IVF methods was reverted in rapidsai/raft#2144 due to the large memory usage of the subsample method.

Since then, PR rapidsai/raft#2155 has implemented a new random sampling method with improved memory utilization. Using that we can now enable random sampling of IVF methods (rapidsai/raft#2052 and rapidsai/raft#2077).

Random subsampling has measurable overhead for IVF-Flat, therefore it is only enabled for IVF-PQ.

@tfeher tfeher requested a review from a team as a code owner May 15, 2024 23:32
@tfeher tfeher self-assigned this May 15, 2024
@github-actions github-actions Bot added the cpp label May 15, 2024
@tfeher tfeher added improvement Improves an existing functionality non-breaking Introduces a non-breaking change and removed cpp labels May 15, 2024
@tfeher tfeher requested a review from a team as a code owner May 16, 2024 16:46
@tfeher tfeher changed the base branch from branch-24.06 to branch-24.08 June 25, 2024 12:14
@tfeher tfeher requested a review from a team as a code owner June 25, 2024 12:51
@tfeher tfeher requested a review from raydouglass June 25, 2024 12:51
Copy link
Copy Markdown
Member

@cjnolet cjnolet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks @tfeher

@tfeher tfeher requested a review from a team as a code owner July 30, 2024 21:16
@tfeher
Copy link
Copy Markdown
Contributor Author

tfeher commented Aug 1, 2024

/merge

@rapids-bot rapids-bot Bot merged commit e599ab2 into rapidsai:branch-24.08 Aug 1, 2024
divyegala pushed a commit to divyegala/cuvs that referenced this pull request Aug 7, 2024
Random sampling of training set for IVF methods was reverted in rapidsai/raft#2144 due to the large memory usage of the subsample method.

Since then, PR rapidsai/raft#2155 has implemented a new random sampling method with improved memory utilization.  Using that we can now enable random sampling of IVF methods (rapidsai/raft#2052 and rapidsai/raft#2077).

Random subsampling has measurable overhead for IVF-Flat, therefore it is only enabled for IVF-PQ.

Authors:
  - Tamas Bela Feher (https://github.com/tfeher)
  - Corey J. Nolet (https://github.com/cjnolet)

Approvers:
  - Corey J. Nolet (https://github.com/cjnolet)

URL: rapidsai#122
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cpp improvement Improves an existing functionality non-breaking Introduces a non-breaking change Python

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants