Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable random subsampling #122

Merged
merged 22 commits into from
Aug 1, 2024

Conversation

tfeher
Copy link
Contributor

@tfeher tfeher commented May 15, 2024

Random sampling of training set for IVF methods was reverted in rapidsai/raft#2144 due to the large memory usage of the subsample method.

Since then, PR rapidsai/raft#2155 has implemented a new random sampling method with improved memory utilization. Using that we can now enable random sampling of IVF methods (rapidsai/raft#2052 and rapidsai/raft#2077).

Random subsampling has measurable overhead for IVF-Flat, therefore it is only enabled for IVF-PQ.

@tfeher tfeher requested a review from a team as a code owner May 15, 2024 23:32
@tfeher tfeher self-assigned this May 15, 2024
@github-actions github-actions bot added the cpp label May 15, 2024
@tfeher tfeher added improvement Improves an existing functionality non-breaking Introduces a non-breaking change and removed cpp labels May 15, 2024
@tfeher tfeher requested a review from a team as a code owner May 16, 2024 16:46
@tfeher tfeher changed the base branch from branch-24.06 to branch-24.08 June 25, 2024 12:14
@tfeher tfeher requested a review from a team as a code owner June 25, 2024 12:51
@tfeher tfeher requested a review from raydouglass June 25, 2024 12:51
Copy link
Member

@cjnolet cjnolet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks @tfeher

@tfeher tfeher requested a review from a team as a code owner July 30, 2024 21:16
@tfeher
Copy link
Contributor Author

tfeher commented Aug 1, 2024

/merge

@rapids-bot rapids-bot bot merged commit e599ab2 into rapidsai:branch-24.08 Aug 1, 2024
54 checks passed
divyegala pushed a commit to divyegala/cuvs that referenced this pull request Aug 7, 2024
Random sampling of training set for IVF methods was reverted in rapidsai/raft#2144 due to the large memory usage of the subsample method.

Since then, PR rapidsai/raft#2155 has implemented a new random sampling method with improved memory utilization.  Using that we can now enable random sampling of IVF methods (rapidsai/raft#2052 and rapidsai/raft#2077).

Random subsampling has measurable overhead for IVF-Flat, therefore it is only enabled for IVF-PQ.

Authors:
  - Tamas Bela Feher (https://github.com/tfeher)
  - Corey J. Nolet (https://github.com/cjnolet)

Approvers:
  - Corey J. Nolet (https://github.com/cjnolet)

URL: rapidsai#122
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cpp improvement Improves an existing functionality non-breaking Introduces a non-breaking change Python
Projects
Development

Successfully merging this pull request may close these issues.

None yet

2 participants