ANN_BENCH #130

achirkin · 2024-05-17T11:29:59Z

Porting the ANN benchmarks from RAFT.

Make it build

Sanity check that benchmarks work (runs and gives reasonable recall for Deep-1M dataset)

NB: the indices built using the old ANN_BENCH in raft tend to crash in cuvs search benchmarks during index deserialization - don't forget to build the indexes anew when testing.

…archs

…dspan

Signed-off-by: Mickael Ide <mide@nvidia.com>

Co-authored-by: Tamas Bela Feher <tfeher@nvidia.com>

achirkin · 2024-06-06T12:23:53Z

I've just realized the benchmarks are not compiled during conda-cpp-build CI. @cjnolet what's the best way to add the benchmark component to CI build?

achirkin · 2024-06-06T19:15:46Z

NB: some algorithms are likely to fail with OOM thrown by the limiting_resource_adapter with very large datasets (e.g. DEEP-1B); the fix is in #181

…marks when the --limit-bench-ann argument is not passed to build.sh

achirkin · 2024-06-11T15:20:21Z

I tried to add benchmarks to the test build in CI, but got the error from faiss not being able to find BLAS libraries, even though I added openblas as a conda dependency. @cjnolet, @benfred, could you please have a look?

tfeher

Thanks Artem for the updates. This PR ports the existing infrastructure from raft and enables us to run all the existing the benchmarks. I believe it would be useful to have this merged, and work o follow up improvements in separate PRs. I have added open discussion points to the tracker issue #160 (comment).

The PR looks good to me. I have also contributed to the PR, so I shall not be the only approver.

jameslamb

I've reviewed on behalf of packaging-codeowners. Left 2 small comments, neither needs to block merging this.

conda/recipes/libcuvs/meta.yaml

jameslamb · 2024-06-17T15:23:27Z

conda/recipes/libcuvs/build_libcuvs_tests.sh

@@ -1,5 +1,5 @@
 #!/usr/bin/env bash
 # Copyright (c) 2022-2024, NVIDIA CORPORATION.

-./build.sh tests --allgpuarch --no-nvtx --build-metrics=tests_bench --incl-cache-stats
+./build.sh tests bench-ann --allgpuarch --no-nvtx --build-metrics=tests_bench --incl-cache-stats


Over in RAFT, bench-ann is its own package with its own dependencies, build scripts, etc.:

https://github.com/rapidsai/raft/tree/877644a423c0268746af62cecb7150afa65d8386/python/raft-ann-bench

https://github.com/rapidsai/raft/blob/877644a423c0268746af62cecb7150afa65d8386/conda/recipes/raft-ann-bench/meta.yaml

For my own understanding in reviewing this... why is this being added to the libcuvs-tests package here instead of creating a new standalone package as exists in RAFT?

Having these separate things be their own packages can be helpful for parallelizing development, limiting the potential impact of packaging changes, and speeding up debugging.

I agree that benchmarks should probably be a separate package. I'm only hesitating to add this because I'm not very familiar with this conda/CI setup. I hoped to postpone this question till a follow-on PR together with the naming decision (#160 (comment)). @benfred, @cjnolet , what do you think?

benfred · 2024-06-17T17:34:15Z

cpp/CMakeLists.txt

@@ -55,6 +55,7 @@ option(BUILD_SHARED_LIBS "Build cuvs shared libraries" ON)
 option(BUILD_TESTS "Build cuvs unit-tests" ON)
 option(BUILD_C_LIBRARY "Build raft C API library" OFF)
 option(BUILD_C_TESTS "Build raft C API tests" OFF)
+option(BUILD_ANN_BENCH "Build cuVS ann benchmarks" OFF)


Can we add an option to build this in CI?

It looks to me that this is off by default (which is fine), but that does mean that we don't have any guarantee that any of this is compiling.

We enable this currently together with tests in build.sh (see #130 (comment))

mfoerste4 and others added 2 commits May 17, 2024 12:35

enable asynchronous host-refinement for cagra index build

4416b0b

Copy the benchmark scaffolding from RAFT

118906c

github-actions bot added cpp CMake labels May 17, 2024

achirkin changed the title ~~ANN_BENCH~~ [WIP] ANN_BENCH May 17, 2024

achirkin and others added 4 commits May 17, 2024 14:05

GGNN/HNSW/FAISS_CPU are compiled

7814d35

Fix FAISS_GPU builds

f8d8b68

move alloc out of OMP region

946e106

added test for refinement -- disabled until detail::build API is exposed

7c6644e

cjnolet assigned achirkin May 17, 2024

cjnolet added improvement Improves an existing functionality non-breaking Introduces a non-breaking change benchmarking labels May 17, 2024

tfeher and others added 17 commits May 21, 2024 20:40

Merge remote-tracking branch 'origin/branch-24.06' into fea-ann-bench

0619207

cuvs_ivf_flat bench compiles

9313d1f

IVF compiles, linker error

2e61d61

Disabled ivf_pq refinement, as a workaround for linking error

8c6f9fe

expose more cagra build parameters

2c3f5e4

clarify code

4d5ff0c

fix merge conflict

545032b

Accept host_mdspan for IVF-PQ build and extend

53caff9

slightly reduce expected recall to catch refinement 1 results on all …

d2c1dad

…archs

Fix extend API

9f31182

Merge remote-tracking branch 'origin/branch-24.06' into ivf_pq_host_m…

292e947

…dspan

init

f1297da

Signed-off-by: Mickael Ide <mide@nvidia.com>

CAGRA bench compiles

9445708

Fix refine_host duplication, add test

2401fcd

fix style

0d214c0

Update cpp/include/cuvs/neighbors/refine.hpp

5efe1c6

Co-authored-by: Tamas Bela Feher <tfeher@nvidia.com>

Add half, update doc

4b9ec70

achirkin and others added 4 commits June 4, 2024 19:53

Merge branch 'branch-24.08' into fea-ann-bench

0fbf825

Apply clang-tidy suggestions

6a20121

Remove the unused dev_list parameter and add a few more clang-tidy fixes

1e64d05

Merge branch 'branch-24.08' into fea-ann-bench

baf8863

achirkin requested a review from cjnolet June 5, 2024 18:50

achirkin and others added 2 commits June 6, 2024 10:22

Merge branch 'branch-24.08' into fea-ann-bench

c047931

Fix using missing (de)serialize_file

887a656

Merge branch 'branch-24.08' into fea-ann-bench

001108d

achirkin requested a review from a team as a code owner June 10, 2024 07:58

achirkin requested a review from jameslamb June 10, 2024 07:58

achirkin added 5 commits June 10, 2024 20:28

Fix: serialize_to_hnswlib_file -> serialize_to_hnswlib

449be8c

Properly add instances of serialize_to_hnswlib

a71dcb0

Add an aggregate target CUVS_ANN_BENCH_ALL to build all enabled bench…

8887225

…marks when the --limit-bench-ann argument is not passed to build.sh

Include the bench-ann component into the CI alongside with the tests

a32abd5

Add openblas dependency to test environment

c22035c

achirkin and others added 5 commits June 11, 2024 19:47

Merge branch 'branch-24.08' into fea-ann-bench

512cff6

Add openblas requirement to libcuvs-tests recipe (meta.yaml)

c8ae918

Temporarily disable all algos except cuvs

3fd97f6

Re-enable all benchmarks but GGNN.

3daa597

Merge branch 'branch-24.08' into fea-ann-bench

10171cf

tfeher approved these changes Jun 17, 2024

View reviewed changes

jameslamb approved these changes Jun 17, 2024

View reviewed changes

benfred reviewed Jun 17, 2024

View reviewed changes

achirkin and others added 3 commits June 17, 2024 19:46

Merge branch 'branch-24.08' into fea-ann-bench

398a808

Merge branch 'branch-24.08' into fea-ann-bench

d7c25bd

Move openblas conda dependency from build to host

660c40d

achirkin requested a review from benfred June 19, 2024 08:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ANN_BENCH #130

ANN_BENCH #130

achirkin commented May 17, 2024 •

edited

achirkin commented Jun 6, 2024

achirkin commented Jun 6, 2024 •

edited

achirkin commented Jun 11, 2024

tfeher left a comment

jameslamb left a comment

jameslamb Jun 17, 2024

achirkin Jun 19, 2024

benfred Jun 17, 2024

achirkin Jun 17, 2024

ANN_BENCH #130

Are you sure you want to change the base?

ANN_BENCH #130

Conversation

achirkin commented May 17, 2024 • edited

achirkin commented Jun 6, 2024

achirkin commented Jun 6, 2024 • edited

achirkin commented Jun 11, 2024

tfeher left a comment

Choose a reason for hiding this comment

jameslamb left a comment

Choose a reason for hiding this comment

jameslamb Jun 17, 2024

Choose a reason for hiding this comment

achirkin Jun 19, 2024

Choose a reason for hiding this comment

benfred Jun 17, 2024

Choose a reason for hiding this comment

achirkin Jun 17, 2024

Choose a reason for hiding this comment

achirkin commented May 17, 2024 •

edited

achirkin commented Jun 6, 2024 •

edited