Support for fp16 in CAGRA and IVF-PQ #2085

achirkin · 2024-01-10T15:29:16Z

Add fp16 (CUDA half) support to CAGRA and its dependencies.

…n the raft_objs component

…instances

achirkin · 2024-01-11T15:15:12Z

NB: this does not add the fp16 capabilities to the ann-bench executable. I suggest we add that in a follow-on PR.

tfeher

Thanks @achirkin for this PR!

We have here a large amount of boilerplate code. Fortunately the nontrivial changes are relatively small, and are confined to

mdspan_numpy_serializer.hpp
device_load_stores.cuh
test/neighbors/ann_cagra.cuh

The PR looks good to me!

cpp/test/neighbors/ann_cagra.cuh

cpp/include/raft/util/device_loads_stores.cuh

tfeher · 2024-01-17T21:20:24Z

cpp/include/raft/util/device_loads_stores.cuh

@@ -148,6 +149,26 @@ DI void sts(int32_t* addr, const int32_t (&x)[4])
               : "l"(s4), "r"(x[0]), "r"(x[1]), "r"(x[2]), "r"(x[3]));
 }

+DI void sts(half* addr, const half& x)


Tagging @mdoijade to have a look at the changes in this file, since the load and store ops here are mostly used by IVF-Flat and contractions.cuh.

The additions to device_loads_stores.cuh looks good to me, I agree it is good to have matching sts function call for lds for larger fp16 vector sizes.

cpp/test/neighbors/ann_cagra.cuh

Co-authored-by: tsuki <12711693+enp1s0@users.noreply.github.com>

achirkin · 2024-01-19T06:21:21Z

/merge

This reverts commit 72f48ae.

RAFT C++ tests were not running for a portion of the 24.02 development cycle, until the merger of rapidsai/rapids-cmake#533. This PR fixes some failing tests and reverts PRs that caused test failures that were silent until now, specifically #2097 and #2085. These features will be revisited in a subsequent release. Authors: - Malte Förster (https://github.com/mfoerste4) - Corey J. Nolet (https://github.com/cjnolet) Approvers: - Ben Frederickson (https://github.com/benfred) - Bradley Dice (https://github.com/bdice)

Add fp16 (CUDA half) support to CAGRA and its dependencies. Authors: - Artem M. Chirkin (https://github.com/achirkin) Approvers: - Tamas Bela Feher (https://github.com/tfeher) - tsuki (https://github.com/enp1s0) URL: rapidsai#2085

1. Add fp16 (CUDA half) support to CAGRA and its dependencies (#2085). 2. Fix the shared memory size error in the ivf-flat that got exposed by new tests in #2085. Regarding the point (2): Warp-sort top-k queue uses shared memory; the module provides the required shmem size calculation function decoupled from the queue object itself. As a result, it's easy to plug-in wrong types and get the calculation incorrectly. IVF-Flat scan kernel always kept the distances in the queue as floats, but we calculated the shmem size as if it used `AccT` (IVF-Flat's internal accumulation type). Hence, with adding the tests with fp16 inputs (and `AccT`), the allocated shmem became too small, which resulted in memory access violation errors. Authors: - Artem M. Chirkin (https://github.com/achirkin) Approvers: - Ben Frederickson (https://github.com/benfred) URL: #2172

Initial support and specializations

94ddf0c

achirkin added feature request New feature or request non-breaking Non-breaking change 2 - In Progress Currenty a work in progress labels Jan 10, 2024

achirkin self-assigned this Jan 10, 2024

github-actions bot added cpp CMake labels Jan 10, 2024

achirkin and others added 10 commits January 10, 2024 17:26

Add missing device_loads_stores overloads

ffa0be9

Tweak the uniformInt range to a valid value in tests

756d03e

Merge branch 'branch-24.02' into fea-cagra-fp16

adf8e09

Undo copyright-only changes introduced by generator scripts

5ab5304

Undo copyright-only changes introduced by generator scripts

bda13b1

Add -v flag for more verbose build to debug the linker error

d75fccc

Update build_libraft.sh

3895754

Make the tests PIE and remove object files that are already present i…

a776329

…n the raft_objs component

Add PIC to the raft_lib and raft_lib_static

ec4c1e6

Mirror the previous CAGRA float tests more closely with the new half …

7bbff83

…instances

Revert debug changes in build_libraft.sh

1af6ebb

achirkin marked this pull request as ready for review January 11, 2024 17:02

achirkin requested review from a team as code owners January 11, 2024 17:02

achirkin added 3 - Ready for Review and removed 2 - In Progress Currenty a work in progress labels Jan 11, 2024

achirkin added 4 commits January 11, 2024 20:38

Merge branch 'branch-24.02' into fea-cagra-fp16

23bad7e

Merge branch 'branch-24.02' into fea-cagra-fp16

96bf1a6

Merge branch 'branch-24.02' into fea-cagra-fp16

b28a9b7

Merge branch 'branch-24.02' into fea-cagra-fp16

6947fb5

tfeher approved these changes Jan 17, 2024

View reviewed changes

enp1s0 reviewed Jan 18, 2024

View reviewed changes

cpp/test/neighbors/ann_cagra.cuh Outdated Show resolved Hide resolved

cpp/test/neighbors/ann_cagra.cuh Outdated Show resolved Hide resolved

cpp/test/neighbors/ann_cagra.cuh Outdated Show resolved Hide resolved

achirkin and others added 6 commits January 18, 2024 09:52

Update cpp/test/neighbors/ann_cagra.cuh

1a60963

Co-authored-by: tsuki <12711693+enp1s0@users.noreply.github.com>

Update cpp/test/neighbors/ann_cagra.cuh

602516f

Co-authored-by: tsuki <12711693+enp1s0@users.noreply.github.com>

Update cpp/test/neighbors/ann_cagra.cuh

bdaecbb

Co-authored-by: tsuki <12711693+enp1s0@users.noreply.github.com>

Skip the tests that may fail due to fp-rounding errors

48c1454

Add a few more sts overloads

274cd40

Merge branch 'branch-24.02' into fea-cagra-fp16

3aa2a75

enp1s0 approved these changes Jan 19, 2024

View reviewed changes

rapids-bot bot merged commit 72f48ae into rapidsai:branch-24.02 Jan 19, 2024
61 checks passed

tfeher mentioned this pull request Jan 19, 2024

[FEA] Enable FP16 in CAGRA index #1890

Closed

cjnolet added a commit to cjnolet/raft that referenced this pull request Feb 8, 2024

Revert "Support for fp16 in CAGRA and IVF-PQ (rapidsai#2085)"

e087fd9

This reverts commit 72f48ae.

bdice mentioned this pull request Feb 8, 2024

Fix failing C++ tests and revert #2097, #2085. #2168

Merged

achirkin mentioned this pull request Feb 11, 2024

Reapply: Support for fp16 in CAGRA and IVF-PQ #2172

Merged

shekhars-li mentioned this pull request Apr 2, 2024

[QST] Check rapids/raft version when installing through pip #2272

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for fp16 in CAGRA and IVF-PQ #2085

Support for fp16 in CAGRA and IVF-PQ #2085

achirkin commented Jan 10, 2024 •

edited

Loading

achirkin commented Jan 11, 2024

tfeher left a comment

tfeher Jan 17, 2024

mdoijade Jan 18, 2024

achirkin commented Jan 19, 2024

Support for fp16 in CAGRA and IVF-PQ #2085

Support for fp16 in CAGRA and IVF-PQ #2085

Conversation

achirkin commented Jan 10, 2024 • edited Loading

achirkin commented Jan 11, 2024

tfeher left a comment

Choose a reason for hiding this comment

tfeher Jan 17, 2024

Choose a reason for hiding this comment

mdoijade Jan 18, 2024

Choose a reason for hiding this comment

achirkin commented Jan 19, 2024

achirkin commented Jan 10, 2024 •

edited

Loading