Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[KT] Get rid of excessive code in onesweep #1001

Conversation

dmitriy-sobolev
Copy link
Contributor

Measurements show that this does not affect performance, at least on Intel® Data Center GPU Max. So the changes might be treated as refactoring.

Nontrivial changes:

  • ResetBinCounters is removed. RankSLM does the same, but without using fence which is unnecessary as well.

dmitriy-sobolev and others added 28 commits June 29, 2023 12:36
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Use verification functions from the testing harness

Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
* Add missed headers
* Remove unnecessary 'inline' specifiers
* Avoid unnecessary SFINAE
* Align the function semantics (parameter order, types, and names)
* Take offsets in the # of data elements, convert internally to bytes as needed

Co-authored-by: Sergey Kopienko <sergey.kopienko@intel.com>
SergeyKopienko and others added 16 commits June 29, 2023 12:37
Also we made change in one_wg_kernel function:
- we replace block of code
            #pragma unroll
            for (uint32_t s = 0; s<PROCESS_SIZE; s+=64){
                keys.template select<64, 1>(s) = lsc_slm_block_load<KeyT, 64>(slm_reorder_this_thread+s*sizeof(KeyT));
            }
to
            keys = utils::BlockLoad<KeyT, PROCESS_SIZE>(slm_reorder_this_thread);
to avoid compile error:
../include/sycl/ext/intel/experimental/esimd/memory.hpp:381:3: error: static assertion failed due to requirement '_DS == lsc_data_size::u32 || _DS == lsc_data_size::u64': Transposed load is supported only for data size u32 or u64
…output into console in test_all_view function
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Fix event dependency in onesweep

Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Do not corrupt memory outside of sync_buffer borders

Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
@SergeyKopienko SergeyKopienko force-pushed the dev/dmitriy-sobolev/esimd-radix-sort branch from 3251a7a to 77639f5 Compare June 29, 2023 10:41
* [KT] Test 8-bit types

Thin out types to test due to long compilation time

Get rid of 'k' suffix in constant names

Disable sort of 8-bit types with onesweep

---------

Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
@dmitriy-sobolev dmitriy-sobolev force-pushed the dev/dmitriy-sobolev/excessive-code branch from 0b3a6ce to a21ce36 Compare June 30, 2023 07:45
Signed-off-by: Dmitriy Sobolev <dmitriy.sobolev@intel.com>
@dmitriy-sobolev dmitriy-sobolev force-pushed the dev/dmitriy-sobolev/excessive-code branch from a21ce36 to ccb7253 Compare June 30, 2023 07:56
@dmitriy-sobolev dmitriy-sobolev force-pushed the dev/dmitriy-sobolev/esimd-radix-sort branch from bfa6e39 to 90400aa Compare July 11, 2023 16:00
@SergeyKopienko SergeyKopienko force-pushed the dev/dmitriy-sobolev/esimd-radix-sort branch from 90400aa to da624f3 Compare July 17, 2023 09:24
@SergeyKopienko SergeyKopienko force-pushed the dev/dmitriy-sobolev/esimd-radix-sort branch 5 times, most recently from 62a9672 to 9f9c1f7 Compare July 31, 2023 15:47
@dmitriy-sobolev
Copy link
Contributor Author

This will be handled as a part of #1024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants