-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for composite devices #1993
Conversation
Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. 🤞 |
Array API standard conformance tests for dpctl= ran successfully. |
2d5942d
to
76f2fc2
Compare
Array API standard conformance tests for dpctl=0.19.0dev0=py310h93fe807_544 ran successfully. |
76f2fc2
to
00cb089
Compare
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_25 ran successfully. |
00cb089
to
cc5ef08
Compare
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_27 ran successfully. |
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_31 ran successfully. |
7ea744b
to
b8c765f
Compare
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_31 ran successfully. |
4ab4cb0
to
8327887
Compare
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_34 ran successfully. |
1 similar comment
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_34 ran successfully. |
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_34 ran successfully. |
8327887
to
d27f1d0
Compare
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_82 ran successfully. |
This leverages oneAPI extension for composite devices to add the free function `ext_oneapi_get_composite_devices` to the main dpctl namespace
This method is only applicable for level_zero backend, returning an empty list for all other backend types
…aspect_component` Aligns with the rest of the SyclDevice properties Adds tests for the aspects
d27f1d0
to
9cb7b50
Compare
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_85 ran successfully. |
@antonwolfy |
…vices_from_composite
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_99 ran successfully. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @ndgrigorian, LGTM!
Array API standard conformance tests for dpctl=0.20.0dev0=py310h93fe807_100 ran successfully. |
DPCTLDevice_GetComponentDevices(__dpctl_keep const DPCTLSyclDeviceRef DRef) | ||
{ | ||
using vecTy = std::vector<DPCTLSyclDeviceRef>; | ||
vecTy *ComponentDevicesVectorPtr = nullptr; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can use unique_ptr
here to avoid calling delete
by hands.
__dpctl_give DPCTLDeviceVectorRef
DPCTLDevice_GetComponentDevices(__dpctl_keep const DPCTLSyclDeviceRef DRef)
{
using vecTy = std::vector<DPCTLSyclDeviceRef>;
if (!DRef) {
return nullptr;
}
auto D = unwrap<device>(DRef);
try {
auto componentDevices =
D->get_info<sycl::ext::oneapi::experimental::info::device::
component_devices>();
auto ComponentDevicesVectorPtr = std::make_unique(new vecTy());
ComponentDevicesVectorPtr->reserve(componentDevices.size());
for (const auto &cd : componentDevices) {
ComponentDevicesVectorPtr->emplace_back(
wrap<device>(new device(cd)));
}
return wrap<vecTy>(ComponentDevicesVectorPtr.release());
} catch (std::exception const &e) {
error_handler(e, __FILE__, __func__, __LINE__);
}
return nullptr;
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is something that might be smart to go back and do throughout the libsyclinterface codebase
__dpctl_give DPCTLSyclDeviceRef | ||
DPCTLDevice_GetCompositeDevice(__dpctl_keep const DPCTLSyclDeviceRef DRef) | ||
{ | ||
auto D = unwrap<device>(DRef); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This function feels a bit overcomplicated. Is this done for purpose?
__dpctl_give DPCTLSyclDeviceRef
DPCTLDevice_GetCompositeDevice(__dpctl_keep const DPCTLSyclDeviceRef DRef)
{
if (!DRef) {
return nullptr;
}
auto D = unwrap<device>(DRef);
try {
bool is_component = D->has(sycl::aspect::ext_oneapi_is_component);
if (is_component) {
const auto &compositeDevice =
D->get_info<sycl::ext::oneapi::experimental::info::device::
composite_device>();
return wrap<device>(new device(compositeDevice));
}
} catch (std::exception const &e) {
error_handler(e, __FILE__, __func__, __LINE__);
}
return nullptr;
}
*/ | ||
__dpctl_give DPCTLDeviceVectorRef DPCTLDeviceMgr_GetCompositeDevices() | ||
{ | ||
using vecTy = std::vector<DPCTLSyclDeviceRef>; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same here. unique_ptr
+ combine two try...catch
into one:
__dpctl_give DPCTLDeviceVectorRef DPCTLDeviceMgr_GetCompositeDevices()
{
using vecTy = std::vector<DPCTLSyclDeviceRef>;
try {
auto Devices = std::make_unique(new vecTy());
auto composite_devices =
ext::oneapi::experimental::get_composite_devices();
Devices->reserve(composite_devices.size());
for (const auto &CDev : composite_devices) {
Devices->emplace_back(wrap<device>(new device(std::move(CDev))));
}
return wrap<vecTy>(Devices.release());
} catch (std::exception const &e) {
error_handler(e, __FILE__, __func__, __LINE__);
}
return nullptr;
}
@@ -316,3 +316,39 @@ DPCTLPlatform_GetDevices(__dpctl_keep const DPCTLSyclPlatformRef PRef, | |||
return nullptr; | |||
} | |||
} | |||
|
|||
__dpctl_give DPCTLDeviceVectorRef | |||
DPCTLPlatform_GetCompositeDevices(__dpctl_keep const DPCTLSyclPlatformRef PRef) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same here:
__dpctl_give DPCTLDeviceVectorRef
DPCTLPlatform_GetCompositeDevices(__dpctl_keep const DPCTLSyclPlatformRef PRef)
{
if (!PRef) {
error_handler("Cannot retrieve composite devices from "
"DPCTLSyclPlatformRef as input is a nullptr.",
__FILE__, __func__, __LINE__);
return nullptr;
}
auto P = unwrap<platform>(PRef);
using vecTy = std::vector<DPCTLSyclDeviceRef>;
try {
auto DevicesVectorPtr = std::unique_ptr<>(new vecTy());
auto composite_devices = P->ext_oneapi_get_composite_devices();
DevicesVectorPtr->reserve(composite_devices.size());
for (const auto &Dev : composite_devices) {
DevicesVectorPtr->emplace_back(
wrap<device>(new device(std::move(Dev))));
}
return wrap<vecTy>(DevicesVectorPtr.release());
} catch (std::exception const &e) {
error_handler(e, __FILE__, __func__, __LINE__);
}
return nullptr;
}
{ | ||
using vecTy = std::vector<DPCTLSyclDeviceRef>; | ||
vecTy *ComponentDevicesVectorPtr = nullptr; | ||
if (DRef) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In some functions we are checking for XRef
to be non nullptr
. (https://github.com/IntelPython/dpctl/pull/1993/files#diff-93f8a41da82dd5dd2ae6b05e019d8104c87c244e6077d5a819625a2fc5e3a2f1R858)
In others we are checking that result of auto X = unwrap<...>(XRef)
is non nullptr
. (https://github.com/IntelPython/dpctl/pull/1993/files#diff-93f8a41da82dd5dd2ae6b05e019d8104c87c244e6077d5a819625a2fc5e3a2f1R883, https://github.com/IntelPython/dpctl/pull/1993/files#diff-2cee49fe41f858d191076f530e6c6ce6ca841be844b8bca525509ed3deddbbaeR324)
Is that intentionally?
Are these checks interchangeable?
Can we unwrap<...>(XRef)
if XRef
is nullptr?
Can result of unwrap<...>(XRef)
be nullptr if XRef
is not nullptr?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
They are treated as more or less interchangeable throughout the file, and throughout libsyclinterface.
As to if we can unwrap
a nullptr, yes, there are a few tests which more or less do this throughout the libsyclinterface tests.
And the last question, I can't think of such a case. Given it's treated as interchangeable: probably not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A bit late, but I have some comments:
- try to avoid calling
delete
by hands and useunique_ptr
instead - In most of the functions two
try...catch
blocks could be combined into one. - I have question about equivalence of two checks:
if (XRef) {
auto X = unwrap<...>(XRef);
...
}
vs
auto X = unwrap<...>(XRef);
if (X) {
...
}
Suggestions themselves make sense, I think it may be most sensible to open up (a) PR(s) doing some refactoring and clean-up in libsyclinterface |
This PR proposes supporting the oneAPI composite devices extension in dpctl, which exposes multi-tile GPU devices as root devices for each tile, but also provides access to the composite devices from level zero prior to
ZE_FLAT_DEVICE_HIERARCHY=FLAT
becoming the default. i.e., withZE_FLAT_DEVICE_HIERARCHY=COMPOSITE
, the composite devices are root devices, but withZE_FLAT_DEVICE_HIERARCHY=COMBINED
, a user gains access to these composite devices through dedicated APIs.