[maintenance] lazy load dpnp.tensor/dpnp and prepare for array_api lazy importing #2509

icfaust · 2025-06-05T12:40:15Z

Description

Dpctl and dpnp are quasi-dependencies which will silently error out if not installed. This is done at import time throughout the codebase, meaning that it is mixed into the codebase in a difficult manner. As the number of supported data frameworks are increased, such a strategy is unsustainable. Lazy loading of the necessary packages must be done, as the load time of follow-on frameworks like PyTorch are non-negligible (>1s). If we were to follow the same strategy, load times of sklearnex would be even longer even if pytorch isn't used but is available. This will compound as we would add framework support. Cleanly separating and isolating their use is necessary.

Therefore we need to first move dpnp and dpctl.tensor support to a lazy loading approach which will then be extended by follow-on frameworks. The next step will be pytorch queue extraction, which will require this infrastructure.

The strategy will follow that of array_api_compat which can check for namespaces without importing the actual modules, and for the direct use of the frameworks, a depedency injection + monkeypatching scheme is used with decorator lazy_import.

A new test is added to test_common.py which verifies for all estimators that when they are imported, that they nor the underlying infrastructure actively load data frameworks which aren't numpy or pandas.

NOTE TO REVIEWERS: Let me know if I should do a performance benchmarks for this.

PR should start as a draft, then move to ready for review state after CI is passed and all applicable checkboxes are closed.
This approach ensures that reviewers don't spend extra time asking for regular requirements.

You can remove a checkbox as not applicable only if it doesn't relate to this PR in any way.
For example, PR with docs update doesn't require checkboxes for performance while PR with any change in actual code should have checkboxes and justify how this code change is expected to affect performance (or justification should be self-evident).

Checklist to comply with before moving PR from draft:

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with update and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least summary table with measured data, if performance change is expected.
I have provided justification why performance has changed or why changes are not expected.
I have provided justification why quality metrics have changed or why changes are not expected.
I have extended benchmarking suite and provided corresponding scikit-learn_bench PR if new measurable functionality was introduced in this PR.

…rn-intelex into dev/lazy_load

david-cortes-intel · 2025-06-05T14:45:59Z

sklearnex/utils/validation.py

+    try:
+        too_small = X.size < 32768
+    except TypeError:
+        too_small = math.prod(X.shape) < 32768


Could also use np.prod, since numpy is already imported throughout the codebase.

https://github.com/scikit-learn/scikit-learn/blob/73a8a656b8df6d02cf88ef8f9cf98373a3f42051/sklearn/utils/_array_api.py#L215 Not entirely sure how numpy would interact with pytorch in that case. Could check that if you want, but its following the precedent set by sklearn itself

onedal/utils/_third_party.py

onedal/datatypes/_sycl_usm.py

Copilot

Pull Request Overview

This PR implements lazy-loading for dpnp, dpctl.tensor, and array_api support to mitigate import-time performance overhead and decouple heavy dependencies from estimator initialization. Key changes include refactoring import paths from deprecated helper modules to a new _third_party module, updating functions in several modules (e.g. logistic regression, ensemble forests, device offload) to use lazy evaluation, and adding a test in test_common.py to verify that only numpy and pandas are loaded on estimator import.

Reviewed Changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
tests/run_examples.py	Updated import path for dpctl availability.
sklearnex/tests/test_memory_usage.py	Removed unused dpctl/dpnp imports from dpep_helpers.
sklearnex/tests/test_common.py	Added new test to validate lazy import behavior for data frameworks.
onedal/ensemble/_forest.py	Replaced get_unique_values_with_dpep with new inline unique extraction.
onedal/_device_offload.py	Updated handling of output conversion and lazy data extraction.
onedal/utils/_third_party.py	Introduced new helper functions for lazy importing and third-party checks.
onedal/utils/_array_api.py	Added caching mechanism for mapping array types to SYCL namespaces.
onedal/tests/utils/_dataframes_support.py	Modified dpnp availability checks using try/except.
onedal/linear_model/logistic_regression.py	Updated unique value extraction using _get_sycl_namespace.
onedal/datatypes/*	Various adjustments to imports and data conversion functions.
onedal/common/tests/test_sycl.py	Updated dpctl availability checks to use new module.

Comments suppressed due to low confidence (1)

onedal/ensemble/forest.py:321

The variable 'xp' is used without being defined. It should be initialized by extracting the array namespace from X (e.g. by adding '_, xp, _ = _get_sycl_namespace(X)' before using xp.unique).

                self.classes_ = xp.unique(y)

icfaust · 2025-06-23T04:55:55Z

Pull Request Overview

This PR implements lazy-loading for dpnp, dpctl.tensor, and array_api support to mitigate import-time performance overhead and decouple heavy dependencies from estimator initialization. Key changes include refactoring import paths from deprecated helper modules to a new _third_party module, updating functions in several modules (e.g. logistic regression, ensemble forests, device offload) to use lazy evaluation, and adding a test in test_common.py to verify that only numpy and pandas are loaded on estimator import.

Reviewed Changes

Copilot reviewed 20 out of 20 changed files in this pull request and generated no comments.

Show a summary per file
File Description
tests/run_examples.py Updated import path for dpctl availability.
sklearnex/tests/test_memory_usage.py Removed unused dpctl/dpnp imports from dpep_helpers.
sklearnex/tests/test_common.py Added new test to validate lazy import behavior for data frameworks.
onedal/ensemble/_forest.py Replaced get_unique_values_with_dpep with new inline unique extraction.
onedal/_device_offload.py Updated handling of output conversion and lazy data extraction.
onedal/utils/_third_party.py Introduced new helper functions for lazy importing and third-party checks.
onedal/utils/_array_api.py Added caching mechanism for mapping array types to SYCL namespaces.
onedal/tests/utils/_dataframes_support.py Modified dpnp availability checks using try/except.
onedal/linear_model/logistic_regression.py Updated unique value extraction using _get_sycl_namespace.
onedal/datatypes/* Various adjustments to imports and data conversion functions.
onedal/common/tests/test_sycl.py Updated dpctl availability checks to use new module.
Comments suppressed due to low confidence (1)
onedal/ensemble/forest.py:321

The variable 'xp' is used without being defined. It should be initialized by extracting the array namespace from X (e.g. by adding '_, xp, _ = _get_sycl_namespace(X)' before using xp.unique).
                self.classes_ = xp.unique(y)

Low confidence recommendation for onedal/ensemble/forest.py is incorrect, as xp is defined beforehand.

icfaust · 2025-06-23T05:15:23Z

/intelci: run

Alexsandruss

I assume PreCommit issues are caused by infra only.

icfaust · 2025-06-23T12:11:04Z

Will rerun because of private-CI issues.

icfaust · 2025-06-23T12:11:10Z

/intelci: run

Alexsandruss · 2025-06-23T13:11:28Z

/intelci: run

ethanglaser · 2025-06-23T21:13:37Z

/intelci: run

icfaust · 2025-06-24T04:10:27Z

We are having private CI infrastructure issues, especially with the GPU runners, so this will be on hold until those run properly.

icfaust · 2025-06-24T10:17:20Z

/intelci: run

icfaust · 2025-06-24T20:44:31Z

/intelci: run

icfaust added 9 commits June 3, 2025 09:44

starting point

7d14b79

Merge branch 'dev/lazy_load' of https://github.com/icfaust/scikit-lea…

4a83297

…rn-intelex into dev/lazy_load

first cut

523e84b

rename

6f4775f

fix various testing imports

219e26f

don't get ahead of my skis

54af074

attempt to further move things apart

f3c5d5b

remove get_unique_values_with_dpep

bfdd3e0

remove actually

436405c

david-cortes-intel reviewed Jun 5, 2025

View reviewed changes

icfaust and others added 20 commits June 5, 2025 23:33

Update _array_api.py

55eab86

try to fix

a7c8fb0

Update _device_offload.py

982c7c4

Update _device_offload.py

e975a4f

Update _device_offload.py

d46d175

Update _device_offload.py

8e8b6d9

Update _sycl_usm.py

125e727

Update _third_party.py

fc6fa24

Update _device_offload.py

c9244b8

Update _device_offload.py

c171175

Update _device_offload.py

18308b2

Update _device_offload.py

603e7d3

Update _sycl_usm.py

bc1c0e3

Update _sycl_usm.py

51a6b06

Update _third_party.py

1f1648c

Update _third_party.py

0ec3ed8

Update _sycl_usm.py

5688076

Update _third_party.py

39d300e

Update _third_party.py

62611c0

Update _array_api.py

3688b1b

icfaust added 17 commits June 21, 2025 12:09

Update test_common.py

869e036

Update test_common.py

400aad8

Update test_common.py

556cb16

Update test_common.py

de1c1ea

Update test_common.py

ab76a76

Update test_common.py

03b99a0

Update test_common.py

733511e

Update test_common.py

24fa5ba

Update test_common.py

a37bacb

Update test_common.py

f31c296

Update test_common.py

c6a7a4d

Update test_common.py

f2baef3

Update _data_conversion.py

d706419

Update _data_conversion.py

b40f59f

Update test_common.py

e155ca3

Update _data_conversion.py

abd8b32

Update _data_conversion.py

2541e21

icfaust requested a review from Copilot June 23, 2025 04:51

Copilot AI reviewed Jun 23, 2025

View reviewed changes

Alexsandruss approved these changes Jun 23, 2025

View reviewed changes

icfaust merged commit 6632523 into uxlfoundation:main Jun 25, 2025
31 of 32 checks passed

[maintenance] lazy load dpnp.tensor/dpnp and prepare for array_api lazy importing #2509

[maintenance] lazy load dpnp.tensor/dpnp and prepare for array_api lazy importing #2509

Uh oh!

Conversation

icfaust commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

david-cortes-intel Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

icfaust Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

icfaust commented Jun 23, 2025

Pull Request Overview

Reviewed Changes

Uh oh!

icfaust commented Jun 23, 2025

Uh oh!

Alexsandruss left a comment

Choose a reason for hiding this comment

Uh oh!

icfaust commented Jun 23, 2025

Uh oh!

icfaust commented Jun 23, 2025

Uh oh!

Alexsandruss commented Jun 23, 2025

Uh oh!

ethanglaser commented Jun 23, 2025

Uh oh!

icfaust commented Jun 24, 2025

Uh oh!

icfaust commented Jun 24, 2025

Uh oh!

icfaust commented Jun 24, 2025

Uh oh!

Uh oh!

Uh oh!

icfaust commented Jun 5, 2025 •

edited

Loading