BUG: fixing circular import in daal4py/sklearnex device_offloading #1832

samir-nasibli · 2024-05-13T21:39:29Z

Description

Fixing circular import in daal4py/sklearnex device offloading.
The best way to resolve the issue here is to refactor the code to avoid it. This PR rethinking the design of sklearnex/daal4py/onedal4py import modules and their dependencies.

Changes proposed in the PR:

adding _config for onedal4py, just for exposing some sklearnex's config settings into onedal4py level
remove circular import in daal4py/onedal4py/sklearnex:
- removing _device_offloading module from daal4py, since after KMeans OOP #1770 and adding ENH: Adding Ridge Regression support into sklearnex.preview #1843 there is no need for GPU offloading via daal4py syc_context.
- most of device_ofloading functionality moved to onedal4py level.
- created _config module in onedal4py, for exposing some sklearnex config setting into onedal4py level, reused it on sklearnex level
- sklearnex depends on onedal4py _config and _device_oflload modules.

Note: It is expected that it will not offload GPU validation Kmeans on main branch, since they are using daal4py GPU offloading, that is removed to call.
#1770 is considered fixe for this.

TODO

create sklearnex estimators, that uses daal4py backend
- BUG: fixing circular import in daal4py/sklearnex device_offloading #1832 (comment)

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes, if necessary (updated in # - add PR number)
The unit tests pass successfully.
I have run it locally and tested the changes extensively.
I have resolved any merge conflicts that might occur with the base branch.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details)
I have added a respective label(s) to PR if I have a permission for that.

samir-nasibli · 2024-06-01T22:15:34Z

/intelci: run

samir-nasibli · 2024-06-08T22:40:08Z

/intelci: run

daal4py/sklearn/__init__.py

icfaust

In the case we drop in the daal4py oneapi interfaces, is this movement of device_offload to daal4py necessary? Couldn't we just delete the sklearnex import of various aspects in daal4py/sklearn/_device_offload.py and it would remove the circular import problem? If true, how far away are we from being able to do that. As far as I see, none of the imported estimators from daal4py in sklearnex outside of Ridge can offload to GPU via daal4py. Its a question over aspects of:
Ridge
Lasso
ElasticNet
PairwiseDistances
train_test_split
roc_auc_score

or am I missing something?

In general just some minor naming convention stuff. Would like the thoughts of other reviewers on that.

daal4py/sklearn/_config.py

daal4py/sklearn/_device_offload.py

samir-nasibli · 2024-06-10T09:25:36Z

In the case we drop in the daal4py oneapi interfaces, is this movement of device_offload to daal4py necessary? Couldn't we just delete the sklearnex import of various aspects in daal4py/sklearn/_device_offload.py and it would remove the circular import problem? If true, how far away are we from being able to do that. As far as I see, none of the imported estimators from daal4py in sklearnex outside of Ridge can offload to GPU via daal4py. Its a question over aspects of: Ridge Lasso ElasticNet PairwiseDistances train_test_split roc_auc_score

or am I missing something?

In general just some minor naming convention stuff. Would like the thoughts of other reviewers on that.

@icfaust Thank you for the review!
I think there are several things mixed up here.
Perhaps this is a lack of of my PR description. I want to bring some clarity.

This PR tries to keep everything as it is. It does not imply changing anything in flow device offloading.
For oneapi iface we have onedal4py+sklearnex. If we drop daal4py, then all these necessary things will simply be transferred to the sklearnex _device_offload module, since daal4py will no longer be needed.

Some minor updates are required, but overall PR is ready

samir-nasibli · 2024-06-10T09:40:35Z

As far as I see, none of the imported estimators from daal4py in sklearnex outside of Ridge can offload to GPU via daal4py. Its a question over aspects of: Ridge Lasso ElasticNet PairwiseDistances train_test_split roc_auc_score

or am I missing something?

If I understood the question correctly, and if you asking why we need daal4py sycl_context, so then actually Kmeans and other estimators from daal4py.sklearn that is used on sklearnex main branch does GPU offloading via sycl_context. so that is why we need to leave it as is.

If anything requires drop I would like to do this reduction in the next steps.

samir-nasibli · 2024-06-10T12:59:03Z

Dear reviewers,
In my PR, I tried to get rid of circular imports. so I slightly changed the import hierarchy in daal4py/onedal4py/sklearnex namespaces.
But @icfaust raised an interesting question.
If we look at this issue more broadly, should we generally retain the support_usm_ndarray functionality for daal4py in general?

Which estimator with daal4py backend do we use for GPU offloading? correct me if nothing other than Kmeans. Kmeans is in the active phase of moving out preview #1770, which should happen in the near future, perhaps already before the next release. In that case, does it make sense to support daal4py for usm_ndarray interop at all? if not, then it makes sense to make changes in my PR and transfer the necessary things for device offloading to onedal4py backend and reuse it in sklearnex.
Removing this from daal4py._device_offloading and adding it to onedal4py._device_offloading also solves the circular imports problem.

After the discussion we will decide to integrate is as is, or update it for onedal4py backend only.

samir-nasibli · 2024-06-11T07:43:08Z

Dear reviewers,

I am returning this PR to development.
Generally I will remove all usm_ndarray support from daal4py and use sklearnex device offloading for daal4py calls. All device_offload primitives will be located in the onedal4py module.
Since Kmeans only uses daal4py's GPU offloading, it is only expected that these tests will fail until Kmeans with onedal4py is integrated #1770

updated onedal4py _device_offload module sklearnex _device_oflload depends on onedal4py only

…imators with daal4py backend

daal4py/sklearn/linear_model/_coordinate_descent.py

daal4py/sklearn/linear_model/_ridge.py

sklearnex/linear_model/coordinate_descent.py

sklearnex/dispatcher.py

remove unnecessary docstrings assignments

samir-nasibli · 2024-06-21T23:03:25Z

/azp run CI

azure-pipelines · 2024-06-21T23:03:33Z

Azure Pipelines successfully started running 1 pipeline(s).

samir-nasibli · 2024-06-21T23:03:51Z

/intelci: run

samir-nasibli · 2024-06-21T23:06:28Z

@ethanglaser @icfaust @Alexsandruss thank you for the review!
Dear reviewers I have addressed all your comments. Please mark comments as resolved if any.
Assuming green CI.

sklearnex/dispatcher.py

sklearnex/_device_offload.py

ethanglaser · 2024-06-25T01:09:47Z

No major objections from my side at this point, looking good, seems like the queue/device offload will functionally remain the same. One remaining question I have is what separates functionality defined in onedal/_device_offload.py vs. sklearnex/_device_offload.py at this point? Seems like maybe these could all be moved to onedal4py. But if there is functionality in sklearnex for a reason then I guess not.

samir-nasibli · 2024-06-25T14:02:50Z

No major objections from my side at this point, looking good, seems like the queue/device offload will functionally remain the same. One remaining question I have is what separates functionality defined in onedal/_device_offload.py vs. sklearnex/_device_offload.py at this point? Seems like maybe these could all be moved to onedal4py. But if there is functionality in sklearnex for a reason then I guess not.

Thank you!
sklearnex specific functional I left in sklearnex._device_offload module. I think it should be ok now.

sklearnex/_device_offload.py

samir-nasibli · 2024-06-25T14:35:34Z

/intelci: run

ethanglaser

LGTM, good work on this and good overall review process, let's wait for internal CI to make sure

samir-nasibli · 2024-06-26T09:49:17Z

/intelci: run

samir-nasibli · 2024-06-26T13:30:43Z

/intelci: run

samir-nasibli · 2024-06-27T06:55:23Z

/intelci: run

samir-nasibli · 2024-06-27T11:56:55Z

CI looks good. Fails are not related with changes on current PR. Merging it
Many thanks to the reviewers for the excellent work done.

BUG: fixing circular import in daal4py/sklearnex device_offloading

bec3ebb

samir-nasibli added the bug Something isn't working label May 13, 2024

samir-nasibli added 4 commits May 15, 2024 03:20

removed onedal4py sklearnex dependence

9052a73

minor fix

b070c89

Merge branch 'intel:main' into fix/device_offload

0fbc680

Merge branch 'intel:main' into fix/device_offload

8a6c4da

samir-nasibli added 4 commits June 6, 2024 16:47

Merge branch 'intel:main' into fix/device_offload

89ac903

minor update

01d73da

Merge branch 'intel:main' into fix/device_offload

d4587aa

added daal4py.sklearn._config for exposing sklearnex settings

b5f8921

samir-nasibli commented Jun 8, 2024

View reviewed changes

daal4py/sklearn/__init__.py Outdated Show resolved Hide resolved

samir-nasibli requested review from icfaust and Vika-F June 8, 2024 23:44

samir-nasibli marked this pull request as ready for review June 8, 2024 23:44

samir-nasibli requested a review from Alexsandruss as a code owner June 8, 2024 23:44

samir-nasibli requested a review from ethanglaser June 8, 2024 23:44

icfaust reviewed Jun 10, 2024

View reviewed changes

daal4py/sklearn/_config.py Outdated Show resolved Hide resolved

daal4py/sklearn/_config.py Outdated Show resolved Hide resolved

daal4py/sklearn/_config.py Outdated Show resolved Hide resolved

daal4py/sklearn/_device_offload.py Outdated Show resolved Hide resolved

samir-nasibli marked this pull request as draft June 11, 2024 07:43

samir-nasibli requested a review from md-shafiul-alam June 11, 2024 07:43

samir-nasibli added 5 commits June 11, 2024 14:28

Merge branch 'intel:main' into fix/device_offload

46dea79

removed daal4py device_offloading

8ce81ea

updated onedal4py _device_offload module sklearnex _device_oflload depends on onedal4py only

integrating changes of device offloading for sklearnex primitives/est…

fc01bae

…imators with daal4py backend

minor fixes

18e6599

minor fix for daal4py/sklearn/linear_model/_coordinate_descent.py

4318064

Alexsandruss requested changes Jun 20, 2024

View reviewed changes

samir-nasibli commented Jun 21, 2024

View reviewed changes

sklearnex/dispatcher.py Show resolved Hide resolved

Added ElasticNet, Lasso, Ridge into sklearnex patching map

324c9cc

remove unnecessary docstrings assignments

samir-nasibli requested a review from Alexsandruss June 21, 2024 23:02

This was referenced Jun 22, 2024

TEST: enable import tests for dataframes testing in sklearnex.cluster.DBSCAN #1886

Merged

TEST: enable import tests for dataframes testing in sklearnex.cluster.Kmeans #1888

Merged

Alexsandruss approved these changes Jun 24, 2024

View reviewed changes

samir-nasibli mentioned this pull request Jun 24, 2024

TEST: enabled different dataframe testing for linear models: ElasticNet, Lasso, Ridge #1891

Merged

8 tasks

ethanglaser reviewed Jun 24, 2024

View reviewed changes

sklearnex/dispatcher.py Show resolved Hide resolved

ethanglaser reviewed Jun 25, 2024

View reviewed changes

sklearnex/_device_offload.py Outdated Show resolved Hide resolved

ethanglaser reviewed Jun 25, 2024

View reviewed changes

sklearnex/_device_offload.py Outdated Show resolved Hide resolved

samir-nasibli added 2 commits June 25, 2024 07:33

removed unnecary imports

cfb5bb7

Merge branch 'intel:main' into fix/device_offload

8d82fa4

ethanglaser approved these changes Jun 25, 2024

View reviewed changes

Merge branch 'intel:main' into fix/device_offload

76716ab

samir-nasibli merged commit a040364 into intel:main Jun 27, 2024
17 of 18 checks passed

samir-nasibli deleted the fix/device_offload branch June 27, 2024 11:59

This was referenced Jul 16, 2024

FIX: prevent support_usm_ndarray from changing queue if explicitly provided. #1940

Merged

FIX: prevent support_usm_ndarray from changing queue if explicitly provided. (backport #1940) #1944

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: fixing circular import in daal4py/sklearnex device_offloading #1832

BUG: fixing circular import in daal4py/sklearnex device_offloading #1832

samir-nasibli commented May 13, 2024 •

edited

Loading

samir-nasibli commented Jun 1, 2024

samir-nasibli commented Jun 8, 2024

icfaust left a comment

samir-nasibli commented Jun 10, 2024

samir-nasibli commented Jun 10, 2024

samir-nasibli commented Jun 10, 2024 •

edited

Loading

samir-nasibli commented Jun 11, 2024 •

edited

Loading

samir-nasibli commented Jun 21, 2024

azure-pipelines bot commented Jun 21, 2024

samir-nasibli commented Jun 21, 2024

samir-nasibli commented Jun 21, 2024 •

edited

Loading

ethanglaser commented Jun 25, 2024

samir-nasibli commented Jun 25, 2024

samir-nasibli commented Jun 25, 2024

ethanglaser left a comment

samir-nasibli commented Jun 26, 2024

samir-nasibli commented Jun 26, 2024

samir-nasibli commented Jun 27, 2024

samir-nasibli commented Jun 27, 2024

BUG: fixing circular import in daal4py/sklearnex device_offloading #1832

BUG: fixing circular import in daal4py/sklearnex device_offloading #1832

Conversation

samir-nasibli commented May 13, 2024 • edited Loading

Description

Changes proposed in the PR:

TODO

samir-nasibli commented Jun 1, 2024

samir-nasibli commented Jun 8, 2024

icfaust left a comment

Choose a reason for hiding this comment

samir-nasibli commented Jun 10, 2024

samir-nasibli commented Jun 10, 2024

samir-nasibli commented Jun 10, 2024 • edited Loading

samir-nasibli commented Jun 11, 2024 • edited Loading

samir-nasibli commented Jun 21, 2024

azure-pipelines bot commented Jun 21, 2024

samir-nasibli commented Jun 21, 2024

samir-nasibli commented Jun 21, 2024 • edited Loading

ethanglaser commented Jun 25, 2024

samir-nasibli commented Jun 25, 2024

samir-nasibli commented Jun 25, 2024

ethanglaser left a comment

Choose a reason for hiding this comment

samir-nasibli commented Jun 26, 2024

samir-nasibli commented Jun 26, 2024

samir-nasibli commented Jun 27, 2024

samir-nasibli commented Jun 27, 2024

samir-nasibli commented May 13, 2024 •

edited

Loading

samir-nasibli commented Jun 10, 2024 •

edited

Loading

samir-nasibli commented Jun 11, 2024 •

edited

Loading

samir-nasibli commented Jun 21, 2024 •

edited

Loading