Enable training on XPU devices in OTX2.0 #3094

kprokofi · 2024-03-13T22:49:15Z

Summary

How to test

Checklist

I have added unit tests to cover my changes.
I have added integration tests to cover my changes.
I have added e2e tests for validation.
I have added the description of my changes into CHANGELOG in my target branch (e.g., CHANGELOG in develop).
I have updated the documentation in my target branch accordingly (e.g., documentation in develop).
I have linked related issues.

License

I submit my code changes under the same Apache License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below).

# Copyright (C) 2023 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

…nsions into v2

…aining_extensions into kp/xpu_otx2.0

kprokofi · 2024-03-22T00:17:08Z

I updated strategy: 98e0b69 removing torch.xpu.optimize for Semantic Segmentation since it was reported by IPEX that they have a bug there in that case and to be able to train segmentation model we should remove optimization.
I hope it will be fixed in the next IPEX releases

kprokofi · 2024-03-22T00:30:25Z

@harimkang , seems like a problem with installing mmdet occurs during set up env for unit tests. Could you take a look?

tests/unit/algo/plugins/test_plugins.py

harimkang · 2024-03-22T04:09:01Z

@harimkang , seems like a problem with installing mmdet occurs during set up env for unit tests. Could you take a look?

There was a brief issue with the CI network. I rerun the test and the install is fine.

* add raising an error when metric is None * added accelerators * fix packages * fix assigning model * debug on MAX * change precision * update MixedPrecisionXPUPlugin * debug * added monkey patching * minor * minor * added patch for mmengine * fix OD and IS * benchmark debug * change device * quick fix for instance seg * fix pre-commit * fix pre-commit * clean the code * added additional flag for mmcv * added unit tests * fixed unit test * fix linter * added unit tests and replied comments * fix pre-commit * minor fix * added documentation * fix unit test * add workaround for semantic segmentation * remove RoiAlignTest due to unstability * minor * remove strategy back * try to patch SingleDeviceStrategy * added auto xpu configuration * patch strategy * small fix * reply to comments * move patching xpu packages to accelerator * fix test_xpu test * remove do-not-install-mmcv * fix pre-commit * remove torch.xpu.optimize for segmentation --------- Co-authored-by: Emily <emily.chun@intel.com>

* Enable training on XPU devices in OTX2.0 (#3094) * add raising an error when metric is None * added accelerators * fix packages * fix assigning model * debug on MAX * change precision * update MixedPrecisionXPUPlugin * debug * added monkey patching * minor * minor * added patch for mmengine * fix OD and IS * benchmark debug * change device * quick fix for instance seg * fix pre-commit * fix pre-commit * clean the code * added additional flag for mmcv * added unit tests * fixed unit test * fix linter * added unit tests and replied comments * fix pre-commit * minor fix * added documentation * fix unit test * add workaround for semantic segmentation * remove RoiAlignTest due to unstability * minor * remove strategy back * try to patch SingleDeviceStrategy * added auto xpu configuration * patch strategy * small fix * reply to comments * move patching xpu packages to accelerator * fix test_xpu test * remove do-not-install-mmcv * fix pre-commit * remove torch.xpu.optimize for segmentation --------- Co-authored-by: Emily <emily.chun@intel.com> * Add exporter/demo unit tests (#3218) * added unit tests. Need to clean up * move tests * fix pre-commit * return demo back * minor * delete unnecessery comments * fix unit test * fix pre-commit * fix pre-commit 2 * fix test_postprocess_openvino_model * fix unit tests * test_precommit * Fix a bug that engine.test doesn't work with XPU (#3293) * fix bug * align with pre-commit --------- Co-authored-by: Emily <emily.chun@intel.com> * fix merge conflicts for pre-commit * fix precommit 2 * fix unit test * fix pre-commit * fix export tests * fix pre-commit * fix tox * fix pre-commit --------- Co-authored-by: Emily <emily.chun@intel.com> Co-authored-by: Eunwoo Shin <eunwoo.shin@intel.com>

kprokofi and others added 29 commits January 14, 2024 23:05

add raising an error when metric is None

0abdf10

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

8756968

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

e171e9d

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

4e6e21e

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

c0abe24

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

1868961

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

d33e66e

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

35e925f

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

3de253f

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

3f0ce95

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

bddffa6

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

8b69f62

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

7efa031

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

1f8b9ce

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

6e0f0b6

…nsions into v2

Merge branch 'v2' of https://github.com/openvinotoolkit/training_exte…

4619997

…nsions into v2

added accelerators

96104fc

fix packages

d09392c

fix assigning model

8c59e04

debug on MAX

57b0f33

change precision

1471fbb

update MixedPrecisionXPUPlugin

0a4a69a

debug

d77b84b

merge

37601e2

Merge branch 'kp/xpu_otx2.0' of https://github.com/openvinotoolkit/tr…

272a534

…aining_extensions into kp/xpu_otx2.0

added monkey patching

79ec108

minor

1558295

minor

d71492a

Merge branch 'kp/xpu_otx2.0' of https://github.com/openvinotoolkit/tr…

1e7f005

…aining_extensions into kp/xpu_otx2.0

github-actions bot added the DEPENDENCY Any changes in any dependencies (new dep or its version) should be produced via Change Request on PM label Mar 13, 2024

harimkang added this to the 2.0.0 milestone Mar 21, 2024

kprokofi added 8 commits March 21, 2024 18:26

try to patch SingleDeviceStrategy

028190a

added auto xpu configuration

c1deb52

patch strategy

52e529a

small fix

8892171

reply to comments

2489581

move patching xpu packages to accelerator

8fb52df

fix test_xpu test

d3b9a6d

Merge branch 'releases/2.0.0' into kp/xpu_otx2.0

a47e884

kprokofi dismissed jaegukhyun’s stale review via 2489581 March 21, 2024 16:58

remove do-not-install-mmcv

f99f022

kprokofi requested review from jaegukhyun and eunwoosh March 21, 2024 18:11

kprokofi added 2 commits March 22, 2024 06:29

fix pre-commit

a368e9e

remove torch.xpu.optimize for segmentation

98e0b69

harimkang previously approved these changes Mar 21, 2024

View reviewed changes

kprokofi dismissed harimkang’s stale review via 98e0b69 March 22, 2024 00:12

jaegukhyun approved these changes Mar 22, 2024

View reviewed changes

eunwoosh approved these changes Mar 22, 2024

View reviewed changes

vinnamkim reviewed Mar 22, 2024

View reviewed changes

tests/unit/algo/plugins/test_plugins.py Show resolved Hide resolved

eugene123tw approved these changes Mar 22, 2024

View reviewed changes

kprokofi enabled auto-merge (squash) March 24, 2024 22:45

kprokofi merged commit 9c746da into releases/2.0.0 Mar 24, 2024
16 checks passed

kprokofi deleted the kp/xpu_otx2.0 branch March 24, 2024 23:30

kprokofi mentioned this pull request Apr 9, 2024

Move XPU/Opt related PRs from releases2.0 to develop #3295

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable training on XPU devices in OTX2.0 #3094

Enable training on XPU devices in OTX2.0 #3094

kprokofi commented Mar 13, 2024

kprokofi commented Mar 22, 2024 •

edited

Loading

kprokofi commented Mar 22, 2024 •

edited

Loading

harimkang commented Mar 22, 2024

Enable training on XPU devices in OTX2.0 #3094

Enable training on XPU devices in OTX2.0 #3094

Conversation

kprokofi commented Mar 13, 2024

Summary

How to test

Checklist

License

kprokofi commented Mar 22, 2024 • edited Loading

kprokofi commented Mar 22, 2024 • edited Loading

harimkang commented Mar 22, 2024

kprokofi commented Mar 22, 2024 •

edited

Loading

kprokofi commented Mar 22, 2024 •

edited

Loading