Skip to content

[AUTO] Check if actual device cached blob exists before model compilation#29636

Merged
peterchen-intel merged 14 commits intoopenvinotoolkit:masterfrom
yangwang201911:yang/enable_checking_if_actual_device_cached_blob_exists_before_model_compilation_in_AUTO
May 11, 2025
Merged

[AUTO] Check if actual device cached blob exists before model compilation#29636
peterchen-intel merged 14 commits intoopenvinotoolkit:masterfrom
yangwang201911:yang/enable_checking_if_actual_device_cached_blob_exists_before_model_compilation_in_AUTO

Conversation

@yangwang201911
Copy link
Contributor

@yangwang201911 yangwang201911 commented Mar 24, 2025

Details:

  • Enable AUTO to check if cache blob existed at the beginning of compilation.
  • Disable startup fallback and runtime fallback if any cache blob existed for actual device
  • Add an new property for AUTO to indicate whether the core is querying the cache ability from virtual device.

Tickets:

…pilation.

2. Add an new property to indicate whether the core is quering the cache ability from virtual device.
@github-actions github-actions bot added category: inference OpenVINO Runtime library - Inference category: samples OpenVINO Runtime Samples category: AUTO OpenVINO AUTO device selection plugin category: CPP API OpenVINO CPP API bindings and removed category: samples OpenVINO Runtime Samples labels Mar 24, 2025
@wangleis wangleis self-requested a review March 31, 2025 04:20
…ing the cache ability from AUTO plugin.

2. update the test cases of caching model for AUTO when cache is enabled by user.
@github-actions github-actions bot removed the category: CPP API OpenVINO CPP API bindings label Apr 14, 2025
…ed_blob_exists_before_model_compilation_in_AUTO
@yangwang201911 yangwang201911 marked this pull request as ready for review April 14, 2025 08:29
@yangwang201911 yangwang201911 requested review from a team as code owners April 14, 2025 08:29
}

if (enable_startup_cpu && num_blob_files == devices_to_be_merged.size() - 1) {
LOG_DEBUG_TAG("Will disable CPU for acclerator when all blob files found");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe more concrete debug message for why you are disabling CPU

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

btw, if GPU.0 does not have the blob, GPU.1 does have, it will still have cpu start up enabled right?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@wangleis , disable CPU on both startup fallback and runtime fallback if any cache blob found. Is this expected behavior? or we need to check if cache blobs exist for all of device in candidate list?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When cache blob exist for candidate device, AUTO will pass model path to target device as pass through mode. Then AUTO need to disable startup fallback and runtime fallback.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

will disable CPU if any blob file found for a device in candidate list.

@yangwang201911 yangwang201911 requested a review from songbell May 6, 2025 09:02
…ed_blob_exists_before_model_compilation_in_AUTO
@peterchen-intel peterchen-intel added this pull request to the merge queue May 11, 2025
Merged via the queue into openvinotoolkit:master with commit e07b97c May 11, 2025
189 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: AUTO OpenVINO AUTO device selection plugin category: inference OpenVINO Runtime library - Inference

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants