-
Notifications
You must be signed in to change notification settings - Fork 10.6k
[Dependency Scanning] Refactor batch clang dependency query to preserve only partial results #84929
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@swift-ci test |
struct BatchClangModuleLookupResult { | ||
llvm::StringMap<clang::tooling::dependencies::ModuleDeps> | ||
discoveredDependencyInfos; | ||
llvm::StringMap<std::vector<std::string>> visibleModules; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
StringMap is not thread-safe. A normal insert might be but it can expand when full. We need a lock for this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We do have a lock for this, it is only ever modified in performParallelClangModuleLookup
in a critical section guarded with a lock.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I will add a helper function to the class to ensure the correct lock usage. It might even be cleaner to move the lock here to avoid introduce regression for future refactoring.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Function wise, it looks good for me. Some comments inline for style and nits.
struct BatchClangModuleLookupResult { | ||
llvm::StringMap<clang::tooling::dependencies::ModuleDeps> | ||
discoveredDependencyInfos; | ||
llvm::StringMap<std::vector<std::string>> visibleModules; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I will add a helper function to the class to ensure the correct lock usage. It might even be cleaner to move the lock here to avoid introduce regression for future refactoring.
…ve only partial results Previously, with the change to bridge Clang dependency scanning results on-demand, the scanner would execute Clang dependency scanning queries for each unresolved import, in parallel, and aggregate all of the results to post-process (including on-demand bridging) later. As a consequence of that change, all of the Clang scanner queries' results ('TranslationUnitDeps') got aggregated during a scan and had their lifetimes extended until a later point when they got processed and added to the scanner's cache. This change refactors the Clang dependency scanner invocation to, upon query completion, accumulate only the 'ModuleDeps' nodes which have not been registered by a prior scan, discarding the rest of the 'TranslationUnitDeps' graph. The arrgegated 'ModuleDeps' objects are still bridged on-demand downstream. This change further splits up the 'resolveAllClangModuleDependencies' method's functionality to improve readability and maintainability, into: - 'gatherUnresolvedImports' method which collects all of collected Swift dependents' imports which did not get resolved to Swift dependencies - 'performParallelClangModuleLookup' which actually executes the parallel queries and includes the new logic described above - 'cacheComputedClangModuleLookupResults' method which takes the result of the parallel Clang scanner query and records in in the Swift scanner cache - 'reQueryMissedModulesFromCache' method which covers the scenario where Clang scanner query returned no result because either the dependency can only be found transitively, or the query is for a dependency previously-queried.
98fb1ff
to
8335a03
Compare
@swift-ci smoke test |
@swift-ci smoke test Windows platform |
Previously, with the change to bridge Clang dependency scanning results on-demand, the scanner would execute Clang dependency scanning queries for each unresolved import, in parallel, and aggregate all of the results to post-process (including on-demand bridging) later. As a consequence of that change, all of the Clang scanner queries' results (
TranslationUnitDeps
) got aggregated during a scan and had their lifetimes extended until a later point when they got processed and added to the scanner's cache.This change refactors the Clang dependency scanner invocation to, upon query completion, accumulate only the
ModuleDeps
nodes which have not been registered by a prior scan, discarding the rest of theTranslationUnitDeps
graph. The arrgegatedModuleDeps
objects are still bridged on-demand downstream.This change further splits up the
resolveAllClangModuleDependencies
method's functionality to improve readability and maintainability, into:gatherUnresolvedImports
method which collects all of collected Swift dependents' imports which did not get resolved to Swift dependenciesperformParallelClangModuleLookup
which actually executes the parallel queries and includes the new logic described abovecacheComputedClangModuleLookupResults
method which takes the result of the parallel Clang scanner query and records in in the Swift scanner cachereQueryMissedModulesFromCache
method which covers the scenario where Clang scanner query returned no result because either the dependency can only be found transitively, or the query is for a dependency previously-queried.