Skip to content

Commit

Permalink
Use fallback libraries for archs without optimized logic (ROCm#1862)
Browse files Browse the repository at this point in the history
Fixes ROCm#1757. Enables architectures that don't have optimized logic files to also produce libraries when `--separate-architectures` or `--lazy-library-loading` is turned on. Previously, one must disable both of these two flags in order for rocBLAS to run on architectures like `gfx1010`.
  • Loading branch information
GZGavinZhao committed Jan 24, 2024
1 parent 4dd0550 commit efbe0c0
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions Tensile/TensileCreateLibrary.py
Original file line number Diff line number Diff line change
Expand Up @@ -940,11 +940,18 @@ def generateLogicDataAndSolutions(logicFiles, args):
# logicData[problemType].append((scheduleName, deviceNames, \
# solutionsForSchedule, indexOrder, exactLogic, rangeLogic ))

(archs, _) = splitArchs()
if globalParameters["SeparateArchitectures"] or globalParameters["LazyLibraryLoading"]:
if "fallback" in masterLibraries.keys():
for key, value in masterLibraries.items():
if key != "fallback":
value.merge(deepcopy(masterLibraries["fallback"]))
for archName in archs:
archName = archName.split('-', 1)[0]
if archName not in masterLibraries:
print1("Using fallback for arch: " + archName)
masterLibraries[archName] = deepcopy(masterLibraries["fallback"])
masterLibraries[archName].version = args.version

masterLibraries.pop("fallback")

Expand Down

0 comments on commit efbe0c0

Please sign in to comment.