This repository was archived by the owner on Dec 9, 2024. It is now read-only.
LS optimization: early stop, const arrays in moduleGapSize, simpler logic in isTighterTiltedModules (master version) #301
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Equivalent to PR #298 but to the master branch. This PR has been tested on the V100 of lnx7188.
Timing
This PR (a5de4d4):
Master (b498de8):
Timing (also confirmed by the profiler timing) is cut down by almost 2/3. Part of the large timing reduction should be coming from the fact that the registers are reduced, so that we can increase the theoretical occupancy by 20%, and the achieved occupancy also goes up by 12.75%.
Profiler reports

This PR (a5de4d4) - in blue- with master (b498de8) - in green - comparison in parenthesis:
Validation plots