CUDA build common lines #1167

garrettwrong · 2024-08-15T20:22:19Z

Adds a CUDA kernel for building common lines. Approx order of magnitude faster before tuning/optimization. Matches Python clmatrix. Have some concerns whether Python matches MATLAB. TBD

Currently has changes from other branches in review.

garrettwrong · 2024-08-19T19:09:11Z

Implemented a better PF memory layout for a pretty good speedup. This takes the 89px JSB2017 problem from ~10hours for a common matrix line build to about 10 minutes.

I'll need to address the unit tests etc next.

garrettwrong · 2024-09-24T18:43:56Z

Happy to report that for JSB 80S 179px class averages I am achieving mean aligned angular distance of 0.36 degrees with the CUDA implementations
at commit a79f8c4 as compared with the published MATLAB code. This is up to transposing the image data, and without S weighting or J weighting on. That is, just the base Sync3N algorithm which include building CL matrix, voting procedure, building S, and the global handedness sync.

These kernels can definitely be further optimized. I plan to revisit the S weighting soon. Couple other things coming up...

garrettwrong · 2024-09-27T19:33:36Z

This will be in 13.1

codecov · 2024-10-10T18:14:52Z

Codecov Report

Attention: Patch coverage is 29.24528% with 75 lines in your changes missing coverage. Please review.

Project coverage is 86.92%. Comparing base (c8a51b8) to head (c47b626).
Report is 45 commits behind head on develop.

Files with missing lines	Patch %	Lines
src/aspire/abinitio/commonline_base.py	37.70%	38 Missing ⚠️
src/aspire/abinitio/commonline_sync3n.py	13.95%	37 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1167      +/-   ##
===========================================
- Coverage    87.37%   86.92%   -0.46%     
===========================================
  Files          132      132              
  Lines        13639    13735      +96     
===========================================
+ Hits         11917    11939      +22     
- Misses        1722     1796      +74

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

garrettwrong · 2024-10-10T20:00:15Z

First round of self review today. Want to look at it one more time and rerun larger manual tests. Then will open it up.

garrettwrong · 2024-10-11T17:33:21Z

Total (unweighted) Sync3N algorithm is now right around 30 minutes on caf (A100 GPU) for 3000 179x179 single precision images. Doubles around 45m. (80S JSB class averages).

j-c-c

This looks great! Just a couple comments/questions.

src/aspire/abinitio/commonline_base.py

src/aspire/abinitio/commonline_sync3n.py

tests/test_commonline_sync3n.py

garrettwrong · 2024-10-18T14:14:02Z

Looks good. Just a few things.

Also, codecov seems to complain that GPU paths are not tested? I assume this is due to the auth problems you were talking about last week.

Yeah I did see that with the last push. I was hoping 1199 resolved but only fixed some of the problems there.

The GPU tests are running on caf/decaf.

That auth issue impacted all uploads and is why we had no codecov reports at all for a while. However, it isn't why the caf/decaf report isn't there.

In this case, the caf/decaf codecov upload is failing (says report is not found). I will look into it and try to patch in another tiny PR. I think it is probably relating to not using default directories. (defaults probably assumed by codecov). I just enabled ampere reports in 1199 with the hope to reports for the GPU code we've been adding (its not coverage reporting we previously had).

garrettwrong added enhancement New feature or request Optimization Performance or Resource Optimzation GPU labels Aug 15, 2024

garrettwrong self-assigned this Aug 15, 2024

garrettwrong force-pushed the sync3n branch from 1b8eca1 to 21e738c Compare August 19, 2024 13:10

garrettwrong force-pushed the pcl branch from 1e87705 to c20c236 Compare August 19, 2024 19:06

garrettwrong force-pushed the sync3n branch from be1f0b2 to bd34d3d Compare August 27, 2024 15:29

Base automatically changed from sync3n to develop August 28, 2024 14:08

garrettwrong force-pushed the pcl branch 3 times, most recently from 998fb0e to a79f8c4 Compare September 24, 2024 18:31

garrettwrong mentioned this pull request Oct 3, 2024

CUDA/CUPY kernel for building Common Line matrix #1114

Closed

garrettwrong force-pushed the develop branch from a01d211 to 01fdc11 Compare October 9, 2024 13:42

garrettwrong force-pushed the pcl branch from a79f8c4 to 4667f75 Compare October 10, 2024 16:23

garrettwrong force-pushed the pcl branch from 2ab3b24 to 91f79ac Compare October 10, 2024 19:58

This was referenced Oct 10, 2024

Restore Sync3N S_weighting unit test #1190

Closed

Redundant CL and rotation calculations #1198

Closed

garrettwrong changed the title ~~WIP: CUDA build common lines~~ CUDA build common lines Oct 11, 2024

garrettwrong force-pushed the pcl branch from 91f79ac to be1089c Compare October 11, 2024 17:34

garrettwrong marked this pull request as ready for review October 11, 2024 17:37

garrettwrong requested a review from janden as a code owner October 11, 2024 17:37

garrettwrong requested a review from j-c-c October 11, 2024 19:41

j-c-c reviewed Oct 15, 2024

View reviewed changes

src/aspire/abinitio/commonline_base.py Outdated Show resolved Hide resolved

src/aspire/abinitio/commonline_base.py Show resolved Hide resolved

src/aspire/abinitio/commonline_sync3n.py Outdated Show resolved Hide resolved

tests/test_commonline_sync3n.py Show resolved Hide resolved

garrettwrong requested a review from j-c-c October 16, 2024 19:46

garrettwrong added 23 commits October 18, 2024 09:29

split kernels

306e35e

general cleanup

18b0548

threads over k

2f54330

continue cleanup threads over k

4b663fd

fix j<i bound bug

0e219cc

fix adative param oversight bug

3eaef56

parallel case bug

1748a69

C order, sigh

90ff5b3

remove unused vars from build cl kernel

627d4c5

remove unused vars from build cl kernel

230fd0f

continue removing unused vars

b64164e

update constants

58fdb60

add single precision build CL kernel and launching code

7e6415d

revert accidental config commit

a570155

cleanup base cuda code a little

bc4c3b8

self review cleanup

c4ca481

use adaptive width mode for sync3n tests

1631cff

add additional sync3n code paths

706c2bd

must use smaller shift step for unit test size problem

f2025c8

Remove missed debug string change

13cc09f

Remove range(0,...)

78030fe

change var name from dist to xcorr

6a99b05

Remove kernel timing

c47b626

garrettwrong dismissed j-c-c’s stale review via c47b626 October 18, 2024 14:02

garrettwrong force-pushed the pcl branch from ed4374c to c47b626 Compare October 18, 2024 14:02

garrettwrong requested a review from janden October 18, 2024 15:29

janden approved these changes Oct 21, 2024

View reviewed changes

garrettwrong merged commit 8011e3a into develop Oct 21, 2024
36 checks passed

garrettwrong deleted the pcl branch October 21, 2024 13:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA build common lines #1167

CUDA build common lines #1167

Uh oh!

garrettwrong commented Aug 15, 2024

Uh oh!

garrettwrong commented Aug 19, 2024

Uh oh!

garrettwrong commented Sep 24, 2024 •

edited

Loading

Uh oh!

garrettwrong commented Sep 27, 2024

Uh oh!

codecov bot commented Oct 10, 2024 •

edited

Loading

Uh oh!

garrettwrong commented Oct 10, 2024

Uh oh!

garrettwrong commented Oct 11, 2024

Uh oh!

j-c-c left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

garrettwrong commented Oct 18, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CUDA build common lines #1167

CUDA build common lines #1167

Uh oh!

Conversation

garrettwrong commented Aug 15, 2024

Uh oh!

garrettwrong commented Aug 19, 2024

Uh oh!

garrettwrong commented Sep 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

garrettwrong commented Sep 27, 2024

Uh oh!

codecov bot commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

garrettwrong commented Oct 10, 2024

Uh oh!

garrettwrong commented Oct 11, 2024

Uh oh!

j-c-c left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

garrettwrong commented Oct 18, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

garrettwrong commented Sep 24, 2024 •

edited

Loading

codecov bot commented Oct 10, 2024 •

edited

Loading