Sync3n Initial Add #1108

garrettwrong · 2024-04-02T16:02:46Z

See #1107 . This adapts some of the CL code brought over for Cn molecules which used the 3N sync matrix method to the C1 case.

Todo

Add S weighting, as was optional in the Matlab code.
Add J weighting, as was optional in the Matlab code.
Speedup, its borderline unusable for experimental runs and gives little feedback...
Docstrings
Tests
Refactor with other 3N based classes. Probably they (Cn) could inherit from this Sync3N one.

I quickly hacked this together enough to minimally function. I'm thinking to take on the 1-3 todo items while Josh is out and pass it along to him to finish integrating.

garrettwrong · 2024-04-12T16:28:22Z

Adding some notes.

I've created a CUDA kernel for the slow operation signs_times_v and minimal (cupy) launching code along with some simple auto enabling configuration. The speedups are 100x+ for relevant problems. Presently I am keeping that work in this branch (off of sync3n): https://github.com/ComputationalCryoEM/ASPIRE-Python/tree/sync3n_cupy

There are some concerns/bugs between the python, matlab, and mex versions that need to be addressed, but they would be minor changes. Josh might be working them out in his current D2 work now anyway.

The original Mex codes of this thead-loop structure all appear to have a common threading ABA bug... I have fixed that in my CUDA version.

I will be able to mimic the kernel and supporting code I wrote for the other similar mex codes (probability and triangle scores for the same speedups, but need to work on a few other projects first before I come back to that.

garrettwrong · 2024-04-24T21:14:48Z

Some updates.

Today I was able to complete a rough draft of the pairs_probabilities CUDA kernel, cupy launcher, and dispatch code for that function. Speedups are similar to the last kernel 100x+. Next is the triangle_scores kernel which should be similar.

After that I can move onto merging the _cupy branch into this branch and cleaning things up. With the accelerated code we should be able to actually test the Sync3N features. FWIW, the large experiment using the pure python/numpy sync3n implementation was kicked off before I started writing the kernel code... and the handedness synchronization was only half way done yesterday...

garrettwrong · 2024-04-24T23:18:27Z

For 1000 images ...

Size:	1000
Allclose inds?  True
Allclose arb?  True
gpu_time: 7.763748619996477
host_time: 8093.168460645
speedup: 1042.4305135022098

garrettwrong · 2024-04-26T14:53:09Z

Triangle scores is also fast now:

Size:   1000
Allclose cum_scores?  True
Allclose hist_scores?  True
gpu_time: 4.986900245989091
host_time: 6058.410942732007
speedup: 1214.8650752749106

Added a test file to compare between the host python and cupy launched CUDA codes.

Broke the CUDA codes out into their own source file so that it is easier to keep it clean.

Merged sync3n_cupy into this branch.

garrettwrong · 2024-04-29T18:23:51Z

Fixed up some typing concerns.

Added some minimal docstrings.

Added a unit test that conditionally compares the host (numpy) implementation to gpu (cupy launched cuda). This test is coded to automatically skip when cupy is not able to be imported.

garrettwrong · 2024-05-07T17:52:44Z

Added 3 tests to compare dummy data with results from MATLAB as a sanity check of the low level function port.

Added some minimal up casting to allow running the underlying sync3n methods in both singles and doubles. This is tested with parameterized fixture. Would still recommend using doubles with our CL codes for now.

garrettwrong · 2024-07-03T14:29:15Z

Added a unit test based on CL Sync 2N and tried to complete most of the docstrings.

Found a small bug in the CL sync tests (doesn't appear it was actually testing both types, just running doubles twice).

j-c-c

This is great! Just a few things.

src/aspire/abinitio/commonline_sync3n.py

tests/test_orient_sync_voting.py

garrettwrong · 2024-07-22T15:27:45Z

Going to wait until after hack24 is reviewed/merged to retest and move this forward. (They both touch/have gpu/cupy code.)

garrettwrong · 2024-07-30T19:15:52Z

Running with JSB2017 Matlab class averages.

garrettwrong · 2024-08-01T13:38:19Z

Running with JSB2017 Matlab class averages.

For the 80s run at 89 pixels the Sweighting solve was infeasable (I'll look into that more, happens too much). Without Sweighting we did produce a reasonable molecule. Average aligned rotation error was ~16* wrt MATLAB, FSC was 80A wrt to EMDB. Not very good, but better than before...

I noticed MATLAB was run using twice as many n_theta and must also have been using non-default shift steps. I am retrying with those changes. Unfortunately I have not found the actual configuration used in the MATLAB logs yet... still looking. I'm basing the values on lines from areas of the log, but its not certain they were the same values for the different functions. Anyway, hoping this will at least yield some improvement.

This reverts commit bd34d3d.

garrettwrong added enhancement New feature or request Optimization Performance or Resource Optimzation labels Apr 2, 2024

garrettwrong self-assigned this Apr 2, 2024

garrettwrong force-pushed the sync3n branch 2 times, most recently from b522147 to acf553d Compare April 4, 2024 15:37

garrettwrong force-pushed the sync3n branch from ae439a1 to f20c45b Compare April 9, 2024 19:49

garrettwrong added the GPU label Apr 12, 2024

garrettwrong force-pushed the sync3n branch 2 times, most recently from c36fec1 to b05d4c2 Compare April 26, 2024 19:41

garrettwrong force-pushed the develop branch from 7c0c43b to e8c2d8d Compare July 1, 2024 12:09

garrettwrong force-pushed the sync3n branch from a8827cb to 47e6594 Compare July 3, 2024 14:24

garrettwrong changed the title ~~WIP: Sync3n~~ Sync3n Initial Add Jul 3, 2024

garrettwrong requested a review from j-c-c July 3, 2024 15:58

j-c-c requested changes Jul 18, 2024

View reviewed changes

garrettwrong force-pushed the sync3n branch from 47e6594 to b79af20 Compare July 19, 2024 12:54

garrettwrong force-pushed the sync3n branch 3 times, most recently from 97b4368 to df83b6e Compare July 30, 2024 13:10

garrettwrong force-pushed the sync3n branch from df83b6e to 2cd7cf6 Compare August 9, 2024 16:33

garrettwrong marked this pull request as ready for review August 9, 2024 19:11

garrettwrong added 22 commits August 27, 2024 09:36

looks like this actually needs double precision.

8161a21

fix precision bug in CL sync3n power method.

accdc93

fixup some of the dtypes

14dfb1c

conditionally run host-gpu comparison

1af210f

add MATLAB comparison tests

a53cd20

Allow sync3n methods to run in singles via upcasting

73e3614

Update some docstrings

0a91eec

initial add cl sync3n test

51ffdac

add minimal test

4c5102c

actually test the different dtypes

d0c2c0d

mark float64 and odd sync3n as expensive

52a099e

first pass addressing review remarks

1124033

move initial rotation estimate lines into estimate_rotations

61c6a89

important progress bar

d8b03bf

Use trust region method for S weight least squares

d4bf0bb

use class mangled names for gpu methods

e177486

typo

8a2cd53

Add disable_gpu sync3n flag

a89fb8f

P->W typo

3d8da44

use more specific language instead of resolution

ff56876

Replace histogram logic

a7f77b9

factor out sync3n score body

bd34d3d

garrettwrong force-pushed the sync3n branch from be1f0b2 to bd34d3d Compare August 27, 2024 15:29

garrettwrong requested a review from janden August 27, 2024 17:07

janden previously approved these changes Aug 27, 2024

View reviewed changes

garrettwrong added 2 commits August 28, 2024 08:01

Revert "factor out sync3n score body"

068ec9a

This reverts commit bd34d3d.

black style

f094f03

garrettwrong dismissed janden’s stale review via f094f03 August 28, 2024 12:04

garrettwrong merged commit 758df6b into develop Aug 28, 2024

garrettwrong deleted the sync3n branch August 28, 2024 14:08

Sync3n Initial Add #1108

Sync3n Initial Add #1108

Uh oh!

Conversation

garrettwrong commented Apr 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

garrettwrong commented Apr 12, 2024

Uh oh!

garrettwrong commented Apr 24, 2024

Uh oh!

garrettwrong commented Apr 24, 2024

Uh oh!

garrettwrong commented Apr 26, 2024

Uh oh!

garrettwrong commented Apr 29, 2024

Uh oh!

garrettwrong commented May 7, 2024

Uh oh!

garrettwrong commented Jul 3, 2024

Uh oh!

j-c-c left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

garrettwrong commented Jul 22, 2024

Uh oh!

garrettwrong commented Jul 30, 2024

Uh oh!

garrettwrong commented Aug 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

garrettwrong commented Apr 2, 2024 •

edited

Loading