-
Notifications
You must be signed in to change notification settings - Fork 26
Batch class avg #1216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Batch class avg #1216
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #1216 +/- ##
===========================================
- Coverage 90.66% 90.63% -0.04%
===========================================
Files 132 132
Lines 13707 13702 -5
===========================================
- Hits 12428 12419 -9
- Misses 1279 1283 +4 ☔ View full report in Codecov by Sentry. |
|
Passing along for initial review. I believe #1214 should go in first, then this will need to be rebased, maybe resolving some conflicts along the way. |
d70ad16 to
6d6f889
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great! Just two things.
| # for the argmax alignment test. | ||
| base_img = _coef[0].reshape(self.alignment_basis.complex_count, 1) | ||
|
|
||
| # (cnt, n_transl) * (cnt, 1) -> (cnt, n_transl) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(cnt, n_rot) * (cnt, 1) -> (cnt, n_rot)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure :), changed.
|
I want to discuss/confirm the class hierarchy during our meeting and if no changes come out of that I'll merge this then. |
|
Factored the base image rotation table outside the shift loop like we discussed in the meeting. |
Cool. Any speedup? |
Yes. Using GPU mode for 179px and 50 nbrs this saves about 0.3s per class (44.5 vs 44.8 s). I'll call it 1%. I think relative to the matmuls the vector vector broadcast multiplication is very fast for the GPU. On the host, much larger improvement, more like 10%. Either way across 3-100k classes I'll definitely take it :). Thanks. |
Stashing WIP before vacation. Still needs a lot of cleanup and testing.