Convert accelerated dice coefficient wrapper code to Cython #363

hardbyte · 2020-10-19T11:09:15Z

Creates a cython wrapper for the similarity comparison C++ code and removes the cffi compiler.

I noticed that the hot loop includes three dynamically resizing arrays (similarities and two indicies arrays), not trivial but it would speed things up if that was all done in the C/C++ code.

Closes #168

Create a cython wrapper for the similarity comparison CPP code. Closes #168

codecov · 2020-10-19T11:22:53Z

Codecov Report

Merging #363 into master will decrease coverage by 0.06%.
The diff coverage is 93.75%.

@@            Coverage Diff             @@
##           master     #363      +/-   ##
==========================================
- Coverage   94.63%   94.57%   -0.07%     
==========================================
  Files          16       16              
  Lines         802      792      -10     
==========================================
- Hits          759      749      -10     
  Misses         43       43

- drop numpy in favour of using array.array

Raise the threshold dynamically. Once we've found k scores above the threshold use the lowest of those as the new threshold.

wilko77 · 2020-10-20T04:39:55Z

/AzurePipelines run

azure-pipelines · 2020-10-20T04:40:17Z

Azure Pipelines successfully started running 1 pipeline(s).

Update libpopcount.h

hardbyte · 2020-10-21T01:24:20Z

So it looks like all the tests now pass on Windows, OSX, and Linux. The CI is failing due to sending code coverage.

wilko77

Thanks for that. Looks pretty good, well, not that I am an authority when it comes to cython...
You may want to add a line to the changelog.
Otherwise, see comments.

setup.py

anonlink/benchmark.py

wilko77 · 2020-10-29T03:20:46Z

setup.py

@@ -34,46 +33,59 @@

 current_os = platform.system()
 if current_os == "Windows":
-    extra_compile_args = ['/std:c++17', '/O2', '/arch:AVX512']
+    # '/arch:AVX512' or '/arch:AVX2'


why did you take the arch:AVX512 out? Maybe expand the comment somewhat...

I read the compilation instructions in https://github.com/kimwalisch/libpopcnt - turns out the flags are not required as the CPU support checks are done at runtime.

anonlink/similarities/_dice.pyx

wilko77 · 2020-10-29T03:25:15Z

anonlink/similarities/_dice.pyx

+        # Note array.extend_buffer requires the gil to extend the array
+        # `.data.as_chars` gives us direct access to the underlying contiguous C array
+        # array.extend_buffer(result_sims, c_scores.data.as_chars, matches)
+        # array.extend_buffer(result_indices0, i_buffer.data.as_chars, matches)
+        # array.extend_buffer(result_indices1, c_indices.data.as_chars, matches)


what's with this section?

Something I was trying... added more complexity and didn't allow me to make this function gil-less or faster so went back to the simpler array.extend method below. I'll remove the commented out code.

wilko77 · 2020-10-29T03:26:58Z

anonlink/similarities/_dice_x86.py

+    carr1.frombytes(b''.join(memoryview(f) for f in filters1))
+
+    # Only worth popcounting in C for a large number of filters1
+    if len(filters1) < 10000:


I'll add a comment, but just found by benchmarking and noting at what size the throughput in cmp/s dropped

wilko77 · 2020-10-29T03:52:08Z

anonlink/similarities/dice.cpp

                    top_k_scores.pop();
+                    // threshold can now be raised
+                    dynamic_threshold = temp_node.score;


wait, the top_k_scores is a priority queue where the lowest score has the highest priority? Thank god that's so well documented, otherwise this would be quite confusing...

Added a comment where the priority queue is created

wilko77

Thanks. That looks great now.

Convert dice_coefficient_x86 glue code to Cython

22431c0

Create a cython wrapper for the similarity comparison CPP code. Closes #168

hardbyte added 3 commits October 20, 2020 13:57

Improve the cython wrapper and update benchmarks

048195a

- drop numpy in favour of using array.array

Small optimization to cpp code

36bf7c4

Raise the threshold dynamically. Once we've found k scores above the threshold use the lowest of those as the new threshold.

Update dependencies

02d4d3b

hardbyte added 2 commits October 21, 2020 00:47

Make it work on windows

399c2ed

Update libpopcount.h

typechecking and testing

43264c9

hardbyte mentioned this pull request Oct 21, 2020

Feature arbitrary size encodings #366

Merged

wilko77 requested changes Oct 29, 2020

View reviewed changes

hardbyte added 2 commits October 31, 2020 09:11

Address review comments

5d7bd10

Merge branch 'master' into cython-similarities

828fbf5

hardbyte requested a review from wilko77 October 31, 2020 19:41

wilko77 approved these changes Nov 2, 2020

View reviewed changes

wilko77 merged commit d8ec9df into data61:master Nov 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert accelerated dice coefficient wrapper code to Cython #363

Convert accelerated dice coefficient wrapper code to Cython #363

hardbyte commented Oct 19, 2020

codecov bot commented Oct 19, 2020 •

edited

Loading

wilko77 commented Oct 20, 2020

azure-pipelines bot commented Oct 20, 2020

hardbyte commented Oct 21, 2020

wilko77 left a comment

wilko77 Oct 29, 2020

hardbyte Oct 30, 2020

wilko77 Oct 29, 2020

hardbyte Oct 30, 2020

wilko77 Oct 29, 2020

hardbyte Oct 30, 2020

wilko77 Oct 29, 2020

hardbyte Oct 30, 2020

wilko77 left a comment

Convert accelerated dice coefficient wrapper code to Cython #363

Convert accelerated dice coefficient wrapper code to Cython #363

Conversation

hardbyte commented Oct 19, 2020

codecov bot commented Oct 19, 2020 • edited Loading

Codecov Report

wilko77 commented Oct 20, 2020

azure-pipelines bot commented Oct 20, 2020

hardbyte commented Oct 21, 2020

wilko77 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wilko77 left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 19, 2020 •

edited

Loading