[Small Feature] Parallelize KD Tree build #4559

sgiraudot · 2020-03-05T10:57:22Z

Rationale

Many algorithms (notably in the Point Set Processing package) rely on the Spatial Searching package for fast neighbor queries. Although many PSP algorithms take advantage of trivial parallelisation to speed-up computation by querying the kd tree in parallel for each point, a substantial time can be spend building the kd tree, especially for very large point clouds. Up to a third of the computation time of algorithms like CGAL::jet_estimate_normals() can be spent building the kd tree when queries are done in parallel.

This feature adds a parallel variant of the KD_tree::build() member function. On a quad-core processor, this parallel algorithm is experimentally 2 to 3 times faster than the sequential one, depending on the point cloud.

Summary of API changes

A template parameter ConcurrencyTag is added to the KD_tree:build() member function, with possible values being CGAL::Sequential_tag, CGAL::Parallel_tag and CGAL::Parallel_if_available_tag.

For backward compatibility, an overload of KD_tree::build() without template parameter is provided, falling back to the sequential version. The build() called internally if the tree is not built at first query is also the sequential one. The rational is that parallelism should always be explicitly specified by users (also to avoid hidden changes of behavior in existing codes).

License and copyright ownership

(No change)

CHANGES.md

(todo)

Submission

Version 1 (outdated)

User manual:
- The first example is updated with an explicit call to build() with CGAL::Parallel_if_available_tag
- A paragraph is added to the kd tree design section
Reference manual

Version 2

User manual:
- An additional example is given with an explicit call to build() with CGAL::Parallel_tag and a TBB loop for queries
- The Performance section is updated and a new array for the parallel version is added
Reference manual

Status

Developed and locally tested (GNU/Linux)
Small Feature Pre-approved -- Andreas Fabri 2 April 2020 (CEST)

Spatial_searching/include/CGAL/Kd_tree.h

afabri · 2020-03-11T16:00:29Z

I would suggest to remove the parallel tag in the very first example. There should be a section at the end on parallelism, where you build with the parallel tag and at the same time show a parallel for loop for performing parallel queries.

afabri · 2020-03-11T16:06:59Z

Why is build() called before a removal function? I know this is also written in master.

sgiraudot · 2020-03-16T10:36:06Z

I updated the doc with a Version 2 following @afabri's review.

I'm currently investigating the performances behavior with respect to the bucket size: we present results with the default value (10) and with 20, and the performances with 20 are always better in our performance table (it was already the case before this small feature), which probably makes users wonder why 20 is not the default value.

sgiraudot · 2020-03-16T13:17:59Z

So I did some more experiment regarding the computation time VS bucket size, and it turns out that for the 800k points case used in the manual, the bucket size that gives the shortest computation time is more around 100:

(Violet is sequential, green is parallel.)

The manual does explain that the bucket size should increase with the size of the point cloud, which is confirmed by the following experiment: if I randomly simplify the point cloud to keep only 10% of points (80k), then the optimal bucket size drops to around 50.

I'm not sure what we should do about this. It looks like the bucket size could be roughly estimated as a factor of the logarithm of the number of points, but do we want to include such an automatic selection in the KD Tree? Or do we keep it as is, meaning it's the responsibility of users to choose the best bucket size according to their needs?

…ree_build-GF' into Spatial_searching-Parallelize_kd_tree_build-GF

mglisse · 2020-04-16T15:47:03Z

The optimal bucket size may also depend on the kernel (and in particular the dimension).

mglisse · 2020-04-16T15:56:21Z

I've checked the code, and remove() does not actually call build(),

remove calls root which calls build if necessary.

…lize_kd_tree_build-GF [Small Feature] Parallelize KD Tree build

maxGimeno · 2020-04-21T15:12:58Z

Successfully tested in https://cgal.geometryfactory.com/CGAL/testsuite/results-5.1-Ic-130.shtml

sloriot

Change.md must be updated + I agree that the commented code in remove should be enabled.

Spatial_searching/doc/Spatial_searching/Spatial_searching.txt

sloriot · 2020-04-27T08:04:25Z

Spatial_searching/include/CGAL/Kd_tree_node.h

@@ -51,15 +51,12 @@ namespace CGAL {
    typedef typename Kdt::iterator iterator;
    typedef typename Kdt::D D;

-    bool leaf;


is that change required in this PR ?

it is not clear to me that this will not negatively impact the performance so except if you show that it is faster you should undo that change.

Co-Authored-By: Sebastien Loriot <sloriot.ml@gmail.com>

…ree_build-GF' into Spatial_searching-Parallelize_kd_tree_build-GF

mglisse · 2020-04-27T09:22:53Z

I agree that the commented code in remove should be enabled.

Did you read my comment in this PR? Do you disagree with it?

sloriot · 2020-04-27T09:35:47Z

I agree that the commented code in remove should be enabled.

Did you read my comment in this PR? Do you disagree with it?

Sorry I missed it.

maxGimeno · 2020-05-04T15:06:45Z

Successfully tested in https://cgal.geometryfactory.com/CGAL/testsuite/results-5.1-Ic-139.shtml.

sgiraudot added 6 commits March 4, 2020 11:05

First version of parallel Kd_tree:build()

6e629a9

Remove useless boolean

fac53dc

Some notes about parallelism in KD Tree

0661542

Fix kd tree node

2818986

Example with parallel build

3f28ea9

Document parallel build

ab3f714

sgiraudot added Not yet approved The feature or pull-request has not yet been approved. Small feature Pkg::Spatial_searching tested manually on Linux labels Mar 5, 2020

Update doc

e17378e

sloriot reviewed Mar 5, 2020

View reviewed changes

Spatial_searching/include/CGAL/Kd_tree.h Outdated Show resolved Hide resolved

sloriot reviewed Mar 5, 2020

View reviewed changes

Spatial_searching/include/CGAL/Kd_tree.h Outdated Show resolved Hide resolved

sgiraudot added 3 commits March 5, 2020 13:40

Clean garbage

bd1c509

Remove now useless workaround

e716d90

Include parallel KD tree build in classification

fe90d1c

MaelRL added the CHANGES.md not updated label Mar 9, 2020

MaelRL reviewed Mar 9, 2020

View reviewed changes

Spatial_searching/include/CGAL/Kd_tree.h Show resolved Hide resolved

sgiraudot added 4 commits March 12, 2020 12:21

Improve doc from review with new example for parallel KD tree

35c838d

Use emplace_back()

9ab9081

Update performance section

4bc2e46

Fix markdown

857eb65

sloriot added 2 commits March 26, 2020 19:35

Update branch from master after trailing whitespaces and tabs removal

f1e5569

extra run of the script to remove tabs and trailing whitespaces

d42113b

MaelRL changed the base branch from master to releases/CGAL-4.14-branch March 26, 2020 19:59

MaelRL changed the base branch from releases/CGAL-4.14-branch to master March 26, 2020 19:59

MaelRL added Accepted small feature and removed Tests failing labels Apr 15, 2020

maxGimeno added the Under Testing label Apr 16, 2020

Merge remote-tracking branch 'mine/Spatial_searching-Parallelize_kd_t…

95b9f05

…ree_build-GF' into Spatial_searching-Parallelize_kd_tree_build-GF

sgiraudot changed the base branch from master to releases/CGAL-5.0-branch April 16, 2020 15:04

sgiraudot changed the base branch from releases/CGAL-5.0-branch to master April 16, 2020 15:04

maxGimeno added a commit to maxGimeno/cgal that referenced this pull request Apr 21, 2020

Merge pull request CGAL#4559 from sgiraudot/Spatial_searching-Paralle…

ae51708

…lize_kd_tree_build-GF [Small Feature] Parallelize KD Tree build

maxGimeno added the Tested label Apr 21, 2020

sloriot reviewed Apr 27, 2020

View reviewed changes

sgiraudot and others added 3 commits April 27, 2020 10:15

Fix version

a6dc66f

Co-Authored-By: Sebastien Loriot <sloriot.ml@gmail.com>

Merge remote-tracking branch 'mine/Spatial_searching-Parallelize_kd_t…

7702f57

…ree_build-GF' into Spatial_searching-Parallelize_kd_tree_build-GF

Update CHANGES.md

e0936d2

Reintroduce bool leaf in Kd_tree_node

bd08ba8

sloriot changed the base branch from master to releases/CGAL-4.14-branch April 27, 2020 09:34

sloriot changed the base branch from releases/CGAL-4.14-branch to master April 27, 2020 09:34

sloriot removed the Tested label Apr 27, 2020

maxGimeno added the Tested label May 4, 2020

sloriot removed the CHANGES.md not updated label May 5, 2020

sloriot self-assigned this May 5, 2020

sloriot removed the Under Testing label May 5, 2020

sloriot merged commit c602038 into CGAL:master May 5, 2020

sloriot deleted the Spatial_searching-Parallelize_kd_tree_build-GF branch May 5, 2020 12:35

MaelRL removed the Ready to be tested label May 5, 2020

lrineau added the Merged_in_5.1 label Sep 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Small Feature] Parallelize KD Tree build #4559

[Small Feature] Parallelize KD Tree build #4559

sgiraudot commented Mar 5, 2020 •

edited by MaelRL

afabri commented Mar 11, 2020

afabri commented Mar 11, 2020

sgiraudot commented Mar 16, 2020 •

edited

sgiraudot commented Mar 16, 2020

mglisse commented Apr 16, 2020

mglisse commented Apr 16, 2020

maxGimeno commented Apr 21, 2020

sloriot left a comment •

edited

sloriot Apr 27, 2020

sgiraudot Apr 27, 2020

sloriot Apr 27, 2020

mglisse commented Apr 27, 2020

sloriot commented Apr 27, 2020

maxGimeno commented May 4, 2020

[Small Feature] Parallelize KD Tree build #4559

[Small Feature] Parallelize KD Tree build #4559

Conversation

sgiraudot commented Mar 5, 2020 • edited by MaelRL

Rationale

Summary of API changes

License and copyright ownership

CHANGES.md

Submission

Version 1 (outdated)

Version 2

Status

afabri commented Mar 11, 2020

afabri commented Mar 11, 2020

sgiraudot commented Mar 16, 2020 • edited

sgiraudot commented Mar 16, 2020

mglisse commented Apr 16, 2020

mglisse commented Apr 16, 2020

maxGimeno commented Apr 21, 2020

sloriot left a comment • edited

Choose a reason for hiding this comment

sloriot Apr 27, 2020

Choose a reason for hiding this comment

sgiraudot Apr 27, 2020

Choose a reason for hiding this comment

sloriot Apr 27, 2020

Choose a reason for hiding this comment

mglisse commented Apr 27, 2020

sloriot commented Apr 27, 2020

maxGimeno commented May 4, 2020

sgiraudot commented Mar 5, 2020 •

edited by MaelRL

sgiraudot commented Mar 16, 2020 •

edited

sloriot left a comment •

edited