Implement parallel::partition_copy. #2716

taeguk · 2017-06-25T20:02:10Z

Check Box

Implementation of partition_copy.
Unit tests for partition_copy.
Benchmark codes for partition_copy.
Benchmark with using policy.executor() instead of hpx::launch::sync in scan_partitioner.
Adapting iterator requirements (Adapting iterator requirements for parallel algorithms #2733).
Adapt to Ranges TS (Adapt all parallel algorithms to Ranges TS #1668).
Adapt to master branch.

Issue List

Benchmark results are inconsistent. (They are different whenever I benchmark.) (https://gist.github.com/taeguk/e40b628fefe025aa3ffc1335bbeed4ee) => With forcing the same seed, benchmark results are repeatable.
Parallel version is slower than sequential version. (parallel version is faster if scan_partitioner use policy.executor()).
'Memory Allocation for caching predicate result' vs 'Duplicated calling predicate' (https://github.com/STEllAR-GROUP/hpx/tree/9644d143cc27d4933664d920c0524cb77ce23889 vs https://github.com/STEllAR-GROUP/hpx/tree/12701f567fb1fac52487339d053f0b966a49f188)

Note

2017/6/25

I implemented partition_copy with reference to copy_if.
The behavior is good, but the benchmark results are very bad.
I must find new efficient way and re-implement partition_copy

2017/7/4

The bad performance is due to #2325.
With using policy.executor() instead of hpx::launch::sync in scan_partitioner, the performance is good. (https://gist.github.com/taeguk/6abe03f9b4cb878872d2bb634cae65b0)

…hms.

hkaiser · 2017-06-25T20:52:24Z

@taeguk: invoking the predicate twice for each element is not allowed, see here: http://en.cppreference.com/w/cpp/algorithm/partition_copy

Complexity

Exactly distance(first, last) applications of p.

taeguk · 2017-06-25T21:03:26Z

@hkaiser You're right. I did not know that.
But, still it does not seem good to allocate additional memory.
However, unfortunately I have no way to resolve that issue.
It would be nice to be able to use memory space of the container pointed to by the Output Iterator, but this does not seem easy because of generalization of algorithm interface.

hkaiser · 2017-06-25T21:06:45Z

Yes, I agree. I think the only viable solution is to allocate that additional array of Booleans as you had it in the first place.

Naios · 2017-06-25T21:39:41Z

hpx/parallel/algorithms/partition.hpp

+        // sequential partition_copy with projection function
+        template <typename InIter, typename OutIter1, typename OutIter2,
+            typename Pred, typename Proj>
+        inline std::pair<OutIter1, OutIter2>


Just a small hint: member functions and templated functions are implicitily inline

Naios · 2017-06-25T21:41:16Z

hpx/parallel/algorithms/partition.hpp

+        {
+            while (first != last)
+            {
+                if (hpx::util::invoke(pred, hpx::util::invoke(proj, *first)))


You should perfect forward callable objects since those can be overloaded on r-value references:

struct my_callable { void operator()()&& { // ... } };

@Naios: Not sure if all of our compilers support that. We should add a feature test if we want to use this. Also, if we do, this could be applied in many places.

@Naios: To clarify, I meant the operator()() &&

@Naios: also, in this context, perfect forwarding wouldn't work as pred (and proj) shouldn't be invalidated (as it might - if moved); pred will be used more than once.

@hkaiser I guess all required compiler versions should support it:

GCC since 4.8 or 4.9

Clang since 3.3 or 3.4

MSVC since 2015 (19.0)

However, it is safe for the future to support perfect forwarding for callable objects.

// EDIT Because of the second comment: yes that's right

hkaiser · 2017-06-26T11:52:40Z

Also, the inspect tool is not happy with your code: https://7095-4455628-gh.circle-artifacts.com/0/tmp/circle-artifacts.KS6bV0d/hpx_inspect_report.html

hkaiser

I'd suggest to keep the work on the scan_partitioner separate from the partition_copy algorithm implementation.

taeguk · 2017-07-02T14:29:08Z

@hkaiser Sorry, this is my mistake. This is just for backup. This commit will be removed.

hkaiser · 2017-07-08T14:46:49Z

@taeguk: could we merge this step by step? Could we merge the current state of partition_copy?

Wrt bad performance: I'd suggest to make the change from hpx::launch::sync to policy.executor() and watch the regression tests (rostam.cct.lsu.edu).

taeguk · 2017-07-08T16:05:12Z

@hkaiser I had already done such a test. With policy.executor, I could get acceptable performance.
I have some commits that are not pushed. And I have to fix some tiny things like inspection errors.

I stopped the progress of this PR because of #2733.
I had a plan to add some commits for adapting this PR to #2733 after #2733 is merged.
I am waiting for it to be merged.

Do you want that I will finish this PR without adapting to #2733?

hkaiser · 2017-07-08T17:10:03Z

@taeguk ok, understood. I applied the first batch of name changes to #2733. I will try to apply the rest asap. In the meantime it might be helpful if you added your review to #2733 as our policy is to have at least one review before merging.

…opy.

…ion_copy.

taeguk · 2017-07-08T19:12:56Z

@hkaiser I'm ready to be merged if you want that you will do adapting this PR to #2733.
Otherwise, if you want that I will do that, I will do more commits in here after #2733 is merged.

taeguk · 2017-07-08T19:21:21Z

@hkaiser Sorry, I have one thing to do for Ranges TS. I will add partition_copy to container_algorithms/partition.hpp.

I have a question. As you think, which one is better between "one PR for both an implemention of parallel algorithm and an adaption for Ranges TS" and "divide one PR into two for each"?

…TEllAR-GROUP#1668)

… unit tests for parallel is_heap, is_heap_until, and partition_copy.

…s of parallel::partition_copy. And fix some tiny things.

…UP#2733).

taeguk · 2017-07-09T17:07:31Z

@hkaiser Finally, I'm ready to be merged!

hkaiser

LGTM, thanks a lot!

hkaiser · 2017-07-10T00:33:36Z

Benchmark results are inconsistent. (They are different whenever I benchmark.) (https://gist.github.com/taeguk/e40b628fefe025aa3ffc1335bbeed4ee)

@taeguk: Have you tried whether you get repeatable results when forcing the same seed? The different results could be caused by a different amount of work needed to perform the partitioning.

taeguk · 2017-07-10T02:45:31Z

@hkaiser Oh, I got almost the same results when forcing the same seed! Great :)

hkaiser · 2017-07-10T11:47:48Z

@taeguk: Thanks a lot for your work on this. That's much appreciated! Please get in contact with aserio on IRC so he can send you a STE||AR-group t-shirt ;)

taeguk added 4 commits June 26, 2017 03:56

Add a partition operations table in documentation of parallel algorit…

724a552

…hms.

Implement parallel::partition_copy. And add docs for that.

6a2bd7a

Add unit tests for parallel::partition_copy.

f4437f9

Add benchmarks for parallel::partition_copy.

9644d14

taeguk force-pushed the tg_partition_copy branch from 5a1d921 to 12701f5 Compare June 25, 2017 20:17

hkaiser added category: algorithms type: enhancement labels Jun 25, 2017

hkaiser added this to the 1.1.0 milestone Jun 25, 2017

hkaiser added this to Work in progress in Standard Algorithms Jun 25, 2017

hkaiser mentioned this pull request Jun 25, 2017

Implement N4409 on top of HPX #1141

Closed

47 tasks

Naios reviewed Jun 25, 2017

View reviewed changes

taeguk force-pushed the tg_partition_copy branch from 12701f5 to 9644d14 Compare July 2, 2017 08:28

hkaiser reviewed Jul 2, 2017

View reviewed changes

taeguk force-pushed the tg_partition_copy branch from 9eeef09 to 9644d14 Compare July 3, 2017 23:37

taeguk added 4 commits July 9, 2017 03:53

Support an iterator tag option for benchmark of parallel::partition_c…

729e72e

…opy.

Generate random numbers as testcase in unit tests of parallel::partit…

fc91242

…ion_copy.

Adapt parallel::partition_copy for STEllAR-GROUP#1668.

89495a2

Fix inspection errors.

b30373b

taeguk changed the title ~~[Working] Implement parallel::partition_copy.~~ Implement parallel::partition_copy. Jul 8, 2017

taeguk changed the title ~~Implement parallel::partition_copy.~~ [Working] Implement parallel::partition_copy. Jul 8, 2017

Add container algorithm of parallel::partition_copy for Ranges TS. (S…

208b339

…TEllAR-GROUP#1668)

taeguk force-pushed the tg_partition_copy branch from 7f18cee to 792fdf0 Compare July 9, 2017 15:54

taeguk added 2 commits July 10, 2017 01:03

Use std::begin and std::end instead of boost::begin and boost::end in…

7396d47

… unit tests for parallel is_heap, is_heap_until, and partition_copy.

Avoid generic lambda of C++14 for compiler compatibility in unit test…

5a129cc

…s of parallel::partition_copy. And fix some tiny things.

taeguk force-pushed the tg_partition_copy branch from 792fdf0 to 5a129cc Compare July 9, 2017 16:03

Adapt parallel::partition_copy for iterator requirements (STEllAR-GRO…

bceca35

…UP#2733).

taeguk force-pushed the tg_partition_copy branch from bcf5b45 to bceca35 Compare July 9, 2017 16:29

taeguk changed the title ~~[Working] Implement parallel::partition_copy.~~ Implement parallel::partition_copy. Jul 9, 2017

taeguk added 2 commits July 10, 2017 01:32

Merge branch 'master' into tg_partition_copy

85f3945

Adapt parallel::partition_copy to STEllAR-GROUP#2734.

bfc5eba

hkaiser approved these changes Jul 10, 2017

View reviewed changes

hkaiser merged commit 11bce88 into STEllAR-GROUP:master Jul 10, 2017

hkaiser mentioned this pull request Jul 11, 2017

Adapt all parallel algorithms to Ranges TS #1668

Closed

46 tasks

hkaiser moved this from Work in progress to Merged to master in Standard Algorithms Jul 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement parallel::partition_copy. #2716

Implement parallel::partition_copy. #2716

taeguk commented Jun 25, 2017 •

edited

hkaiser commented Jun 25, 2017

taeguk commented Jun 25, 2017

hkaiser commented Jun 25, 2017

Naios Jun 25, 2017

Naios Jun 25, 2017

hkaiser Jun 25, 2017

hkaiser Jun 25, 2017

hkaiser Jun 25, 2017

Naios Jun 25, 2017 •

edited

hkaiser commented Jun 26, 2017

hkaiser left a comment

taeguk commented Jul 2, 2017

hkaiser commented Jul 8, 2017

taeguk commented Jul 8, 2017

hkaiser commented Jul 8, 2017

taeguk commented Jul 8, 2017

taeguk commented Jul 8, 2017 •

edited

taeguk commented Jul 9, 2017

hkaiser left a comment

hkaiser commented Jul 10, 2017

taeguk commented Jul 10, 2017

hkaiser commented Jul 10, 2017

Implement parallel::partition_copy. #2716

Implement parallel::partition_copy. #2716

Conversation

taeguk commented Jun 25, 2017 • edited

Check Box

** Issue List **

Note

2017/6/25

2017/7/4

hkaiser commented Jun 25, 2017

taeguk commented Jun 25, 2017

hkaiser commented Jun 25, 2017

Naios Jun 25, 2017

Choose a reason for hiding this comment

Naios Jun 25, 2017

Choose a reason for hiding this comment

hkaiser Jun 25, 2017

Choose a reason for hiding this comment

hkaiser Jun 25, 2017

Choose a reason for hiding this comment

hkaiser Jun 25, 2017

Choose a reason for hiding this comment

Naios Jun 25, 2017 • edited

Choose a reason for hiding this comment

hkaiser commented Jun 26, 2017

hkaiser left a comment

Choose a reason for hiding this comment

taeguk commented Jul 2, 2017

hkaiser commented Jul 8, 2017

taeguk commented Jul 8, 2017

hkaiser commented Jul 8, 2017

taeguk commented Jul 8, 2017

taeguk commented Jul 8, 2017 • edited

taeguk commented Jul 9, 2017

hkaiser left a comment

Choose a reason for hiding this comment

hkaiser commented Jul 10, 2017

taeguk commented Jul 10, 2017

hkaiser commented Jul 10, 2017

taeguk commented Jun 25, 2017 •

edited

Issue List

Naios Jun 25, 2017 •

edited

taeguk commented Jul 8, 2017 •

edited