Accumulate for Expressions of 2 Particle Labels #32

JabirSS · 2018-06-28T13:19:14Z

I have been trying to evaluate the result of something like:

//create particle labels a and b and symbols
Accumulate<Aboria::max<double>> max;

double  maxVal= eval(max(a,b,SomeFunction(a,b)));

This is straight-forward when SomeFunction only deals with one particle label, i.e. eval(max(a,SomeFunction(a))); I tied looking at Symbolic.h but I couldn't figure out the syntax to use when the expression involves more than one label. Seems like dx has to be somewhere in there?

The text was updated successfully, but these errors were encountered:

martinjrobins · 2018-06-28T13:23:59Z

Have you tried:

double  maxVal = eval(max(a,max(b,SomeFunction(a,b))));

?
I must admit I have never tried this, but as far as I can tell it should work

JabirSS · 2018-06-29T07:00:49Z

Thanks! That works!

However, it seems to be running in serial and extremely slowly...
Do you think I should give up on it and switch to the lower level API for this part of the code or is there something I can do to make it run faster?

martinjrobins · 2018-06-29T09:55:53Z

You are right: the outer max() isn't being parallised, which is an oversight on my part. I should be able to fix that up.

By "extremely slowly", do you mean slowly compared to running in parallel, or very slow even for serial code?

JabirSS · 2018-06-29T13:12:27Z

Thanks for the quick reply! I would say slow even for serial , as other much more involved parts of the code take only about 1/10th the time it takes to calculate the eval expression in question.

…

On Fri, Jun 29, 2018, 18:55 Martin Robinson ***@***.***> wrote: You are right: the outer max() isn't being parallelism, which is an oversight on my part. I should be able to fix that up. By "extremely slowly", do you mean slowly compared to running in parallel, or very slow even for serial code? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#32 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/Aku2sPMgNf8pzUnmVBYeBWevJGRbLBn6ks5uBfmpgaJpZM4U7X7u> .

martinjrobins · 2018-06-29T13:25:30Z

Ok, there might be other problems with this then, I'll look into it. In the meantime, you could swap to the lower level interface. This would just involve a double loop to evaluate all the particle pairs. You could make this parallel with openmp.

double my_max = 0;
#pragma omp for reduction (max:my_max)
for (size_t i = 0; i < particles.size(); ++i) {
  for (size_t j = 0; j < particles.size(); ++j) {
   const double val = function(particles[i],particles[j]);
    if (val > my_max) {
      my_max = val;
    }
  }
}

JabirSS · 2018-07-02T04:23:39Z

Thanks! That works however that openmp control predicate does not seem to work for my case and would not compile. This is not a huge deal in this particular case since this loop only adds about 1% to the time it takes to run the whole code, but it would of course be nice to be able to run such loops in parallel, just in case more work was needed to be done by such a loop.

For your reference, I was trying to calculate the viscous diffusion constraint on time step for SPH according to Monaghan. Below is the code I ended up with.

for (auto n = container.get_query().get_subtree(); n != false; ++n) {
			for (auto i = container.get_query().get_bucket_particles(*n);
					i != false; ++i) {

				//i is first particle, loop for which is here

				auto& Xi=get<position>(*i);
				auto& Vi = get<cVelocity>(*i);
				auto& RHOi=get<cDensity>(*i);

				for (auto j = i + 1; j != false; ++j) {


					auto& Xj=get<position>(*j);
					auto& Vj = get<cVelocity>(*j);

					auto  dx= (Xj-Xi);
					auto  Vij= (Vj-Vi);
					double ldxl= dx.squaredNorm();

					if (ldxl<2*h)
					{

						thisMaxVisc= h* (Vj-Vi).dot(dx)/(ldxl*ldxl+epsilon);

						thisMaxVisc>maxVisc? maxVisc=thisMaxVisc : 0;


					}


				}

				thisMINcv= h/(ss0*std::pow(RHOi/rho0,eta) + maxVisc);

				thisMINcv<MINcv? MINcv=thisMINcv:0;


			}
		}

Any optimization suggestions or comments are welcomed!
Also, I noticed something else that you might find interesting or want to look into. An expression like eval(max1(a,max2(b,SomeFunction(a,b)))) would not work if max1 was an Accumulate object and max2 was an AccumulateWithinDistance Object.

martinjrobins · 2018-07-02T09:21:30Z

I notice that you are only looping through particles within each bucket, so you are not taking into account interactions between particles that are in different buckets.

You could write something like this to loop though all neighbour pairs within radius:

for (auto i: particles) {
   for (auto j = euclidean_search(particles.get_query(), get<position>(i), radius);
         j != false; ++j) {
  // do something with i and j
 }
}

Note that openmp will only be able to parallise the outter loop if it is an index loop, so you can change to:

for (int ii = 0; ii < particles.size(); ++ii) {
  auto& i = particles[ii];
   for (auto j = euclidean_search(particles.get_query(), get<position>(i), radius);
         j != false; ++j) {
  // do something with i and j
 }
}

Note that if you want to do something on a per-bucket basis, and you don't want to double evaluate pairs of particles, and you are using the cell list data structure, then you can use the fast cell list neighbour search described here: https://martinjrobins.github.io/Aboria/aboria/neighbourhood_searching.html#aboria.neighbourhood_searching.fast_cell_list_neighbour_search

JabirSS · 2018-07-02T11:23:48Z

Thank you for spotting the mistake! It works well now.

I am not sure I understand the concept of a "bucket". Is it the same as cell in a cell list data structure, a list of all particles that are within some radius of each other, or something else? It is honestly the most confusing bit in the documentation, the rest is pretty well written I must say.

I tried to use the fast cell-list neighbour search you recommended but I seem to get faster results with the second code you recommended in your last post with openmp.

martinjrobins · 2018-07-02T11:30:32Z

Yes, "bucket" is the same as "cell". A cell list is sometimes referred to as a "bucket-search" algorithm, and I started using the bucket terminology, then swapped to "cells" once I realised this was more prevalent. I need to go though and be consistent in the terminology!

I'd say go for the second code with openmp then, it takes extra operations since you are doing each pair twice, but you can make it parallel, which is good. Its also a lot easer to read :)

JabirSS · 2018-07-02T11:39:24Z

I see now!
I look forward to trying out new features in Aboria, especially running it on GPU! :)

… fix bug if a sparse sum is within a dense sum

martinjrobins · 2018-07-06T10:42:26Z

the code eval(max1(a,max2(b,SomeFunction(a,b)))) with max2 being an AccumulateWithinDistance works now, and is run in parallel if OpenMP is used, fixed in 8080b04

martinjrobins self-assigned this Jun 29, 2018

martinjrobins added bug question labels Jun 29, 2018

martinjrobins added a commit that referenced this issue Jul 6, 2018

#32 make dense accumulates run in parallel for const expressions, and…

8080b04

… fix bug if a sparse sum is within a dense sum

martinjrobins closed this as completed Jul 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accumulate for Expressions of 2 Particle Labels #32

Accumulate for Expressions of 2 Particle Labels #32

JabirSS commented Jun 28, 2018

martinjrobins commented Jun 28, 2018

JabirSS commented Jun 29, 2018

martinjrobins commented Jun 29, 2018 •

edited

JabirSS commented Jun 29, 2018 via email

martinjrobins commented Jun 29, 2018 •

edited

JabirSS commented Jul 2, 2018 •

edited

martinjrobins commented Jul 2, 2018

JabirSS commented Jul 2, 2018

martinjrobins commented Jul 2, 2018

JabirSS commented Jul 2, 2018

martinjrobins commented Jul 6, 2018

Accumulate for Expressions of 2 Particle Labels #32

Accumulate for Expressions of 2 Particle Labels #32

Comments

JabirSS commented Jun 28, 2018

martinjrobins commented Jun 28, 2018

JabirSS commented Jun 29, 2018

martinjrobins commented Jun 29, 2018 • edited

JabirSS commented Jun 29, 2018 via email

martinjrobins commented Jun 29, 2018 • edited

JabirSS commented Jul 2, 2018 • edited

martinjrobins commented Jul 2, 2018

JabirSS commented Jul 2, 2018

martinjrobins commented Jul 2, 2018

JabirSS commented Jul 2, 2018

martinjrobins commented Jul 6, 2018

martinjrobins commented Jun 29, 2018 •

edited

martinjrobins commented Jun 29, 2018 •

edited

JabirSS commented Jul 2, 2018 •

edited