ShenEOS scaling issues #378
[reported by manderson] [Trac time Tue Mar 27 22:32:58 2012] The shen eos component now performs comparable to the standalone fortran equivalent from C. Ott when run on 1 thread. However, it does not scale at all:
./sheneos_test --num-workers 1 --hpx:threads 1 --num-partitions 1 gives 0.0405 sec
Varying the number of partitions alters these numbers; however, the optimal result is to keep the same number of partitions as the number of threads, as shown here.
I have eliminated the time overhead associated with shen_connect in these measurements as well as all other unrelated overhead. I have removed the srand and random_shuffle in case they were affecting the scaling.
The shen eos component does not scale at all. In fact, running the same amount of work on one thread is nearly twice as fast compared to running on eight threads:
./sheneos_test --num-workers 8 --hpx:threads 1 --num-partitions 1 gives 0.2350 sec
[comment by manderson] [Trac time Wed Mar 28 17:52:43 2012] the latest numbers...
40 x 40 x 40
control case: 0.0549
100 x 100 x 100