[GSoC] Add vectorization improvement tests #6548

coodie · 2017-06-27T12:10:41Z

This PR adds vectorization test testing changes related to #6533.

Tests are done using FileCheck. It's best if this PR is merged after FileCheck is integrated into chapel's test suite because current solution adds PREDIFF script which runs FileCheck, and for every test empty .good file has to be added.

mppf · 2017-06-27T14:37:48Z

test/llvm/vectorization/simple_loop_novec.chpl

+// This loop shouldn't be vectorized, because
+// LLVM backend cannot check whether A and B overlap
+// And runtime check for overlap was turned off
+// And --vectorize option too


I think that 'vectorize' should be on by default for the LLVM backend. Therefore the .compots file for this test should have a --no-vectorize.

…torize

mppf · 2017-07-05T12:59:33Z

test/llvm/PREDIFF

+
+mv $OUTFILE $TMPFILE
+$FILECHECK --input-file $TMPFILE $TESTNAME.chpl 2> $OUTFILE
+rm $TMPFILE


Can you put this script it util/test and then symbolically link to it, in the event we need to use it in more places? I'd still like to see FileCheck as a 1st class alternative to a .good file in the testing system but until then, let's use a symbolic link to a shared implementation of this file.

I think this PR would be best merged after I add FileCheck to Chapel's test system and then refactor it not to use PREDIFF script, because current way we use FileCheck is actually a fast hack to see whether it makes sense to use this tool (it makes!).

I'd be comfortable merging this PR as-is or with the symbolic link. I think we should adjust the test system in a separate PR.

mppf · 2017-07-05T13:00:11Z

test/llvm/parallel_loop_access/PREDIFF

@@ -0,0 +1 @@
+../PREDIFF


This one too can be a symbolic link to the script it util/test

mppf · 2017-07-05T13:04:35Z

test/llvm/parallel_loop_access/different_numbers.chpl

+      start_loop3();
+
+      //CHECK: llvm.mem.parallel_loop_access ![[LOOP3:[0-9]+]]
+      //CHECK-NOT: llvm.mem.parallel_loop_access ![[LOOP2]]


I would have thought this access should be marked with parallel loop access for both loops? i.e. i'm surprised to see the CHECK-NOT for LOOP2.

I realized that it is probably not what we want by default. Here is one example of nested loops which are both parallel, but aren't a group of parallel loops:

for i in vectorizeOnly(1..n) { A[i] = A[i]*B[i]; for j in vectorizeOnly(A[i], B[i]) { C[j] = 3*D[j]; } }

There is a dependency, because result of inner loops depends on multiplication A[i] = A[i]*B[i], but once we know range the inner loop is obviously parallel. It's not a big deal to add this anyway, but keep in mind that according to example in documentation, only the innermost loop has metadata referring to the group of loops, meaning that this test is still valid for this case. My example might not be the best but in most general case the innermost loop might not be 'group parallel' and we shouldn't be marking code as such. I'm unable to test whether this will make difference, because as far as I know llvm currently does not use 'group parallel' metadata for anything.

Can you point me to some LLVM documentation explaining what you mean by group parallel ? As I understand llvm.mem.parallel_loop_access, it indicates:

no loop carried memory dependence exist between it and other instructions denoted with the same loop identifier

In your example, the inner loop and the outer loop both have no loop-carried dependencies (that is, it would be a user error if the program depended upon the loop iteration order). There is a dependency between the first statement in the outer loop and the inner loop. If I understand correctly, my Allen & Kennedy textbook calls such a thing a loop-independent dependency.

mppf · 2017-07-05T13:07:33Z

test/llvm/vectorization/simple_loop.compopts

@@ -0,0 +1 @@
+--llvm --fast --vectorize --llvm-print-ir loop --llvm-print-ir-stage full --mllvm -force-vector-width=4 --mllvm -force-vector-interleave=1


GitHub is showing a funny line ending here, maybe, there's an extra newline or something?

This is in fact mark, that there is no newline character at the end.

mppf · 2017-07-05T13:09:04Z

hi @coodie - these are looking great, thanks! Besides my specific comments, my one request is to put a comment at the top of each of your tests to briefly describe what the purpose of the test is. I can tell you're trying to encode that in the test names (which is fine) but I think it'll help maintenance of these tests if a comment indicates what the intent was. Thanks!

mppf · 2017-07-10T16:03:55Z

Passed test/llvm with CHPL_LLVM=llvm.
Didn't pass test/llvm with CHPL_LLVM=none.

@coodie - start_test test/llvm fails the tests in llvm/parallel_loop_access/ when CHPL_LLVM=none. I believe you are missing a SKIPIF in that directory.

mppf · 2017-07-10T16:43:34Z

Passed test/llvm with CHPL_LLVM=llvm.
Passed test/llvm with CHPL_LLVM=none.
Passed test/llvm with CHPL_LLVM=none and no install in third-party/llvm.

Add tests

47ff591

mppf reviewed Jun 27, 2017

View reviewed changes

coodie added 4 commits July 3, 2017 14:45

Add if, double and zipped loop tests

23c0723

Update if_loop test

03461fa

Add parallel_loop_access tests, make symlink to PREDIFF, add --no-vec…

fee08b5

…torize

Modify parallel_loop test

ebfd621

mppf reviewed Jul 5, 2017

View reviewed changes

test/llvm/parallel_loop_access/PREDIFF

@@ -0,0 +1 @@

../PREDIFF

Copy link

Member

mppf Jul 5, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This one too can be a symbolic link to the script it util/test

mppf reviewed Jul 5, 2017

View reviewed changes

coodie mentioned this pull request Jul 7, 2017

Integrate FileCheck into chapel's test suite #6628

Open

Add descriptions for each test and add README.md

e3420cc

Add SKIPIF in parallel_loop_access

2d2e30f

mppf merged commit 2a86c82 into chapel-lang:master Jul 10, 2017

coodie changed the title ~~Add vectorization improvement tests~~ [GSoC] Add vectorization improvement tests Aug 28, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GSoC] Add vectorization improvement tests #6548

[GSoC] Add vectorization improvement tests #6548

coodie commented Jun 27, 2017 •

edited

Loading

mppf Jun 27, 2017

mppf Jul 5, 2017

coodie Jul 5, 2017 •

edited

Loading

mppf Jul 6, 2017

mppf Jul 5, 2017

mppf Jul 5, 2017

coodie Jul 5, 2017 •

edited

Loading

mppf Jul 6, 2017

mppf Jul 5, 2017

coodie Jul 5, 2017

mppf commented Jul 5, 2017

mppf commented Jul 10, 2017 •

edited

Loading

mppf commented Jul 10, 2017 •

edited

Loading

		@@ -0,0 +1 @@
		--llvm --fast --vectorize --llvm-print-ir loop --llvm-print-ir-stage full --mllvm -force-vector-width=4 --mllvm -force-vector-interleave=1

[GSoC] Add vectorization improvement tests #6548

[GSoC] Add vectorization improvement tests #6548

Conversation

coodie commented Jun 27, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coodie Jul 5, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coodie Jul 5, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mppf commented Jul 5, 2017

mppf commented Jul 10, 2017 • edited Loading

mppf commented Jul 10, 2017 • edited Loading

coodie commented Jun 27, 2017 •

edited

Loading

coodie Jul 5, 2017 •

edited

Loading

coodie Jul 5, 2017 •

edited

Loading

mppf commented Jul 10, 2017 •

edited

Loading

mppf commented Jul 10, 2017 •

edited

Loading