Scalarization transformation #2499

LonelyCat124 · 2024-02-08T13:29:15Z

In parts of the physics codes for LFRiC we come across loop patterns such as this:

do i = ...
  do l = ...
    temp_in(l) = array(l,i) * array2(l,i)
  end do
  call exp_v(n, temp, temp_in)
  do l = ...
   !do some based on temp(l)
  end do
end do

Once we inline and fuse this loop structure, we get loops like this:

do i = ...
  do l = ...
    temp_in(l) = array(l,i) * array2(l,i)
    temp(l) = exp(temp_in(l))
   !do some based on temp(l)
  end do
end do

For cases such as this, temp_in and temp can be scalarised providing that nothing outside the loop depends on their values (which would already be a strange implementation choice, since it would only be for the final value of i). This would help us remove some false dependencies, as there is a write-write dependency on temp(l) if we use collapse on this loop, however these are not necessary since temp can just be a local scalar instead.

The goal of this transformation would be to take code like the above (post all the other inline and loop fusion transformations) and generate:

do i = ...
  do l = ...
   temp_in_scalar = array(l,i) * arary2(l,i)
   temp_scalar = exp(temp_in_scalar)
   !do something based on temp_scalar
  end do
end do

At this point, we can apply target + loop with collapse which will lead to less kernel launches and synchronization, and probably better performance on GPU.

The text was updated successfully, but these errors were encountered:

LonelyCat124 · 2024-04-18T10:00:49Z

First step - on Reference: Find next (and previous?) Reference to this symbol.

hiker · 2024-04-18T10:20:49Z

First step - on Reference: Find next (and previous?) Reference to this symbol.

The VariableAccess information already contains all accesses in order.

LonelyCat124 · 2024-04-18T10:22:11Z

First step - on Reference: Find next (and previous?) Reference to this symbol.

The VariableAccess information already contains all accesses in order.

Yeah - the plan is to use that data with this transformation.

LonelyCat124 · 2024-04-19T10:07:42Z

@hiker I'm a bit confused about how to use the VariablesAccessInfo - is there a linkage between VariablesAccessInfo and the node for a given read/write access? E.g. For a given routine if I wanted to find (in order) all the accesses/dependencies on a given symbol (slash signature) can I do that with the VariablesAccessInfo? I can find the sequence of reads/writes but if I wanted to refer back to the relevant Reference is that currently possible?

Ah I guess its .node in AccesInfo?

hiker · 2024-04-19T11:47:09Z

@hiker I'm a bit confused about how to use the VariablesAccessInfo - is there a linkage between VariablesAccessInfo and the node for a given read/write access? E.g. For a given routine if I wanted to find (in order) all the accesses/dependencies on a given symbol (slash signature) can I do that with the VariablesAccessInfo? I can find the sequence of reads/writes but if I wanted to refer back to the relevant Reference is that currently possible?

Ah I guess its .node in AccesInfo?

Yes :) I saw the comments in the wrong order, and commented elsewhere :)

(Towards #2499) Initial implementation of next_access function on Reference

LonelyCat124 · 2024-04-29T10:06:27Z

@sergisiso If the next access to an array reference (that is otherwise a potential target for scalarization) - if its contained within an IfBlock that isn't also an ancestor of the Loop I'm "scalarizing" I will just ignore it rather than dealing with the if condition - unless you think we should specifically try to handle if blocks here?

LonelyCat124 · 2024-04-29T10:39:04Z

Also I realise that I probably need to be careful with next_access, since I think next_access for something like:

a(i) = a(i) + 1

will point to the LHS of the assignment, so I should also check the RHS of the assignment in this case for scalarization.

LonelyCat124 added the enhancement label Feb 8, 2024

LonelyCat124 mentioned this issue Apr 18, 2024

Add nowait to OMPTarget and compute required barriers to satisfy dependencies. #2551

Open

sergisiso added a commit that referenced this issue Apr 26, 2024

Merge pull request #2553 from stfc/2499_reference_next_access

4d44aab

(Towards #2499) Initial implementation of next_access function on Reference

LonelyCat124 added a commit that referenced this issue Apr 29, 2024

First code towards #2499

59d9875

LonelyCat124 added the NG-ARCH Issues relevant to the GPU parallelisation of LFRic and other models expected to be used in NG-ARCH label May 3, 2024

LonelyCat124 self-assigned this May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalarization transformation #2499

Scalarization transformation #2499

LonelyCat124 commented Feb 8, 2024

LonelyCat124 commented Apr 18, 2024

hiker commented Apr 18, 2024

LonelyCat124 commented Apr 18, 2024

LonelyCat124 commented Apr 19, 2024 •

edited

hiker commented Apr 19, 2024

LonelyCat124 commented Apr 29, 2024

LonelyCat124 commented Apr 29, 2024

Scalarization transformation #2499

Scalarization transformation #2499

Comments

LonelyCat124 commented Feb 8, 2024

LonelyCat124 commented Apr 18, 2024

hiker commented Apr 18, 2024

LonelyCat124 commented Apr 18, 2024

LonelyCat124 commented Apr 19, 2024 • edited

hiker commented Apr 19, 2024

LonelyCat124 commented Apr 29, 2024

LonelyCat124 commented Apr 29, 2024

LonelyCat124 commented Apr 19, 2024 •

edited