WiP: Support derived types and make kernels look more like original Fortran code #40

domcharrier · 2021-12-14T17:23:00Z

TBC

Major changes:

New GPUFORTRT runtime, replaces gpufort_acc_runtime: Complete C++ rewrite of the gpufort_acc_runtime
- Adds C header file gpufortrt_api.h and a Fortran module gpufortrt_api.f90,
  so that it can be accessed from C/C++ and Fortran
- Adds logging with 5+ log levels (controllable via environment variable)

Other changes:

bin/gpufort and bin/gpufortfc: Add options to only partially convert a file, e.g. allows to convert only OpenACC compute constructs while not translating any other OpenACC directives.

Changes (unsorted):

Rewrite and breakup monolithic templates into individual macros. Use code generation to create python render methods from the macros. Allows to:
- more flexibly use templates.
- more easily test template-based code generation.
- use templates in other GPUFORT python modules.
- use GPUFORT functionality in other python apps
Anticipate new kernel code generation backends:
- Split fort2hip into generic fort2x part and fort2x.hip part
- Put abstract codegen base classes and generators into fort2x folder
  and HIP specific parts into subfolder fort2x/hip.
Add module fort2x.hip that provides routines for creating HIP code generators based on string input
(Inputs: Fortran declaration list snippet & annotated Fortran loop snippet)
Replace hip_auto launcher by hip_ps (ps: problem size), where the first argument is a problem size dim3 that is derived from the range of the translated loop nest.
Improve performance of linemapper's preprocessor by using python string & regex features instead of pyparsing when evaluating macros or expressions.
CUDA Fortran
- Support of fixed size device and pinned arrays in programs and procedures.

…ll appear twice in token stream

…son when reading large files

* Token-based allocate parser can decomposes deallocate/allocate statements into a list of variable tuples consisting each of the variable's name plus a list of its range triples (lbound, ubound, stride). Further parses `stat` expression if present. * Token-based use statement parser now also parses module attribute lists, e.g. `use, intrinsic ::` plus renamings, e.g. `use, intrinsic :: iso_c_binding, renamed_local_type => c_ptr`.

* Move more analysis functionality from translator tree node into dedicated module

…to write them

* Identify literal strings as single token Not used yet in indexer

…tified as tensors in all vars routine

* Previous version identified <name> and <name>(\w+) as the same tag.

* Groups multiple use statements into single use statement if detecting a group of statements with 'only' var list. Example: use mymod, only: a use mymod, only: b_local => b use mymod, only: c_local => c becomes: use mymod, only: a, b_local => b, c_local => c * Groups multiple use statements into two use statement if detecting a group of statements with variable renaming list: Example: use mymod, a_local => a use mymod, b_local => b # 'a_local' is avalaible again as 'a' use mymod, c_local => c # 'a_local', 'b_local' are avalaible # again as 'a', 'b', 'c' is only available as 'c_local' becomes: use mymod, only: a_local => a, b_local => b use mymod, c_local => c

* Generate a C++ file for toplevel device procedures as well.

* util/parsing.py:is_assignment: Do not join the operands after splitting the tokens on the highest level with respect to the `=` operator before passing the first operand to `parse_lvalue`. * This commit fixes issues where do-loop related token streams like ["do","100","i"] were incorrectly identified as identifiers like "do100i".

…ength

... of numbers with user-defined precision * Example: "a>1._dp.and." should be tokenized as ["a",">","1._dp",".and."] Before this commit, it has been incorrectly tokenized as: ["a",">","1","._dp.","and","."] as any \.\w+\. was identified as Fortran operator. * This commit hardcodes all named Fortran operators in the regular expression used by the tokenizer. Further: * Makes certain private routines in util.parsing.parsing.py public to make it possible to test them. Further renames them to have a `_fortran_` substring.

* translator: Support case statements with multiple values * python/converter: Remove "-std=f2008" from `--print-gfortran-config` flags as forcing the standard is an issue in codes where non-standard intrinsics such das `dfloat` are used Minor: * namespacegen: Ensure config flags are actually used; pass subprocess command as string, not as list of string

* Consider gfortran intrinsics in indexer.scope * indexer.scope: Add routine to check if a name matches that of an intrinsic. * indexer.scope: Add routine to solely lookup implicitly declared variables according to given scope's implicit spec.

... when checking if an expression is a tensor access or function call before generating C code. Now checking in the following order: 1. Is explicitly declared array / implicitly declared array (via DIMENSION)? 2. Is explicitly declared procedure? 3. Is intrinsic? If the expression cannot be associated with the any of the above, it must be a variable from an ignored module, or an external procedure, which will available only at the linking stage. This check is still missing.

* Result type of an (accelerator) procedure might also be implicitly defined. * This commit tries to resolve the type of an accelerator procedure also by checking the implicit spec of the current scope.

* Emit prototype for accelerator procedures so that they can be defined in any particular order afterwards and still depend on each other.

* Add test * Integration into indexer still missing ...

* Only introduce a C++ parameter namespace for non-device procedures and program if that program unit contains compute constructs. * Significant translation time reduction possible.

* Classify function call/array access expressions as array access, function call, or intrinsic call. * Improve detection of intrinsics

* Add missing value attributes to interfaces declared in gpufortrt_wait* Fortran runtime function.

domcharrier added 30 commits March 23, 2022 14:48

(WiP) memcpy check is backend specific

be82104

(WiP) Workaround: Do not lookup C type for character, always assume char

1c5498d

(WiP) Do not identify 'module procedure' as 'module' statement

389731c

(WiP) Add '\b' around end/else in util.parsing.tokenize regexes

c853345

(WiP) Improve declaration parsing

0d63de9

(WiP) More modules to ignore

c8bb972

(WiP) Scanner should also not interpret module procedure as module

f782eee

util/parsing: Break apart expression group as matching expressions wi…

cf4e7fa

…ll appear twice in token stream

indexer: Adopt default python package as it appears robuster than orj…

dc440e4

…son when reading large files

WiP: Expanding array assignment operations (unstable)

f40a093

(WiP) Add kernels example. Builds & passes

97a303e

Add previously missing file

43105e4

(WiP) Added accidentally removed line

9fcc8c6

(WiP) Streamlined code generation further

13c7909

* Move more analysis functionality from translator tree node into dedicated module

WiP on loopnest identification

57e3f80

(WiP) Scanner workaround for functions with prefixed return value

823501a

(WiP) Add integer c_char to default conv. table

9c751c9

(WiP): Derive c_type only if needed in C++ kernel

ccc1082

(WiP) converter: Do not read other gpufort mod files if we just want …

3149cc3

…to write them

(WiP) Minimized ACC kernels example

87f52f1

(WiP) Add first impl of host_data via associate + example

8903825

(WiP) Fix tokenizer issue that prevented splitting end(if|do) elseif

946d0d7

* Identify literal strings as single token Not used yet in indexer

(WiP) translator: Only collect tensors/call nodes that have been iden…

1e0c8a6

…tified as tensors in all vars routine

acc kernels: Fix kernelgen when bounds unspecified

fc8c0d8

Makefile: Fix target name, regenerate shared gpufort_array.h

26b4c0d

indexer/scope: More robust tag comparison

3407722

* Previous version identified <name> and <name>(\w+) as the same tag.

(WiP,unstable) Commit because dev machine might be migrated tomorrow

2b520e9

(WiP, unstable) Last changes

e9db4b4

domcharrier added 30 commits August 30, 2022 02:11

fort2x/codegen: C++ file for toplevel procedures

de0290b

* Generate a C++ file for toplevel device procedures as well.

fort2x/namespacegen: Change default flags to support arbitrary line l…

4820c98

…ength

gpufort.h: Add alog10/dlog10 device functions

05888bc

scope: Consider implicit rule in variable lookup

84b9d1d

codegen: Add c_int8t to default interfaces

14be046

Add two stability tests

149e133

gpufort.h: Add device function dexp

b0886b2

py*/gpuf*/index*: Support dimension statement

729c4cf

scanner/*/nodes: Lookup of implicit func retty

e48e24a

* Result type of an (accelerator) procedure might also be implicitly defined. * This commit tries to resolve the type of an accelerator procedure also by checking the implicit spec of the current scope.

fort2x.hip: Emit prototype for accel. procedure

06c7ef6

* Emit prototype for accelerator procedures so that they can be defined in any particular order afterwards and still depend on each other.

util/parsing: Add parser for external stmt

0fd3495

* Add test * Integration into indexer still missing ...

Accelerator routines: Take implicit variables into account

daa6234

studies/gfortran: Add study on hiding present intrinsic

0a75f44

hipcodegen: Fix error when catching exception

f6964b2

grammar_f03: Fix grammar of \.not\. op

97ee73e

fort2x/codegen: Reduce number of namespaces

fc4ddb5

* Only introduce a C++ parameter namespace for non-device procedures and program if that program unit contains compute constructs. * Significant translation time reduction possible.

translator: Improve classific. of call-like expr

310fb03

* Classify function call/array access expressions as array access, function call, or intrinsic call. * Improve detection of intrinsics

studies: +study on Fortran vs C loop control statements

324b9d6

Add stability test

3ab82ba

Fix:gpufort_api_core.f90: Add missing value attrib

1a77277

* Add missing value attributes to interfaces declared in gpufortrt_wait* Fortran runtime function.

Fix: gpufortrt_api_core.f90: F90 fun bound to wrong C fun

8e8bf8b

Fix: Not map min intrinsic to max fun

19f4c2d

maintain(*/parser.py): support recent Python versions

643ea4a

fix(src/rules.mk): allow HIPCC/HIPFC override

0197a95

maintain(README.md): update install information.

3a05b65

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WiP: Support derived types and make kernels look more like original Fortran code #40

WiP: Support derived types and make kernels look more like original Fortran code #40

domcharrier commented Dec 14, 2021 •

edited

Loading

WiP: Support derived types and make kernels look more like original Fortran code #40

Are you sure you want to change the base?

WiP: Support derived types and make kernels look more like original Fortran code #40

Conversation

domcharrier commented Dec 14, 2021 • edited Loading

domcharrier commented Dec 14, 2021 •

edited

Loading