Mesh refinement: fix MR on GPUs by SeverinDiederichs · Pull Request #564 · Hi-PACE/hipace

SeverinDiederichs · 2021-07-14T20:47:15Z

Currently, using level 1 does not work on GPUs. This PR is resolving the issue.

The problem lies within the field solver: the staging area, which is used to conduct the DST, was based on the real box array. However, on level 1, this boxes have an offset because not all cells are valid on level 1. This causes out of memory accesses in the field solver.
To resolve the issue, the staging area is created like the complex array, without an offset. All functions interacting with the staging area needed to be adapted to take into account the offset.

Using a version which includes #561, I ran the full MR test:

amr.n_cell = 128 128 300
hipace.patch_lo = -1 -1 -3.5
hipace.patch_hi =  1  1 -1
amr.ref_ratio_vect =  8 8 1

hipace.normalized_units=1
hipace.predcorr_max_iterations = 30
hipace.predcorr_B_mixing_factor = 0.05
hipace.predcorr_B_error_tolerance = 4e-2

amr.blocking_factor = 4
amr.max_level = 1

max_step = 0
hipace.output_period = 1

hipace.numprocs_x = 1
hipace.numprocs_y = 1

hipace.depos_order_xy = 2

geometry.coord_sys   = 0                  # 0: Cartesian
geometry.is_periodic =  1     1     0      # Is periodic?
geometry.prob_lo     = -8.   -8.   -6    # physical domain
geometry.prob_hi     =  8.    8.    6

beams.names = beam beam2
beam.injection_type = fixed_weight
beam.num_particles = 1000000
beam.profile = gaussian
beam.zmin = -5.9
beam.zmax = 5.9
beam.radius = 1.2
beam.density = 20.
beam.u_mean = 0. 0. 2000
beam.u_std = 0. 0. 0.
beam.position_mean = 0. 0. 2
beam.position_std = 0.3 0.3 0.5
beam.ppc = 1 1 1
beam.finest_level = 0

beam2.injection_type = fixed_weight
beam2.num_particles = 1000000
beam2.profile = can
beam2.zmin = -1.5
beam2.zmax = -3.0
beam2.radius = 1.2
beam2.density = 20000.
beam2.u_mean = 0. 0. 2000
beam2.u_std = 0. 0. 0.
beam2.position_mean = 0. 0. 0
beam2.position_std = 0.1 0.1 0.2
beam2.ppc = 1 1 1
beam2.finest_level = 1

plasmas.names = plasma ions
plasma.density = 1.
plasma.ppc = 1 1
plasma.u_mean = 0.0 0.0 0.
plasma.element = electron
plasma.level = 0

ions.density = 1.
ions.ppc = 1 1
ions.u_mean = 0.0 0.0 0.
ions.element = proton
ions.level = 1
ions.neutralize_background = 0

diagnostic.diag_type = xyz

Previously, I found some differences between CPU and GPU, however, they were caused by different beam initialization. Reading a beam from file, both give the same result:

This was tested both in Debug and in normal mode.

Using just this PR (so without #561, therefore no plasma is allowed on level 1) and a grid current example, both CPU and GPU give the same result:

Small enough (< few 100s of lines), otherwise it should probably be split into smaller PRs
Tested (describe the tests in the PR description)
Runs on GPU (basic: the code compiles and run well with the new module)
Contains an automated test (checksum and/or comparison with theory)
Documented: all elements (classes and their members, functions, namespaces, etc.) are documented
Constified (All that can be const is const)
Code is clean (no unwanted comments, )
Style and code conventions are respected at the bottom of https://github.com/Hi-PACE/hipace
Proper label and GitHub project, if applicable

MaxThevenet · 2021-07-21T08:45:27Z

Awesome, thanks for this PR!

SeverinDiederichs added 3 commits July 14, 2021 19:16

change staging area to begin at 0

bbf0c74

fix copy to StagingArea from rho and jz

29732be

cleaning

2ad29f6

SeverinDiederichs mentioned this pull request Jul 14, 2021

Implementation of mesh refinement #528

Open

15 tasks

SeverinDiederichs added 8 commits July 14, 2021 23:15

add only transverse offset

f6586c7

fix longitudinal component of staging area

8cd6e2d

fix typo in long derivative

038c8c7

remove white space

8f839d9

get lo directly via the src fabs

d8ecb0e

unify CopyToStagingArea

2c1fbde

Merge branch 'development' into mr_fix_gpus_clean

8fec27e

grow offset box in PoissonSolver

1e2c3f7

SeverinDiederichs requested a review from MaxThevenet July 20, 2021 15:46

MaxThevenet approved these changes Jul 21, 2021

View reviewed changes

add asserts

f1893e4

SeverinDiederichs merged commit 664d769 into Hi-PACE:development Jul 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mesh refinement: fix MR on GPUs#564

Mesh refinement: fix MR on GPUs#564
SeverinDiederichs merged 12 commits into
Hi-PACE:developmentfrom
SeverinDiederichs:mr_fix_gpus

SeverinDiederichs commented Jul 14, 2021 •

edited

Loading

Uh oh!

MaxThevenet commented Jul 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SeverinDiederichs commented Jul 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaxThevenet commented Jul 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SeverinDiederichs commented Jul 14, 2021 •

edited

Loading