Fetching latest commit…
Cannot retrieve the latest commit at this time.
|Failed to load latest commit information.|
Build info: make [ build with double precision ] make sp [ build for single precision with no debug information ] make debug [ build for debug ] make clean [ removes objects ] Build variable: TIMING=TRUE [ build with kernel execution times ] Ex: make TIMING=TRUE make sp TIMING=TRUE Run: ./lulesh -h [ help] --------------------------------------------------------------------------------- This is the README for LULESH 2.0 Beta More information including LULESH 1.0 can be found at https://codesign.llnl.gov/lulesh.php If you have any questions or problems please contact: Ian Karlin <email@example.com> Jeff Keasler <firstname.lastname@example.org> or Rob Neely <email@example.com> Also please send any notable results to Ian Karlin <firstname.lastname@example.org> as we are still evaluating the performance of this code. *** Notable changes in LULESH 2.0 *** Split functionality into different files lulesh.cc - where most (all?) of the timed functionality lies lulesh-init.cc - Setup code lulesh-util.cc - Non-timed functions CLsetup.cpp - OpenCL init code kernels.cl - OpenCL kernels The concept of "regions" was added, although every region is the same ideal gas material, and the same sedov blast wave problem is still the only problem its hardcoded to solve. Regions allow two things important to making this proxy app more representative: Four of the LULESH routines are now performed on a region-by-region basis, making the memory access patterns non-unit stride Artificial load imbalances can be easily introduced that could impact parallelization strategies. * The load balance flag changes region assignment. Region number is raised to the power entered for assignment probability. Most likely regions changes with MPI process id. * The cost flag raises the cost of ~45% of the regions to evaluate EOS by the entered multiple. The cost of 5% is 10x the entered multiple. MPI and OpenMP were added, and coalesced into a single version of the source that can support serial builds, MPI-only, OpenMP-only, and MPI+OpenMP Added support to write plot files using "poor mans parallel I/O" when linked with the silo library, which in turn can be read by VisIt. Enabled variable timestep calculation by default (courant condition), which results in an additional reduction. Also, seeded the initial timestep based on analytical equation to allow scaling to arbitrary size. Therefore steps to solution will differ from LULESH 1.0. Default domain (mesh) size reduced from 45^3 to 30^3 Command line options to allow for numerous test cases without needing to recompile Performance optimizations and code cleanup uncovered during study of LULESH 1.0 Added a "Figure of Merit" calculation (elements solved per microsecond) and output in support of using LULESH 2.0 for the 2017 CORAL procurement Possible Differences in Final Release (other changes possible as discovered) * High Level mesh structure to allow data structure transformations * Different default parameters * Minor code performance changes and cleanup TODO in future versions * Add reader for (truly) unstructured meshes, probably serial only * CMake based build system