Switch branches/tags
Nothing to show
Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
..
Failed to load latest commit information.
CLsetup.cc
CLsetup.hpp
Makefile
Makefile.open64
README
commons.hpp
kernels.cl
lulesh-init.cc
lulesh-util.cc
lulesh.cc
lulesh.h

README

Build info:

	make				[ build with double precision ]
	make sp				[ build for single precision with no debug information ]
	make debug			[ build for debug ]
	make clean			[ removes objects ]

Build variable:
	TIMING=TRUE	[ build with kernel execution times ]
	Ex:
		make TIMING=TRUE
		make sp TIMING=TRUE

Run:

	./lulesh -h			[ help]

---------------------------------------------------------------------------------
This is the README for LULESH 2.0 Beta

More information including LULESH 1.0 can be found at https://codesign.llnl.gov/lulesh.php

If you have any questions or problems please contact:

Ian Karlin <karlin1@llnl.gov>
Jeff Keasler <keasler1@llnl.gov> or
Rob Neely <neely4@llnl.gov>

Also please send any notable results to Ian Karlin <karlin1@llnl.gov> as we are still evaluating the performance of this code.

*** Notable changes in LULESH 2.0 ***

Split functionality into different files
lulesh.cc - where most (all?) of the timed functionality lies
lulesh-init.cc - Setup code
lulesh-util.cc - Non-timed functions
CLsetup.cpp	   - OpenCL init code
kernels.cl     - OpenCL kernels

The concept of "regions" was added, although every region is the same ideal gas material, and the same sedov blast wave problem is still the only problem its hardcoded to solve. Regions allow two things important to making this proxy app more representative:

Four of the LULESH routines are now performed on a region-by-region basis, making the memory access patterns non-unit stride

Artificial load imbalances can be easily introduced that could impact parallelization strategies.  
   * The load balance flag changes region assignment.  Region number is raised to the power entered for assignment probability.  Most likely regions changes with MPI process id.
   * The cost flag raises the cost of ~45% of the regions to evaluate EOS by the entered multiple.  The cost of 5% is 10x the entered
 multiple.

MPI and OpenMP were added, and coalesced into a single version of the source that can support serial builds, MPI-only, OpenMP-only, and MPI+OpenMP

Added support to write plot files using "poor mans parallel I/O" when linked with the silo library, which in turn can be read by VisIt.

Enabled variable timestep calculation by default (courant condition), which results in an additional reduction.  Also, seeded the initial timestep based on analytical equation to allow scaling to arbitrary size.  Therefore steps to solution will differ from LULESH 1.0.

Default domain (mesh) size reduced from 45^3 to 30^3

Command line options to allow for numerous test cases without needing to recompile

Performance optimizations and code cleanup uncovered during study of LULESH 1.0

Added a "Figure of Merit" calculation (elements solved per microsecond) and output in support of using LULESH 2.0 for the 2017 CORAL procurement

Possible Differences in Final Release (other changes possible as discovered)

* High Level mesh structure to allow data structure transformations
* Different default parameters
* Minor code performance changes and cleanup

TODO in future versions
* Add reader for (truly) unstructured meshes, probably serial only
* CMake based build system