No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
demo Updated demo Oct 1, 2012
sample reorg repository Oct 1, 2012
src fix bugs of the variance Apr 28, 2013 Updated docs Oct 1, 2012 Updated docs Oct 1, 2012 update about outputing the variance Apr 26, 2013 Docs update Oct 1, 2012 Docs update Oct 1, 2012
example_commands.txt update about outputing the variance Apr 26, 2013
make.dist Updated docs Oct 1, 2012

simform: MapReduce based Simulation Informatics

Written by Austin Benson, Paul Constantine, David F. Gleich, and Yangyang Hou


This code is designed to work with a database of exodus files produced by the ARIA thermal code in the Sierra Mechanics toolbox. For such a database, it will:

  • convert the data into a format to make it easier to process in Hadoop
  • compute the SVD of the database
  • interpolate the solution at any set of new points using a linear interpolation routine
  • interpolate the solution at any set of new points using a non-linear model reduction based interpolation routine that includes a estimate of the error
  • convert all the MapReduce based predictions to exodus files.


Here are some commands I ran to do a quick analysis on the EC2 cluster:


$ make dir=hdfs://nebula/data/exodus-runs

$ make setup_database name=runs variable=TEMP dir=hdfs://
$ make -f runs preprocess
$ make -f runs convert timestepfile=timesteps.txt

In this case, we had to normalize time-steps across the different files as the default step-length is variable.

Compute the global variance

$ make -f runs var
$ make -f runs outputvar

Simple interpolation

$ make -f runs predict design=design_points.txt points=new_points.txt

and then dump out exodus files

$ make -f runs seq2exodus  numExodusfiles=10 OutputName=output/thermal_maze

SVD based Model Reduction

$ make -f runs seq2mseq
$ make -f runs model numExodusfiles=6
$ make -f runs interpsvd design=design_points.txt points=new_points.txt


mrsimform requires quite a few libraries as it combines the functionality of many different pieces of code. Please see for help on setup. The code does not actually require any pieces to be installed on the system. But this makes it much easier. We've provided a list of commands to setup a demostration cluster on Amazon's EC2 system and process a small collection of files. See the demo directory.


These codes are based on a few active research projects from the authors.