Visual Analysis Tool for Fault Characterization of Supercomputers

This tool is designed to visualize and analyze large-scale heterogeneous logs on supercomputers, in order to characterize faults. Currently, the tool provides a user interface to explore logs on Mira. Three types are involved, including

RAS (Reliability, Availability, Serviceability) logs
Cobalt and backend job logs
Darshan logs (not yet supported)

Software architecture

The server provides services through node.js. There are two backend databases: the hand-written C++ data cube engine, and MongoDB. The data cube provides high performance query interface that responses a data cube query in less than a second. MongoDB serves as a general data retrieval tool to access logs.

Build Guidelines

Prerequisites

A decent compiler that supports C++11
node.js (6.9.4)
mongodb (3.2.11)

Data preparation

MongoDB directory
The pregenerated data cube file (raslog)

Installation

Install node.js and node-gyp. We recommend to install node.js in home directory and set $PATH to the installation path.

npm install -g node-gyp

Clone the repo and install dependencies with npm

git clone git@bitbucket.org:hanqiguo/catalogvis.git
cd catalogvis
npm install

Build the C++ data cube (you may need to modify binding.gyp to add C++11 arguments)

cd cpp
node-gyp configure
node-gyp build

Start the MongoDB daemon

cd $your_mongodb_dir
mongod --dbpath=. &> log &

Copy the data (raslog) to the root directory of the project, and then run the server

node server.js

You may need a process manager, such as pm2.js or forever.js to keep the server running. After the server is started, you can visit the server through

http://your_ip:8081

Security

You can limit the access with a preshared key. Uncomment the line in server.js:

app.use(basicAuth("catalog", "catalog1"));

You may also need to secure MongoDB to only allow connection from localhost.

Name		Name	Last commit message	Last commit date
Latest commit History 359 Commits
cpp		cpp
public		public
.gitignore		.gitignore
README.md		README.md
analysis.js		analysis.js
buildPartitionInfo.js		buildPartitionInfo.js
buildProjInfo.js		buildProjInfo.js
buildUserInfo.js		buildUserInfo.js
color.js		color.js
cube.js		cube.js
cubeHeaderConverter.js		cubeHeaderConverter.js
graph.js		graph.js
graphRM.csv		graphRM.csv
graphRMN.csv		graphRMN.csv
graphRMNJ.csv		graphRMNJ.csv
icons.js		icons.js
import.md		import.md
importBackendJobLog.js		importBackendJobLog.js
importCobaltLog.js		importCobaltLog.js
importCube.js		importCube.js
importLog2.js		importLog2.js
importRAS.js		importRAS.js
level.js		level.js
machine.csv		machine.csv
machineViewGenerator.js		machineViewGenerator.js
maintenance.csv		maintenance.csv
mira.js		mira.js
miscMap.js		miscMap.js
package.json		package.json
parser.js		parser.js
partitionInfo.csv		partitionInfo.csv
profiles.js		profiles.js
projProfiles.csv		projProfiles.csv
rasbook.js		rasbook.js
run.sh		run.sh
server.js		server.js
testQuery.js		testQuery.js
torus.js		torus.js
torusRMNJ.csv		torusRMNJ.csv
userProfiles.csv		userProfiles.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visual Analysis Tool for Fault Characterization of Supercomputers

Software architecture

Build Guidelines

Prerequisites

Data preparation

Installation

Security

About

Releases

Packages

Languages

hguo/LaValse

Folders and files

Latest commit

History

Repository files navigation

Visual Analysis Tool for Fault Characterization of Supercomputers

Software architecture

Build Guidelines

Prerequisites

Data preparation

Installation

Security

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages