fti_cuda

This project demonstrates how FTI can be used with CUDA. It performs a simple vector addition of C = A + B by dividing the vector size evenly among the number of MPI processes. Each process then launches a CUDA kernel to compute their partition of the vector.

Compiling

To compile the following environment variables need to be set:

MPI_HOME
CUDA_HOME
FTI_HOME

These variables should point to the home directory of MPI, CUDA and FTI respectively. To compile run make.

Running

Execute the binary with the following two arguments

vector-size
iterations

vector-size Specifies the length of the vector
iterations Specifies how many times each MPI process should launch its kernel

You will need to have FTI built and configured for a successful run. For more information on FTI see their github repository.

Example Run

The following will spawn 8 MPI processes and each process will execute their kernel 10 times.

mpirun -np 8 ./fti_cuda.out 10000 10

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
config.fti		config.fti
fti_cuda.cu		fti_cuda.cu
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fti_cuda

Compiling

Running

Example Run

About

Releases

Packages

Languages

License

maxbaird/fti_cuda

Folders and files

Latest commit

History

Repository files navigation

fti_cuda

Compiling

Running

Example Run

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages