Skip to content

FluidNumerics/rcc-tf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Research Computing Cloud (RCC) - Terraform Module

Copyright 2021 Fluid Numerics LLC

About

The Research Computing Cloud (RCC) Terraform Module is infrastructure-as-code that will quickly get you started with a complete HPC/RC cluster on Google Cloud. Features of this deployment include:

  • Slurm job scheduler hosted on a controller instance
  • Multiple login nodes
  • Multi-region, multi-zone compute partitions
  • (Optional) Cloud SQL Slurm Database
  • (Optional) Lustre file system (mounts to /mnt/lustre)
  • (Optional) NFS Filestore file system (mounts to /mnt/filestore)

We recommend that you use this infrastructure as code with Fluid Numerics' RCC VM Image Library image families.

Images

Free Images

Fluid Numerics makes some VM images available for free. These images provide a good starting point, but do not grant you access to support from Fluid Numerics.

Supported Images

If you would like to obtain support from Fluid Numerics for using this solution, you can use the following VM Images :

Use of these images incurs a licensing fee of $0.01 USD/vCPU/hour and $0.09 USD/GPU/hour. If you are interested in obtaining a different usage-based pricing model for support, reach out to support@fluidnumerics.com.

The use of these images are subject to the terms of Fluid Numerics' RCC EULA

Quick Start

Deploy your cluster

  1. Navigate to Google Cloud Shell
  2. Clone this repository
git clone https://github.com/fluidnumerics/rcc-tf ~/rcc-tf
  1. Set your project ID, replacing PROJECT-ID with your Google Cloud project ID
gcloud config set project PROJECT-ID
  1. (Optional) Enable Lustre parallel file system
export USE_LUSTRE=true
  1. (Optional) Enable Filestore
export USE_FILESTORE=true
  1. (Optional) Enable CloudSQL for Slurm database
export USE_CLOUDSQL=true
  1. Create a terraform plan
cd ~/rcc-tf/
make plan
  1. Deploy you're infrastructure when ready
make apply

Tear down your cluster

When you're done using your cluster, you can tear down resources to avoid accruing additional costs on Google Cloud.

  1. Navigate to Google Cloud Shell
  2. Navigate to the ~/rcc-tf/ directory
cd ~/rcc-tf/
  1. Delete your infrastructure
make destroy

Dive Deeper

You can learn more about configuring your cluster at the rcc-tf readthedocs.

Reporting Issues

You can report any issues associated with the rcc-tf repository using our issue tracker