Skip to content

Latest commit

 

History

History
36 lines (20 loc) · 3.53 KB

hardware-suggestions.md

File metadata and controls

36 lines (20 loc) · 3.53 KB

We are often asked to recommend hardware configurations for servers and clusters to run the OpenQuake engine. Obviously this depends very much on the calculations one wishes to perform and the available budget, but here we attempt to provide some general advice, please remember "your mileage may vary".

A general rule is the more GB of RAM and the more GHz you have, the better it is. The OpenQuake Engine is designed with a suggested amount of 2GB of RAM per worker core for classical, 4GB of RAM for event based computations. In the controller it is best to have more RAM.

Single node configuration

Small to medium hazard calculations and small risk calculations can run on a laptop or an equivalent cloud server: 8GB of RAM and 4 cores with several GB of disk space. Using >= 7.2k RPM disks or solid-state drives (SSD) will improve the overall performance.

More serious calculations would be better handled by a single server with a hardware RAID support: our "hope" server is a Dell® PowerEdge™ R420 with 12 cores (2 x Intel® Xeon™ E5-2430) 64GB of RAM and 4x2TB disks in a RAID 10 configuration and a hardware RAID controller (Dell® PERC H710). It is used now primarily to host databases but for a little while it was the best machine we had in Pavia and was used to run calculations too.

Large scale/multi-node configuration

Our cluster master node "wilson" is a Dell® PowerEdge™ R720 with 16 Xeon cores, 128GB of RAM RAID arrays and 10Gbit/s networking. This sort of machine would be able to handle some pretty large calculations as a single server but can also be used as the master node if you find you need to add more machines to form a cluster; so this might be a good starting point if it is compatible with your budget.

For our largest calculations on a continental or global scale we use a cluster composed of "wilson" (see above) acting as a "master" and two clusters composed by 5 worker nodes (Dell® PowerEdge™ M915 blades) each with 4x 16 cores AMD® Opteron™ and 128GB of RAM for the first and 4 Dell® PowerEdge™ M630 blades (2x 20t/10c Intel® Xeon™ E5-2640v4, 128GB RAM each) for the second cluster. Worker nodes do not need much disk since the data is stored in the master filesystem.

Network is made with a link aggregation between two 10 gigabit connections; however, up to a couple hundred celery workers a single gigabit connection is enough.

Windows is not supported for large scale deployments.

Cloud

The OpenQuake Engine can be deployed in the Cloud a virtual machine (using standard Ubuntu or RHEL/CentOS binary packages) or using Docker containers. Both single node and multi-node configuration are supported.

Deployment can considered stateless (an "always-on" configuration isn't required) as soon as outputs have been exported via the REST API or via the CLI.

Our users reported to have successfully deployed the OpenQuake Engine on Amazon AWS, Google GCE, Microsoft Azure, OpenStack IaaS and many other providers or internal platforms.


All product and company names are trademarks™ or registered trademarks© of their respective holders.

Getting help

If you need help or have questions/comments/feedback for us, you can: