Skip to content
This repository was archived by the owner on Mar 13, 2023. It is now read-only.
This repository was archived by the owner on Mar 13, 2023. It is now read-only.

parallelCluster Manager 3.2.0 failed on PERSISTENT_2 Lustre creation #274

@cyberchip-wang

Description

@cyberchip-wang

parallelCluster Manager 3.2.0 created with https://www.hpcworkshops.com/03-deploy-pcm/01-deploy-pcm.html does not propagate PerUnitStorageThroughput to the final Cluster Configuration. This caused the "Dry Run" failed. The workaround is adding a line manually to the Cluster Configuration template file: PerUnitStorageThroughput: 125

Screenshots:
Storage Properties
Dry run error

URL for AWS ParallelCluster Manager:
https://k0fymmi0ei.execute-api.us-east-2.amazonaws.com/home

The Lustre section in the Cluster Configuration:
SharedStorage:

Name: FsxLustre0
StorageType: FsxLustre
MountDir: /shared
FsxLustreSettings:
StorageCapacity: 1200
DeploymentType: PERSISTENT_2
DataCompressionType: LZ4
Dry Run error:

Invalid cluster configuration.
ValidationErrors:
FsxPersistentOptionsValidator: Per unit storage throughput must be specified when deployment type is PERSISTENT_2.

The issue can be reproduced by following the instructions from hpcworkshops:

Deploy the Pcluster Manager stack:
https://www.hpcworkshops.com/03-deploy-pcm/01-deploy-pcm.html
Create HPC Cluster:
https://www.hpcworkshops.com/06-fsx-for-lustre/01-create-cluster.html
Create FSx Lustre:
https://www.hpcworkshops.com/06-fsx-for-lustre/02-create-cluster-fsx.html

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions