From b3602bde03e599779aa57c944adc15e2b8f87b01 Mon Sep 17 00:00:00 2001 From: bcumming Date: Thu, 27 Mar 2025 10:51:43 +0100 Subject: [PATCH 1/3] use all caps for platform acronyms --- docs/clusters/santis.md | 4 ++-- docs/platforms/cwp/index.md | 2 +- docs/platforms/mlp/index.md | 8 ++++---- 3 files changed, 7 insertions(+), 7 deletions(-) diff --git a/docs/clusters/santis.md b/docs/clusters/santis.md index 44c7a006..962537a7 100644 --- a/docs/clusters/santis.md +++ b/docs/clusters/santis.md @@ -1,7 +1,7 @@ [](){#ref-cluster-santis} # Santis -Santis is an Alps cluster that provides GPU accelerators and file systems designed to meet the needs of climate and weather models for the [CWp][ref-platform-cwp]. +Santis is an Alps cluster that provides GPU accelerators and file systems designed to meet the needs of climate and weather models for the [CWP][ref-platform-cwp]. ## Cluster specification @@ -19,7 +19,7 @@ You will be assigned to one of the four login nodes when you ssh onto the system ### Storage and file systems -Santis uses the [CWp filesystems and storage policies][ref-cwp-storage]. +Santis uses the [CWP filesystems and storage policies][ref-cwp-storage]. ## Getting started diff --git a/docs/platforms/cwp/index.md b/docs/platforms/cwp/index.md index cf40ff4e..bcfa9f77 100644 --- a/docs/platforms/cwp/index.md +++ b/docs/platforms/cwp/index.md @@ -30,7 +30,7 @@ Its name derives from the highest mountain Säntis in the Alpstein massif of Nor [](){#ref-cwp-storage} ## File systems and storage -There are three main file systems mounted on the CWp system Santis. +There are three main file systems mounted on the CWP system Santis. | type |mount | filesystem | | -- | -- | -- | diff --git a/docs/platforms/mlp/index.md b/docs/platforms/mlp/index.md index ddd2addc..b6aa88d6 100644 --- a/docs/platforms/mlp/index.md +++ b/docs/platforms/mlp/index.md @@ -1,20 +1,20 @@ [](){#ref-platform-mlp} # Machine learning platform -The Machine Learning Platform (MLp) provides compute, storage and expertise to the machine learning and AI community in Switzerlan, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/). +The Machine Learning Platform (MLP) provides compute, storage and expertise to the machine learning and AI community in Switzerlan, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/). ## Getting started ### Getting access -Project administrators (PIs and deputy PIs) of projects on the MLp can to invite users to join their project, before they can use the project's resources on Alps. +Project administrators (PIs and deputy PIs) of projects on the MLP can to invite users to join their project, before they can use the project's resources on Alps. This is performed using the [project management tool][ref-account-waldur] Once invited to a project, you will receive an email, which you can need to create an account and configure [multi-factor authentication][ref-mfa] (MFA). ## Systems -The main cluster provided by the MLp is Clariden, a large Grace-Hopper GPU system on Alps. +The main cluster provided by the MLP is Clariden, a large Grace-Hopper GPU system on Alps.
- :fontawesome-solid-mountain: [__Clariden__][ref-cluster-clariden] @@ -31,7 +31,7 @@ The main cluster provided by the MLp is Clariden, a large Grace-Hopper GPU syste [](){#ref-mlp-storage} ## File Systems and Storage -There are three main file systems mounted on the MLp clusters Clariden and Bristen. +There are three main file systems mounted on the MLP clusters Clariden and Bristen. | type |mount | filesystem | | -- | -- | -- | From 5d5b215b2b16efcc6ddbfbe7a7dfe9dc30774261 Mon Sep 17 00:00:00 2001 From: bcumming Date: Thu, 27 Mar 2025 10:58:33 +0100 Subject: [PATCH 2/3] fixes that were too late in the PR review --- docs/clusters/santis.md | 6 +++++- docs/guides/gb2025.md | 1 + docs/platforms/cwp/index.md | 2 +- docs/platforms/mlp/index.md | 2 +- 4 files changed, 8 insertions(+), 3 deletions(-) diff --git a/docs/clusters/santis.md b/docs/clusters/santis.md index 962537a7..485b10e1 100644 --- a/docs/clusters/santis.md +++ b/docs/clusters/santis.md @@ -7,7 +7,11 @@ Santis is an Alps cluster that provides GPU accelerators and file systems design ### Compute nodes -Santis consists of around ??? [Grace-Hopper nodes][ref-alps-gh200-node]. +Santis consists of around 600 [Grace-Hopper nodes][ref-alps-gh200-node]. + +!!! note + In late March 2025 Santis was temporarily expanded to 1233 nodes for [Gordon Bell and HPL runs][ref-gb2025]. + The number of nodes can change when nodes are added or removed from other clusters on Alps. There are four login nodes, labelled `santis-ln00[1-4]`. diff --git a/docs/guides/gb2025.md b/docs/guides/gb2025.md index 7a1ea414..c82e857d 100644 --- a/docs/guides/gb2025.md +++ b/docs/guides/gb2025.md @@ -1,3 +1,4 @@ +[](){#ref-gb2025} # Gordon Bell and HPL runs 2025 For Gordon Bell and HPL runs in March-April 2025, CSCS has created a reservation on Santis with 1333 nodes (12 cabinets). diff --git a/docs/platforms/cwp/index.md b/docs/platforms/cwp/index.md index bcfa9f77..b4657bf9 100644 --- a/docs/platforms/cwp/index.md +++ b/docs/platforms/cwp/index.md @@ -35,7 +35,7 @@ There are three main file systems mounted on the CWP system Santis. | type |mount | filesystem | | -- | -- | -- | | Home | /users/$USER | [VAST][ref-alps-vast] | -| Scratch | `/capstor/scratch/cscs/$USER` | [Iopstor][ref-alps-capstor] | +| Scratch | `/capstor/scratch/cscs/$USER` | [Capstor][ref-alps-capstor] | | Project | `/capstor/store/cscs/userlab/` | [Capstor][ref-alps-capstor] | ### Home diff --git a/docs/platforms/mlp/index.md b/docs/platforms/mlp/index.md index b6aa88d6..df70f410 100644 --- a/docs/platforms/mlp/index.md +++ b/docs/platforms/mlp/index.md @@ -1,7 +1,7 @@ [](){#ref-platform-mlp} # Machine learning platform -The Machine Learning Platform (MLP) provides compute, storage and expertise to the machine learning and AI community in Switzerlan, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/). +The Machine Learning Platform (MLP) provides compute, storage and expertise to the machine learning and AI community in Switzerland, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/). ## Getting started From a5c3673e2a392513ee121298e68849a03f1301ca Mon Sep 17 00:00:00 2001 From: bcumming Date: Thu, 27 Mar 2025 11:11:53 +0100 Subject: [PATCH 3/3] fix number of nodes in the santis description table --- docs/clusters/santis.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/clusters/santis.md b/docs/clusters/santis.md index 485b10e1..58630875 100644 --- a/docs/clusters/santis.md +++ b/docs/clusters/santis.md @@ -18,8 +18,8 @@ There are four login nodes, labelled `santis-ln00[1-4]`. You will be assigned to one of the four login nodes when you ssh onto the system, from where you can edit files, compile applications and start simulation jobs. | node type | number of nodes | total CPU sockets | total GPUs | -|-----------|--------| ----------------- | ---------- | -| [gh200][ref-alps-gh200-node] | 1,200 | 4,800 | 4,800 | +|-----------|-----------------| ----------------- | ---------- | +| [gh200][ref-alps-gh200-node] | 600 | 2,400 | 2,400 | ### Storage and file systems