Skip to content

Conversation

@bcumming
Copy link
Member

Take the MLp and clariden docs as a template, and create CWp and santis docs.

@bcumming bcumming requested review from RMeli and msimberg as code owners March 27, 2025 08:19
@github-actions
Copy link

preview available: https://docs.tds.cscs.ch/61

@bcumming bcumming merged commit 6c24cfb into eth-cscs:main Mar 27, 2025
1 check passed
[](){#ref-cluster-santis}
# Santis

Santis is an Alps cluster that provides GPU accelerators and file systems designed to meet the needs of climate and weather models for the [CWp][ref-platform-cwp].
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mostly a question for myself to know: is CWp/MLp/etc. the way we capitalize this, not CWP/MLP/etc.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I just asked, and it was XYp, but it was changed to XYP.
I will make the change.


### Compute nodes

Santis consists of around ??? [Grace-Hopper nodes][ref-alps-gh200-node].
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Santis consists of around ??? [Grace-Hopper nodes][ref-alps-gh200-node].
Santis consists of around 600 [Grace-Hopper nodes][ref-alps-gh200-node].

? Not sure what the official number is supposed to be. The table below says 1200, but I think the real number is lower than that at least.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have added a temporary link to the Gordon Bell guide.


| node type | number of nodes | total CPU sockets | total GPUs |
|-----------|--------| ----------------- | ---------- |
| [gh200][ref-alps-gh200-node] | 1,200 | 4,800 | 4,800 |
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See comment above.

| type |mount | filesystem |
| -- | -- | -- |
| Home | /users/$USER | [VAST][ref-alps-vast] |
| Scratch | `/capstor/scratch/cscs/$USER` | [Iopstor][ref-alps-capstor] |
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
| Scratch | `/capstor/scratch/cscs/$USER` | [Iopstor][ref-alps-capstor] |
| Scratch | `/capstor/scratch/cscs/$USER` | [Iopstor][ref-alps-iopstor] |

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry, the change should have been [Iopstor] to [Capstor]

* who are the users (help answer the question "is this the platform that I am on")
* who are the partners (SwissAI, etc)
* how to get apply to access MLp (if that is a thing)
The Machine Learning Platform (MLp) provides compute, storage and expertise to the machine learning and AI community in Switzerlan, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The Machine Learning Platform (MLp) provides compute, storage and expertise to the machine learning and AI community in Switzerlan, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/).
The Machine Learning Platform (MLp) provides compute, storage and expertise to the machine learning and AI community in Switzerland, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/).

@msimberg
Copy link
Collaborator

I was too slow. Do you want me to make a PR with the suggested changes or do you want to change it?

@bcumming
Copy link
Member Author

I will fix it, thanks!

@bcumming bcumming deleted the improve-platforms branch June 30, 2025 07:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants