-
Notifications
You must be signed in to change notification settings - Fork 41
add platform docs for CWp #61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
preview available: https://docs.tds.cscs.ch/61 |
| [](){#ref-cluster-santis} | ||
| # Santis | ||
|
|
||
| Santis is an Alps cluster that provides GPU accelerators and file systems designed to meet the needs of climate and weather models for the [CWp][ref-platform-cwp]. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mostly a question for myself to know: is CWp/MLp/etc. the way we capitalize this, not CWP/MLP/etc.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just asked, and it was XYp, but it was changed to XYP.
I will make the change.
|
|
||
| ### Compute nodes | ||
|
|
||
| Santis consists of around ??? [Grace-Hopper nodes][ref-alps-gh200-node]. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| Santis consists of around ??? [Grace-Hopper nodes][ref-alps-gh200-node]. | |
| Santis consists of around 600 [Grace-Hopper nodes][ref-alps-gh200-node]. |
? Not sure what the official number is supposed to be. The table below says 1200, but I think the real number is lower than that at least.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have added a temporary link to the Gordon Bell guide.
|
|
||
| | node type | number of nodes | total CPU sockets | total GPUs | | ||
| |-----------|--------| ----------------- | ---------- | | ||
| | [gh200][ref-alps-gh200-node] | 1,200 | 4,800 | 4,800 | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See comment above.
| | type |mount | filesystem | | ||
| | -- | -- | -- | | ||
| | Home | /users/$USER | [VAST][ref-alps-vast] | | ||
| | Scratch | `/capstor/scratch/cscs/$USER` | [Iopstor][ref-alps-capstor] | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| | Scratch | `/capstor/scratch/cscs/$USER` | [Iopstor][ref-alps-capstor] | | |
| | Scratch | `/capstor/scratch/cscs/$USER` | [Iopstor][ref-alps-iopstor] | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, the change should have been [Iopstor] to [Capstor]
| * who are the users (help answer the question "is this the platform that I am on") | ||
| * who are the partners (SwissAI, etc) | ||
| * how to get apply to access MLp (if that is a thing) | ||
| The Machine Learning Platform (MLp) provides compute, storage and expertise to the machine learning and AI community in Switzerlan, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| The Machine Learning Platform (MLp) provides compute, storage and expertise to the machine learning and AI community in Switzerlan, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/). | |
| The Machine Learning Platform (MLp) provides compute, storage and expertise to the machine learning and AI community in Switzerland, with the main user being the [Swiss AI Initiative](https://www.swiss-ai.org/). |
|
I was too slow. Do you want me to make a PR with the suggested changes or do you want to change it? |
|
I will fix it, thanks! |
Take the MLp and clariden docs as a template, and create CWp and santis docs.