Welcome! This onboarding document helps to create an Azure confidential VM with NVIDIA H100 Tensor Core GPU in Confidential Computing mode.
Please note that any associated materials or documentation below are strictly confidential and subject to obligations in the Non-Disclosure Agreement (NDA) signed between Microsoft and your organization.
Through this onboarding process, you can:
- Deploy a secure boot enabled Azure confidential virtual machine.
- Attach One (1) NVIDIA H100 PCIe Tensor Core GPU in Confidential Computing mode.
- Perform in-guest platform attestation to retrieve raw hardware evidence.
- Perform local GPU attestation.
- Run AI Workload in HW based Trusted Execution Environments (TEE).
- Give us feedback and request features to help shape this VM SKU.
- Next-generation CPUs: AMD 4th Gen EPYC processors with SEV-SNP technology to meet CPU performance for AI training/inference.
- AI state-of-the-art GPUs: NVIDIA H100 Tensor Core GPUs with 94GB of High Bandwidth Memory 3 (HBM3).
- Trusted Execution Environment (TEE) that spans confidential VM on the CPU and attached GPU, enabling secure offload of data, models, and computation to the GPU.
- VM memory encryption using hardware-generated encryption keys.
- Encrypted communication over PCIe between confidential VM and GPU.
- Attestation: Ability for CPU and GPU to generate remotely verifiable attestation reports capturing CPU and GPU security critical hardware and firmware configuration.
- Azure subscription and a contributor or administrator role to the subscription
- Quota for the NCC H100 v5 VM SKU
- Install Azure CLI
The following four major steps are provided to help deploy your first Confidential GPU VM and to run a sample workload. The first step sets up the Confidential GPU environment and create the VM. The second step is optional, only required if you choose the customer managed key option (read more about key management options here: Azure Key Management). The third step performs attestation verification and ensures the CGPU mode has been turned on successfully. The last step helps run a sample workload to verify and complete the setup. These steps are only required the first time you are deploying your VM.
- Create CGPU VM
- [Optional] Create Customer Managed Key
- Attestation Verification
- Workload Running
Please make sure to follow all steps exactly as detailed. If you run into issues, please check the FAQ page to check if there is more information or reach out using the contact information at the bottom of this document
We support the options to create confidential GPUs with Windows and Linux hosts, as well as with customer (CMK) and platform (PMK) managed keys. We recommend for first time users to try out the PMK option since it's simpler to onboard. If you want to use a one step auto-onboarding script, you can chose between the following instruction options:
If you prefer to go through the steps manually, you can follow these instructions:
- NVIDIA certified VMI-based provisioning with the GPU driver, CUDA, ML tools, and a customized local verifier already pre-installed
- In-guest attestation evidence appraised by Microsoft Azure Attestation Service (MAA)
This offer is currently available in the East US 2 and West Europe regions. We plan to expand to more regions in a phased manner during upcoming semesters.
For any questions, please check the FAQ page here: Frequently Asked Questions
For additional help, please open a support ticket through the Azure portal. Thanks for your cooperation and help!