5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

for many environments, I/O intensive workloads can suffer significant performance degradation. Each I/O operation - inbound or outbound - must be intercepted and processed by the VI adding significant platform resource overhead.

SR-IOV provides tools to reduce these platform resources overheads. The benefits of SR-IOV are:

• The ability to eliminate VI involvement in main data movement actions - DMA, Memory space access, interrupt processing, etc. Elimination of VI interception and processing of each I/O operation can provide significant

application and platform performance improvements.

• Standardized method to control SR-IOV resource configuration and management through Single Root PCI Manager (SR-PCIM).

。 Due to a variety of implementation options - system firmware, VI, operating system, I/O drivers, etc. - SR-PCIM implementation is outside the scope of this specification.

• The ability to reduce the hardware requirements and associated cost with provisioning potentially a significant number of I/O Functions within a device.

• The ability to integrate SR-IOV with other I/O virtualization technologies such as Address Translation Services (ATS), Address Translation and Protection Table (ATPT) technologies, and interrupt remapping technologies to create robust, complete I/O virtualization solutions.

[Figure 9-3](#bookmark1)illustrates an example SR-IOV capable platform.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAATgAAAAqCAYAAAAnFkdOAAAQvUlEQVR4nO3da3Abx2EH8P/uvQ8HgCLAN0FABCOH1Nu2Etey3JnWSWzl0Q9J2zR9ZPJoZzKZJk0/9EPTZppJ0zRNvmTaL6mn/Za2aZK6k9iS6zh1/ZBjR7FoxbIoW4IECJT4AkQCONz7dvsBoCIrli3JlChL+5vZ2b29o7BLgf+5OxzuCIS3nWK+QADIALRuUS9oa1fSTwjRFVUxZUk2qSTplBCDUmoQQnRCiA5A55yrADTOucY5lznnFAABQDg4WW2f7+NAt03BOTh/zTarGCGEgxAA4KulswgOgJ3vwy+3IYTEhJCAEOITQnwAAQCPc+4xxlzGmcsZ9+I4doMgaMdx7ALwuyW4oO1faX+pUmZX+38mrA/y5psIa6UbTAaA1OuUNIAUpTStKEqvqqkZWZJ7CSE9nPMUYywZM5YMg8CIosjgnBNZlmNJkmJJlpkiy0xVVa4oClc1lSuKylVVhaqq0DSNaJpGdF2n3TbVdF3SNE3SNI2ubqeoCrRuW1U1XNi/2pZlBZQSEEJAKQVIp03QrQk6NaXd5QvXd9ZxzsE5wDjrtjm6QXh+HeccjLOL+jlixhD4AYIgQBh26vPlEv2e50W+7zPP8+IgCLjnecz3fR74Pvc7BWEQIggCEoQhCcOAhEFIoyiiURTROI6lKIokRVECWZZdWZHbEpVsSmkTQCOO4+U4js+5rluL43gFQBNAo1v/SilVytF1f/PdokTAXaFuSKUA9AMYWK0ppX2XDKY4ToRhmAjDUJckiWm6HpqGESUsi1mWxVOpFEmlUjTd06Ok0yklmUxRy7JgJS1YloVkMgnLSp5ftiwLmqat6+/hVsMYg+M4sG0bdsuGbbfQarUuWO70NZvNqNFohI2VRtRsNnmz2eRt26btdpt6nqd4nqfKshzJsuzIiuxIVLIJIU1CyPmg9H2/HoZhHcBityx0y1KpUg7W9RfxNiMCDkAxX5AAZNAJrPOhpWrqsKEbeUmShhljA2EY9vq+n6YSZalUyu/tzcT9/f10YHBA7u/v15PJJLGsJC4Mp9VgSiaTSCQSUFV1XecqrC/OOVzXhW3baDVbsO1fDUnbtrG8vBwtzC/4CwsLUW1piSyvLCtO29FlWfZUTV2RJXkJwHwYhlXP86pRFM2jE4KLF9R2qVLm6znf9XZTB1wxXzABbASQBzBIKR3UdX1UVdUcIWQojuP+MAw3+L5vGoYR9GzoCTOZLBsY6JcGBgbV/oEBNZPNIJvNIpPJIJvtQyabgWEY6zwz4VbEGMPKygrq9TpqSzXU6zXUazXU63U+NzfvLczPh0uLi/zc8rLcajY1zjk0TVuRZfkclaSFKAzPuK5bCcNwDp0QPAvgJICzN+v5xbd1wBXzBRnAKDohNq4oyoRlWVs45xO+749EUWRms1lnZHSUDw0PyQMDg1pfX5+0Glqd4MpiQ+8GyLK8zrMRhLXlOA5qtRrqtTpqtSXU63XUazUsLCz483PzwZkzZ9jc2bOq4ziqbhhLqqJUgjA85rTbM4yxEoBTAE6WKuXl9Z7L1bqhA657visLYBzARlmWi6ZpbpYkaVMYRWOu4/SmUil/eHg4HJ8oKuPj4+ZoLoexsTGM5nLo6+vrnAgXBOGSPM/DmdkzqJ4+jepsFacrlah04oRbKVf4/Py8zsGZoRtzkiSddF33qOu6r6Cz53cKQLlUKXvrPIVLuiECrpgv6AA2A9humuadmqZNxXE87rrugKIofGh4yB8by5PixISZz49Jo7kccrkchoaHxcl2QbiGOOdoNBqYrVZRrVZRPX0a5VNlr1Qq+bPVqlSv1w1VVW1N02Y55yfa7faLYRgeAvAigOp6nwO87gFXzBf6AGynlO5MpVL3xCze4bne8ODgoLN12zZp6/ZtifxYHrmxHEZzOSSTyes9REEQLhNjDAsLC6iePo3ZahUnTpTiF6en28dmZhTf92Ga5iu+7//UcZznARwGcPR6fhJ8zQKue3g5AWCnqql3JszE3b7vb+GcG8Vi0d15++3Glq1b1XdOTmLiHRNiT0wQbjL1Wg0zMzOYOXoU09PT9stHjvDFhUXDNM1ZAIdardYBxtg0gEOlSrlxLcawZgHXvdRiG6X019Pp9F7Xde8yDEPaum1rvGPnTmtyaopMTU1heGQEhNwQR8aCIFxnnufh+KuvYmZmBkdeeimYPnTIK5VKhqEbs2EYPuY4zuMAni5Vygtr8XpXnTTdPbStqqp+KJFI7G077Z2ZTDbcfc9u5dfuvlvftWsXhkdG1mKMgiDcxIIgwJGXjuDnB3+Gp558qnn4xRd1SZbPEYInWs3WwwD2X+0nuVcUcMV8QQGwxzTN36GUfljTdeOBBx5Q7t69W71j153IZDJXMwZBEITzGGN49ZVXcPDgQTz+2I/tgwd/ppqG+ZJt29+JouihUqVcvtx/600Drrundk8qlfqC7/v3j42NRR/40AcT973nPXTTbbeJw01BEK4p13XxzNNP49F9+92fPP44oZTO+77/r77vf7tUKS++0c9eMp2K+YIly/If6obxF0nL6vvUH386cf/evRgcHFz7GQiCIFyGOI4xfegQvvvv/+Ht37ePqKq6r9ls/gOA51/vkpRfCbhivqCZpvmlOI7/7N13vZt/8tOfTty9e7e4YFYQhBvKysoKvv+f3+P/8uCDrud7s61m61OlSvmZC7d5TcAV84U9iUTi3+7YdWfmy1/5ipHL5a7viAVBEK4QYwz79+3Dl774V27M2PfsVutzq5ednA+47Vu2fkOWpM9+9WtfM973wP3rN1pBEISr0Gw08Xdf/Vv/kR897Liue2+pUj5CAGBy06bPZDLZb/7wkYfNDRs2rPc4BUEQrtp/P/QQ/vovv7jkuu5WqZgvpCVZ+vEPHvovY2hoaL3HJgiC8Ja8c3IS9XpdO3H8eB8FcO+WzVuCwsaN6z0uQRCENbH3/Xvldrv9CQpgpV6vi49IBUG4adh2GwBAARyo1WorP/je99d3RIIgCGvA93x84+tfd1RV/by03Fjhlpl49MAzz3x8vDiuTrzjHes9PkEQhKvSajbx+T/9nHfs2LGfuK775xIALDdWasmE9b9PPfnk3hd+/oL87rvuUizLWu+xCoIgXLbHHv0f/NHv/4E7Ozv7fbvV+nipUg4vvtBXN03zy4SQz33ms5/Vfvf3PkrEZSOCINyoOOeYPjSNf/zWt5xDL7yw3G63P1aqlJ9aXf+630Ut5gubU6nU3/i+/8H33v8+/olPflLftn379Ru1IAjCG3BdFz/64Q/x4Lf/2V5aXGwHQfDNIAj+6eLnQ7zhrUCK+UJWVdU/kWX5C8PDw9qHf/sj1m/edx8ZLxav7egFQRAu4vs+nnv2p3h0/37/kYcf5oqiPN9oNP4ewGOXeuzhZd3rqHu33vusZPJjcRT9Vk9Pj/T+D3xAf8/975N37NgBSZLWch6CIAgAOl+o/78nnsC+hx9pP3vggKLr+quO43wnCILvlirlU2/281d8M7divkAB3KHr+kcUVfkoZ7z/jl13Rnv27LHufNe7MDk5KQJPEISr0mg08PODB/Gz55+Pnn7qaadcPqWbhnmg2Wx+hzH2oze7/9vF3vLdKov5whiAPVYy+V5C8BuBH/Rv277d23Pvvcld79pFpjZvRiKReKsvIwjCTYZzjrNnzmB6ehrPPftT/8AzzwQLCwuqaZq/aLfb+8MwfAKd+7y5V/saa3473mK+kAVwTyKRuE+W5fscxxnv7e31J6em+M7bb09ObZ7COycnMTg4KO4GLAi3iCAIcOL4CcwcPYqXjxwJp6ennRPHjxuEEFdV1UO2be+PouhJANOlSjlcq9e95glTzBdkAJsA7DAMY5dhGHc7jjMpSZK66bZN/vYdO8zNW7bIU1NTGC8WoSjKtR6SIAjX0PLyMo7NzODoy0fxi8OH2784fDien59P6IZ+VqLStG3bz0ZRdAjA4Ss95LxS67YLVcwXBgHskGX5dsuydncfAN0/NDTkFMY3komJCWNsLC/ncjnkxnIYGRmFpotnpwrCeuOcY3l5GdXTp1GtVjFbncXJkyX31MmTwamTpxTHdaSEmTgehuFztm0/h85T7l+++BKO6+GGOkYs5gsJAJMAximl4wkrsVmRlduCMMy7jpOxLCsYHhkONm4cl8eLxcTYWA6juU4ZGBgQH24IwhpxHAezs7OYrVZRPX0alcrpsHTihFspl7G4tGgAiAzdOEspPek4zsu+778K4BSAVwGUL3XZxvV2QwXcG+leqjICYBzARkVRJhKJxBZCyITneSNhGFqZTMYdzeVYsVhUCxsL+mguh7GxMYyMjqKnp0ec8xOEriAIsLiwiGp1dS+syo8fP94unyrHc3Nzque6imEaS7Ikl4MgmHFd9xhj7CSAkwBOlSrllfWew+W4af7ii/mCAaAAYCOAccMwbjMMYyqO43HXcwfjKFYty/I29PZG2WyWDwwMyENDQ3qmLytls1lkMhlks33IZDPo7e0V5wKFtxXOOdrtNur1Omq1Guq1Ouq1JdRqNSwsLPrzc3PB0uIiq9frtNFoqL7vK7quNzRNqzLGjtu2/XIcxyfQ2Qs7BWDuRtkLeytumoB7M8V8QQfQB2AAQD+AAUrpgKZpo5qmjVFKh6Io6g/DcIPv+wnDMMJUOh1ks1nW19dHB4cG1YGBAS1zQRhm+zpt0zTXd3LCTYkxhuXlZdTrddS7oVWrLaFer/O5s3Pu/Px8VFta4ufOnZNbrZYGgGu63pAlqU4ImY/j+KzrupUwDOcALABYvKA+dzME2Ju5ZQLuSnQPh3txQRgC6FcUZdgwjDFJkkY45wNhGGZ8309TicJKWH4ikYitpMWTyRTS6RRNpdNyOp1WUqmUbFkWLCuJZDIJK2lhdXm1bZqmOIS+iYRhiHa7Dbtlw7ZbsG0brVbromUbzUYjWGmshM1GI261WrzVbKHdbtOWbcuu4+iyLLuqpi7LklwDMBeGYdVxnCpjbB6vDayFUqXcXtdJ34DEX9RbVMwXCAALQBZA6vWKJEk9hmH0SZLUK0nSBs55mnOeillsRWFkxnFshmGoaJoW6LoeJqwESyQslkqlkEqlaDqdltI9aSWVSimroWgaJlRVhaqqUFTlfLtTNKiaCkV5bb+iKCJE0dkzCoIAgR8gCPxOOww79fn+zrqw2+/7Ptp2ezWc+MrKStBsNKOVlZW41Wpx226Rtt2mbceRfM9T4jiWFEXxZFl2JElqS5LUIoQ0OecrURydC/ygHoZhnTHWANB8ndIAsLSW14TdisS7/QbRvV4wiUuEJDpBuUHTtIyiKBlKaZIQogPQOecqAI1zrjHGVA6uspgpjDGZMaZwzpUoiiTGGJVlOZYkicmyHMuyzBRVYYqiclVRuKIoXNVUaJoGTdOIqmlE+2Whuq5LqqpSQggooYRQCkIIoZQQQggIIQQgIISAUgpCgG5/N1g7NeccAAdjDJxzcI5uzc+v45zzOGa828fBORhnPI4Z9z2PeZ4X+77PfN/nvu/zbghx3/cRhgEJ/ABRFNEgCEgYhTQKIxrHsdQtRJblmEo0ooSGlNKQSlJECAJKaEAICQghPiEkAOAD8Bhjbszi5csIptXivN6T1oXrSwTcLaT7PWINgNqtLy6X06+i875ZLfSiZQJ0cg8ApZRSAGS1JpRQzjhjjMUAOGOMAeCccwaAX1AuXl4tMTqhs1qCi5Yvpz8U4XNr+H9d5oqkcLMAZQAAAABJRU5ErkJggg==)

SI

SI

SI

SI

SI

SI

Virtualization Intermediary

SR-PCIM

|  |
| --- |
| Processor |
| Memory |

|  |  |  |
| --- | --- | --- |
| Translation Agent (TA) |  | Address Translation and Protection Table (ATPT) |
|  |

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
|  | | Root Complex (RC) |  |  |
|  | Root Port | Root Port |
|  | (RP) |  | (RP) |  |

|  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- |
| |  | | --- | | ATC |  |  | | --- | | PF0 |  |  |  |  | | --- | --- | --- | | VF0, 1 |  | VF0, n |   PCIe Device |

Switch

|  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- |
| |  | | --- | | ATC |  |  | | --- | | PF0 |  |  |  |  | | --- | --- | --- | | VF0, 1 |  | VF0, n |   PCIe Device |

|  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- |
| |  | | --- | | ATC |  |  | | --- | | PF0 |  |  |  |  | | --- | --- | --- | | VF0, 1 |  | VF0, n |   PCIe Device |

A-0624A

Figure 9-3 Generic Platform Configuration with SR-IOV and IOVEnablers

The SR-IOV generic platform configuration is composed of the following additional functional elements:

• SR-PCIM - Software responsible for the configuration of the [SR-IOV Extended Capability](#bookmark2), management of

Physical Functionsand Virtual Functions, and processing of associated error events and overall device controls such as power management and hot-plug services.

• Optional Translation Agent (TA) - ATA is hardware or a combination of hardware and software responsible for translating an address within a PCIe transaction into the associated platform physical address. ATA may

contain an Address Translation Cache (ATC) to accelerate translation table access. ATA may also support

Address Translation Services (ATS) which enables a PCIe Function to obtain address translations a priori to DMA access to the associated memory. See[Chapter 10f](#bookmark3)or more details on ATS benefits and operation.

• Optional Address Translation and Protection Table (ATPT) - An ATPT contains the set of address translations accessed by a TA to process PCIe requests - DMA Read, DMA Write, or interrupt requests. See Address

Translation Services ([Chapter 10)](#bookmark4) for additional details.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

。 In PCIe, interrupts are treated as memory write operations. Through the combination of a Requester Identifier and the address contained within a PCIe transaction, an interrupt can be routed to any

target (e.g., a processor core) transparent to the associated I/O Function.

。 DMA Read and Write requests are translated through a combination of the Routing ID and the address contained within a PCIe transaction.

• Optional Address Translation Cache (ATC) - An ATC can exist in two locations within a platform -within theTA which can be integrated within or sit above an RC - or within a PCIe Device. Within an RC, the ATC enables

accelerated translation look ups to occur. Within a Device, the ATC is populated through ATS technology. PCIe transactions that indicate they contain translated addresses may bypass the platform’s ATC in order to improve performance without compromising the benefits associated with ATPT technology. See Address Translation

Services ([Chapter 10)](#bookmark5) for additional details.

• Optional Access Control Services (ACS) - ACS defines a set of control points within a PCI Express topology to determine whether a TLP should be routed normally, blocked, or redirected. In a system that supports SR-IOV, ACS may be used to prevent device Functions assigned to the VI or different SIs from communicating with one another or a peer device. Redirection may permit a Translation Agent to translate Upstream memory TLP

addresses before a peer-to-peer forwarding decision is made. Selective blocking may be provided by the optional ACS P2P Egress Control. ACS is subject to interaction with ATS. See [Section 9.3.7.6f](#bookmark6)or additional details.

• Physical Function (PF) - A PF is a PCIe Function that supports the [SR-IOV Extended Capability](#bookmark7)and is accessible to an SR-PCIM, a VI, or an SI.

• Virtual Function (VF) - A VF is a “light-weight” PCIe Function that is directly accessible by an SI.

。 Minimally, resources associated with the main data movement of the Function are available to the SI.

Configuration resources should be restricted to a trusted software component such as a VI or SR-PCIM.

。 A VF can be serially shared by different SI, (i.e., a VF can be assigned to one SI and then reset and assigned to another SI.)

。 A VF can be optionally migrated from one PF to another PF. The migration process itself is outside the scope of this specification but is facilitated through configuration controls defined within this

specification.

• All VFs associated with a PF must be the same device type as the PF, (e.g., the same network device type or the same storage device type.)

To compare and contrast a PCIe Device with a PCIe SR-IOV capable device, examine the following set of figures.[Figure](#bookmark8) [9-4](#bookmark9)illustrates an example PCIe-compliant Device.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| |  | | --- | | ATC 1 |   Function 0   |  |  | | --- | --- | | |  | | --- | | Physical Resources1 | |   Configuration  Resources  Internal Routing   |  | | --- | | ATC2 |   Function 1   |  |  | | --- | --- | | |  | | --- | | Physical Resources2 | |   PCIe Port  PCIe Device   |  | | --- | | ATC3 |   Function 2   |  |  | | --- | --- | | |  | | --- | | Physical Resources3 | | |

A-0625

Figure 9-4 Example Multi-Function Device

This figure illustrates an example multi-Function PCIe Device with the following characteristics:

• The PCIe Device shares a common PCIe Link. The Link and PCIe functionality shared by all Functions is managed through Function 0.

◦ While this figure illustrates only three Functions, with the use of the Alternative Routing Identifier (ARI) capability, a PCIe Device can support up to 256 Functions.

◦ All Functions use a single Bus Number captured through the PCI enumeration process.

• In this example, each PCIe Function supports the ATS capability and therefore has an associated ATC to manage ATS obtained translated addresses.

• Each PCIe Function has a set of unique physical resources including a separate configuration space and BAR.

• Each PCIe Function can be assigned to an SI. To prevent one SI from impacting another, all PCIe configuration operations should be intercepted and processed by the VI.

As this figure illustrates, the hardware resources scale with the number of Functions provisioned. Depending upon the complexity and size of the device, the incremental cost per Function will vary. To reduce the incremental hardware cost, a device can be constructed using SR-IOV to support a single PF and multiple VFs as illustrated in [Figure 9-5 .](#bookmark10)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| PF 0   |  | | --- | | ATC |  |  |  |  | | --- | --- | --- | |  | Configuration Resources |  | | |  | | --- | | Physical Resources | | | |   VF 0,1   |  |  | | --- | --- | | |  | | --- | | Physical Resources | |   Internal Routing  PCIe Port  VF 0,2   |  |  | | --- | --- | | |  | | --- | | Physical Resources | |   PCIe SR-IOV Capable Device  VF 0,N   |  |  | | --- | --- | | |  | | --- | | Physical Resources | | |

A-0626A

Figure 9-5 ExampleSR-IOV Single PF Capable Device

The example in [Figure 9-5](#bookmark10)illustrates a single PF with NVFs. Key observations to note:

• The PF is a PCIe-compliant.

◦ Initially and after a conventional reset, a PCIe Function that supports the SR-IOV capabilities defined in this specificationshall have the SR-IOV capabilities disabled.

◦ To discover the page sizes supported by a PF and its associated VF, the Supported Page Sizes

configuration field is read. For additional information on how this field can be used to align PF or VF Memory space apertures on a system page boundary, see [Section 9.2.1.1.1 .](#bookmark11)

• PF nomenclature **PFM** designates the PF at Function number M.

• VF nomenclature **VFM,N**designates the Nth VF associated with PF M. VFs are numbered starting with 1 so the first VF associated with PF M is VF M,1.

• Each VF shares a number of common configuration space fields with the PF; (i.e., where the fields are

applicable to all VF and controlled through a single PF. Sharing reduces the hardware resource requirements to implement each VF.)

◦ AVF uses the same configuration mechanisms and header types as a PF.

◦ All VFs associated with a given PF share a VF BAR set (see[Section 9.3.3.14](#bookmark12)) and share a VF Memory

Space Enable (MSE) bit in the SR-IOV extended capability (see [Section 9.3.3.3.4)](#bookmark13) that controls access to the VF Memory space. That is, if the VF MSE bit is Clear, the memory mapped space allocated for all VFs is disabled.

◦ The InitialVFs and TotalVFs fields (see[Section 9.3.3.5](#bookmark14)and [Section 9.3.3.6)](#bookmark15) are used to discover the maximum number of VFs that can be associated with a PF.

◦ If the Device does not support VF migration, TotalVFs and InitialVFs shall contain the same value. If the Device supports VF migration, when TotalVFs is read, the PF must return the number of VFs that can be assigned to the PF. For such a Device, when InitialVFs is read, the PF must return the initial number of VFs assigned to the PF.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

• Each Function, PF, and VF is assigned a unique Routing ID. The Routing ID for each PF is constructed as per

Section 2.2.4.2. The Routing ID for each VF is determined using the Routing ID of its associated PF and fields in that PF’s[SR-IOV Extended Capability.](#bookmark16)

• All PCIe and SR-IOV configuration access is assumed to be through a trusted software component such as a VI or an SR-PCIM.

• Each VF contains a non-shared set of physical resources required to deliver Function-specific services, (e.g., resources such as work queues, data buffers, etc.) These resources can be directly accessed by an SI without requiring VI or SR-PCIM intervention.

• One or more VF may be assigned to each SI. Assignment policies are outside the scope of this specification.

• While this example illustrates a single ATC within the PF, the presence of any ATC is optional. In addition, this specification does not preclude an implementation from supporting an ATC per VF within the Device.

• Internal routing is implementation specific.

• While many potential usage models exist regarding PF operation, a common usage model is to use the PF to bootstrap the device or platform strictly under the control of a VI. Once the[SR-IOV Extended Capability](#bookmark17)is

configured enabling VF to be assigned to individualSI, the PF takes on a more supervisory role. For example, the PF can be used to manage device-specific functionality such as internal resource allocation to each VF, VF arbitration to shared resources such as the PCIe Link or the Function-specific Link (e.g., a network or storage Link), etc. These policy, management, and resource allocation operations are outside the scope of this

specification.

Another example usage model is illustrated in [Figure 9-6 .](#bookmark18) In this example, the device supports multiple PFs each with its own set of VFs.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| |  | | --- | | VF 0,1 |  |  | | --- | | Physical Resources |  |  |  |  | | --- | --- | --- | |  | Configuration Resources |  | | |  | | --- | | Physical Resources | | | |  |  | | --- | | ATC |   PF 0  Internal Routing  VF 0,2   |  |  | | --- | --- | | |  | | --- | | Physical Resources | |   Internal Routing   |  |  |  | | --- | --- | --- | |  | Configuration Resources |  | | PF M Physical  Resources | | |  |  | | --- | |  |   PCIe Port   |  | | --- | | ATC |   VF 0,3   |  |  | | --- | --- | | |  | | --- | | Physical Resources | |   VF M,1   |  |  | | --- | --- | | |  | | --- | | Physical Resources | |   VF M,2   |  |  | | --- | --- | | |  | | --- | | Physical Resources | |   Internal Routing  PCIe SR-IOV Capable Device  VF M,3   |  |  | | --- | --- | | |  | | --- | | Physical Resources | | |

A-0627

Figure 9-6 ExampleSR-IOV Multi-PF Capable Device

Key observations to note:

• Each PF can be assigned zero or more VFs. The number of VFs per PF is not required to be identical for all PFs within the device.

• The ARI Extended Capabilityenables Functions to be assigned to Function Groups and defineshow Function Grouparbitration can be configured. PFs and VFs can be assigned to Function Groups and take advantage of the associated arbitration capabilities. Within each Function Group, though, arbitration remains

implementation specific.

• Internal routing between PFs and VFs is implementation specific.

• For some usage models, all PFs may be the same device type; (e.g., all PFs deliver the same network device or all deliver the same storage device functionality.) For other usage models, each PF may represent a different device type; (e.g., in [Figure 9-6 ,](#bookmark18) one PF might represent a network device while another represents an

encryption device.)

◦ In situations where there is a usage model dependency between device types, such as for each VF

that is a network device type, each SI also requires a VF that is an encryption device type. The [SR-IOV](#bookmark19) [Extended Capability](#bookmark20) provides a method to indicate these dependencies. The policies used to

construct these dependencies as well as assign dependent sets of VF to a given SI are outside the scope of this specification.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

As seen in the prior example, the number of PF and VF can vary based on usage model requirements. To support a wide range of options, an SR-IOV Device can support the following number and mix of PF and VF:

• Using the Alternative Routing Identifier (ARI) capability, a device may support up to 256 PFs. Function Number assignment is implementation specific and may be sparse throughout the 256 Function Number space.

• A PF can only be associated with the Device’s captured Bus Number as illustrated in [Figure 9-7 .](#bookmark21)

• SR-IOV Devices may consume more than one Bus Number. A VF can be associated with any Bus Number within the Device’s Bus Number range -the captured Bus Number plus any additional Bus Numbers configured by

software. See [Section 9.2.1.2f](#bookmark22)or details.

。 Use of multiple Bus Numbers enables a device to support a very large number of VFs - up to the size of the Routing ID space minus the bits used to identify intervening busses.

。 If software does not configure sufficient additional Bus Numbers, then the VFs implemented for the additional Bus Numbers may not be visible.

**IMPLEMENTATION NOTE**

Function Co-location

The ARI Extended Capabilityenables a Device to support up to 256 Functions - Functions, PFs, or VFs in any

combination - associated with the captured Bus Number. If a usage model does not require more than 256

Functions, implementations are strongly encouraged to co-locate all Functions, PFs, and VFs within the captured Bus Number and not require additional Bus Numbers to access VFs.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACoCAYAAADdE69lAAAAOklEQVRYhe3MMREAIAwEwQSTtEijRWVQADGw1/7ORzRlrFPPdc8c3QMAAAAAAAAAAAAAAAAAAADw6QKWKwRO6L9RAwAAAABJRU5ErkJggg==)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| |  |  |  | | --- | --- | --- | |  |  |  | | |  | | --- | | Physical Resources | | | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | |  |  |  |  | | --- | --- | --- | |  | Configuration Resources |  | | |  | | --- | | Physical Resources | | | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | | |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAEUAAAA8CAYAAAAwoHcgAAAEwklEQVR4nOWbX0hbVxjAb7JaZx4qbA/upTsnOYkBI6SrAaFWpRq36job6rSWxvoQEVZdxtQXpU+ujsUWVjJHQd3GqMqgmtI9RAcr2JeOxI0V3UMX6WYyMEMSmqy5uYk33ruHxWHrn9z8uTn3Xn9PgcDl+34c7nfP+b4jI7IEAaj51PZZRdvFi4Fsn5UNwWDw6JXL5ledPyw4cMZBIABlF86bHiUSCYYVANN3pigEoA6rFI1S1ex2uSK4ZWyTSCSYC+dNjxCAMixCEICFvR9cXcct4mXcLldEo1Q1Y5Gi15UPrq+vk7gl7MWHV3v8CMDCvApBAJbcsNkEKYRlWdbv95N6XflgXqWcqa6ZpihqE3fyB3HDZiMRgCV5EYIAPDk3OxvFnXQqKIrarKupncmHEFl7a9uvDCOICpwSx9xcFAFYwasUNVS2rywvC6YEp4JhGLa9te0xbyUaAagY+LgvgDvRdFlZXo6oobKdFykV+hMjgUBA8O+SvRjo6w8gABU5FYIAPD5mt4tSCMuybDAYjFboT4zkVMrb9fX34/E4jTu5bBiz26MIwOM5EYIArJp3OkW7SraJx+P0WWPD/VwIkXeaO57gTihXLDjnSQTg6aykaJHasupZFU0J5kKnueN3BKA801Vy7NrQUAh3Erlm1bMa0SJ1V0ZSKg2Gz8OhMIU7CT64NjQUQgAeS3eVqCfHJyQphGVZ9p9wmKo0GG6lJaW56d0faZrewh08n0yOT1AIQA3XVWJ8uLgo2LOSXEHT9NZ7TU0PuAg50m3pWsMdcL54uLhIIgAbDpRSVqq1+nw+SZXgVHRburwIwCP7rZLXrg8PP8cdZL7x+XyRslLtRztdvLL9o1yns395+/ZbBQUFe1uTKMXFxUdDoWenvX/8Of4sHKL+/wMBqJu+MyXZEpwKkiRj1aeqJncKkbWYhNPlw0Wyu1hOEMR/Xb6fl5YO1ct1LxKJBNNiMv2EAJQR1t7ev3AHJBTcLhepUarq5IoiRWY7RglSpFAQBEHI5bN37/b/trJC4g4INyzLEiPDn6wyDPNAdL0cvrjncLzYI0IAnrzncIj+yDFTYlRss66m9rtdy6eupnYmRsUE3R/mi5ujoyQC8I1dUhCAJTdHRyW/O36Z5ITC0L4vG72ufNDv9x8qMdae3oNnWRCAhdaeXj/uQPPFktsd0ShVppSlSaNUNS+53ZL/wn3hCzYVh2UvNDO1Y6/DBQSgbmZKurvmXbtirlSfqpogSTKGOwE+uD48/BwB+HraUqR6Euf1enedtKVFWanW6vV6JfXS7bZ0re17JstxtUjqdD95em/MWMgOMZLoA9E0vXWusTF1n4cr5xobRd8x/GpikkIAqnMmRey95Yx6x1yoNBhuiXUKIaMpAy6IdV4lq3kULmiR2uLxeERVojvNHU8ynlziAgJQfuWyWTQzcAvO+SgCsIo3ITvEVC045wV/dBmPx+l36o3f8y5km7PGBsHP1Y7Zv4giAN/MmxShT2DzMmHNBSHP6vMyi88FBKBioK9fcLc6krc2LuVdyDZqqLwkpPs/vN/v4UKyu/hYKN3FXN0Ey2pq6al3jSUIosMxO3empfX9v7MNJhs2NjaKvv36m4Kn3rVfsn3Wv43hq87VD3a8AAAAAElFTkSuQmCC)![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAEUAAAA9CAYAAAD7/KSFAAAFaklEQVR4nO2b/U9TVxjHLwVRo0bNFqPZsvOUB4rtqCkRUYIokU0UHRvqCBM04kSZoqghSNSpxBk0UUOcoJgwg/6AIKJkuMmckqgZLoH4MtnE4aiQJpMKYtvbW3tv2/2yGqbUlr6dtuzzB9w830+enuf2nHsY5n/eIMjVB8gkkV/VXKj7MioqSu2OgpxFr9eP2Z6/dezPV6/OffxEKVArBAlMLt6zV2PxEbq6unQySeQmakIYhmHiZ885qdVqDbRlDKZ47z4tEphMRQgSkJ6pquJoS3gdrVZriJ895yQNIUGffZJ6SxAEE20JQ3GmqopDAlKvSokQhy1t+aWFpR3eFoIgmNNSP72FBFweJA6BBEI3bshV0Q5uj9stLboIcdhSr0iRS2WFKpXKZ7tkMBs35KqQQKhHhSCBKYdKDupoh3UUlUrFyqWyQo9KSZybcJbjuJe0ww6HQyUHWSQwxSNCkICirva8nnbI4cJx3MvEuQlnPSEkKH35ijaz2Uw7o1PU1Z7XIwGFW6WEgzj97p07frOWvI7ZbLakL1/R5rYRjQTGbs/fqqYdzFXu3b2rCwdxulukRMtnFD9Tq/1uLRmK7flb1UhgrKtd8t6x0tKAEGKxWCzP1Gp9tHxGsUtSFiYl1RsMBp52GHdyrLRUjwTed7ZL4i43NgZMl1gxGAz8wqSkemeEiLK+WPk77QCe4nJjox4JxA1LSiSGr+l4+NBvR7AjrFqZ+QcSEDnaJRN27ih6TrtoT9PR0aGLxPBsh6TEzow5MjAw4HM7ap5g546i50hggr0uCTtVUTEihFgsFsvAwAAXOzPm6FulLFm0+ArP8wLtYr3JqYoKDgmE2eqSBc3XrvvF5pE74XleSEle1DSUkJB12Wv/ol0gLZqvXWeRQNJ/pMgkkZuUSmVAj2B7rMte24UEQqxdMrl47z4t7aJoo1QqdTJJZB7DMEzwhzJZadmJ8pmhoaEhb12FA5xJkyaF9vX1zetRKitEWauyFo8fP3407aJ8gbwtm0NEIlGCqOFSw32e5020C/IFas/VMGaz+R6DBD44UVYecP+Gh0t/f78+RhF96JWhGEV0SX9//4gWU1hQ0IcExr2SggTGFRYU9NEujBbtD9p14SDOeuP3FA7izPYH7SPyXSUzI+O3IXf6kUBQZkbGfdoFepuGi5f0SCDW5uqLBGIaLl4aMWuLgTMYF8xPrLU7lhbMT6wxcAYj7YK9wdHDR/RIYJpdKUhg2tHDRwK+W3qf9rIKuXyPXSFWFHL57t6nvQG9jbAlL+8pEhjjsBQkMDp/8+a/aRfuKdpa23QR4rBlDguxEiEOS2ttbQ24EW0ymcyfpy371anDdiQQtCIt7bbJZPLP7y9scK66Wo8E5MMWMkiM/Fx1dcAsuizLvpwXH3/aaSFW5sXHf8eyrF990mWLkm8O6JDAuy5LQQLvHNi/3+/Xlp7uHjZqunSby0KsRE2Xbuvp7vFrMbk5Od1IYJTbpCCBUbk5Od20gznLrZs3WSSQ7DYhg8Qk37xxw+9e6ARBMKWmLGl2uxArqSlLmnme98lLCrY4XVnJIQGJx6QgAcnpykq/OWvWaDRc3KzYbz0mxErcrNhjGo3GL8Ts2b37BRKY6HEpSGDi17t2vaAd2B6POzt10gjJeo8LsSKNkKzv/LPTp0d09urVj5BAsNekIIHgNatWP6Id3BY/NTWxSCDBa0IGiUloutLkcyPaaDTyixcmN3pdiJVFH33caDQafeo72/LjZXokQKhJQQKk/HiZz0yif0/5SlzN5fKthhhF9MHvf/xhzdSpU3lXn+UKgiAE7ywqEtXXXcDHT5QszVoYJDDKa7c67eCu+4L/AFpzDPBsa9pSAAAAAElFTkSuQmCC)

|  |
| --- |
| Bus N  Configuration Resources |

|  |
| --- |
| Bus N+1, N+2, ... |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAHMAAABjCAYAAAColT99AAAIiUlEQVR4nO3da1BT6RkH8H/OIWZDEgeUgjWIC4pSZrE4gNWClHptdtCOCFgvI4uW1RW1Dh0Ng5fqUMVlLY7W1XXs4KqrY0e0WGfHcbxQqR3vWGupSllHxLKKwELIPSehH4rVXQWSkORJyPv7xgSSnOd/3uQ8h/e8RwTPSyovL/9Vbm5uA8Fre0R1dXX49OnTz5jN5grq9+JOorS0tHtdfmDJkiVtAOTUBXcbjuMW3blzR0tdaE9obW3Vh4aGllLX3F3kubm5bdRF9qTt27frAURSF97lQkJCPm5padFTF9iTzGazEBcXd5a69q727rZt2wzUxaVw+vRpHYA06gBcJi4u7kuz2WyhLiwVlUr1FQCeOgdX+EllZaWOuqCU6urqtBKJ5CPqIPqLV6lU9dTF9AYrV67UAAiiDsRpEolk2YMHD/yiFemLRqMxKJXKvdSZOCtoxYoVGuoiepPdu3cbAMRQB+MwpVL5aUdHh18ewfZEEARrYmJiNXU2jhq7a9cuFuRbXLp0SQvgfeqA7JaQkPAXQRCs1IXzVhkZGU8BDKLOyR6q8+fP+3Ur0peGhgadQqFYSx1UX8Rz5sx5Sl0sX7Bu3TotgFDqwHokk8l+/fjxYzYq7aDT6UxRUVGHqTPryffWrl3LekoHlJeX6wHEUwf3hsjIyM91Op2JukC+xGazdaWkpNwCIKLO73U/PHDgAGtFnHDt2jUtz/NZ1AG+JEpOTr5ptVpt1IXxVQsXLnwBQEodJHiez7x69Sr7ruyH58+f64KDg7dQZ/nO/Pnzm6mLMRBs3rxZD0BJlmRQUNBvnj17xloRFzAajZaxY8eepMpSuWnTJr+a0+Nux48f1wOY5PEkx4wZU2E0Gs3UBRhopk6d+i8AnCez/NGxY8fYqHSDe/fuacVi8QeeCpKbMmXKP6k3eiDLy8v7BoDC7UmKxeKcu3fvslbEjdra2gxhYWG/c3eW8qVLl35DvbH+oLS0VA8gym1JhoaGftLa2spO23mAxWIRxo0bd86RfBw5aoosKChYOWTIkHcc3AcYJwQEBPAlJSUpAKa4/Mnj4uLOms1mgXqP9Tfp6emPAAS4Msufnjlzhp3pIVBfX6+VSqUrXRUkr1KpvqLeKH+2evXqTgDB/U5SIpF8VFdXx1oRQhqNxhgREfFZX1n1dQAUnJeX93F0dLSs33sF4zSFQiFRq9U5AGKdfpLw8PB9Go3GSL1nMl1dgiDYJkyY8Df0MsWkt5EZo1arP1AoFBKn9wbGZXieF5WWlsbzPJ/u8B8nJiZWs1np3iczM7MJDs6Gf//ixYusFfFCjY2NOrlcrrY3yEEZGRn/oX7TTM8KCwt1AML6TFIul69raGhgo9KL6fV686hRo45+N7vvHgCF5ufnb4qIiAi0dxgznieVSsUbN26cA2B8j78UFRV1mM1K9w02m61r8uTJNXitVXl9ZMavX78+KzAw0CeuG/R3IpEIZWVlYziOy37jseTk5Ns2G5uU7msWL17cAuDV1yLHcdnXr19n5199UHNzs37o0KG/Bf63apR0wYIFF1atWuW7a9T4MZlMJjaZTElVVVWHOQBBmZmZA2IpMH+VnZ1tBRDBAfi6uLj4riAINuo3xTinqKioHcCNlz+P2blzJ5uo5YMuXLigA/Czb6U7fPjw37NFmHyLIAjWhISEyy8z/H+f2dTUtKGwsNDigU8FxkX27t1rvn379odvfVAikXx4//591qL4gI6ODqNSqfy0t7D5mTNn1lG/UaZv+fn5di15OvnUqVPsCi8v9vDhQ61EIllm12dxbGzsGZPJxCY8eymVSvVvOLBMeMTWrVvZka0Xqqys1AFItTdIAEBISEjJixcv2MetF+m+tcaXDgXZTZaTk9NKvQHMKyUlJXoA7zoTJjiOW3Dr1i3WqniBlpYWfUhIyHanguwmSk1N/Tv7Pye93NzcVgD9vrIg8ciRI+y7k1BNTY2W47hF/Q0SADB69OjjBoOBLRNDJC0t7R9w4cqXw4qKitj0SwJHjx7VA0hyVZAAgMGDBxc9ffqUBepBBoPBHB0d/UeXBtlNkp2d/TX1BvqTDRs26AB83x1hguf5n1dXV7PR6QFNTU26oKCgDW4Jspto4sSJVwVBYL2Km82bN+8ZALdfTvnevn372HlbN7py5YqW5/k57g4SADBixIgDnZ2d7IpqN7BarbZJkyZdgwcX4R+yZs2aTuoNH4j2799vAPCep4IEAEil0tX19fXsYMiFtFqtceTIkX/waJDdAmbNmvWYugADSUFBQSeAoRRhAsC0s2fPstHpAo8ePdLKZLI1VEECAOLj4y9YLBY2xaSfZs+e3QBATBomgNE7duxgrUo/nDt3TgdgOnWQAICwsLCdbW1tLFAnWCwW6/jx4y9SZ/g6xbJly9qpC+OLysrKDACiqQP8FrFYvLS2tpZNMXFAe3u7YdiwYbuos3sbbtq0aQ+oC+RLli9f3g5gMHVwPfnxiRMn2BQTO9TW1mrFYvEvqQPrVUxMzJ+MRqOFuljebsaMGQ/g4TsLOSN8y5YtbHT24uTJkzoAydRB2SU4OLi4ubmZnRl6C5PJZImNjf0zdUaOCFy0aFELdeG8UXFxsR5ABHVADuE4bt6NGzdYq/Ka7jV7tlJn4wxRSkpKDZsN/8obq2n5mPEHDx5kB0NdXV03b97Uchz3C+pA+iUqKuoLvV7v1ytl2my2rtTU1Dvw4FQQdwlTq9V+fWR76NAhPYAE6iBcQi6Xq588eeKXgfa0arMvGzR37ly/XPfd7vXUfQnP8+lVVVV+1ao0NjZq5XJ5IXXt3UGUlJT0V3+6V0pWVpbD9yDxJT/Ys2ePX8xIuHz5spbn+dnUBXer8PDwzwb6/cXsuW+XO1D0PUGFhYXRJSUl7QSv7REVFRXSrKysdgBPPPm6/wVnbggnv/0rTwAAAABJRU5ErkJggg==)

|  |  |
| --- | --- |
| |  | | --- | | Physical Resources | |

|  |  |
| --- | --- |
| |  | | --- | | Physical Resources | |

PCIe SR-IOV

Capable Device

Internal Routing

Internal Routing

Internal Routing

|  |
| --- |
| ATC |

|  |
| --- |
| ATC |

PCIe Port

VF 255,2

VF 255,3

VF 255,1

VF 0,1

VF 0,2

VF 0,3

PF 255

PF 0

A-0628

Figure 9-7 ExampleSR-IOV Device with Multiple Bus Numbers

In this last example,[Figure 9-8 ,](#bookmark23) a device implementation may mix any number of Functions, PFs, and VFs.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| |  |  |  | | --- | --- | --- | |  |  |  | | |  | | --- | | Physical Resources | | | |  |  |  |  | | --- | --- | --- | |  | Configuration Resources |  | | Physical Resources | | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | |  |  |  |  | | --- | --- | --- | |  | Configuration Resources |  | | |  | | --- | | Physical Resources | | | |  |  |  | | --- | --- | | |  | | --- | | Physical Resources | | |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAEYAAAA9CAYAAAAQyx+GAAAFxElEQVR4nO2ae0xTVxzHT4EyNjccmuHcYs4ppxVxdBQF6pCH74mCYmCaKcOJi0wWeSpxIJsxTNmYm4ljEnVmzunE59xDx6KZuhVMZAxQAkNhKtQIIn3Q3va2t+3+mDVoKNT2lnNb9/n39p6c7+eee87p71wA/mdQeGw0EoyFGVXHjmaHSSS9bLTnDPv2fvXS1tLSlPabN1qIdgRD5F/8fpHSwhEYhjElJy36A0PEykN3GGlExA6VUqUjLWQgtTW1GgzRQmJSMETCvbv3cEqKlbWZmXIMkS8RMYkJCWeNRqOJtITBkMvlGnHI5MIRl4Ihmnv+t/Na0gKG4uNtZRoMUeBISvFZs/qdm6SDDwdFUfSMmNhvRkzM5InBObdu3dKQDm4PR6uOUBgiiculYIjGlm7Z0k86sL2YzWbL0pTUOpcv37HR0/dqtVo96cCPw1/19RqRIOgNl0nBEIUePPAtJ5fn4cjLzrmLIXraFVJ4KcnJlxiGMZMO6Qh3e3q04eJXN7MuRiQISq67fNktJlxb7PjscwpD9DJrUjBET63Leu8O6WDOotfrjXNmzjrOmpiwV0KLu7u7Ob2Zs5cfT/1AYYimOS0FQzR+e3m5R0ixkvbm8mYMkZdTYmbFxR/W6/QG0mHYpLWlRROMhW87M1oiT544QZEO4go2bihUYIiedUQKb/myZU2kA7gKpVJJRU6ZWm4rv7etC2MDAlZUVFamBwYGkqlpuBg/Pz++xWIOb7nafEihUirtuglDNGpDQcE90k/V1RgMBiZh3utn7LYZIQnf1tfX55Fzy6OcO3tWiyGaac9ogRU7v3gipFhZtXJlO4bI5rQCAABg/py5PxkMBiPpzo4k/3R0aEJEE9cONVriqn+p9qjNnL1sLvlAjSEKGEyK96r09GukO0iK/v5+XXSUdJfVx4P36sXAwDWVe3anBIwZ45HL83D4+vr6+PB9Qq80NB5XqJT/nahiiEaXFBerSD810jAMY1qcmHTxga3XIqN2qtVqt6zMsU2NTKbFEC3wxhD55ebn75dOm/YcqWHMJSZMmMC/fu260AsAYGlsbDCQ7hBXoGna2Nra0gUAAEAkCFpSV1fn1mVLtnio/Ikh4qUuWXLJZDK5ZaGbLQYWzL0BAEChUgJap780bty4jFCx2IfQKCZOcVGR8kpTU6pCpWQeuuCOh2lsUf9nvUYkCEq1unjojxPfy1tG03R2bFzcE7XJM5vNltx12U23b9/OV6hslGZCJ4XkusuBPVtUHT6swxCFDWkPQ8R3h0882IKiKDp+esx+u4YWhmjuxQsXnoh/2ds+2qrBEL1g93uXtGDBOa5+RsYWXZ2dmtBJIQWD5bdZtTIZjDX+/qMzw6dM8djle2NhYc/fra3LFSql+bFulEZE7FCruPWpKlvUyGQaDNF8h4xiiPw3FXHn42a2YBjGtHhh4gWHpFgJxsLVbW1tHrV8f71vnw5DFOyUGAyRV/qKtFbSYdhCrVbroqOkFU5JGSAn+szp0x5xpPLhphI1huh5VsQAAMC82bNP0TTt1scqHe3tmhDRxEx78g59yDQQs0XG5/OzoqRSvsN2CVOQl9fZ0d6RoVApLaw2PDVMUtrb2+uWO+Jfq6u1GKJ4VoVYwRA9k5+T20s65ONy//D+Z5dIsSJEgmWNDQ1utXzvqviSwhAhl4rBEPGWpqTWm83uUQXt6+ujIiThZS6VMkBO+LEjR91i+S5cv/4ehmjUiIgBAIAZMbEHdDodTTr4UDRfbdYIkSDNkXz2L9eP3sjzkjGMad30mBjOlkHzcrLbujq7smyWK12FOGRyoVwu5+REfOrk9xSGKHJkjdwHQ+SblfmunLSER9Hr9IZZ8TOqiEixIhIEJdbW1HJq1Gwv/1SLIRpPVAyGiJectOh3hmE4UQbt6e7RSsTiEmdzOV22bL95wwIAyPju4KHZaelv3XG2PWf5pKxM0K/u3zX8L4fmX2GaSTzkgRpnAAAAAElFTkSuQmCC)

|  |
| --- |
| Bus N  Configuration Resources |

|  |
| --- |
| Bus N+1, N+2, ... |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAHMAAABkCAYAAAC1kA/FAAAIqklEQVR4nO3da1AT6xkH8D+7WXNCkMKBGos4oqg1R0UFBZTioYaq1NEiZRgvjJc6Sh0tVauVadWOzJlqvVStWj3qIBUvcyijnFqVMuqh3hXrqRfqrYq3yqUKBjcJuWzoh4q1KhJCsk9C3t8MX5gdkn3+87L7bJ7dAPJTA1ASvK7cgqnfgNuFhYVt1uv1hqamJqmj/kiSZM/Ozm4A8DF1vd2p78aNG01NPqChoaExPDz8c+qCu010dPRXVqtVoi60XLZs2WIC8Al13d1hbGlpqYG6wHKy2Wz22NjYswD8qIvvSkJqaupj6uJSKCsrM/A8P546AJdRq9WLKisrfWpVvik9Pf0pgE7UObhC6OLFi0XqglJ6/PixISAgIIc6iHaLiIjYLYqimbqg1HJycgwANNR5tEfUzp07jdSF9ARGo9EcGRm5jzoQZ/mNGDHikiRJdupCeor8/HwjgGjqYNqM5/kfnjt3zqePlW+z2+1NiYmJX8PLWpWPJk+eXEtdPE9UXl4uchw3iToghwUFBf2qurraZ1uR1kybNu0ZAH/qnBwRtnz5cnbS8wG1tbWGkJCQz6iDalXfvn3/2NjYaKEumKfLzc01AuhOndeHxO3fv5+tSgeYzWarVqv9kjqwlnCjRo26QV0kb1JUVGQE8B3q4N4hCML0q1evslakjUaPHn0LAEed35sCZs2aVUddGG9UUVEhCoIwizrA17p06bLm+fPn7FjppKysrBcAAqlzBICeq1ev9olREHepr683de3adSN1kBg4cOAxi8Vioy6It1u/fr0JQB/KLL97+PBhdqXHBaxWqzRkyJATVEHyKSkp96iL0JGUlJQYAHxP9iSVSuXcO3fusFbExSZMmPAQgELOLIPnz5/fQL3jHdH9+/dFlUr1U9mSDA8P39bQ0NBIveMd1cKFC18CCJEjS+3mzZtZK+JGoig29ujRY5fbkxw2bNhpm83mM1PpVLZv324CMMCdWX7/5MmTrBWRgSRJ9vj4+Atw04hJp7S0tH9R76QvOXPmjMjzfKrLkwwICPj5w4cP2aqUWUZGRjVcfD9rl6VLl7KeksDTp08NgYGBv3QkJN6RjXr16rW9sLAwShAEh7ZnXKdz586CwWCIP3369G4AYnv/3pC8vDzWihAymUyWPn36fNHeIP0SEhL+ZrezoXRqBQUFRgDDnE6S47iMixcvsmOlh0hKSroGJ1sVVWZm5r+pd4D5nytXrogcx01tc5LBwcG5NTU1bBTEw8yYMeM5/vv4HYeFr1y5kgXpgZ49e2YMDQ1d/b7Q3ttqaLXa/IKCAq1CofCoEUAG8Pf3FyRJGnrixIm9APStbT+isLCQrUoPZrFYrP379/9za0FyycnJN6nfLNO6Q4cOGQCMbDFJQRB+dOPGDdaKeImxY8feRQuHysA5c+a8oH6DjONu3bolKpXKrOYAX6eq0WhWFxcXx6tUKlmHiRjnhYaGdqqqqvq0vLz89wDMzb+PXLt2Lbv+6oX0er0pLCxsM/BqZQ4aNOiLvLy83hzHsVbEyyiVSoVCoYgqKSkp8gOgO3r06J9SUlK84h575l2SJNnj4uJOc2q1OiQmJoatSC/G8zw3dOjQrgDgP3369OfU//sZ5127dk0UBGE6AIDjuCmXL19m/aWX0ul0FXjj7mu/kSNH/p19CO19Dhw4YAQQ//a/3pg9e/awa7JepLGx0dKvX7+i9x5Ie/fufcBkMvn840S9xYoVK4wAur0+EXozzLq6urNNTU0/0el0ggtOshg3qqmpMWZmZq41m82HW9woMDDwF0+ePGHDzh5uypQptQBUrYWuzMjIqKJ+s0zLzp8/L/I8n+7QEuZ5/genTp1irYoHkiTJnpCQUI42TOn5xcXFnbfZbKxX8TC7du0yARjkaJDNBmzbto19kuJBDAaDuWfPnn9oa5AAgO7du+98+fIlu93dQyxZskQE8E2nwgTw8YIFC15S7wTT1PTgwQNRrVb/zNkgAQAqlSr73r177GSIWFpa2mMA7e7/FePHj39AvTO+7Pjx4wYAKe0NslnysWPH2IUEAjabTYqJifmrq4IEAAwePPi41WplDzyU2aZNm0wAvu3SMAH0XrduHWtVZKTX603dunXb6uogAQAajWZjfX09C1Qm8+bNawAQ5JYwAQRmZWWxQWkZ3L59W1QqlT92V5AAAEEQZlVUVLBWxc1SUlL+CQcfINIeXHJy8i3qne3IiouLDQA+dXeQzRKKiorYiIkbWCwW24ABA47KFSQAQKvVfmk2m63UO9/RrFq1ygQgQtYwAXTPzc1lq9OFXt3i/hu5gwQAhISEfFZbW8uuDLnIzJkz2/zwCVfyz8zMfEZdhI7g1WNhMqmCBABwHDfp0qVLrFVpp6SkpOvwgK8v9ktMTPyaTcM7b9++fUYAsdRBNovOz89nJ0NOePWQw0LqAP9PZGTkPqPRyKbh22jZsmUGAN+izu9tmpycHHZm2wZVVVWGoKCgFdTBvVdAQEDOo0ePWKAOmjRpUg2Aj6hza0mn9PT0p9RF8gZnz54VeZ5Pow7sg3ieH19WVsZalQ+QJMk+fPjwi/CAVqQ1frGxsWfYNHzLduzYYQIwkDooR32ydetWNpHwHqIomiMiInZTB9Qm4eHhn7Mvf3vXokWLRACh1Pm0VXB2djabhn9DZWWlqFarF1IH4xSVSjX/7t277GToldTU1EdwwVQ6FcW4ceMqqYvoCUpLSw0AxlAH0l66I0eO+PSFBKvVKkVHR39FHYRLREVF/cWXp+E3bNhgAtCXOgdX6bVmzRqf/FTlxYsXprCwsN9RB+BSGo3mt3V1dT7Xe86dO1cP4BvU9Xe1zrNnz66nLq6cbt68KSqVyjnUhXcLQRBmXr9+3WdalTFjxtyBDFPpVDidTvcP6iLL4eDBg0YAiXIWl+Kq/bC9e/emTZ069QnBa8uiurpaPXHiRMWFCxd+Lefr/gc6fUfQna/fVAAAAABJRU5ErkJggg==)

|  |  |
| --- | --- |
| |  | | --- | | Physical Resources | |

|  |  |
| --- | --- |
| |  | | --- | | Physical Resources | |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAEUAAAA8CAYAAAAwoHcgAAAFxUlEQVR4nOWba0xTVxzAT6E0LhE2jcmiUc8ppxXL7EQeVm1lsBpdUVCYBI2bzqFRwpBlGz7ifCJK2HxlmXPTbeIjmBAT9UPZB3A4EQbKHMTAgiIWLYvY2Vt6b28f9172YXbBR6HYx72F39f2np7/7/57zu3//CsCPoIhkm3bvv2j3PXrun0dy1cOlJTgH0+c/KrTcN/E60TS09JqXC4X2y8AKIpyJKvVP/MqBEO08LerVym+ZQzkfEWFDUP0Nl9CIjauX2/gW8LzsCzLZWdmNWGIREGXMmO64tMH3Q9IviW8jOabzaRcGv1+UIVgiCaUFBcLUoibwoKCRxiiMUGTkqxW/0RRlIPvwAfjcW8vFadU7gyKEAyR8nxFhY3voL3h0NcHbRiiSYEWIsrOzGpiWZbjO2BvsNvtTm1KamVApcil0VnNN5sFvZY8z+WLl2wYIlVAhGCIxhQWFDziO8hXYdWKlbcxRGF+lxKnVO583NsrqAc1b2lvayNjsGyNX4VgiCYdPngoJBZXT2wpKnqCIRrrNynalNRKu93u5DswXzCbzbak+IQyvwjBEKkuX7wU0lni5vixYzYMkdRXIWGrVqy8zXcw/sLpdDK6hYuqfJISg2Wr29vaQmoLHoqa6moKQ5T6qlkydmvR5id8BxEI1q5Z04khCh+2lKT4hDKCIEbEWvI8XffukQr5tDxPsb/UFoZIWlC4qVyt0bw2bJshwBvjxklMj03JDwyG42YLYffqosXv6X5xOp0M33c0kFitVrtaNec7r4RgiFKv1NSE5JPrcCk/dYrGECmGEiJet/bjTr4nGywYhmGXpWfUDVq6jJ0Wk9/V1TWituChaKhvIDFEiz1lybg9u3Zb+Z4kH+Rt2GDEEElekKKZO++41Wq18z1BPjA+fEgpFbGb3S7Cn2ZJ7BdFRccSk5KCV+gVEJFRURFWKzmv6+7dk2YLQQEMkSgzY+l1hmFCosQYKGiadqRo5p8BAIDwCePHpx88cviTKVOnvvidGkWIxeLwyMhIeXPTDT3I35hn5PsuCQWO4/pzlmffCJs8efKozpCBiEQiMHHSxEgwY7qisLu7e1Q9m3iiqbGRlEujl4bRNP3tvj17+e3nEAAsy/aX7j/QynHcZQAAABiiBVdra0fF7x1PnDtzlsYQvfWMqSU6XbVQmm+CDUmSds2cuSfcLv6vp7Aupj4q6vUNs+LjxcFKWaFQVlpqr7t2bbHZQtAvvKhKTDzcZ7HQfN+5YGIwGMjYaTGbBnp4pvI2JkJS39fXt+ldrXbUPO4Xffb533fv3PnQbCE4j2+KwbLcjo6OUbFF1/5aS2GIFgxpDkMUtnrVB3/xPeFA43K5mCU6XbXXKYUhmlel14/ISr6bE9//QGOIZF5LAQCARdoFFx0Oh4vvyQcCC2GhZyckHvEUu+cDIY6rk0gk+bNVqohh2QwBivfupW40NmaYLYRj2BcnzIzbZzKZRtTXqKOjg4zBstzB4h60u4cgiP2lJfttw7YpYIp373nIMIxvreoyJM1pbWkZEVt0lV5vwxCpfTaLIRLlLM/+g+NCu1rpcDhcC7XaSz4LGSBm1oXKypBeW745etSGIZriNykAAJA6P/ksTdMh2eJlMpmohJlx+7yN1esejXCRqI5l2QK1RhNyW/SuL3f0/XnrVpbZQrj8PrhSEbvFaDSGVDGqtaWFlCHpCr/LcIMhkuRvzOvhO1BveVqdvxXw//7IpdFLGuobQiJbLlRW2jBE8QEVAsB/W/Sy9IxrQj9RpGnamTo/+VzAhQwQozhdXi7oCl1ZaSmFIXozaFIAAECtmiPYLoWenh5SqYjd+qqxDb9t8ikSsfi6zUYXvpOSIrgTxm2bt/zT3t6eY7YQbNA/XIidT783NFByaXRG0GW4wRCFC6lHjmEYLjNj6XVft2Cfzng6DfdZcOVKbpVev0OXlmb0ZSx/cPb0adja0pLXabjf78s4/wJkXLNWiwYVWgAAAABJRU5ErkJggg==)

PCIe SR-IOV

Capable Device

Internal Routing

Internal Routing

Internal Routing

|  |
| --- |
| ATC |

|  |
| --- |
| ATC |

|  |
| --- |
| ATC |

PCIe Port

Function 0

VF 15,3

VF 15,2

VF 15,1

VF 1,3

VF 1,2

VF 1,1

PF 15

PF 1

A-0629A

Figure 9-8 ExampleSR-IOV Device with a Mixture of Function Types

Key observations to note:

• Each Device must contain a Function 0. Function 0 may be a PF (i.e., it may include the SR-IOV extended capability).

• Any mix of Functions can be associated with the captured Bus Number.

◦ Non-VFs can only be associated with the captured Bus Number.

• If the ARI Extended Capabilityis supported, Functions can be assigned to Function Groups. The assignment policy is outside the scope of this specification. If the ARI Extended Capabilityis not supported, Functions can still use the Function arbitration capabilities as defined in Section 6.3.3.4 .

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**9.1.1 PCI Technologies Interoperability**

Establishing clear interoperability requirements is critical to the success of any technology. To this end, the PCI-SIG I/O virtualization specifications are organized to maximize the interoperability potential for compliant implementations.

Conceptually, this is viewed as a set of concentric circles that define how functionality is layered to build an IOV capable component as illustrated in [Figure 9-9 .](#bookmark24)

![](data:image/jpeg;base64,/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/2wBDAQkJCQwLDBgNDRgyIRwhMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjL/wAARCAGxAZEDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD3+iiigAooooAKKKKACiiigAooooAKKKKACiiigAoopKAFopjSIilnYKo6knAFY0/irTEcx20kl7KONlohk592Hyj8SKANykJwK5l9b1q4P+jWEFqn966l3uP+Apx/49Vd4tUuf+PrWrjB6pbIsI/Plv1oA6wyKoyTge9UbjXtJtSRPqVpGR2My5/nXNnQ7CQ5uIXuj63UrTf+hE1bgs7W3GIbaGIeiRhf5UAXj4u0XOEu2l/64wvJ/wCgqaRvFdiMbbfUHz/ds5P6ioaTHtQBN/wlVt/z4an/AOApo/4Su1HWx1ID/r1aoqKAJh4s00DMiXsf+9Zy8fktPXxboTYB1GKMntLmM/8AjwFVaDyMEZHpQBs2+pWV2M213BMP+mcgb+VWQwNcjPpOnXJzNYWsh9WhUn88VGukRQnNpc3tr7Q3Lhf++SSv6UAdpRXJpNrtt/qtSjuVH8F1CM/99Jj+RqePxJeQYF/pUuO8lo4lUfgcN+QNAHS0Vm2WvaXqD+Xb3iGXvE+Ucf8AAWwf0rRDA9KAFooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigApCQOpqjqWr2elRq1zKFZ+EjUFnkPoqjk/hWDPqOr6rwn/ErtT6Yadh+qp+p+lAG7qOtWGlhRdXKrI33YlBeR/ooyT+VY8ut6vfZFnaJZQn/AJa3XzOR7Ip4/E/hUNpp1rZlmhj/AHj/AH5XYs7/AFY8mrVAGe2lJcuH1Gea/fri4b5PwjGFH5VeREjQIiqigYCqMAfhT65+bWrmDxlDpbJH9ilgB34O4SncQM9MEIaAN/ilz71xMviu92X0ga1ggS9hhglkUkCJzgs3Iz0J+mK1dH1me81Sa2+1Wt9apFv+1WyFVR842E5IJxzx6UAdDmkyPWud1HxDNY3+oQLFHIIo7cQLnG6SVmXk+nAp8t5q+nXdpDeTWs63jGFHjiKeVLtJXILHKnB9DTA38+9LXPRa5cXGmaaY0Rb66n8mRCOEKE+acewVsfUVJr+q3OnS2qpJHa20m7zbuWFpFjIxgEAjGcnk8cUgN2iuft/EJSC1jmQXlzcCRojYYdJEQj5s5+XqOCfaph4nsXiiaCO4nkkVmMKR/OgU7W3A4xg8Y65oA2qTNZB8R2TbDbpPcoYkmd4I9wjRvuk9/wABk1jar4nu4Ni2pG3zrgST/Z9wRYhnG3cMn1Pp2oA7GishdegMwhWG4m27FlljiykbMAQD37j1xnmmv4igSfyTa3Ss4fyWePaJSgJIGTkdDjIGaANmkxXMWHitJra1muo3SS4tYpVtkjyzO5IAU55zg8egyaut4mtFMUYt7o3MkrQ/ZhH+8Vwu7B5wOOc5xTsBp3Vla3ibbm3imA6b1Bx9PSoYra/sDnTdSlVR/wAsLrM0Z/E/MPz/AAqTT76HUrNLqAMEbIw64ZSCQQR6gg1apAJF4oe2O3V7J7Yf8/EP72H8SBuX8Rj3regure6hWa3njliYZV42DA/iKwaotpiRTNcWEz2NwTkvDja/+8h4b8s+9AHZZormYPEdxYkJrNuFj/5/LcEx/Vl6p9eR710UM8VxCk0MiSRuMq6MCCPY0ASUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRWfqmsWmlQK87MXc7Y4YxueRvRR3oAuySLFGzuwVFGSxOABXNXPiC51ImPRVVYDwb6Vcqf+ua/wAX16fWqksN3rLiXVsLADlLBGyg9C5/jPt0Hv1q+AAAAMYoAqWmmw2sjzZea5f79xM26Rvx7D2GBVyiigAooooAKw9V8O/2lNdTLdtBLLFEkbqmTEyMWDDnnr0rcooA59vC8e/93cbEEttIF2ZwIe3XvVmTQLeTUprhtjW1ygFxaumUdwflf2OOD68ela9FO4WMKbwxayyXmJDFFPDDHGka4MJjLFWU/Ujt2qWPSLya+trnUr5LgWpLQxxQ+WN5BG5uTk4J9BzWxRRcGZFroKW2u3GpCZmWQEpCV4jZsb2B99q/r61Nf2N5NcxXNlffZ5EUoySIZI3B9VyOeOua0MiigDk00DULLWLJ7O4UMVuZLidoMx73KcBARgfLxz271I/gyLMUyzQS3QWQSvdWwlVy7biwXIwc9OenFdR+Boz7GkBgxeHZrNj/AGffi3WSGOKbEAydgwGTBAU4J7Ee1RT+FBNDJH9sI3vdNny8/wCuUr69s/j7V0dHBoAxYdDuLWd/smoeTBM8ckyiLLllVVO1s8AhRng1StfCJgvLed7yNzDI7F/IxJKGDD53yckZ7AdOldRRQBysHg+SEWj/ANoAz2cMccDiHgbC2Cw3c5ViD09auweHmTUodQmuxJcidp5Nse1WzH5YAGTgAY7mt2ii4FLTNP8A7NszbiTzP3kkmcY+85bH4ZxV2iigAooooATGaoCxmsZWuNJnFtIxy8LDMMh917H3GPfNaFFAFnTPEMV1MtndxGzviOInbIk90bow/UdxWyDmuWu7SC9gMNxGHQnIz1B7EHqD7imW2rXmiYj1B5LvT+10BmSEf9NAPvD/AGhz6jvQB1tFRwzRzxLLE6vG4DKynII9QakoAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAoorB1rW5IpjpumhZL5ly7tylup/ib1Povf6UASaxrospFs7NBc6hIMrFnCoP77nsv6noKybWxMc7Xd3Kbm+kGHmYYwP7qj+Ffb880+yso7ONsM0ksh3SzOcvK3qx/p0A4FWqACiiigAooooAKKKKACis/UNasNMIW4nHmt92FBvdvoo5/HpWFceINTu8i1hSxiP8cuJJf++R8o/EmgDq3kSJC8jqijqWOAKx5/FOlxsVglku3H8Nshf/AMe+7+tc3JaLcOJL2WW8kHObhtwB9l+6PwFWAAoAAAA6AU7AXpPE1/L/AMeunRxL2a5m5/75UH+dVJNR1qbO/UUiHpBbgfq2aZRRYCNhdSHMup37+wnKD8lxURsYWPzvcSf79w7fzNWaKAKh0uxY5a1jY+rc0o02zUYSAIP9hiv8jVqimFiutoEP7u5vI/8AdupB/wCzVMkmoRY8nVr1cdnZZP8A0IGnUUATx6zrcPWa0uR6SRGM/mpI/SrcXimRMC70ydfV7dxKv5cN+lZtFIDpLPXtMvn2Q3ieb/zzkyj/APfLYNaWa4WaCG4TZNEki+jqDRAbywx9gvpYlH/LKQ+bH+R5H4EUWA7qiuat/FLxYXU7QxjvPb5kT8R94fkR71v211b3kCzW00c0bdGRgRSAmooooAKKKKACkpaKAM+FbrQ5Wn01DLasd01iDj6tH2U/7PQ+xrqNP1G11O0S5tZQ8bcHjBUjqCOxHpWNVGWC4srs6jpmBcH/AF0DHCXC+/o3o34HigDsqKoaXqlvqtoJ4CRg7ZI3GHjYdVYdjV+gAooooAKKKKACiiigAooooAKKKKACiiigAoorI1zWP7OgSK3QS305KwRE8E92b0Udz9B3oAh1zWZYJBpun7W1CVc7iMrAn99v6DufYGqFlZx2cGxCzMxLSSOctIx6sx7n/I4ptjZfZEdnkM1xK2+eZhzI39B2A7CrdABRRRQAUUUUAFFJmuf1PxHtle00tUnnU7ZJm5iiPof7zew/HFAGrf6naaZCJbuZYwThF6s59FUck/Subu9Z1LUsrDnT7Y9xgzOPryF/DJ+lU0tz57XE8r3F033ppOuPQDoo9hU1OwEMFrDbBvLQBm5ZySWY+pJ5P41NRWZquovYtborQxLKSGnnzsTHY4xyfqKYGnRWHLq91GLNXaziM7yAzM+6MhRkFcHv6E1b0q/kvmuY38p/JYKJYDlHyM8e478mgDRorGm1iZb+ayhiRpvOWKLcSBym4lvp7UT6lfWbTQSxQyTiEzxMgIVwpG4EE5B59aANmiqP9oebd2UNuFdZ4zM7f3UxwfxJH61V1DVJrfUfsqSWtuPLDK9zkCUkkbVOR0x+tAGxRWe2rRQeVHcxyLcPF5rRRqZNo7nI7Cj+29P2q6SNJGVDmRELKgPQsR0oA0KKpPqtqLgwguSG2FwhKBsZwW6ZrMfxBKZUCeWIhbCd5WicqcnAx6CkK50FFUV1a0aYpufGWHmFDsJXqA3TI5/Kq11r8EFncTJFMzxRCVUeNl3qTjIyOlMZr0VnDWIVMvmE5EixpGsbFySu7GO5xQ2t2QEe0yu8m7aiRMWyv3gRjgigDRqAW5in+0Wkr2tweskRxu/3l6N+Ip8E8d1bxzwtujkXcp9RUlAGhZ+J3t8R6vEsa9BdRAmP/gQ6p+o9xXSRyLKiujKyMMhlOQR7VxWARg9KZavdaVIZNOYeWTl7Vz+7b/d/uH3HHqKVgO6orO0vWbbVUYR7o7iMfvYJOHT/ABHuOK0aQBRRRQAUUUUAUJ4bi0vBqWmgfagAJYScLcIP4T6N6N278V0umalBqlklzbElSSGVhhkYdVYdiKyKoP52k3p1OzRnRsfa7df+Wqj+NR/fH6jj0wAdnRUNrdQ3ttHcW8iyQyKGR1OQwPepqACiiigAooooAKKKKACiiigAoopCQO9AFTUr+HTLGS7uCRHGM4AyWPYAdyTwBXNWUE8k8mo34H2yccqDkQp2jB9u57n8KJrk67qn2jOdPs3K247SyDhpPoOg98n0q9QAUUUUAFFFFABTJJEijaSRgqKMszHAA9zSTTR28LzTSKkaKWZmOAAO9cbf38mvSbmDR6cpBjhYYM2OjOPT0X8T7MCXUNYn1kmK0Z4NOPDSj5XnHt/dX36n2qvHEkMaxxoqIowFUYAp9FABRRRTAKp3lvdSPHJazopUENFKu5HB9cdCKuUUAY1roXkzwSyPEdsskrxrHhMsoGFGeBxn8alh0WJDLDIQ9mX8yGLJHlE/eAwenoO3NalFAGRLorteTXcU6pN5ySwkrkLhNhDc8gip7exna++23skbyiMxxpGpCqCcnryScCtCigDN0zSjp7zM0vmBjtiGMbIwSQv5k0t3aXsk7tDNA8MigNDcxllUjuMH8xWjzRQBz0emXtnfQQWki+XHZeUZZYyQTu7YPb0qGLwjHbRmKJ4ZEdFV2nQkgjqQAQOfQ10/NLk0gMqLTrm3lkSC4jS2klMrAplxnqoPTGarDw/ILIwC4XJtFt92zuG3ZrdopiMmPTL2CB7WG9WK3zIUZU+cFsn1xwTn8Kpnw5K6z7poUMtsYCUViSSQdxJOT06V0VFAzGGj3IuTeCeL7T5wlA2HZ/q9hHXPvT7XSJIL5Lt51aQ+a0gC4BZyOnsMVrUUgKun2pstPhtmYOY1xuAxmrVFFMAooooAhmg8x0lR3huIzmOZD8yH+o9QeDW9o+vm4mSx1ALFen7jL9yf3X0Pqv8AOseop4I7iPy5BkZyCDggjoQexFAHd0Vzei65Ks0enak+ZW4guO03+yfR/wCf6V0dSAtFFFABRRRQBSs7k6DqIRjjTLuTn0t5SevsrH8j9eOvBBrmLiCO5geCZA8UilWU9CDU/h2/kRpNJvJC1xbruikbrNF0DH/aHQ/ge9AHQ0UUUAFFFFABRRRQAUUUUAFc/wCJb2QrFpVrIUuLsHe69Yoh95vYnoPc57VtXVxFa20s8zhIokLux6AAZNcnp4luXm1O5UrPdkMFbrHGPuJ+XJ92NAFuGGO3hSGJAkaKFVR0AHQVJRRQAUUUUAFIzBFLMQABkknoKWuS16/OpXL6ZA3+iRHF06/8tG/55j2H8X5etAFbUNQbXpwRkaZG2Y1PHnsP4yP7voPx9KKTAAwOB6UtUAUUUUAFFFFABRRRQAUUWcN3qspi0u2Nzg4aYnbEh927/QZNdTYeBISBJq90923/ADxjzHEPwBy34nHtSuByC3KSzeTAslxN/wA84EMjD646fjWpb+HdfvMEWUVoh73Uo3f98pn+Yr0O1sraxhWG1gigjXokaBR+QqxRcDiIvAt04H2rWGX/AGbaALj8WLVdj8B6aP8AW3moyk9cz7P/AEECuqopAc1/wgmh90vD9b2b/wCKpD4E0XHy/bVPqL2U/wA2rpqKAOQl8BWoB8jU9QiP+0yOP1WqM/grVYsm11G2nHZZ4jGfzUn+Vd7RQB5bc6ZrFgCbvS5ig6y2xEy/kPm/8dqnFcQzkiORWZfvL0K/UdR+NeukcVm6l4f0vVh/plpG7jpIPldfow5H507gec0Vt6h4Nv7El9NuftkXXyLghZB9H6H8fzrAWX9+1vLHJBcp96GVdrj8O49xkUXAkooopgFFFFABRRRQBHPBHcRGKVcqfQ4IPYg9j71s6DrMjuNNv3zdKuYpj/y3Uf8Asw7j8fplVDcQefGAHaORW3xyL95GHQigDvKKyNC1c6lbvHOFS9gwsyDofRh/sn/63atepAKKKKACqOowTMsV1aYF7at5kOeN3HKH2Ycfke1XqQjNAGxpuoQ6lYQXcGfLlXOD1U9wfcHIP0q5XJ6VN/ZWvNbMcWmoEvH6JOBlh/wIDP1B9a6sc0ALRRRQAUUUUAFFFMlkWKJpHYKqgsxPYCgDm/Esxvb220ZCfLbFxdY/55qflX/gTfopp46VQ0xnu/P1SUESXz+YAf4YxxGP++cH6sa0KACiiigAooqK4njtreSeZwkUal3Y9gOtAGV4h1R7K3S2tWH225ysZxny1/ic/TPHqSK56CBLeBYowdq+pyT7n3NIkst7cy6jcArJPwiH/lnGPur/AFPuTU1MAooopgFFFFABRRTMyzXMdpaRGe7l+5EDjjuxPZR60AJLMkW0EMzu22ONBuZ29FA6mui0jwZNehbjWvkjPK2KN/6MYdfoOPXNbXh/wxBpH+kzsLnUHGHnIwFH91B2H6nvXQAYpXAjggjt4lihjSONRhUQYAH0qWiikAUUUUAFFFFABRRRQAUUUUAFFFFAB+FZ2raJY61AIryANt5SQHa6H1VhyK0aKAPMdX0S/wBALSSk3enjpcqvzxj/AKaAdv8AaHHqBVMMrKGUggjII716wVDAggEHqDXC6/4Uk09pL7R4i9v96ayXt6tH/wDE9+2O7AwqKZFNHPEssTBkboRT6YBRRRQAUUUUARM81ncx6haqWnh4ZB/y1Tuv17j3FdpaXcN9aRXNu2+KVQymuQqxoN4dO1M2LnFteMXh/wBiXqy/RgCfqD60gOuopB0paQBRRRQBU1C0N5ZvGrbJQQ8Un9x1OVP4ECt3RdSGqaVBcldshysif3HBww/Ag1mHkVBpMp0/xFJbk4g1BfNT2mUYYfiuD/wE0AdXRSUtABRRRQAVz3iqdnsotMQ4e+l8psdowNzn8hj/AIEK6GuTuJPtvia6m6x2aC2T/eOGc/8AoA/A0AWFAVQAMAcACloooAKKKKACuX8T3X2m5h0pD8nE9z/ug/Kv4kZ+i+9dJcTR29tJPKwWONSzE9gBk1w1s0lwZb2cETXT+awP8I6Kv4LgfXNNAT0UUUwCiiigAooqOaZIImlfO1ewGSfYDuaAFYytNFb20Rmupm2xR5xk+pPYDua9A8O+HotDtmLN515NzPORgsfQeijsKp+EtAbT4Wvr1B/aFwo3Dr5KdkH9fU/QV04GBU3AOlLRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFIeRS0UAcN4q8ONayy6vpsRZT813boPvf9NFH94dx3+o551HSWNZI2DIwypB4Ir1o9K848R6P/Yd99qgXGnXT4ZR0glP8lb9D9aaAzqKKKYBRRRQAVFcw/aICgYo4wyOOqMOQfwNS0UAdLoupf2ppkVwRtlGUmT+644I/wAPYitGuQ0W5+w66YScQ3w6dhKo/qv/AKCK68c1IBRRRQAVQ1aOQ2f2i3BNxauLiIDqSvO38Rlfxq/QelAG5Z3Md5aRXMTbo5UDqR3BGanrnvCs3k211pp62cxEYP8Azyb5l/AZK/8AAa6GgAooooAhup0trWWeQ7Y40LsfQAZrktGR10yOWUYmuC1xIPRnO4j8M4/CtbxbJ/xJPsufmvJUthj0Y/N/46GqAdBxigBaKKKAEpaKKAOd8V3GbaDTlPN3J+8HpGvLfmdo/GsunahN9s8Q3k2cpbgWyfUcv+pA/wCA02mgCiiimAlLRRQAla3hXSv7W1Y30ozaWL4QHpJN6+4UfqfasaXzW2QW4BuZ3EUIP949/oOSfYV6hpOmw6RpkFjAPkiXGT1Y9yfcnJ/GkwLlLRRSAKSlooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKAEpaKKAEqvf2UGo2ctpcoHhlUq6nuDVmigDyV7abT72fTrli01ucByP8AWIfuv+I4PuDS11XjnTM2cesRL+9s8iXH8UJ+9+XDfgfWuVByMjpTAKSlopgFFFFAEF2kjW5aH/XxESxH/bU5H6jH412tjdx39hBdxf6uZA4/EVyNanhSfbDd2B/5d5d6D/Yfkf8Aj24fhSYHRUlLRSAKSlooArWz/ZPE9s/RL2FoHP8AtL86fp5ldZXF6y5gslvF62cqXHHop+b/AMd3V2asGAIOQeaAFooooA5nXnM+u6XbfwxJLcsPcAIv/obflTqrzt53iq/ftBBDCPYnc5/mtWKAFooooAKiuJktraWeQ4SNC7fQDNS1i+KpNvh+eIHDXDJAP+BMAf0zQBzVgHNlHJJ/rZcyyf7zncf1NWaMUVQBRRRQAUUU13WONnbhVBJPtQBteD7EXmvTXzgmKxXy4/8Arq4yT+CkD/gRr0GsDwfYmz8M2vmLtnnBuJfXc53Y/AED8K36kAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKAI5okmiaKRQyOCrKehBGDXk/2VtOubnTZMlrSQxqT3TGUP/fJH4g165XBeNbT7PrtpeqPluYjA5/2lyy/oW/KhAYdFFFUAUUUUAFSaXL9m8SW56LcxNC3uw+Zf5P+dR1XupPs4guun2adJT9Aw3foTQB31FFFSAUUUUARzwpcQSQSDKSKUYexGK0PDNw914csJJTmUQqkn+8vyn9QapVJ4UbbbX9t/wA8byTA9mw4/wDQqAOgooooA4+0Pmajq0x/ivGXPsqqv9DV2s/SMtazSH/lpdTv+crVoUAFFFFABXN+Kn3SaZb+s7Sn6KhH82FdJXKeIm3a/Zp/ctZG/wC+mX/4mmgKlFFFMAooooAKinhN15NovW5mSHj0ZgD/AOO5qWreiRfaPFelR8ERvJMQf9lCB+rCgD0uNQqBVGFHAFPooqQCiiigAooooAKTNc9rGsX9rrNnp1ktmDPBJM0l07KBtKjAx/vfpVODxZc3UEMEFrC2oy3ctqoEhMJ8sZaTdjJXGO2c8UAdaDmlrmLnW9W02P8A06yh3G6ghSWKQ+XIsjbSQDyCPQ8dOal8Ra5PpE+nQwm0Q3crIZbuQoibULdR3OMUAdFRWDZa/FHBG+pXtgfOkKRyWrl48gZwWPAOM9aV/F+jJszcSnfCLgbbeRv3WSN5wvA4PJoA3aKxpPFejR3M0DXZ3QoJJGETlUUjIJYDHI6c80o8UaV5MsrTSx+VtLJLbyI53HC4QqGOTwMCgDYorEbxXpKRqxlnyd+YxbSF1C43Fl25AGRyR3o8Ra6dJ8PtqVsIpMtGFMrFUw7quSewGc0AbdFctp3ioNcXsd9JZPDaxJI1zZSmWP5iRsPGd3A4Gc5FaA8U6UYXfzZgySiExNbyCTeRkAIV3HIyenagDZorIk8T6VHBDL50rCYMURIJGfCnDEoF3DB65FaVvcw3dvHcW8iyRSKHR1PDA9DQBLRRRQAUUUUAFc145tvM8NS3AHzWkiXA9grDd/46Wrpap6tbC90e9tSMieB4yPqpH9aAPMqKgs5TNZQSHq0ak/XFT1QBRRRQAVDdw/aLKeH/AJ6Rsv5jFTUUAdVpVz9s0mzuT1lhRz+IBq5WN4VbPhu0Xr5YaL/vliv9K2akAooooAKTw8dmu6vD0DLBN+YZT/6AKWodLYp4rlHaSxB/75c//FUAdTRTc+9FAHGaDzols395S35kmtKs3w//AMgCx/64itKgAooooAK5HWiG8SuO6Wkf6u/+FddXH6v/AMjPcZ/59YcfTdJTQENFFFMAooooAK1fCSh/FwJx+7spCPxdBWVWt4O/5G6X/rxP/oxaTA9DooopAFFFIelAASB3pNw9ax/E4ZtBuY47xLOSTaiTOxUZLABcjkZ+7ketcm1/N4c+1iLT3sb4wK0du9yZraUeYql8/eB+YenXvQB02p+HbfVtes728ht7i3t4JI/KmQN8zFSCM8cbT+dO1PQ2kFhNpfkWtxYOWhUpiMqwwykDoCO47gVi6j4p1TSriXTpYobi8MkAilggYqFk39U3Ekjy24BGcipLXxJq97NbaeLeO2u5ZpVE9xbsqlEVW3CMtkE7gME9jRYLFm60TV9STffXsAcXUEqQRA+XGsb7jyeSx9fpxVjxFo13qlxptxaCykazld2jvFJRwyFe3cZzXMa7eX99fR2RltEZms1neEuyyZlcFR8wwPl6deozWjpWsX9w6abp62ls+66kaSZXddscxQADdnJ6k549KEDLV74av9Y0n+y79dPtLN5d8y2KEFgBkAZHB3YJPoMVU/s7Xv7dkiU2rFtKS3lneNghO9xlcdwDkr70up+LdQtd8sDWkqW6wmZIYZJVy2CcyghV4PHU9D3rOn1++0uS/vLordxw6rMI0wylFW334HzY9uRjqaQPQ1v+EJ/4lWpaf9pHl3C24hY5yPKVQN2MdSvamp4PnMN2zwaeJpY0jVGaWZWAbccs7ZGe2OnXmm6l4l1nSIWScWM9xLafaYTGjKqEOilWG4kj94MEY6GtjTr/AFH+3LnTb9raTbAk6SQRsmNxYFSCxz0609QMV/B+otbqrTW8hV3MSvNLutgQANkoO8gYJIPBz2xWvrOhXeo+FY9LW4jluV8kmWdflkKMrHcB67f1qG81nV4vFttYxaYz2jwyMT5qDdhkG71GMnjvmreq6lfLq9npdg1vFJPFJM006FwAu0YCgjJJYd+1AGNJ4W1WZ7i5WexsJ3hWFY7FGRGG8Md565IBUEDIyaavg+7VL3f9hdbiWKQQu0jAbVIP7wncG5BDD8qLbxNrGpylLX7BB5Vm08plRmDOsjoQMMMKdmc9s96z012+u7u7vXKtay/2a0Vu4YeV5rc8hvr254oBl+TwXeN9kne5jurmKKWEpNNKqqjPuUB1O47cY+bqPSur0ewXS9KtrFQgEMYX5AQue+ASTj8a5qLxLqXlWuoOtqbK7u3to4AjeZHjeFYtnB+5yMDGfao7LxTqqWdnd37aeI7zT5LtdqugiZQpwxycg7uwz9aAZ25YAdaAQe9ed3XiDUbpZbO5ONkllMki2727ENcBSu1mJI468Zz0r0Je1AD6KKKACkYcUtI3SgDx+yQR23lAYEbumPoxH9KsVDB/y8en2mfH08xqmqgCiiigAooooA2vCnGjuv8AduZgP++yf61uVg+EsnTLnP8Az+S/zreqQCiiigAqCyJHi61HZrKb8SHi/wATU9V7T/kbbL/rzuP/AEOKgDqce9FJRQBxug8aJar/AHVK/kSK0qz9I4tJI+nl3M8eB7SsK0KACiiigArkdaAXxM5zy1pH+jv/AI111cp4hXZ4gtW7SWrj/vl1/wDiqaAqUUUUwCiiigArV8IsE8XDP8dlIB+Dof61lVc0GUQeLdLc/wDLTzYM/VN3/slID02iiikAUhpaKAILq1gvLZ7e5iSWGQbWRxkEe9YN94N0ybSrmytYkga4CK8jZkO1WB28nOOOldLSUAYlxoui2OkXQmtkFqB50xYkn5RkNuJzkAcc8Vk2w0m/8Nxz2Wi3EyCdmMAYLNHJyGJYsDn1571p+K7e81HSk0yzUj7ZKsU0u3cscXViRkZyBj8aqaJYajpWtalbXBE9rdqLmOaKLy0WTG11xk9cKffmlrqBn2moaM+l2+pDw9cwwt5KWgITMrBj5aqA3BBJPOOtaNnZ6Vq3m2lxo8tpLbuZDHJwf3hJJDKxBBOcjP1FU4dPuIvAOl2lzpUly0Qj8+BH2yxgfxJgj5gcHqO9XPDS36Xl2rNqDaaETyDqI/e7+dwB6lcY+9znNMDPuzoEtzdZ0Cee0hkW2uJ41HlhlwB8gYE7eBnbxj2rbaw0H+0SXW1+1yybyjPyz7Nudueu046dK53ULe/ivL2TT7LU7PVXmLRG2bdaT9NruD8o4A3cA9cZqncWQvdY19INNkk1Y3sJgvFTKwkRxknd/Djkn1zjmgLHWQaN4eBubKGG2ZyqiWLzNzKoOQMZyAD26VZludOhuLq9i8qa7ijEcqxyLvCg5AOSAOSetcNpegaxFdIT5sWpKJ/OuRaKgcsGA3S7suCSpAxxgdKE0aQeFjaRaBcx6t9hMd3cHgyyZXOWz+9JIJz29s4oA7CDxLaXesT2EELSNbyiCSUOmA5XdjG7P6dfpVi8i0bWI2Nw8EwtidzrLgxHHOSDleOtcdeeHLwXl5Np1gIbiXUzIkyoF4NsQGJ9N5P45qQ6esejNFY+GpY5Gihjuy6kByHG4lAR5pA3EnPPTJzQCL19YeGl1KyvZprZ7OaNdOtooj8u7czdQenYitU22hT7NQu4ILd5CsamWQLu8tsoODg4IyK5C30GZWDS6Qz20OsQ3CILRYx5ZjCsyxgnA3dR14zU9no8kMcH9saNNfWwgmSKERCTypTKxzt7FlK4btjqKAOsbT9AtdSM7rax3TscBpMfM3UhScZPqBk1bbQdMe3ht2tImhhha3RCOBGQAV+nA/KuEHhe9lsNSk1CxNxqKadbJbysNzeaqnOxv7wOOa9JhzsUNnO0ZoDqZkXhjSInLrZpvIQF2ZmYhG3Lkk5OCMitcDAxmlooAKKKKACkbpS1V1O4Fnpd3dE4EMLyH8ATQB5RZOJLbzQeJHeT/vpif61YqvYxmKwt0PVY1B+uKsVQBRRRQAUUUUAbXhTnR3b+9czH/wAfI/pW5WN4VXHh21f/AJ6b5P8Avpyf61s1IBRRRQAVXsgW8XWp7LZTZ/F4v8DViodMBk8Vvj/lnY/h8z//AGNAHTUUYNFAHJWYMd9qsJ/gvXI+jBW/9mq7VaVfJ8U6jH0E0UM49zhkP/oK1YoAWiiigArmvFSbLjS7j/pq8JPsyZ/mgrpaxPFce7QpJgObeSOf8FYZ/QmgDEoooqgCiiigAqOSf7JLbXv/AD7Txyn/AHQw3f8AjpNSUyWNZoXib7rqVP0NAHrSncuQcinVi+E743/hqzlkOZkTyZf99DtP54z+NbVSAUUUUAFFFFACYFG0elLRQAmBRtHpS0UAJtHpUUdrBC8rxxIjytukZRgucAZPrwAKmooATaPSk2L6U6igBu0elLtHpS0UAJtHpRtHpS0UAJtHpQBjpS0UAFFFFABRRRQAVznje48rwvcw5+a6ZbYD1DsAf/Hc10dcJ43uvO1awsFPywK1zIPc5VP/AGf8qAMGiiiqAKKKKACormUW9pNMekaMx/AZqWq17H9ohS1HW5lSH8GYA/pmgDsdHtjZ6LZWx+9FAin6hRmr1IKWpAKKKKACk8Pjfr+rS9kjgiH1AZj/AOhiin+FV3Ralc/89r1wD6hAE/mpoA6GiiigDmNcXyfEGnXHQTRS27H34df0V6fU3i1NukR3gHNncRz/APAQdrf+Os1Q0ALRRRQAVDd263dnNbOMpKjIfoRipqQ0AcDYO72UXmf6xB5cn+8vyt+oNWaW8g+x67fW+PklIuY/o3Df+PA/nSVQBRRRQAUUUUAbngu9+zavdac5wl0v2iL/AHwArj8tp/Ou9ryKSSa3eG8tRm5tpBLGP72Oq/iCR+NepadfQ6lYQXlu26KZA6n2NSwLdFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQA13EaF2ICjkk9hXlEt4dU1C71M5xcyZjz2jXhPzA3f8Crr/G+pmDS102FiLi+OwkdViH32/L5fqwrj1UIoVRgAYAHamgFooopgFFFFABT9Oi+0+I7NOq26vO31xtX/ANCP5UytLwrBvN9fkf62XyYz/sJkf+hFvypAdJRRRSAKKKKAGSyLDE8jnCIpZj6AVd8LQNB4bsQ6lZJIvNcH+8/zH9TWHrgMumtaocPdutsv/AyFJ/AEn8K7NEEahVGFAwAKAHUUUUAV762S8sZ7aUZSaNkb6EYrldHmkm0yASnM0YMMv++h2t+oNdjgVyTx/YfEd/bdI7kC7iHuflcfmAf+BUAW6KKKACiiigDnPFUHlra6kB/qJPKlP/TN8DP4Nt/Wsyuvu7aK8tJraYZjlQow9iMVxNqZEV7e4P8ApFu5il9yOh/EYP400BPRRRTAKKKKACtrwhqv9nag2lTNi3umMlsT/DJ1ZPx+8PfdWLUc8Qmj27mVgQyOpwyMOQw9waQHrtFc/wCF9f8A7YtGhuSq39vhZ0HG70cex/Q5Fb9IBaKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKguriO1t3nmkEcUalnYngAdTU3Fef8AizWv7VuzpVu2bO3fNy46SuORGPYHk++B60AZFxeSatqM+qTAr5vyQof4Ih90fU/eP19qSiiqAKKKKACiiigCC7meG2Zo13SthIl/vOxwo/Miuy02yXTtNt7NDkRIFz6nufxPNcxpNt9v15MjMNiPMb0Mh4UfgMn8RXZUmAUUUUgCiiigCrEn2zxNYw9UtUe5f2Y/In83P4V1tc74WiE/27Uzz9pm8uI/9M4/lH5tvP410VABRRRQAVzvimLyorbVVHNnL+8x3ib5W/L5W/4DXRVDcwx3NtJBKoaORSjKe4PBoAwQcilrP0ppIoZLCdibiyfyHJ6sAMo34qQfrmtCgAooooAK5XxJa/Y7+LUkGIpsQXHs38Df+y/iK6qoLy0ivrOa1nG6KVCjD2NAHIUVDCJoJZbK5Obm3O1jj76/wv8AiP1zU1UAUUUUAFFFFACLJcWt3Fe2ThLqH7pP3XXujex/TrXouha5b65ZCaEFJVO2aFvvRN6H+h6GvO6SGW5sb1L6wkEd0gwQ33JV/usPT36j86QHrVFY2g+IbXW4CEBhuo8edbufmQ+vuvoRxWzSAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACkPSgnAzXGeIvFjeZJp2jOrTj5ZroDKw+oX1f9B39KAJPFXiR42fStMkxdkfv516W6+g/2z2Hbqe1cnDEkESxRrtRRgCkhhSFNqZ5JJZjksT1JPc1JTAKKKKYBRRRQAVFcTrbQPKwJC9FHVj2A9ycCpal0ez/tPVxMwzaWTZ9nmxwPoo5+pHpSA3tB05tO0xElx9plJlnI7ue30HAH0rTpKWkAUUUUAFUdVnlhsWW3P+kzEQQf77HAP4Zz9BV2oNPi/tHxIH6wacufYzMOn/AVP/j9AHQ6fZx6fYQWcIxHDGqLn0AxVqiigAooooAKPwoooA5jX4fsGq2+qqMQzbbW59sn923/AH0dv/AvapK276zhv7Ge1nXdFMhRh7GuW02SZUlsrps3do3lyH++P4X/ABGD9cjtQBeooooAKKKKAMDxJprzRpqNqha6tgdyL1lj7r9R1Hv9axYpUmhSWNgyOMqR3FdxXH6xYf2PeG6iXGn3D5kHaGQ9/wDdb9D9aaAiooopgFFFFABRRRQAwxsJo54ZXguYjmOaM4Zf8R6g8V1Wj+NEylrrYW3mJ2pcrxDIff8AuH2PHoa5ikZVdSrAMpGCCOCKQHrCsG6cj1p1eV6bqOo6JgafOGtx/wAuk5JT6Keqfy9q63TvG+m3LLFfb9PnPG2f7jH/AGXHB/HB9qQHT0U1XR1DKwKnoQetLnNAC0UUUAFFFFABRRRQAUUhIHeq17qFnp9uZ7y6igjH8UjBaALVVb7UbPTLVrq9nSCFerOcc+g9T7VyWo+OZJQY9Gti+f8Al5uVKIPdV+8347R71zUolu7oXd9cPdXI+68nRP8AdXov4c/WnYDV1fxNe61uhtPNstPPBY8TTD/2Rf1+lZUcUcMaxxIERRgKBwKfRTAKKKKACiiigAooqKedLeEyPkgcAKMliegA7k0AJN50ssVnagG6nOEz0Qd3PsB+uB3rstPsYtNsYrSEHZGMZPVj1LH3J5rO0DSXs43u7sD7dcAbwDnyl6hB9O/qa26TAKKKKQBRRSUAVr+7SxspbhlLFB8qDq7E4VR7kkD8a2NA05tO0qOOUhrlyZbhh/FI3J/AdB7AVj2MP9reIBkZtNOO5vR5yOB/wEHP1I9K6wADpQAtFFFABRRRQAUUUUAJXOeJLQ2sketQKSYF2XKr1eHOc+5U/N9Nw710lNZQwIIBHoaAOcRxIqurBlYZBB6inVQWA6JqJ01sizmy9k3Ze7Rfh1Htx2q/QAUUUUAFRzQx3ELwzIskbgqyMMhgeoNSUUAcPdWcmi3a2srM9pIcW0zHOP8Apmx9R2Pf606uvu7SC9tZLe5jEkMgwymuOurefRrlbe6ZpLdztguT/F6K/o3v3+tMB1FFFMAooooAKKKKACkYBlKsAVPUHpS0UANthPYHdp15PZ4/gif93/3wcr+lbNt4w1u2AWeK0vVHfmFv03D+VZFFAHVQ+PbY4F1pt9B6sqrIP/HTn9KuJ440FwN148X/AF1gkX+a1xNFKwHer4u0B1DLqttg+rYpsnjLw/H97VICT2XLfyFcJRRYDs5PHWiL/qpLmc9hHayc/iQB+tUJ/HsjZFno8xPZrmVYx/46WP6VzdFFgL914i1694N5FZof4bWPLf8AfTZ/QCsr7NGZvPlLzz/89p3Mj/mc4/CpqKYBRRRQAUUUUAFFFFABRRTJZY4ImllcJGoyzHtQASSpDG0kjBUUZZieAK0tB0l55k1S+jK45tYGHKA/xsP7x7DsD6mo9H0aS+lS/wBRiKQqd1vbOOSezuPX0HbqeenVUgDFLRRSAKKKKACqeo3TwQpHbqHu53EUCHu57n2AyT7CrTyLEjO7BUUZLE4AHrSeHbNr6dtauEIDqUs0YcpET94jsW4PsMD1oA19I02PStNhtUYuVBZ3PWRzyzH3JJNX6AMCigAooooAKKKKACiiigAooooAoatpcWrWLW0jFDndHIv3o3HRh7g1z1jcyu0treII723IWZR0Po6/7J7fiO1dhWHrukS3YjvbLamoW4OzPAkU9Y29j2PY80AV6Wq1lex3tuJIwysCVeNxho2HVWHYirNABRRRQAlRXNtDeW7wXESyRSDDIw4IqaigDib+wuNDYs5efTv4Zjy0Ps/qP9r8/WkBDAEEEEZBHeu2IDAggEHgg1zGoeHJbRmuNIC7CcvZscKfdD/Cfbp9KdwKNFRQ3KTF0AZJYziSKQbXQ+4qWmAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUVDG097cNa6fF58ynDsTiOL/AHm9fYc0ALPOluoL5LMdqIoyzn0A7mtbSdAkkmS+1RRvU7obbOVjPYt/eb9B29avaToMOnN9olf7ResMNOwxtHog/hH6+prXpXAQUtFFIAooooAKKKo3D3F7eLpdg+24dd00w5FvGf4v9487R+PQUANS3Ovai1oB/wAS62b/AElu0r9REPYdW/AdzXZAAAADgVWsLCDTrKO0t49kUYwozkn3J7k9c1aoAKKKKACiiigAooooAKKKKACiiigApCAaWigDnNb0maK5OrabHuuAALiAcC4Qeno47Hv0PtBaXcN7bLPA25G45GCCOCCOxByCPauqIzXNavo01tdPqelJulbm4tc4Wceo7B/fvjB9QAOoqC0vIb2ATQtkZwwIwysOqkdiPSp6ACiiigAooooAztT0Wz1RVaZGSdBhJ4ztkT6H09jxXNXdjqOlZNxGbq2H/LxAnzD/AH06j6jP4V21FAHDRTRTxiSGRZEPRlORT637/wAOWF7K1witbXTdZ4DtLf7w6N+IrEuNK1ex5MSX0I/ig+WT8UPB/A/hTuBHRUEd5BJJ5W8pMOsUgKOPwPNT0wCiiigAooooAKKKKACiiigAooooAKKQkKpZiAB3NQx3Jun2WMEt4/fyVyo+rHCj86AJ6hluY4pViAaSdvuwxLudvwH8+laNv4cv7rBv7lbaP/njbHLH6uRx+A/GugsdLs9NjKWkCR7uWbqzH1Ynk/jSuBz1p4evL0h9Rc20H/PvC3zt/vOOn0X866a2tbezt0t7aFIoUGFRBgCpgMUUgCiiigAooooAKKKo3N3K1ythYRiW+cZwfuRL/fc9h6Dqe3qABbu6mNxHYWCLJfTDKhvuxr/ff2Hp3PHuN/SNJh0mz8pC0krtvmmf70rnqx/w7Dim6Po8OlwMQ7TXMp3T3D/ekb+gHYdq06ACiiigAooooAKKKKACiiigAooooAKKKKACiiigApKWigDn9X0ORp21HTGSK+x+8RuI7gDs3ofRuvrkVRs75LrfGyPDcxcTQScPGff1HoRwa66srVdDh1PZKGaC8i/1VzH95fY9iPY8UAUO1LVAXc9ldLZ6rGsMzHbFMv8Aqpj/ALJ7H/ZPPpmr9ABRRRQAUUUUAFFFFAFa6sLS+j8u7t4Z07CRA2PzrIl8J2w5sru5tD/dD+Yn5PnH4EV0FFAHIy6HrMH+rezulHqWib+o/lVV4tRh/wBfpN2o/vR7ZR/46Sf0ruKKYHANfQR/60TQ+vmwOn8xSDUrE/8AL5b/APfwV6BUb28Mn34Y2+qg0XA4X7dZ/wDP3B/38WkN/ZAZN5bj/totdv8AYLP/AJ9IP+/Y/wAKBY2gORawA+0Y/wAKLgcMdSsR/wAvURPorbv5VIlz5v8AqLa8m/3LZ8fmRiu7WNE+6ir9BinUXA4qOy1ef/V6W0Y/vXEyKPyBY/pVuLw3qUxBub+GBe628e4/99Nx+ldVRRcDFg8LaXEweeN7yQfxXTlx+C/dH4CthEWNQiKqqBgKowBTqKQBRRRQAUUUUAFFFFABSUyeeK3heWeRIo0GWdzgAfWq1raX2v4ZfNstNP8AGRtmnH+yDyi+55PbHWgBjXFzqNy9jpQUyIcT3LDMcHt/tP8A7P5479HpWk22k25jg3M7ndLK5y8rerH/ADjoKsWdnb2NsltbQpFCgwqKOBVigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAgurS3vLd4LmFJonGGRxkGuauNJ1DR8tY+ZfWI/wCXd2zNEP8AYY/eHsefc9K6yigDk7O+t75GaCTJQ7XQjDIfRlPIP1qzV3U9As9RkExDwXajC3MJ2uPbPcexyKxZl1XSf+Py3N7bj/l5tUO4D/aj6/iufoKALtFQWt3BeRebbzJKnqpzg+ntU9ABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRSVTuNTt7eZbcFprphlbeFd8h/AdB7nAoAu1Rl1DdcmzsYWvLwdYozxH7u3RR9efQGp4dG1PU8NqEpsbY/8ALvA2ZGH+1J/D9F/OugsdPtdOtlgtIEhiH8KDHPqfU0AZFh4cLTpd6vIt1cKdyRKMQwn/AGV7n/aP4YroAAOlLRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUhGaWigDIv8Aw5YX0puArW12f+Xm3Oxz9ezfQ5rKlsda08/di1OEd48RTD/gJ+VvzH0rrKKAONi1e0km+zu7QXH/ADxuFMb/AIA9fwzV4VuXdlbX0JhureKeI9UkQMP1rGk8KQxnOnXl1ZeiK/mR/wDfD5A/DFADaKgk0/XrQZ2Wd8g7xsYXP4HI/UVXbUmt+L3T76192hMi/mm4UAX6KpQ6vp1wdsV7bs393zACPwPNXAwIyKAFoozRQAUUUUAFFFJmgBaKY8scS7pHVF9WYCqTa5pobYl0sz/3IAZW/JQaANCiqST6hc82mj3TA9JLgrCv5E7v/Hasx6JrNzzc39vaL3S1j8xv++34/wDHaAFkkSJC8jqiDksxwB+NUU1T7WdmmW0183TfEMRj6ucD8s1sQeFdMikWW4je9mU5D3bmXB9QDwPwArbVQqhQAAOgHagDmItB1C+OdRvhbx/8+9kSD+Mh5/ICtux0uy0yIxWdvHCpOW2ryx9SepPuau0UAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRRQAmM0YFLRQBWuNPs7sYubSCYf9NIw386zW8J6MeUshAf8Ap3kaL/0EituigDAPhS1U5hvNRi+l0zD/AMezTP8AhGbgD5Nd1EfVYT/OOuiooA54eHbzH/Ibuf8AvzF/8TQfDl2f+Y7dj/dhh/qhroaKAOeHhhyf3ms6i49MxL/6CgNPXwlp5/1st/N6772T+QYVvUUAZEPhjRIWDDTLZmH8Uqb2/NsmtOOCKFAkUaIo6BVwKkooATA9KXFFFABRRRQAUUUUAFFFFABRRRQAUUUUAFFFFABRRSZAoAWikyKNw9RQAtFJuHqKAQelAC0UUUAFFN3j1H50m8eo/OgB9FN3j1H50bx6j86AHUUzePUfnS7x6j86AHUU3ePUfnRvHqPzoAdRTd49R+dG8eo/OgB1FM3j1H50u8eo/OgB1FN3j1H50bx6j86AHUU3ePUfnRvHqPzoAdRTd49R+dJvHqPzoAfRTd49R+dG8eo/OgB1FN3j1H50bx6j86AHUU3ePUfnRvHqPzoAdRTd49R+dG8eo/OgB1FN3j1H50bx6j86AHUU3ePUfnRvHqPzoAdRTd49R+dG8eo/OgB1FM3j1H50u8eo/OgB1FN3j1H50bx6j86AHUU3ePUfnRvHqPzoAdRTd49R+dJvHqPzoAfRTd49R+dG8eo/OgB1FN3j1H50bx6j86AHVw3i7Xr2z1WKOxa5IsYxd3CwQs4kG7AjYgHAKiQ89wK7fePUfnVZbW2SWeRYow8+PNYAZfAwM+vFAHO6prYvTYWlm91DHdXMaPcLGyAxlSw2ORg5wBketc0+t6u8t3D5901vpkMkpnjmVGdVmddxGCJCFQDBwDz6ivQn0zT5NOXT3tYDaKAqwlRtAHTA7YqF9B0iVLdH0+1ZLcYiUoMIM5wPbPNAHI67q19pMz39pfzTwzQ3BVmdWjEioSsYQfdK7Tk/UGuh8PvPDq+oae13NdQxQwSpJM25gzhtwz6fKDjtmtBdG0tL2S8WytxcSgh5Ngyc9fz7+tS6fp1hpcbR2NtDboxywjGMnpQBeopu8eo/OigD5UooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKAP//Z)

Figure 9-9 I/O Virtualization Interoperability

Key observations to note:

• At their core, I/O virtualization extensions build upon this specification. All IOV implementations must be

compliant with the [PCIe-1.1] or later versions. Where applicable, the IOV specifications note relevant deltas between these versions.

。 None of the IOV specifications touch the physical layer.

。 SR-IOV does not touch the data Link or transaction layers specified in this specification. 。 The [MR-IOV] does not touch the transaction layer specified in this specification.

。 All I/O virtualization capabilities are communicated through new PCI Express capabilities implemented in PCI Express extended configuration space.

。 I/O virtualization specifications have no impact on PCI or PCI-X specifications.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

。 A hierarchy can be composed of a mix of PCI Express and PCI Express-to-PCI/PCI-X Bridges.

▪ A PCI Express-to-PCI/PCI-X Bridge and PCI/PCI-X Devices can be serially shared by multiple SIs.

• ATS defines optional functionality applicable to any Function. ATS can be supported in SR-IOV components.

• To implement an SR-IOV Device, SR-IOV requires the Device to be fully compliant with the [PCIe]. 。 A hierarchy can be composed of a mix of SR-IOV and non-SR-IOV components. For example, a

hierarchy may contain any mix of SR-IOV and non-SR-IOV Endpoint Devices.

**9.2 SR-IOV Initialization and Resource Allocation**

**9.2.1 SR-IOV Resource Discovery**

The following sections describe how software determines that a Device is SR-IOV capable and subsequently identifies VF resources through Virtual Function Configuration Space.

[**9.2.1.1**](9.2.1.1) **Configuring SR-IOV Capabilities**

This section describes the fields that must be configured before enabling a PF’s IOV Capabilities. The VFs are enabled by Setting the PF’s[VF Enable](#bookmark25) bit (see [Section 9.3.3.3.1)](#bookmark26) in the SR-IOV extended capability.

The [NumVFs](#bookmark27)field (see [Section 9.3.3.7)](#bookmark28) defines the number of VFs that are enabled when [VF Enable](#bookmark29)is Set in the associated PF.

**9.2.1.1.1 Configuring the**[**VF BAR**](#bookmark31)**Mechanisms**

This section describeshow the [VF BARs](#bookmark32)are configured to map memory space. VFs do not support I/O Space and thus [VF](#bookmark33) [BARs](#bookmark34)shall not indicate I/OSpace.

The [System Page Size](#bookmark35)field (see [Section 9.3.3.13](#bookmark36)) defines the page size the system will use to map the VF’s PCIe memory addresses when the PF’s IOV Capabilities are enabled. The[System Page Size](#bookmark37)field is used by the PF to align the Memory space aperture defined by each [VF BAR](#bookmark38)to a system page boundary. The value chosen for the [System Page Size](#bookmark39)must be one of the Supported Page Sizes (see[Section 9.3.3.12)](#bookmark40) in the SR-IOV extended capability.

The behavior of[VF BARs](#bookmark41)is the same as the normal PCI Memory Space BARs (seeSection 7.5.1.2.1), except that a [VF BAR](#bookmark42) describes the aperture for each VF, whereas a PCI BAR describes the aperture for a single Function. The attributes for

some of the bits in the [VF BARs](#bookmark43)are affected by the [VF Resizable BAR Extended Capability](#bookmark44) (see [Section 9.3.7.5)](#bookmark45) if it is implemented.

• The behavior described in Section 7.5.1.2.1for determining the memory aperture of a Function’s BAR applies

to each [VF BAR](#bookmark46). That is, the size of the memory aperture required for each [VF BAR](#bookmark47)can be determined by writing all “1”s and then reading the [VF BAR.](#bookmark48) The results read back must be interpreted as described in Section

7.5.1.2.1 .

• The behavior for assigning the starting memory space address of each BAR associated with the first VF is also as described in Section 7.5.1.2.1. That is, the address written into each [VF BAR](#bookmark49)is used by the Device for the starting address of the first VF.

• The difference between[VF BARs](#bookmark50)and BARs described in Section 7.5.1.2.1is that for each [VF BAR,](#bookmark51) the memory space associated with the second and higher VFs is derived from the starting address of the first VF and the

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

memory space aperture. For any given VFv, the starting address of its Memory space aperture for any implemented BARb) is calculated according to the following formula:

BARb VFv starting address =[VF BAR](#bookmark52)b + (v - 1) x ([VF BAR](#bookmark53)baperture size)

where [VF BAR](#bookmark54)b aperture size is the size of[VF BAR](#bookmark55)b as determined by the usual BAR probing algorithm as described in [Section 9.3.3.14 .](#bookmark56)

VF memory space is not enabled until both[VF Enable](#bookmark57)and [VF MSE](#bookmark58)have been Set (see[Section 9.3.3.3.1](#bookmark59)and

[Section 9.3.3.3.4](#bookmark60)). Note that changing [System Page Size](#bookmark61) (see [Section 9.3.3.13)](#bookmark62) may affect the[VF BAR](#bookmark63)aperture size.

[Figure 9-10](#bookmark64)shows an example of the PF and VF Memory space apertures.

PCIe Memory Space

PF Config Space

|  |  |
| --- | --- |
|  |  |
| BAR0 (RW) | |
|  |  |
| VF BAR0 (RW) | |
|  |  |

(

|  |
| --- |
|  |
| PF BAR0  Memory  Space Aperture |
|  |
| VF 1 BAR0 Memory Space Aperture |
| VF 2 BAR0 Memory Space Aperture |
|  |
| VF N BAR0 Memory Space Aperture |
|  |

A-0631

Figure 9-10 BAR Space Example for Single BAR Device

[**9.2.1.2**](9.2.1.2) **VF Discovery**

The [First VF Offset](#bookmark65)and [VF Stride](#bookmark66)fields in the SR-IOV extended capability are 16-bit Routing ID offsets. These offsets are used to compute the Routing IDs for the VFs with the following restrictions:

• The value in [NumVFs](#bookmark67)in a PF ([Section 9.3.3.7)](#bookmark68) may affect the values in [First VF Offset](#bookmark69) [(Section 9.3.3.9 )](#bookmark70) and [VF](#bookmark71) [Stride](#bookmark72) [(Section 9.3.3.10 )](#bookmark73) of that PF.

• The value in[ARI Capable Hierarchy](#bookmark74) [(Section 9.3.3.3.5 )](#bookmark75) in the lowest-numbered PF of the Device (for example PF0) may affect the values in [First VF Offset](#bookmark76)and [VF Stride](#bookmark77)in all PFs of the Device.

• [NumVFs](#bookmark78)of a PF may only be changed when[VF Enable](#bookmark79) [(Section 9.3.3.3.1 )](#bookmark80) of that PF is Clear.

• [ARI Capable Hierarchy](#bookmark81) [(Section 9.3.3.3.5 ) may only be changed](#bookmark82) when [VF Enable](#bookmark83)is Clear in all PFs of a Device.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**IMPLEMENTATION NOTE**

NumVFs and ARI Capable Hierarchy

After configuring [NumVFs](#bookmark84)and [ARI Capable Hierarchy](#bookmark85)where applicable, software may read [First VF Offset](#bookmark86)and [VF](#bookmark87) [Stride](#bookmark88)to determine how many Bus Numbers would be consumed by the PF’s VFs. The additional Bus Numbers, if any, are not actually used until [VF Enable](#bookmark89)is Set.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

[Table 9-1](#bookmark90)describes the algorithm used to determine the Routing ID associated with each VF.

Table 9-1 VF Routing ID Algorithm

|  |  |
| --- | --- |
| VF Number | VF Routing ID |
| VF 1 | (PF Routing ID +[First VF Offset)](#bookmark91) Modulo 216 |
| VF 2 | (PF Routing ID +[First VF Offset](#bookmark92)+[VF Stride)](#bookmark93) Modulo 216 |
| VF 3 | (PF Routing ID +[First VF Offset](#bookmark94)+ 2 \*[VF Stride)](#bookmark95) Modulo 216 |
| … | … |
| VF N | (PF Routing ID +[First VF Offset](#bookmark96)+ (N-1) \*[VF Stride)](#bookmark97) Modulo 216 |
| … | … |
| VF [NumVFs](#bookmark98)(last one) | (PF Routing ID +[First VF Offset](#bookmark99)+ ([NumVFs](#bookmark100)-1) \*[VF Stride)](#bookmark101) Modulo 216 |

All arithmetic used in this Routing ID computation is 16-bit unsigned dropping all carries.

All VFs and PFs must have distinct Routing IDs. The Routing ID of any PF or VF must not overlap with the Routing ID of any other PF or VF given any valid setting of[NumVFs](#bookmark102)across all PFs of a device.

[VF Stride](#bookmark103)and [First VF Offset](#bookmark104)are constants. Except as stated earlier in this section, their values may not be affected by settings in this or other Functions of the Device.

VFs may reside on different Bus Number(s) than the associated PF. This can occur if, for example,[First VF Offset](#bookmark105) has the value 0100h. A VF shall not be located on a Bus Number that is numerically smaller than its associated PF. A VF that is located on the same Bus Number as its associated PFshall not be located on a Device Number that is numerically

smaller than the PF158 .

VFs of an SR-IOV RCiEP Device are associated with the same Root Complex Event Collector (if any) as their PF. Such VFs are not reported in the Root Complex Event Collector Endpoint Association Extended Capabilityof the Root Complex Event Collector.

As perSection 2.2.6.2 , SR-IOV capable Devices that are associated with an Upstream Port capture the Bus Number from any Type 0 Configuration Write Request. SR-IOV capable Devices do not capture the Bus Number from any Type 1

Configuration Write Requests. SR-IOV capable RCiEPs use an implementation specific mechanism to assign their Bus Numbers.

Switch processing of Bus Numbers is unchanged from base PCIe. In base PCIe, the switch sends all Configuration

Requests in the range Secondary Bus Number (inclusive) to Subordinate Bus Number (inclusive) to the Device. Type 1 Configuration Requests addressing the Secondary Bus Number are converted into Type 0 Configuration Requests while

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

158. SR-IOV Devices immediately below a Downstream Port always have a Device Number of 0 and thus always satisfy this condition.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Type 1 Configuration Requests addressing Bus Numbers between Secondary Bus Number (exclusive) and Subordinate Bus Number (inclusive) are forwarded on to the Device as Type 1 Configuration Requests.

Note: Bus Numbers are a constrained resource. Devices are strongly encouraged to avoid leaving “holes” in their Bus Number usage to avoid wasting Bus Numbers.

All PFs must be located on the Device’s captured Bus Number.

**IMPLEMENTATION NOTE**

VFs Spanning Multiple Bus Numbers

As an example, consider an SR-IOV Device that supports a single PF. Initially, only PF 0 is visible. Software Sets[ARI](#bookmark106) [Capable Hierarchy](#bookmark107). From the[SR-IOV Extended Capability](#bookmark108)it determines:[InitialVFs](#bookmark109)is 600,[First VF Offset](#bookmark110)is 1 and [VF](#bookmark111) [Stride](#bookmark112) is 1.

• If software sets[NumVFs](#bookmark113)in the range [0 … 255], then the Device uses a single Bus Number.

• If software sets[NumVFs](#bookmark114)in the range [256 … 511], then the Device uses two Bus Numbers.

• If software sets[NumVFs](#bookmark115)in the range [512 … 600], then the Device uses three Bus Numbers.

PF 0 and VF 0,1 through VF 0,255 are always on the first (captured) Bus Number. VF 0,256 through VF 0,511 are always on the second Bus Number (captured Bus Number plus 1). VF 0,512 through VF 0,600 are always on the third Bus Number (captured Bus Number plus 2).

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAEmCAYAAABf3tF5AAAAQ0lEQVRoge3KoQEAIAzAsI0nsZyG5cphsJO41DYZ61R07ZmjnS8AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAADgH7hDHQVMOTaRbwAAAABJRU5ErkJggg==)

Software should configure Switch Secondary and Subordinate Bus Number fields to route enough Bus Numbers to the Device. If sufficient Bus Numbers are not available, software should reduce a Device’s Bus Number requirements by not enabling SR-IOV and/or reducing[NumVFs](#bookmark116)for some or all PFs of the Device prior to enabling SR-IOV.

After [VF Enable](#bookmark117)is Set in some PF n, the Device must Enable VF n,1 through VF n,m (inclusive) where m is the smaller of [InitialVFs](#bookmark118)and [NumVFs](#bookmark119). A Device receiving a Type 0 Configuration Request targeting an Enabled VF located on the

captured Bus Number must process the Request normally. A Device receiving a Type 1 Configuration Request targeting an Enabled VF not located on the captured Bus Number must process the Request normally. A Device receiving a Type 1 Configuration Request targeting the Device's captured Bus Number must follow the rules for handling Unsupported

Requests. Additionally, if[VF MSE](#bookmark120)is Set, each Enabled VF must respond to PCIe Memory transactions addressing the memory space associated with that VF.

Functions that are not enabled (i.e., Functions for VFs above m) do not exist in the PCI Express fabric. As perSection 2.3.1 , addressing Functions that do not exist will result in Unsupported Request (UR) being returned. This includes Functions on additional Bus Numbers.

**IMPLEMENTATION NOTE**

Multi-Function Devices with PFs and Switch Functions

SR-IOV devices may consume multiple bus numbers. Additional bus numbers beyond the first one are

consecutive and immediately follow the first bus number assigned to the device. If an SR-IOV device also contains PCI-PCI Bridges (with Type 1 Configuration Space Headers), the SR-IOV usage must be accounted for when

programming the Secondary Bus Number for those Bridges. Software should determine the last Bus Number used by VFs first and then configure any co-located Bridges to use Bus Numbers above that value.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAC5CAYAAAAVmX5bAAAAO0lEQVRYhe3KoREAIAwAsZYlsYyGZUow2B4GmbefjLF2VM2erZw3AAAAAAAAAAAAAAAAAAAAAAB+gWcHv3AEcDup6oUAAAAASUVORK5CYII=)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.2.1.3**](9.2.1.3) **Function Dependency Lists**

PCI Devices may have vendor-specific dependencies between Functions. For example, Functions 0 and 1 might provide different mechanisms for controlling the same underlying hardware. In such situations, the Device programming model might require that these dependent Functions be assigned to SIs as a set.

Function Dependency Lists are used to describe these dependencies (or to indicate that there are no Function dependencies). Software should assign PFs and VFs to SIs such that the dependencies are satisfied.

See [Section 9.3.3.8f](#bookmark122)or details.

[**9.2.1.4**](9.2.1.4) **Interrupt Resource Allocation**

PFs and VFs support either MSI, MSI-X interrupts, or both if interrupt resources are allocated. VFs shall not implement INTx. Interrupts are described in [Section 9.5 .](#bookmark123)

**9.2.2 SR-IOV Reset Mechanisms**

This section describeshow PCI-base-defined-reset mechanisms affect Devices that support SR-IOV. It also describes the mechanisms used to reset a single VF and a single PF with its associated VFs.

[**9.2.2.1**](9.2.2.1) **SR-IOV Conventional Reset**

A Conventional Reset to a Device that supports SR-IOV shall cause all Functions (including both PFs and VFs) to be reset to their original, power-on state as per the rules in Section 6.6.1 .[Section 9.3 describes](#bookmark124) the behavior for the fields

defined.

Note: Conventional Reset clears[VF Enable](#bookmark125)in the PF. Thus, VFs no longer exist after a Conventional Reset.

[**9.2.2.2**](9.2.2.2) **FLR That Targets a VF**

VFs must support Function Level Reset (FLR).

Note: Software may use FLR to reset a VF. FLR to a VF affects a VF’s state but does not affect its existence in PCI

Configuration Space or PCI Bus address space. The VFs BARn values (see[Section 9.3.3.14](#bookmark126)) and [VF MSE](#bookmark127) (see [Section](#bookmark128) [9.3.3.3.4](#bookmark129)) in the PF’s SR-IOV extended capability, and the VF Resizable BAR capability values (see[Section 9.3.7.5)](#bookmark130) are unaffected by FLRs issued to the VF.

[**9.2.2.3**](9.2.2.3) **FLR That Targets a PF**

PFs must support FLR.

FLR to a PF resets the PF state as well as the SR-IOV extended capability including [VF Enable](#bookmark131)which means that VFs no longer exist.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**9.2.3 IOV Re-initialization and Reallocation**

If[VF Enable](#bookmark132)is Cleared after having been Set, all of the VFs associated with the PF no longer exist and must no longer

issue PCIe transactions or respond to Configuration Space or Memory Space accesses. VFs must not retain any state after [VF Enable](#bookmark133) has been Cleared (including sticky bits).

**9.2.4 VF Migration**

VF Migration is an optional feature in [MR-IOV]. Devices that do not support the **MR-IOV Extended Capability** do not support VF Migration functionality.

In Multi-Root systems, VFs can be migrated between Virtual Hierarchies.

For VF Migration to occur, both[VF Migration Capable](#bookmark135)and [VF Migration Enable](#bookmark136) must be Set.[VF Migration Capable](#bookmark137)

indicates to SR-PCIM that VF Migration Hardware is present and that Multi-Root PCI Manager (MR-PCIM) has enabled its use. Hardware support for VF Migration is optional. SR-PCIM support for VF Migration is also optional.

[VF Migration Enable](#bookmark138)indicates to Device hardware and to MR-PCIM that SR-PCIM has also enabled VF Migration. Supporting VF Migration places the following requirements on SR-PCIM:

• The need to determine if a VF is Active, Dormant, or Inactive. A VF that is Active is available for use by the Single Root (SR). A VF that is Dormant can be configured by SR but will not issue transactions. A VF that is Inactive is not available for use by SR.

• The need to participate in **Migrate In** operations. A [Migrate In](#bookmark139)operation is used to offer a VF to SR. A [Migrate In](#bookmark139)

operation is initiated by MR-PCIM. SR-PCIM can accept a [Migrate In](#bookmark139)operation and move a VF to the [Active.Available](#bookmark141)state.

• The need to participate in **Migrate Out** operations. A [Migrate Out](#bookmark140)operation is used to request the graceful removal of an Active VF from use by SR. SR-PCIM can accept the[Migrate Out](#bookmark140)and move the VF to the

[Inactive.Unavailable](#bookmark142)state.

• The need to handle **Migrate In Retractions**. This is when an offer of a VF to SR is retracted by MR-PCIM and the VF moves back to the [Inactive.Unavailable](#bookmark143)state.

[**9.2.4.1**](9.2.4.1) **Initial VF State**

This section describes the initial values in the[VF Migration State Array](#bookmark145) (see [Section 9.3.3.15.1)](#bookmark146).

If[InitialVFs](#bookmark147) [(Section 9.3.3.5 )](#bookmark148) is non-zero, VF1 through VF InitialVFs are in the [Active.Available](#bookmark149)state. If[TotalVFs](#bookmark150) [(Section](#bookmark151) [9.3.3.6](#bookmark152)) is greater than[InitialVFs,](#bookmark153) VF InitialVFs+1 through VF TotalVFs are in the [Inactive.Unavailable](#bookmark154)state. If[VF Migration](#bookmark155) [Enable](#bookmark156) [(Section 9.3.3.3.2 )](#bookmark157) is Clear, VFs above[InitialVFs](#bookmark158)are not used.

If[InitialVFs](#bookmark159) is 0, no VFs are in the [Active.Available](#bookmark160)state. If[TotalVFs](#bookmark161)equals [InitialVFs](#bookmark162)all VFs are in the [Active.Available](#bookmark163)

state. If[TotalVFs](#bookmark164) is 0, no VFs are associated with this PF and there is no [VF Migration State Array.](#bookmark165) [Figure 9-11](#bookmark166)describes this initial VF State.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

VF 1

Active.Available VFs

VF 2

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAGUAAAAVCAYAAABfXiAOAAABHElEQVRoge3YPUoEMQDF8f8sKQUr60RTWIoH8AIWHkHs7L2ApbcRLCzcdRXL1UZRBMFgggfQraxEixlR1k9YxgTm/WCYwCTw4BEmMxVTuLi8eplmvXytlzuAfKZSCqRSCqRSCqRSCmS8dUvAAnAUUhznDiRggBtgF7DeulOg31yjkOJzznBdVQF469aAvYlnY+CYuqBBSPF2crG+U9pRvQ28dfvA6g9z74ABdUnDkOKjSmnHxxf96Je588A6sAlseOtmWkvVcQbAWzcHbH0z55p6dxwAJyHFp3/K1lmmue8As834ATikLqEfUrzPEazLjLduGVgEtql3xJlOXXkZ4DykuJI7iLzrhRR1giqMfrMUSKUUSKUUSKWI/MUrIEVHHAd3T7sAAAAASUVORK5CYII=)![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAA8AAAAECAYAAABREWWJAAAAHUlEQVQImWNkYGD4z4AKGKE0NnEUMSYGCsDAaQYAj/4DB2nCKKkAAAAASUVORK5CYII=)

InitialVFs

VF n

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAFYAAAAsCAYAAAD/709QAAAAgElEQVRoge3asQ2AMBAEQUyVIPqvAzfBCtmaqeC10QU/rvt5Dz53/n3AroSNCBsRNiJsRNiIsBFhI8JGhI0IGxE2ImxE2IiwEWEjwkaEjQgLwHKGv4KGVRARNiJsRNiIsBFhI8JGhI0IGxE2ImxE2IiwEWEjwkaEjQgbETYibGQCkj0EZUQqhQsAAAAASUVORK5CYII=)

VF n+1

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAA8AAAAECAYAAABREWWJAAAAIklEQVQImWP8////fwYkwMjIyMjAwMCATRxdjImBAjBwmgFDxAwEpe/MeAAAAABJRU5ErkJggg==)Inactive.Unavailable VFs

TotalVFs

VF m

A-0632

Figure 9-11 Initial [VF Migration State Array](#bookmark167)

[**9.2.4.2**](9.2.4.2)[**VF Migration State**](#bookmark168) **Transitions**

VF migration follows the state diagram shown in [Figure 9-12 .](#bookmark169) The state values shown are contained in the VF State array entry associated with the VF. State transitions indicated by solid lines are initiated by MR-PCIM. State transitions

indicated by dotted and dashed lines are initiated by SR-PCIM.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- |
| VF Inactive (not usable by SR)  SR: Set  Inactive .Unavailable  00b  VF Enable and VF > InitialVFs  MR: Request Migrate In  SR: Set VF  Migration Status | | | | | SR: Complete Migrate Out | VF Active (fully usable by SR)  10b  VF Enable and VF ≤ InitialVFs  11b  Active  .MigrateOut  MR: Retract Migrate Out  SR: Set VF  Migration Status  MR: Request Migrate Out  SR: Set VF  Migration Status  SR: Set  Active  .Available |
| SR: Clear VF Enable  SR: Clear VF  SR: Clear VF Enable  VF Disabled  Enable  SR: Clear VF Enable |
|  |  |  |  |  |
| MR: Retract Migrate In  SR: Set VF  Migration Status | | | | |
| Dormant .MigrateIn  01b | | | | |
| SR: Activate  SR: Deactivate |
| VF Dormant (responds to SR) | | | | |

A-0633

Figure 9-12 [VF Migration State](#bookmark170)Diagram

SR-PCIM initiates a state transition by writing a new value to the[VF Migration State Array.](#bookmark171) Devices ignore write

transactions and no state transitions occur when SR-PCIM attempts to initiate any state transition other than in [Table 9-2](#bookmark172)

.

Table 9-2 SR-IOV VF Migration State Table

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| Current [VF Enable](#bookmark173) | Current VF State | Written [VF Enable](#bookmark174) | Written VF State | Meaning |
| 1 | Dormant.MigrateIn | 1 | [Active.Available](#bookmark175) | SR Activate VF. |
| 1 | [Active.Available](#bookmark176) | 1 | Dormant.MigrateIn | SR Deactivate VF. |
| 1 | Active.MigrateOut | 1 | [Inactive.Unavailable](#bookmark177) | SR Complete[Migrate Out.](#bookmark140) |
| 1 | Any | 0 | Any | SR Disables all VFs. |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| Current [VF Enable](#bookmark178) | Current VF State | Written [VF Enable](#bookmark179) | Written VF State | Meaning |
|  |
| 0 | Any | 1 | Any | SR Enables all VFs.  VFs transition to either [Active.Available](#bookmark180)or  [Inactive.Unavailable](#bookmark181) based on the VF number and [InitialVFs.](#bookmark182) |

**IMPLEMENTATION NOTE**

Software State Migration Change Detection

SR-PCIM will typically need to verify that a state change took effect by re-reading [VF Migration State](#bookmark183)after writing it.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACHCAYAAAAr6RiGAAAANklEQVRYhe3KoREAIAwAsZYlsYyGZUow2B4DkLefjLF2VM2erZw3AAAAAAAAAAAAAACAH8CzA1LpBAzVw1v7AAAAAElFTkSuQmCC)

VFs in states[Inactive.Unavailable](#bookmark184)are not usable by software in any way. Requests targeting an Inactive VF receive

Unsupported Request (UR). Within 100 ms of transitioning to the Inactive or Dormant states, the Device must ensure that no new transactions will be issued using the indicated Routing ID.

MR-PCIM initiates state transitions by using a different data structure as specified in [MR-IOV]. The effects of such

transitions are visible in the[VF Migration State Array](#bookmark185)and in the [VF Migration Status](#bookmark186) bit. All state transitions initiated by MR-PCIM cause the [VF Migration Status](#bookmark187) bit to be Set.

This migration state machine exists for every VF that supports VF Migration. Migration state machines are not affected by Function Dependency Lists (see[Section 9.2.1.3](#bookmark121)and [Section 9.3.3.8)](#bookmark188).

[VF Migration State](#bookmark189)does not affect Function state. If VF state needs to be reset as part of a [Migrate Out](#bookmark140)and/or [Migrate In](#bookmark139) operation, SR-PCIM must issue an FLR to accomplish this. VF behavior when VF Migration occurs without an FLR is

undefined.

**IMPLEMENTATION NOTE**

FLR and VF Migration

VF Migration from system A to system B will typically involve one FLR in system A before completing the

MigrateOut operation and a second FLR in system B after accepting the MigrateIn operation but before using the VF. System A uses the first FLR to ensure that none of its data leaks out. System B uses the second FLR to ensure that it starts with a clean slate.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACnCAYAAAAsRR2wAAAAN0lEQVRYhe3KoQEAIAzAsI0nsZyG5cphsOOC1DYZ61R07ZmjnS8AAAAAAAAAAAAAAAAAAADg0wWWCgRONTYlZAAAAABJRU5ErkJggg==)

**9.3 Configuration**

**9.3.1 SR-IOV Configuration Overview**

This section provides SR-IOV-added requirements for implementing PFs and VFs.

PFs are discoverable in configuration space, as with all Functions. PFs contain the SR-IOV Extended Capability described in [Section 9.3.3 .](#bookmark190) PFs are used to discover, configure, and manage the VFs associated with the PF and for other things

described in this specification.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**9.3.2 Configuration Space**

PFs that support SR-IOV shall implement the[SR-IOV Extended Capability](#bookmark191)as defined in the following sections. VFs shall implement configuration space fields and capabilities as defined in the following sections.

**9.3.3 SR-IOV Extended Capability**

The [SR-IOV Extended Capability](#bookmark191)defined here is a PCIe extended capability that must be implemented in each PF that supports SR-IOV. This Capability is used to describe and control a PF’s SR-IOV Capabilities.

ForMulti-Function Devices, each PF that supports SR-IOV shall provide the Capability structure defined in this section. This Capability structure may be present in any Function with a Type 0 Configuration Space Header. This Capability must

not be present in Functions with a Type 1 Configuration Space Header. [Figure 9-13](#bookmark192)shows the SR-IOV Extended Capability structure.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAJEAAAAjCAYAAACU2uhdAAACTElEQVR4nO3csUvjcBjG8ScxadoECQS73p9wg9Nth0txPsqtXW7wD/A2h3CT4ORyLoKTi1Np4eggpQGLYKFLBUFROKRmMEFNY6+1pneT+1t/B6XwfObv8EKe0iwtfN//5Pt+GUTvpAP4CKA070NocenzPoAWH0dEyjgiUsYRkTKOiJRxRKSMIyJlBgAkSWIB8OZ8Cy2eCYCBtrq6+m19ff2nbdvmvC+ixdLv98O9vb3PGoDvcRz/8DwvP++jaLG0Wq3h2traF74T0btpmvYX4Is1/QccESnjiEgZR0TKOCJSxhGRMo6IlHFEpIwjImUcESnjiEgZR0TKOCJSxhGRMo6IlM08osPDQxwdHYn73d1dNJtNcb+1tYVeryfuNzY2EIahuC+XZ/vbgUqlIm6vrq6wubkp7k9PT7G9vS3u6/U69vf3xf3BwQGq1aq439nZwcnJibh/M/OILi4ucHl5Ke673S5ub2/FfbvdRhRF4v74+BjPz8/ivlariVsAaDQa4vbx8RFBEIj7MAxxdnYm7m9ubmb6gJ2fn+P6+lrcdzod3N3difs3/DojZRwRKeOISJkOANPplGOimWVZtgQAGoAPpVLpl2maop8MpWlqapoGx3Emkj5JkpxpmtNCofAq6R8eHizHcV5zuVwm6eM4zruu+2IYxlTS39/fF4rF4h9JCwBRFBVWVlZE/WQy0QeDQc7zvJGkH41GS+Px2HBddyzph8OhkWWZvry8/CLp0zQ1dV2HbduiZ/X09GRZlpXl83nRs0rT9HcQBF//ASrFuO8QC0qlAAAAAElFTkSuQmCC)

31 20

Next Capability Offset

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- |
| 31  30  29  28  27  26  25  24 | 23  22  21  20  19  18  17  16 | | 15  14  13  12  11  10  9  8 | | 7  6  5  4  3  2  1  0 |
| PCI Express Extended Capability Header | | | | | |
| SR-IOV Capabilities Register (RO) | | | | | |
| SR-IOV Status Register | | | SR-IOV Control Register (RW) | | |
| TotalVFs (RO) | | | InitialVFs (RO) | | |
| RsvdP | Fcn Dep Link (RO) | | NumVFs (RW) | | |
| VF Stride (RO) | | | FirstVF Offset (RO) | | |
| VF Device ID (RO) | | | RsvdP | | |
|  | | Supported Page Sizes (RO) | |  | |
| System Page Size (RW) | | | | | |
| VF BAR0 (RW) | | | | | |
| VF BAR1 (RW) | | | | | |
| VF BAR2 (RW) | | | | | |
| VF BAR3 (RW) | | | | | |
| VF BAR4 (RW) | | | | | |
| VF BAR5 (RW) | | | | | |
| VF Migration State Array Offset (RO) | | | | | |

Figure 9-13 [SR-IOV Extended Capability](#bookmark191)

Byte Offset +000h

+004h +008h +00Ch +010h +014h +018h +01Ch +020h +024h +028h +02Ch +030h +034h +038h +03Ch

[**9.3.3.1**](9.3.3.1) **SR-IOV Extended Capability Header (Offset 00h)**

[Table 9-3](#bookmark194)defines the [SR-IOV Extended Capability header](#bookmark193). The Capability ID for the [SR-IOV Extended Capability](#bookmark191)is 0010h.

|  |
| --- |
| 19 16 |
|  |

|  |
| --- |
| 15 0 |
|  |

PCI Express Extended Capability ID Capability Version

Figure 9-14 SR-IOV Extended Capability Header

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Table 9-3 SR-IOV Extended Capability Header

|  |  |  |
| --- | --- | --- |
| Bit  Location | Register Description | Attributes |
| 15:0 | **PCI Express Extended Capability ID** - This field is a PCI-SIG defined ID number that indicates the nature and format of the Extended Capability.  The Extended Capability ID for the [SR-IOV Extended Capability](#bookmark191)is 0010h. | RO |
| 19:16 | **Capability Version** - This field is a PCI-SIG defined version number that indicates the version of the Capability structure present.  Must be 1h for this version of the specification. | RO |
| 31:20 | **Next Capability Offset** - This field contains the offset to the next PCI Express Capability structure or 000h if no other items exist in the linked list of Capabilities.  For Extended Capabilities implemented in Configuration Space, this offset is relative to the beginning of PCI-compatible Configuration Space and thus must always be either 000h (for terminating list of  Capabilities) or greater than 0FFh. | RO |

[**9.3.3.2**](9.3.3.2) **SR-IOV Capabilities Register (04h)**

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAA0AAAAZCAYAAADqrKTxAAAAZ0lEQVQ4jWPU1NTMNzExaWRkZPzLQBgwXr9+fRWDnZ3d2f8kgMTExI9MRNqADP4zkaiBgYGBgWFU06imUU0jRhPLz58/n585c+Y7MzPzH0KK//37x/Tp06cXjAwMDAIMDAz2JFh0DgDM6kffcCD9nAAAAABJRU5ErkJggg==)[Table 9-4](#bookmark196)defines the layout of the SR-IOV Capabilities field.

0

2

3

20

1

|  |
| --- |
| 31 21 |
|  |

|  |
| --- |
| RsvdP |

VF Migration Capable

ARI Capable Hierarchy Preserved

VF 10-Bit Tag Requester Supported

VF Migration Interrupt Message Number

Figure 9-15 SR-IOV Capabilities Register Table 9-4 SR-IOV Capabilities Register

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 0 | **VF Migration Capable** - If Set, the PF is Migration Capable and operating under a Migration Capable MR-PCIM. | RO |
| 1 | **ARI Capable Hierarchy Preserved**  PCI Express Endpoint:  If Set, the [ARI Capable Hierarchy](#bookmark199) bit is preserved across certain power state transitions. RCiEP:  Not applicable - it is strongly recommended that this bit be hardwired to 0b. | RO |
| 2 | **VF 10-Bit Tag Requester Supported**- If Set, all VFs must support 10-Bit Tag Requester capability. If Clear, VFs must not support 10-Bit Tag Requester capability.  If the10-Bit Tag Requester Supported bit in the PF’sDevice Capabilities 2 Register is Clear, this bit must be Clear. | HwInit |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAIklEQVRYhe3BAQ0AAAgDoNvUeqbUHG5ApWcDAAAAAADw2QHZqwIKU+HaKwAAAABJRU5ErkJggg==)

Page 1123

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 31:21 | **VF Migration Interrupt Message Number** - Indicates the MSI/MSI-X vector used for migration interrupts. The value in this field is undefined if[VF Migration Capable](#bookmark137) is Clear. | RO |

**9.3.3.2.1**[**VF Migration Capable**](#bookmark137)

[VF Migration Capable](#bookmark137)is Set to indicate that the PF supports VF Migration. If Clear, the PF does not support VF Migration.

VF Migration support is an optional feature of [MR-IOV]. If the PF does not implement hardware for VF Migration, this bit shall be implemented as ROvalue 0b. If the PF implements hardware for VF Migration, this bit is controlled by

mechanisms defined in [MR-IOV] and indicates the presence of support for VF Migration.

Devices that implement SR-IOV only shall implement this field as ROvalue zero. PFs that support VF Migration must implement MSI or MSI-X interrupts (or both).

**9.3.3.2.2**[**ARI Capable Hierarchy Preserved**](#bookmark198)

[ARI Capable Hierarchy Preserved](#bookmark198)is Set to indicate that the PF preserves the[ARI Capable Hierarchy](#bookmark203) bit across certain power state transitions (see [Section 9.3.3.3.5](#bookmark204)). Components must either Set this bit or Set the No\_Soft\_Reset bit (see [Section 9.6.2](#bookmark205)). It is recommended that components set this bit even if they also set No\_Soft\_Reset.

[ARI Capable Hierarchy Preserved](#bookmark198)is only present in the lowest-numbered PF of a Device (for example PF0).[ARI Capable](#bookmark198) [Hierarchy Preserved](#bookmark198)is Read Only Zero in other PFs of a Device.

[ARI Capable Hierarchy Preserved](#bookmark198)does not apply to RCiEPs, and its value is undefined (see [Section 9.3.3.3)](#bookmark206).

**9.3.3.2.3**[**VF 10-Bit Tag Requester Supported**](#bookmark200)

If a PF supports 10-Bit Tag Requester capability, its associated VFs are permitted to support 10-Bit Tag Requester

capability as well, but this is optional. Especially for usage models where the bulk of the traffic is spread across several VFs concurrently, it may not be necessary for individual VFs to use 10-Bit Tags so they can support >256 outstanding Non-Posted Requests each.

For a given PF, it is required that either all or none of its associated VFs support 10-Bit Tag Requester capability. This avoids unnecessary implementation and management complexity. See [Section 9.3.5.9 .](#bookmark208)

VFs that support 10-Bit Tag Requester capability have additional requirements beyond other Function types in order to simplify error handling and reduce the possibility of 10-Bit Tag related errors with one VF impacting other traffic.

• If the [VF 10-Bit Tag Requester Enable](#bookmark209) bit in the [SR-IOV Control register](#bookmark210)is Set, then each VF must use 10-Bit Tags for all Non-Posted Requests that it generates.

• For each outstanding 10-Bit Tag Request, if the VF receives a Completion that matches the outstanding Request other than Tag[9:8] being 00b, the VF must prevent that Request from (eventually) generating a Completion

Timeout error, and instead handle the error via a device-specific mechanism that avoids data corruption.

It is strongly recommended that software not configure Unexpected Completion errors to be handled as Uncorrectable Errors. This avoids them triggering System Errors or hardware error containment mechanisms like Downstream Port Containment (DPC).

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

**IMPLEMENTATION NOTE**

NoVF 10-Bit Tag Completer Supported Bit

There is no VF 10-Bit Tag Completer Supported bit. If a PF supports 10-Bit Tag Completer capability, then all of its associated VFs are required support 10-Bit Tag Completer capability as stated in [Section 9.3.5.9 .](#bookmark211) This helps avoid the complexity of PCIe hierarchies where some Completers support 10-Bit Tag capability and some do not.

**9.3.3.2.4**[**VF Migration Interrupt Message Number**](#bookmark201)

[VF Migration Interrupt Message Number](#bookmark201)must contain the MSI or MSI-X vector number used for the interrupt message generated in association with certain VF migration events.

For MSI,[VF Migration Interrupt Message Number](#bookmark201) must indicate the MSI message number [0 .. 31] used to reference

certain SR events described in this section. The Function is required to update this field so that it is correct if the number of MSI Messages assigned to the Function changes when software writes to the Multiple Message Enable field in the

Message Control Register for MSI.

For MSI-X,[VF Migration Interrupt Message Number](#bookmark201) must indicate the MSI-X Table entry [0 .. 2047] used to generate the interrupt message.

If both MSI and MSI-X are implemented, they are permitted to use different vectors, though software must enable only one mechanism at a time. If both MSI and MSI-X interrupts are enabled, this field is undefined.

If[VF Migration Capable](#bookmark137) is Clear, this field is undefined.

**IMPLEMENTATION NOTE**

Migration and MSI

If a PF implements MSI and software sets Multiple Message Enable to a valueless than Multiple Message Capable, some sharing of MSI vectors may be occurring. This can create complicated software structures since a single

vector might need to be directed to both SR-PCIM and the PF Device Driver.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

[**9.3.3.3**](9.3.3.3) **SR-IOV Control Register (Offset 08h)**

[Table 9-5](#bookmark213)defines the layout of the SR-IOV Control fields.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- |
| 15 6 | 5 | 4 | 3 | 2 | 1 | | 0 |
| RsvdP |  |  |  |  |  |  | |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAANCAYAAACUwi84AAAATklEQVQYldXPIQqAQBhE4c+Ni1ZPZvUY9v+MZm9g1axFYYur1QdThgfDEJLQeyChxVITqnwSDqxvYoUfvGhA6JCLfhN2Id8TE+Yi49UPJ5W1Dw5jLAI1AAAAAElFTkSuQmCC)VF Enable

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABcAAAACCAYAAACzMUeIAAAAK0lEQVQImWNkaGDgZWBg4GJAgC8MDQxfqSHOxMDAUMrAwHAJCcdCJSkWBwD95Revxv7J/wAAAABJRU5ErkJggg==) VF Migration Enable

VF Migration Interrupt Enable  VF MSE

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAEcAAAACCAYAAAAKJcHAAAAALUlEQVQokWNkaGDgZWBg4GJAgC8MDQxfR8UZvjAxMDCUMjAwXELCsVDJES8OADG9Se8USZcaAAAAAElFTkSuQmCC) ARI Capable Hierarchy

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAFcAAAACCAYAAAAti0AoAAAAMUlEQVQokWNkaGDgZWBg4GJAgC8MDQxfR8UpFv/LxMDAUMrAwHAJCcdCFY+KUya+CQC7j1uB6pYSowAAAABJRU5ErkJggg==) VF 10-Bit Tag Requester Enable

Figure 9-16 SR-IOV Control Register Table 9-5 SR-IOV Control Register

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 0 | **VF Enable** - Enables/Disables VFs. Default value is 0b. | RW |
| 1 | **VF Migration Enable** - Enables/Disables VF Migration Support. Default value is 0b. See [Section](#bookmark215)  [9.3.3.3.2 .](#bookmark216) | RWor RO  (see  description) |
| 2 | **VF Migration Interrupt Enable** - Enables/Disables VF Migration State Change Interrupt. Default value is 0b. | RW |
| 3 | **VF MSE** - Memory Space Enable for Virtual Functions. Default value is 0b. | RW |
| 4 | **ARI Capable Hierarchy** - PCI Express Endpoint:  This bit must be RWin the lowest-numbered PF of the Device and hardwired to 0b in all other PFs. If the value of this bit is 1b, the Device is permitted to locate VFs in Function Numbers 8 to 255 of the captured Bus Number. Otherwise, the Device must locate VFs as if it were a non-ARI Device.  This bit is not affected by FLR of any PF or VF. Default value is 0b.  RCiEP:  Not applicable -this bit must be hardwired to 0b.  Within the Root Complex, VFs are always permitted to be assigned to any Function Number allowed by [First VF Offset](#bookmark218)and [VF Stride](#bookmark219) rules (see [Section 9.3.3.9](#bookmark220)and [Section 9.3.3.10)](#bookmark221). | RWor RO  (see  description) |
| 5 | **VF 10-Bit Tag Requester Enable** - If Set, all VFs must use 10-Bit Tags for all Non-Posted Requests they generate. If Clear, VFs must not use 10-Bit Tags for Non-Posted Requests they generate. See [Section](#bookmark207) [9.3.3.2.3 .](#bookmark207)  If software changes the value of this bit while any VFs have outstanding Non-Posted Requests, the result is undefined.  If the [VF 10-Bit Tag Requester Supported](#bookmark200) bit in the [SR-IOV Capabilities register](#bookmark195) is Clear, this bit is  permitted to be hardwired to 0b. Default value is 0b. | RWor RO |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAIklEQVRYhe3BAQ0AAAgDoNvUeqbUHG5ApWcDAAAAAADw2QHZqwIKU+HaKwAAAABJRU5ErkJggg==)

Page 1126

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**9.3.3.3.1**[**VF Enable**](#bookmark25)

[VF Enable](#bookmark25) manages the assignment of VFs to the associated PF. If[VF Enable](#bookmark25)is Set, the VFs associated with the PF are

accessible in the PCI Express fabric. When Set, VFs respond to and may issue PCI Express transactions following the rules for PCI Express Endpoint Functions.

If[VF Enable](#bookmark25)is Clear, VFs are disabled and not visible in the PCI Express fabric; requests to these VFs shall receive UR and these VFs shall not issue PCI Express transactions.

To allow components to perform internal initialization, after changing the [VF Enable](#bookmark25) bit from Cleared to Set, the system is not permitted to issue Requests to the VFs which are enabled by that [VF Enable](#bookmark25) bit until one of the following is true:

• At least 100 ms has passed

• An FRS Message has been received from the PF with a Reason Code of[VF Enable](#bookmark25)d

• At least[VF Enable](#bookmark25)Time has passed.[VF Enable](#bookmark25)Time is either (1) the Reset Time value in the Readiness Time Reporting capability associated with the VF, or (2) a value determined by system software / firmware159 .

The Root Complex and/or system software must allow at least 1.0 s after Setting the [VF Enable](#bookmark25) bit, before it may

determine that a VF which fails to return a Successful Completion Status for a valid Configuration Request is broken.

After Setting the [VF Enable](#bookmark25) bit, the VFs enabled by that [VF Enable](#bookmark25) bit are permitted to return a CRS status to

configuration requests up to the 1.0 s limit, if they are not ready to provide a Successful Completion Status for a valid

Configuration Request. After a PF transmits an FRS Message with a Reason Code of[VF Enable](#bookmark25)d, no VF associated with that PF is permitted to return CRS without an intervening VF disable or other valid reset condition. After returning a

Successful Completion to any Request, no VF is permitted to return CRS without an intervening VF disable or other valid reset condition.

Since VFs don't have an MSE bit (MSE in VFs is controlled by the [VF MSE](#bookmark217) bit in the [SR-IOV Extended Capability](#bookmark191)in the PF), it's possible for software to issue a Memory Request before the VF is ready to handle it. Therefore, Memory Requests

must not be issued to a VF until at least one of the following conditions has been met:

• The VF has responded successfully (without returning CRS) to a Configuration Request.

• After issuing an FLR to the VF, at least one of the following is true: 。 At least 1.0 s has passed since the FLR was issued.

。 The VF supports FRS and,after the FLR was issued, an FRS Message has been received from the VF with a Reason Code of FLR Completed.

。 At least FLR Time has passed since the FLR was issued. FLR Time is either (1) the FLR Time value in the Readiness Time Reporting capability associated with the VF or (2) a value determined by system software / firmware153.

• After Setting [VF Enable](#bookmark25) in a PF, at least one of the following is true: 。 At least 1.0 s has passed since [VF Enable](#bookmark25)was Set.

。 The PF supports FRS and, after [VF Enable](#bookmark25)was Set, an FRS Message has been received from the PF with a Reason Code of[VF Enable](#bookmark25)d.

。 At least[VF Enable](#bookmark25)Time has passed since [VF Enable](#bookmark25)was Set.[VF Enable](#bookmark25)Time is either (1) the Reset Time value in the Readiness Time Reporting capability associated with the VF or (2) a value

determined by system software / firmware160 .

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

159. For example, ACPI tables.

160. For example, ACPI tables.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

The VF is permitted to silently drop Memory Requests after an FLR has been issued to the VF or [VF Enable](#bookmark214) has been Set in the associated PF's[SR-IOV Extended Capability](#bookmark191) until the VF responds successfully (without returning CRS) to any

Request.

Clearing [VF Enable](#bookmark214)effectively destroys the VFs. Setting [VF Enable](#bookmark214)effectively creates VFs. Setting [VF Enable](#bookmark214)after it has previously been Cleared shall result in a new set of VFs. If the PF is in the D0 power state, the new VFs are in the

D0uninitializedstate. If the PF is in a lower power state behavior is undefined (see Sections 9.6.1 and 9.6.2).

When Clearing [VF Enable,](#bookmark214) a PF that supports FRS shall send an FRS Message with FRS Reason VF Disabled to indicate when this operation is complete. The PF is not permitted to send this Message if there are outstanding Non-Posted Requests issued by the PF or any of the VFs associated with the PF. The FRS Message may only be sent after these

Requests have completed (or timed out).

After [VF Enable](#bookmark214)is Cleared no field in the [SR-IOV Extended Capability](#bookmark191)or the [VF Migration State Array](#bookmark222) (see [Section](#bookmark223)

[9.3.3.15.1 )](#bookmark224) may be accessed until either:

• At least 1.0 s has elapsed after [VF Enable](#bookmark29)was Cleared.

• The PF supports FRS and after [VF Enable](#bookmark29)was Cleared, an FRS Message has been received from the PF with a Reason Code of VF Disabled.

[Section 9.3.3.7 NumVFs,](#bookmark225)[Section 9.3.3.5 InitialVFs,](#bookmark226)[Section 9.3.3.6 TotalVFs,](#bookmark227)[Section 9.3.3.9 First VF Offset,](#bookmark228)[Section 9.3.3.13](#bookmark229) [System Page Size,](#bookmark230) and [Section 9.3.3.14](#bookmark231)VF BARx describe additional semantics associated with this field.

**9.3.3.3.2**[**VF Migration Enable**](#bookmark156)

[VF Migration Enable](#bookmark156) must be Set to allow VF Migration on this PF. See [MR-IOV] for details.

If[VF Migration Capable](#bookmark137) is Set, this bit is RWand the default value is 0b. Otherwise, this bit is ROZero. This field is ROwhen [VF Enable](#bookmark29)is Set.

**9.3.3.3.3**[**VF Migration Interrupt Enable**](#bookmark217)

An MSI or MSI-X interrupt message must be sent everytime the logical AND of the following conditions transitions from FALSE to TRUE:

• The VF’s Bus Master Enable bit is Set.

• The associated vector is unmasked.

• The [VF Migration Interrupt Enable](#bookmark217) bit is Set.

• The [VF Migration Status](#bookmark232) bit is Set.

The MSI or MSI-X vector used is indicated by the [VF Migration Interrupt Message Number](#bookmark201)field. If[VF Migration Capable](#bookmark137) is Clear, this field is undefined.

Note: VF Migration events are described in [Section 9.2.4 .](#bookmark134)

**9.3.3.3.4**[**VF MSE**](#bookmark217)**(Memory Space Enable)**

[VF MSE](#bookmark217)controls memory space enable for all Active VFs associated with this PF, as with the Memory Space Enable bit in a Function’s PCI Command register. The default value for this bit is 0b.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

When [VF Enable](#bookmark174)is Set, VF memory space will respond only when [VF MSE](#bookmark217)is Set. VFs shall follow the same error reporting rules as defined in the [PCIe] if an attempt is made to access a Virtual Function’s memory space when [VF Enable](#bookmark174)is Set and [VF MSE](#bookmark217)is Clear.

**IMPLEMENTATION NOTE**

VF MSE and VF Enable

VF memory space will respond with Unsupported Request when[VF Enable](#bookmark174)is Clear. Thus,[VF MSE](#bookmark217)is “don’t care” when [VF Enable](#bookmark174)is Clear; however, software may choose to Set[VF MSE](#bookmark217)after programming the VF BARn registers, prior to Setting [VF Enable.](#bookmark174)

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

**9.3.3.3.5**[**ARI Capable Hierarchy**](#bookmark203)

For Devices associated with an Upstream Port,[ARI Capable Hierarchy](#bookmark203)is a hint to the Device that ARI has been enabled in the Root Port or Switch Downstream Port immediately above the Device. Software should set this bit to match the ARI Forwarding Enable bit in the Root Port or Switch Downstream Port immediately above the Device.

[ARI Capable Hierarchy](#bookmark203)is only present in the lowest-numbered PF of a Device (for example PF0) and affects all PFs of the Device.[ARI Capable Hierarchy](#bookmark203)is Read Only Zero in other PFs of a Device.

A Device may use the setting of[ARI Capable Hierarchy](#bookmark203)to determine the values for [First VF Offset](#bookmark233) (see [Section 9.3.3.9)](#bookmark234)

and [VF Stride](#bookmark235) (see [Section 9.3.3.10](#bookmark236)). The effect of changing[ARI Capable Hierarchy](#bookmark203)is undefined if[VF Enable](#bookmark174)is Set in any PF. This bit must be set to its default value upon Conventional Reset. This bit is not affected by FLR of any PF or VF. If

either [ARI Capable Hierarchy Preserved](#bookmark198)is Set (see[Section 9.3.3.2.2](#bookmark202)) or No\_Soft\_Reset is Set (see[Section 9.6.2](#bookmark237)), a power state transition of this PF from D3HottoD0does not affect the value of this bit (see[Section 9.6.2)](#bookmark238).

[ARI Capable Hierarchy](#bookmark203)does not apply to RCiEPs.

**IMPLEMENTATION NOTE**

ARI Capable Hierarchy

For a Device associated with an Upstream Port, that Device has no way of knowing whether ARI has been enabled. If ARI is enabled, the Device can conserve Bus Numbers by assigning VFs to Function Numbers greater than 7 on the captured Bus Number. ARI is defined in Section 6.13 .

Since RCiEPs are not associated with an Upstream Port, ARI does not apply, and VFs may be assigned to any Function Number within the Root Complex permitted by [First VF Offset](#bookmark239)and [VF Stride](#bookmark240) (see [Section 9.3.3.8](#bookmark241)and [Section 9.3.3.9)](#bookmark242).

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAADTCAYAAAC89lJnAAAAPUlEQVRYhe3KoREAIAwAsZYlsYyGZUow2B4WkbefjLF2VM2erZw3AAAAAAAAAAAAAAAAAAAAAAAAAPgZPDsA8gSknSIPwAAAAABJRU5ErkJggg==)

[**9.3.3.4**](9.3.3.4) **SR-IOV Status Register (Offset 0Ah)**

[Table 9-6](#bookmark243): defines the layout of the SR-IOV Status field.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |
| --- | --- |
| 15 1 | 0 |
| RsvdZ |  |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABEAAAAGCAYAAAAlvnXZAAAAfElEQVQYlZXOoQ3CUBCA4e9hWAFVVUslCYIJWAPDDiScqy+WEBYiDIBkAEDiMK148ELg5OW+P5eECW7C078TKlxHWOEirIXxj7gWjjihSv1yhi2maHEofhZqbLDEDp1wT29H5ViOux4/BpZHPmMNzpiX8PdIHltgX8LDvAA3aCHnGHFrWAAAAABJRU5ErkJggg==)![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAAMCAYAAABfnvydAAAAYUlEQVQYlY3NsQmDABCG0RcR7C0CWUIkoNNYZpDbKRtYW1s5hpaCjYWNl3xwzc+DI3TC6KbivCoDaQV2bL9gUqiEZ/aixTcDaX+BDfMdeIDQoL7sk7AKr/IcBvQX8MGC9wFOPw1uc7FltAAAAABJRU5ErkJggg==)VF Migration Status

Figure 9-17 SR-IOV Status

Table 9-6 SR-IOV Status

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 0 | **VF Migration Status** - Indicates a VF Migration In or Migration Out Request has been issued by MR-PCIM. To determine the cause of the event, software may scan the VF State Array. Default value is 0b. | RW1C |

**9.3.3.4.1**[**VF Migration Status**](#bookmark244)

[VF Migration Status](#bookmark244)is Set when[VF Enable,](#bookmark214)[VF Migration Capable,](#bookmark137) and [VF Migration Enable](#bookmark156)are Set and certain state changes occur in a[VF Migration State Array](#bookmark245)entry (see[Section 9.2.4](#bookmark134)and [Section 9.3.3.15.1f](#bookmark246)or details).

This setting of[VF Migration Status](#bookmark232)is not affected by the value of[VF Migration Interrupt Enable](#bookmark217)or by any controls for MSI or MSI-X.

[**9.3.3.5**](9.3.3.5) **InitialVFs (Offset 0Ch)**

[InitialVFs](#bookmark226)indicates to SR-PCIM the number of VFs that are initially associated with the PF. The minimum value of[InitialVFs](#bookmark226)is 0.

For Devices operating in Single-Root mode, this field is HwInitand must contain the same value as [TotalVFs.](#bookmark248)

For Devices operating in Multi-Root mode, the value of this field may be changed by MR-PCIM when [VF Enable](#bookmark29)is Clear. Note: The mechanism used by MR-PCIM to affect this field is described in [MR-IOV].

If[VF Migration Enable](#bookmark156)is Set and[VF Enable](#bookmark29)is Cleared and then Set, the value of[InitialVFs](#bookmark226) may change. This is necessary since some VFs may have been migrated to other PFs and may no longer be available to this PF.

[**9.3.3.6**](9.3.3.6) **TotalVFs (Offset 0Eh)**

[TotalVFs](#bookmark248)indicates the maximum number of VFs that could be associated with the PF. The minimum value of[TotalVFs](#bookmark248)is 0.

For Devices operating in Single-Root mode, this field is HwInitand must contain the same value as [InitialVFs.](#bookmark226) For Devices operating in Multi-Root mode, the value of this field may be changed by MR-PCIM.

Note: The mechanism used by MR-PCIM to affect this field is described in [MR-IOV].

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.3.3.7**](9.3.3.7) **NumVFs (Offset 10h)**

[NumVFs](#bookmark250)controls the number of VFs that are visible. SR-PCIM sets[NumVFs](#bookmark250)as part of the process of creating VFs. This number of VFs shall be visible in the PCI Express fabric after both[NumVFs](#bookmark250)is set to a valid value and [VF Enable](#bookmark214)is Set. A visible VF has a Function number reserved for it, but might not exist. If the VF exists, it shall respond to PCI Express

transactions targeting them, and shall follow all rules defined by this specification. A VF exists if either:

• [VF Migration Capable](#bookmark197)is Clear and the VF number is less than or equal to[TotalVFs.](#bookmark249)

• [VF Migration Capable](#bookmark197)is Set and the associated VF is in either the [Active.Available](#bookmark251)or [Dormant.MigrateIn](#bookmark252)state (see [Section 9.2.4)](#bookmark134)

The results are undefined if[NumVFs](#bookmark250)is set to a value greater than[TotalVFs.](#bookmark164)

[NumVFs](#bookmark250) may only be written while [VF Enable](#bookmark174)is Clear. If[NumVFs](#bookmark250)is written when [VF Enable](#bookmark174)is Set, the results are undefined.

The initial value of[NumVFs](#bookmark250)is undefined.

[**9.3.3.8**](9.3.3.8) **Function Dependency Link (Offset 12h)**

The programming model for a Device may have vendor-specific dependencies between sets of Functions. The [Function](#bookmark241) [Dependency Link](#bookmark241)field is used to describe these dependencies.

This field describes dependencies between PFs. VF dependencies are the same as the dependencies of their associated PFs.

If a PF is independent from other PFs of a Device, this fieldshall contain its own Function Number.

If a PF is dependent on other PFs of a Device, this fieldshall contain the Function Number of the next PF in the same

Function Dependency List. The last PF in a Function Dependency List shall contain the Function Number of the first PF in the Function Dependency List.

If PFp and PFq are in the same Function Dependency List, then any SI that is assigned VFp,n shall also be assigned to VFq,n.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAN3CAYAAAAPkxsCAAAAfElEQVR4nO3KoQEAIAzAsI0nsZyG5cphsDPo1DYZ61R07ZmjnS8AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAADgA1yp9QnuABDWEQAAAABJRU5ErkJggg==)

**IMPLEMENTATION NOTE**

Function Dependency Link Example

Consider the following scenario:

|  |  |  |  |
| --- | --- | --- | --- |
| SR-IOV Field | PF 0 | PF 1 | PF 2 |
| [Function Dependency Link](#bookmark188) | 1 | 0 | 2 |
| [NumVFs](#bookmark119) | 4 | 4 | 6 |
| [First VF Offset](#bookmark253) | 4 | 4 | 4 |
| [VF Stride](#bookmark254) | 3 | 3 | 3 |

|  |  |  |
| --- | --- | --- |
| Function Number | Description | Independent |
| 0 | PF 0 | No |
| 1 | PF 1 | No |
| 2 | PF 2 | Yes |
| 3 | Function not present |  |
| 4 | VF 0,1 (aka PF 0 VF 1) | No |
| 5 | VF 1,1 (aka PF 1 VF 1) | No |
| 6 | VF 2,1 (aka PF 2 VF 1) | Yes |
| 7 | VF 0,2 | No |
| 8 | VF 1,2 | No |
| 9 | VF 2,2 | Yes |
| 10 | VF 0,3 | No |
| 11 | VF 1,3 | No |
| 12 | VF 2,3 | Yes |
| 13 | VF 0,4 | No |
| 14 | VF 1,4 | No |
| 15 | VF 2,4 | Yes |
| 16 to 17 | Functions not present |  |
| 18 | VF 2,5 | Yes |
| 19 to 20 | Functions not present |  |
| 21 | VF 2,6 | Yes |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |
| --- | --- | --- |
| Function Number | Description | Independent |
| 22 to 255 | Functions not present |  |

In this example, Functions 4 and 5 must be assigned to the same SI. Similarly, Functions 7 and 8, 10 and 11, and 13 and 14 must be assigned together. If PFs are assigned to SIs, Functions 0 and 1 must be assigned together as well. Functions 2, 6, 9, 12, 15, 18, and 21 are independent and may be assigned to SIs in any fashion.

All PFs in a Function Dependency List shall have the same values for the [InitialVFs,](#bookmark247)[TotalVFs,](#bookmark248) and [VF Migration](#bookmark137) [Capable](#bookmark137)fields.

SR-PCIM shall ensure that all PFs in a Function Dependency List have the same values for the [NumVFs,](#bookmark225)[VF Enable,](#bookmark29) and [VF Migration Enable](#bookmark156)fields before any VF in that Function Dependency List is assigned to an SI.

VF Migration and VF Mapping operations occur independently for every VF. SR-PCIM shall not assign a VF to an SI until it can assign all dependent VFs. For example, using the scenario above, if both VF 0,2 (Function 7) and VF 1,2 (Function 8) are in the [Inactive.Unavailable](#bookmark255)or [Dormant.MigrateIn](#bookmark256)states, SR-PCIM shall not assign either VF to an SI until both VFs reach the [Active.Available](#bookmark257)state.

Similarly, SR-PCIM shall not remove a VF from an SI until it can remove all dependent VFs. For example, using the

scenario above, both VF 0,2 and VF 1,2 shall be removed from an SI only when they both reach the

[Active.MigrateOut](#bookmark258)state. SR-PCIM shall not transition the Functions to[Inactive.Unavailable](#bookmark259) until the SI has stopped using all dependent Functions.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAGMCAYAAADnWePxAAAAUUlEQVRoge3MoREAIAwEwYQmsZSGpcpQAROD3Le38xHNMtapZ90zR/cAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAF/ABW4OBhZjqCtvAAAAAElFTkSuQmCC)

[**9.3.3.9**](9.3.3.9) **First VF Offset (Offset 14h)**

[First VF Offset](#bookmark220)is a constant and defines the Routing ID offset of the first VF that is associated with the PF that contains

this Capability structure. The first VF’s 16-bit Routing ID is calculated by adding the contents of this field to the Routing ID of the PF containing this field ignoring any carry, using unsigned, 16-bit arithmetic.

A VF shall not be located on a Bus Number that is numerically smaller than its associated PF.

This field may change value when the lowest-numbered PF’s[ARI Capable Hierarchy](#bookmark203)value changes or when this PF’s [NumVFs](#bookmark225)value changes.

Note:[First VF Offset](#bookmark220)is unused if[NumVFs](#bookmark225)is 0. If[NumVFs](#bookmark225)is greater than 0,[First VF Offset](#bookmark220) must not be zero.

[**9.3.3.10**](9.3.3.10) **VF Stride (Offset 16h)**

[VF Stride](#bookmark221)defines the Routing ID offset from one VF to the next one for all VFs associated with the PF that contains this Capability structure. The next VF’s 16-bit Routing ID is calculated by adding the contents of this field to the Routing ID of the current VF, ignoring any carry, using unsigned 16-bit arithmetic.

This field may change value when the lowest-numbered PF’s[ARI Capable Hierarchy](#bookmark203)value changes or when this PF’s [NumVFs](#bookmark225)value changes.

Note:[VF Stride](#bookmark221)is unused if[NumVFs](#bookmark225)is 0 or 1. If[NumVFs](#bookmark225)is greater than 1,[VF Stride](#bookmark221) must not be zero.

[**9.3.3.11**](9.3.3.11) **VF Device ID (Offset 1Ah)**

This field contains the Device ID that should be presented for every VF to the SI.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[VF Device ID](#bookmark260)may be different from the PF Device ID. A [VF Device ID](#bookmark260)must be managed by the vendor. The vendor must ensure that the chosen [VF Device ID](#bookmark260)does not result in the use of an incompatible device driver.

[**9.3.3.12**](9.3.3.12) **Supported Page Sizes (Offset 1Ch)**

This field indicates the page sizes supported by the PF. This PF supports a page size of 2 n +12 if bit n is Set. For example, if bit 0 is Set, the PF supports 4-KB page sizes. PFs are required to support 4-KB, 8-KB, 64-KB, 256-KB, 1-MB, and 4-MB page sizes. All other page sizes are optional.

The page size describes the minimum alignment requirements for VF BAR resources as described in [Section 9.3.3.13 .](#bookmark261)

**IMPLEMENTATION NOTE**

Non-pre-fetch Address Space

Non-pre-fetch address space is limited to addresses below 4 GB. Pre-fetch address space in 32-bit systems is also limited. Vendors are strongly encouraged to utilize the [System Page Size](#bookmark262)feature to conserve address space while also supporting systems with larger pages.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

[**9.3.3.13**](9.3.3.13) **System Page Size (Offset 20h)**

This field defines the page size the system will use to map the VFs’ memory addresses. Software must set the value of the [System Page Size](#bookmark262)to one of the page sizes set in the [Supported Page Sizes](#bookmark40)field (see [Section 9.3.3.12](#bookmark40)). As with[Supported](#bookmark40) [Page Sizes](#bookmark40), if bit n is Set in[System Page Size](#bookmark262), the VFs associated with this PF are required to support a page size of 2 n +12. For example, if bit 1 is Set, the system is using an 8-KB page size. The results are undefined if[System Page Size](#bookmark262)is zero.

The results are undefined if more than one bit is set in [System Page Size.](#bookmark262) The results are undefined if a bit is Set in [System Page Size](#bookmark262)that is not Set in [Supported Page Sizes.](#bookmark40)

When [System Page Size](#bookmark262)is set, the VF associated with this PF is required to align all BAR resources on a [System Page Size](#bookmark262) boundary. Each VF BARn or VF BARn pair (see [Section 9.3.3.14)](#bookmark263) shall be aligned on a [System Page Size](#bookmark262) boundary. Each VF BARn or VF BARn pair defining a non-zero address space shall be sized to consume an integer multiple of[System Page](#bookmark262) [Size](#bookmark262) bytes. All data structures requiring page size alignment within a VF shall be aligned on a [System Page Size](#bookmark262) boundary.

[VF Enable](#bookmark29) must be zero when [System Page Size](#bookmark262)is written. The results are undefined if[System Page Size](#bookmark262)is written when [VF Enable](#bookmark29)is Set.

Default value is 0000 0001h (i.e., 4 KB).

[**9.3.3.14**](9.3.3.14) **VF BAR0 (Offset 24h), VF BAR1 (Offset 28h), VF BAR2 (Offset 2Ch), VF BAR3 (Offset**

**30h), VF BAR4 (Offset 34h), VF BAR5 (Offset 38h)**

These fields must define the VF’s Base Address Registers (BARs). These fields behave as normal PCI BARs, as described in Section 7.5.1. They can be sized by writing all 1s and reading back the contents of the BARs as described in Section

7.5.1.2.1 , complying with the low order bits that define the BAR type fields.

These fields may have their attributes affected by the [VF Resizable BAR Extended Capability](#bookmark264) (see [Section 9.3.7.5)](#bookmark265) if it is implemented.

The amount of address space decoded by each BARshall be an integral multiple of[System Page Size.](#bookmark262)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Each VF BARn, when “sized” by writing 1s and reading back the contents, describes the amount of address space

consumed and alignment required by a single Virtual Function, per BAR. When written with an actual address value, and [VF Enable](#bookmark214)and [VF MSE](#bookmark217)are Set, the BAR maps [NumVFs](#bookmark250) BARs. In other words, the base address is the address of the first VF BARn associated with this PF and all subsequent VF BARn address ranges follow as described below.

VF BARs shall only support 32-bit and 64-bit memory space. PCI I/O Space is not supported in VFs. Bit 0 of any

implemented VF BARx must beRO0b except for a VF BARx used to map the upper 32 bits of a 64-bit memory VF BAR pair. The alignment requirement and size read is for a single VF, but when [VF Enable](#bookmark174)is Set and[VF MSE](#bookmark217)is Set, the BAR contains the base address for all ([NumVFs)](#bookmark250) VF BARn.

The algorithm to determine the amount of address space mapped by a VF BARn differs from the standard BAR algorithm as follows:

1. Resize the BAR via the [VF Resizable BAR Extended Capability](#bookmark266) (see [Section 9.3.7.5)](#bookmark267) if it is implemented.

2. After reading the low order bits to determine if the BAR is a 32-bit BAR or 64-bit BAR pair, determine the size and alignment requirements by writing all 1s to VF BARn (or VF BARn and VF BARn+1 for a 64-bit BAR pair) and reading back the contents of the BAR or BAR pair. Convert the bit mask returned by the read(s) to a size and alignment value as described in Section 7.5.1.2.1 . This value is the size and alignment for a single VF.

3. Multiply the value from step 2 by the value set in [NumVFs](#bookmark28)to determine the total amount of space the BAR or BAR pair will map after [VF Enable](#bookmark29)and [VF MSE](#bookmark217)are Set.

For each VF BARn field, n corresponds to one of the VFs BAR spaces.[Table 9-7](#bookmark268)shows the relationship between n and a Function’s BAR.

Table 9-7 BAR Offsets

|  |  |
| --- | --- |
| n | BAR Offset in a Type 0 Header |
| 0 | 10h |
| 1 | 14h |
| 2 | 18h |
| 3 | 1Ch |
| 4 | 20h |
| 5 | 24h |

The contents of all VF BARn registers are indeterminate after [System Page Size](#bookmark39)is changed.

[**9.3.3.15**](9.3.3.15) **VF Migration State Array Offset (Offset 3Ch)**

If[VF Migration Capable](#bookmark137) [(Section 9.3.3.2.1 ) is Set and](#bookmark201)[TotalVFs](#bookmark164) [(Section 9.3.3.6 ) is not zero](#bookmark164), this register shall contain a PF BAR relative pointer to the[VF Migration State Array](#bookmark270). This register isROZero if[VF Migration Capable](#bookmark137)is Clear. The [VF](#bookmark271)

[Migration State Array](#bookmark272)is defined in [Section 9.3.3.15.1](#bookmark273). The layout of the [VF Migration State Array Offset](#bookmark269)is defined in [Table](#bookmark274) [9-8 .](#bookmark275)

2 0

31 3

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |
| --- | --- |
| VF Migration State Offset |  |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABQAAAALCAYAAAB/Ca1DAAAAj0lEQVQoka3OOw5BURSG0XWRKCQSE9DoPOZgDAqNwiBUGtmdOShEpdOpVMYi6BCdRKO5kjuA/Zdfzl45dWForOPsIWE1zDDJwP5g6tLBBlaZYA09dDPBOaaZYOrSwUJooY9P2S7CWxhV3l2FlzBEUbab8BQGlY/dG/hiWzle4IR9pS1xxA7NsgUO2KBdtvUP6TMYFZPqEacAAAAASUVORK5CYII=)VF Migration State BIR

Figure 9-18 VF Migration State Array Offset Table 9-8 VF Migration State Array Offset

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 2:0 | **VF Migration State BIR** - Indicates which one of a Function’s Base Address registers, located beginning at 10h in Configuration Space, is used to map the Function’s [VF Migration State Array](#bookmark277)into Memory  Space.  **BIR Value BAR**  **0** BAR0 10h  **1** BAR1 14h  **2** BAR2 18h  **3** BAR3 1Ch  **4** BAR4 20h  **5** BAR5 24h  **6** Reserved  **7** Reserved  For a 64-bit BAR, the [VF Migration State BIR](#bookmark276)indicates the lower DW. This field is undefined if[TotalVFs](#bookmark249) is 0. | RO |
| 31:3 | **VF Migration State Offset** - Used as an offset from the address contained by one of the Function’s Base Address registers to point to the base of the [VF Migration State Array.](#bookmark278) The lower three [VF Migration State](#bookmark276) [BIR](#bookmark276) bits are masked off (set to zero) by software to form a 32-bit QW-aligned offset.  This field is undefined if[TotalVFs](#bookmark249) is 0. | RO |

If a BAR that maps address space for the [VF Migration State Array](#bookmark279)also maps other usable address space not associated with VF Migration, locations used in the other address space must not share any naturally aligned 8-KB address range with one where the [VF Migration State Array](#bookmark280) resides.

**9.3.3.15.1 VF Migration State Array**

The [VF Migration State Array](#bookmark280)is located using the [VF Migration State Array Offset](#bookmark269) register of the[SR-IOV Extended](#bookmark191) [Capability](#bookmark191) block.

The [VF Migration State Array](#bookmark273) has a [VF Migration State Entry](#bookmark281)for each VF. The total size of the [VF Migration State array](#bookmark273)is [NumVFs](#bookmark28) bytes. The [VF Migration State Array](#bookmark273)shall not exist if[TotalVFs](#bookmark164)is 0.[Table 9-9](#bookmark282)defines the layout of a [VF Migration](#bookmark273) [State Array](#bookmark273)entry.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |
| --- | --- | --- |
| 7 2 | | 1 0 |
| RsvdP |  | |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABAAAAANCAYAAACgu+4kAAAAVElEQVQoke3RoQ2AQBAF0cdJApbuKAN/NVEKmg6wIAkokpMXFskka0ZMsvlkSTZ4SUKHNRII8UngwhYNBfhXCK/QgKxHW/hddlT483lhwlLcWOnnGyavGAVdXeqeAAAAAElFTkSuQmCC)VF Migration State

Figure 9-19 VF Migration State Entry Table 9-9 **VF Migration State Entry**

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 1:0 | **VF Migration State** - State of the associated VF. This field is undefined when [VF Enable](#bookmark214)is Clear.  The initial values of the[VF Migration State Array](#bookmark280)are described in [Section 9.2.4.1 .](#bookmark144) | RW |

[VF Migration State](#bookmark170)contains the values shown in [Table 9-10 .](#bookmark283)

Table 9-10 VF Migration State Descriptions

|  |  |  |
| --- | --- | --- |
| VF State | VF Exists | Description |
| 00b | No | **Inactive.Unavailable** - VF does not exist to SR nor is it being migrated in or out. |
| 01b | No | **Dormant.MigrateIn** - VF is available for use by SR. VF exists but cannot initiate transactions. |
| 10b | Yes | **Active.MigrateOut** - SR has been requested to relinquish use of the VF. |
| 11b | Yes | **Active.Available** - Fully functional. Could be assigned to an SI. |

The initial values of the[VF Migration State Array](#bookmark171)are described in[Section 9.2.4.1](#bookmark144). The [VF Migration State Array](#bookmark171)is returned to a configuration as described in [Section 9.2.4.1](#bookmark144)within 1.0 s after [VF Enable](#bookmark174)is Cleared.

Software initiates a state transition by writing a new state value to the entry.

Valid state transitions are listed in [Table 9-11](#bookmark286)and [Table 9-12](#bookmark287)and shown in [Figure 9-12](#bookmark169). Only changes in [Table 9-11](#bookmark288)can be requested by SR. Changes in [Table 9-12](#bookmark289)are initiated by MR-PCIM using mechanisms described in the [MR-IOV] and have SR visible effects described here. Any transition issued by SR-PCIM and not listed in [Table 9-11](#bookmark290)is ignored and does not change the VF State.

Table 9-11 SR-PCIMInitiated VF Migration State Transitions

|  |  |  |  |
| --- | --- | --- | --- |
| Current State | New State | Change Initiated By | SR Visible Effects of Change |
| [Dormant.MigrateIn](#bookmark284) | [Active.Available](#bookmark176) | SR-PCIM | **VF Activate**  VF now exists. |
|  |
| [Active.Available](#bookmark176) | [Dormant.MigrateIn](#bookmark284) | SR-PCIM | **VF Deactivate**  VF no longer exists. |
| [Active.MigrateOut](#bookmark285) | [Inactive.Unavailable](#bookmark177) | SR-PCIM | **VF Migrate Out Complete** |
|  |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Current State | New State | Change Initiated By | SR Visible Effects of Change |
|  |  |  | VF no longer exists. |

Table 9-12 MR-PCIMInitiated VF Migration State Transitions

|  |  |  |  |
| --- | --- | --- | --- |
| Current State | New State | Change Initiated By | SR Visible Effects of Change |
| [Active.Available](#bookmark176) | [Active.MigrateOut](#bookmark285) | MR-PCIM | **VF Migrate Out Request**  VF continues to exist. Sets[VF Migration Status.](#bookmark244) |
| [Inactive.Unavailable](#bookmark177) | [Dormant.MigrateIn](#bookmark284) | MR-PCIM | **VF Migrate In Request**  VF remains non-existent. Sets[VF Migration Status.](#bookmark244) |
|  |
| [Dormant.MigrateIn](#bookmark284) | [Inactive.Unavailable](#bookmark177) | MR-PCIM | **VF Migrate In Retract**  VF remains non-existent. Sets[VF Migration Status.](#bookmark244) |
|  |
| [Active.MigrateOut](#bookmark285) | [Active.Available](#bookmark176) | MR-PCIM | **VF Migrate Out Retract**  VF continues to exist. Sets[VF Migration Status.](#bookmark244) |

**9.3.4 PF/VF Configuration Space Header**

This section defines the requirements on PF and VF configuration space fields.

The register definitions listed throughout this chapter establish the mapping between existing PCI-SIG specifications and the PF/VF definitions for a PCIe SR-IOV-capable device.

[**9.3.4.1**](9.3.4.1) **PF/VF Type 0 Configuration Space Header**

[Figure 9-20](#bookmark291)details allocation for register fields of PCI Express Type 0 Configuration Space Header.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

31

0

|  |  |  |  |
| --- | --- | --- | --- |
| Device ID | | Vendor ID | |
| Status | | Command | |
| Class Code | | | Revision ID |
| BIST | Header Type | Master Latency Timer | Cache Line Size |
| Base Address Registers | | | |
| Cardbus CIS Pointer | | | |
| Subsystem ID | | Subsystem Vendor ID | |
| Expansion ROM Base Address | | | |
| Reserved | | | Capabilities Pointer |
| Reserved | | | |
| Max\_Lat | Min\_Gnt | Interrupt Pin | Interrupt Line |

Byte Offset

00h

04h

08h

0Ch

10h

14h

18h

1Ch

20h

24h

28h

2Ch

30h

34h

38h

3Ch

OM14316

Figure 9-20 PF/VF Type 0 Configuration Space Header

The PCI Express-specific interpretation of registers specific to PCI Express Type 0 Configuration Space Header is defined in this section.

Error Reporting fields are described in more detail in [Section 9.4 .](#bookmark292)

**9.3.4.1.1 Vendor ID Register Changes (Offset 00h)**

This Read Only register identifies the manufacturer of the Device.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

This field in all VFs returns FFFFh when read. VI software should return the Vendor ID value from the associated PF as the Vendor ID value for the VF.

**9.3.4.1.2 Device ID Register Changes (Offset 02h)**

This Read Only register identifies the particular Device.

This field in all VFs returns FFFFh when read. VI software should return the VF Device ID (see [Section 9.3.3.11)](#bookmark260) value from the associated PF as the Device ID for the VF.

**IMPLEMENTATION NOTE**

Legacy PCI Probing Software

Returning FFFFh for Device ID and Vendor ID values allows some legacy software to ignore VFs. See Section

7.5.1.1.1 .

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACGCAYAAADgtcsjAAAANElEQVRYhe3KoQEAIAzAsI0nsZyG5cphsLuA1DYZ61R07ZmjnS8AAAAAAAAAAAAAAIAfwAXNTgQMupkb1QAAAABJRU5ErkJggg==)

**9.3.4.1.3 Command Register Changes (Offset 04h)**

PF and VF functionality is defined inSection 7.5.1.1.3except where noted in [Table 9-13 .](#bookmark294) For VF fields marked RsvdP, the PF setting applies to the VF.

Table 9-13 Command Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 0 | **I/O Space Enable** - Does not apply to VFs. Must be hardwired to 0b for VFs. | Base | 0b |
| 1 | **Memory Space Enable** - Does not apply to VFs. Must be hardwired to 0b for VFs. VF Memory Space is controlled by the VF MSE bit in the VF Control register. | Base | 0b |
| 2 | **Bus Master Enable** - Transactions for a VF that has its Bus Master Enable Set must not be blocked by transactions for VFs that have their Bus Master Enable Cleared. | Base | Base |
| 6 | Parity Error Response | Base | RsvdP |
| 8 | SERR# Enable | Base | RsvdP |
| 10 | **Interrupt Disable** - Does not apply to VFs. | Base | 0b |

**9.3.4.1.4 Status Register Changes (Offset 06h)**

PF and VF functionality is defined inSection 7.5.1.1.4except where noted in [Table 9-14 .](#bookmark295)

Table 9-14 Status Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 3 | **Interrupt Status** - Does not apply to VFs. | Base | 0b |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**9.3.4.1.5 Revision ID Register Changes (Offset 08h)**

This register specifies a device specific revision identifier.

The value reported in the VF may be different than the value reported in the PF.

**9.3.4.1.6 Class Code Register Changes (Offset 09h)**

The Class Code register isROand is used to identify the generic function of the device and, in some cases, a specific register level programming interface. The field in the PF and associated VFs must return the same value when read.

**9.3.4.1.7 Cache Line Size Register Changes (Offset 0Ch)**

This field is implemented by PCI Express devices as a read-write field for legacy compatibility purposes but has no effect on any PCI Express device behavior.Physical Functionscontinue to implement this field as RW. ForVirtual Functions,

this field is ROZero.

**9.3.4.1.8 Latency Timer Register Changes (Offset 0Dh)**

This field does not apply to PCI Express. This register must be ROZero.

**9.3.4.1.9 Header Type Register Changes (Offset 0Eh)**

This byte identifies the layout of the second part of the predefined header (beginning at byte 10h in Configuration Space) and also whether or not the device contains multiple Functions. Bit 7 in this register is used to identify a Multi-Function Device. For an SR-IOV device, bit 7 in this register is only Set if there are multiple Functions. VFs do not affect the value of bit 7. Bits 6 through 0 identify the layout of the second part of the predefined header. For VFs, this register must be RO

Zero.

**9.3.4.1.10 BIST Register Changes (Offset 0Fh)**

VFs shall not support BIST and must define this field as RO Zero.

If VF Enable is turned on in any PF of a Device, then software must not invoke BIST in any Function associated with that Device.

**9.3.4.1.11 Base Address Registers Register Changes (Offset 10h, 14h, … 24h)**

For VFs, the values in these registers are RO Zero.

Note: See also [Section 9.2.1.1.1](#bookmark11)and [Section 9.3.3.14 .](#bookmark126)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**9.3.4.1.12Cardbus CIS Pointer RegisterChanges (Offset 28h)**

For VFs, this register is not used and shall be RO Zero.

**9.3.4.1.13Subsystem Vendor ID RegisterChanges (Offset 2Ch)**

This Read Only field identifies the manufacturer of the subsystem. The field in the PF and associated VFs must return the same value when read.

**9.3.4.1.14Subsystem ID RegisterChanges (Offset 2Eh)**

This Read Only field identifies the particular subsystem. The Device may have a different value in the PF and the VF.

**9.3.4.1.15 Expansion ROM Base Address Register Register Changes (Offset 30h)**

The Expansion ROM Base Address Registeris permitted to be implemented in PFs. TheExpansion ROM Base Address Registeris RO Zero in VFs. The VI may choose to provide access to the PFExpansion ROM Base Address Registerfor VFs via emulation.

The PF is not permitted to implement Expansion ROM address decoder sharing (seeSection 7.5.1.2.4).

**9.3.4.1.16 Capabilities Pointer Register Changes (Offset 34h)**

No differences from the description in Section 7.5.1.1.11 .

**9.3.4.1.17Interrupt Line RegisterChanges (Offset 3Ch)**

This field does not apply to VFs. This field must be RO Zero.

**9.3.4.1.18Interrupt Pin RegisterChanges (Offset 3Dh)**

This field does not apply to VFs. This field must be RO Zero.

**9.3.4.1.19Min\_Gnt Register/Max\_Lat RegisterChanges (Offset 3Eh/3Fh)**

These registers do not apply to PCI Express. They must be RO Zero.

**9.3.5 PCI Express Capability Changes**

The PCI Express Capability (see Section 7.5.3) is used for identification of a PCI Express device and indicates support for PCI Express features.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Figure 7-21details allocation of register fields in the PCI Express Capability. PFs and VFs are required to implement this capability subject to the exceptions and additional requirements described below.

[**9.3.5.1**](9.3.5.1) **PCI Express Capabilities Register Changes (Offset 00h)**

The PCI Express Capabilities Registeris described in Section 7.5.3.2and the functionality described there applies to both the PF and VF.

[**9.3.5.2**](9.3.5.2) **PCI Express Capabilities Register Changes (Offset 02h)**

The PCI Express Capabilities register identifies PCI Express device type and associated capabilities. The PF and VF functionality is defined in Section 7.5.3.2 .

[**9.3.5.3**](9.3.5.3) **Device Capabilities Register Changes (Offset 04h)**

The Device Capabilities Register identifies PCI Express device specific capabilities.Figure 7-24details allocation of register fields in theDevice Capabilities Register;[Table 9-15 pro](#bookmark297)vides the respective bit definitions.

PF and VF functionality is defined inSection 7.5.3.3except where noted in [Table 9-15 .](#bookmark298)

Table 9-15 Device Capabilities Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 4:3 | Phantom Functions Supported - When the VF Enable is Set, use of Phantom Function  numbers by this PF and associated VFs is not permitted and this field must return 00b when read. | Base | 00b |
| 25:18 | Captured Slot Power Limit Value | Base | undefined |
| 27:26 | Captured Slot Power Limit Scale | Base | undefined |
| 28 | Function Level Reset Capability - Required for PFs and for VFs. | 1b | 1b |

[**9.3.5.4**](9.3.5.4) **Device Control Register Changes (Offset 08h)**

The Device Control Registercontrols PCI Express device specific parameters.Figure 7-25details allocation of register fields in the Device Control register;[Table 9-16](#bookmark300)provides the respective bit definitions.

PF and VF functionality is defined inSection 7.5.3.4except where noted in [Table 9-16 .](#bookmark301) For VF fields markedRsvdP, the PF setting applies to the VF.

Table 9-16 Device Control Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 0 | Correctable Error Reporting Enable  See [Section 9.4f](#bookmark302)or details on error reporting. | Base | RsvdP |
| 1 | Non-Fatal Error Reporting Enable | Base | RsvdP |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
|  | See [Section 9.4f](#bookmark303)or details on error reporting. |  |  |
| 2 | Fatal Error Reporting Enable  See [Section 9.4f](#bookmark304)or details on error reporting. | Base | RsvdP |
| 3 | Unsupported Request Reporting Enable  See [Section 9.4f](#bookmark305)or details on error reporting. | Base | RsvdP |
| 4 | Enable Relaxed Ordering | Base | RsvdP |
| 7:5 | Max\_Payload\_Size | Base | RsvdP |
| 8 | Extended Tag Field Enable | Base | RsvdP |
| 9 | Phantom Functions Enable | Base | RsvdP |
| 10 | Aux Power PM Enable | Base | RsvdP |
| 11 | Enable No Snoop | Base | RsvdP |
| 14:12 | Max\_Read\_Request\_Size | Base | RsvdP |
| 15 | Initiate Function Level Reset - Required for PFs and for VFs.  Note: Setting Initiate Function Level Reset in a PF resets VF Enable which means that VFs no longer exist after the FLR is complete. | Base | Base |

[**9.3.5.5**](9.3.5.5) **Device Status Register Changes (Offset 0Ah)**

The Device Status register provides information about PCI Express device specific parameters.Figure 7-26details allocation of register fields in the Device Status register;[Table 9-17](#bookmark307)provides the respective bit definitions.

PF and VF functionality is defined inSection 7.5.3.5except where noted in [Table 9-17 .](#bookmark308)

Table 9-17 Device Status Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 4 | AUX Power Detected | Base | 0b |
| 6 | Emergency Power Reduction Detected | Base | 0b |

[**9.3.5.6**](9.3.5.6) **Link Capabilities Register Changes (Offset 0Ch)**

The Link Capabilities register identifies PCI Express Link specific capabilities.Figure 7-27details allocation of register fields in the Link Capabilities register.

PF and VF functionality is defined in Section 7.5.3.6 .

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.3.5.7**](9.3.5.7) **Link Control Register Changes (Offset 10h)**

The Link Control Registercontrols PCI Express Link specific parameters.Figure 7-28details allocation of register fields in the Link Control register.

PF and VF functionality is defined inSection 7.5.3.7except where noted in [Table 9-18 .](#bookmark309) For VF fields markedRsvdP, the PF setting applies to the VF.

Table 9-18 Link Control Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 1:0 | Active State Power Management (ASPM) Control | Base | RsvdP |
| 3 | Read Completion Boundary(RCB Must be hardwired to 0b for VFs. | Base | RsvdP |
| 6 | Common Clock Configuration | Base | RsvdP |
| 7 | Extended Synch | Base | RsvdP |
| 8 | Enable Clock Power Management | Base | RsvdP |
| 9 | Hardware Autonomous Width Disable | Base | RsvdP |

[**9.3.5.8**](9.3.5.8) **Link Status Register Changes (Offset 12h)**

The Link Status Register provides information about PCI Express Link specific parameters.Figure 7-29details allocation of register fields in the Link Status register.

PF functionality is defined inSection 7.5.3.8. For the VF, all fields in this register areRsvdZand the PF setting applies to the VF.

[**9.3.5.9**](9.3.5.9) **Device Capabilities 2 Register Changes (Offset 24h)**

PF and VF functionality is defined inSection 7.5.3.15except as noted in [Table 9-19 .](#bookmark211)

Table 9-19 Device Capabilities 2 Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 3:0 | **Completion Timeout Ranges Supported**  VF value must be identical to PF value. | Base | Base |
| 4 | **Completion Timeout Disable Supported**  VF value must be identical to PF value. | Base | Base |
| 6 | **AtomicOp Routing Supported**  Not applicable to Endpoints. | RsvdP | RsvdP |
| 7 | **32-bit AtomicOp Completer Supported** | Base | Base |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
|  | VF value must be identical to PF value. |  |  |
| 8 | **64-bit AtomicOp Completer Supported**  VF value must be identical to PF value. | Base | Base |
| 9 | **128-bit CAS Completer Supported**  VF value must be identical to PF value. | Base | Base |
| 16 | **10-Bit Tag Completer Supported**  VF value must be identical to PF value. | Base | Base |
| 17 | **10-Bit Tag Requester Supported**  VF value must equal the value of the VF 10-Bit Tag Requester Supported bit in the SR-IOV Capabilities register. See [Section 9.3.3.2.3](#bookmark207) | Base | Base |
| 25:24 | **Emergency Power Reduction Supported**  VF value must be identical to PF value. | Base | Base |
| 26 | **Emergency Power Reduction Initialization Required** VF value must be identical to PF value**.** | Base | Base |

[**9.3.5.10**](9.3.5.10) **Device Control 2 Register Changes (Offset 28h)**

PF and VF functionality is defined inSection 7.5.3.16except as noted in [Table 9-20 .](#bookmark310)

Table 9-20 Device Control 2 Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 3:0 | **Completion Timeout Value**  The PF value applies to all associated VFs. | Base | RsvdP |
| 4 | **Completion Timeout Disable**  The PF value applies to all associated VFs. | Base | RsvdP |
| 6 | **AtomicOp Requester Enable**  The PF value applies to all associated VFs. | Base | RsvdP |
| 8 | **IDO Request Enable**  The PF value applies to all associated VFs. | Base | RsvdP |
| 9 | **IDO Completion Enable**  The PF value applies to all associated VFs. | Base | RsvdP |
| 11 | **Emergency Power Reduction Request**  This bit is only present in one Function associated with an Upstream Port. That Function can never be a VF. | Base | RsvdP |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAIklEQVRYhe3BAQ0AAAgDoNvUeqbUHG5ApWcDAAAAAADw2QHZqwIKU+HaKwAAAABJRU5ErkJggg==)

Page 1146

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 12 | **10-Bit Tag Requester Enable**  The value in the VF 10-Bit Tag Requester Enable bit in the SR-IOV Control register applies to all associated VFs. | Base | RsvdP |

[**9.3.5.11**](9.3.5.11) **Device Status 2 Register Changes (Offset 2Ah)**

PF and VF functionality is defined in Section 7.5.3.17 .

[**9.3.5.12**](9.3.5.12) **Link Capabilities 2 Register Changes (Offset 2Ch)**

PF and VF functionality is defined in Section 7.5.3.18 .

[**9.3.5.13**](9.3.5.13) **Link Control 2 Register Changes (Offset 30h)**

PF and VF functionality is defined in Section 7.5.3.19 .

[**9.3.5.14**](9.3.5.14) **Link Status 2 Register Changes (Offset 32h)**

PF and VF functionality is defined inSection 7.5.3.20except where noted in [Table 9-21 .](#bookmark311) The VF fields markedRsvdZ use the value of the associated PF.

Table 9-21 Link Status 2 Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 0 | Current De-emphasis Level | Base | RsvdZ |

**9.3.6 PCI Standard Capabilities**

SR-IOV usage of PCI Standard Capabilities is described in [Table 9-22 .](#bookmark312) Items marked n/a are not applicable to PFs or VFs.

Table 9-22 SR-IOV Usage of PCI Standard Capabilities

|  |  |  |  |
| --- | --- | --- | --- |
| Capability ID | Description | PF Attributes | VF Attributes |
| 00h | Null Capability | Base | Base |
| 01h | PCI Power Management Interface | Base | Optional. See [Section 9.6 .](#bookmark313) |
| 02h | AGP | n/a | n/a |
| 03h | VPD | Base | Optional. See [Section 9.3.6.1 .](#bookmark314) |
| 04h | Slot Identification | n/a | n/a |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Capability ID | Description | PF Attributes | VF Attributes |
| 05h | MSI | Base | See [Section 9.5.1.1 .](#bookmark315) |
| 06h | CompactPCI Hot Swap | n/a | n/a |
| 07h | PCI-X | n/a | n/a |
| 08h | HyperTransport | n/a | n/a |
| 09h | Vendor-specific | Base | Base |
| 0Ah | Debug Port | Base | Base |
| 0Bh | CompactPCI Central Resource Control | n/a | n/a |
| 0Ch | PCI Hot Plug | Base | n/a |
| 0Dh | PCI Bridge Subsystem ID | n/a | n/a |
| 0Eh | AGP 8x | n/a | n/a |
| 0Fh | Secure Device | n/a | n/a |
| 10h | PCI Express | Base | See [Section 9.3.5 .](#bookmark296) |
| 11h | MSI-X | See [Section 9.5.1.2](#bookmark316)and [Section](#bookmark317)  [9.5.1.3 .](#bookmark318) | See [Section 9.5.1.2](#bookmark319)and [Section](#bookmark320)  [9.5.1.3 .](#bookmark321) |
| 12h | Serial ATA Data/Index Configuration | Base | n/a |
| 13h | Advanced Features | n/a | n/a |
| 14h | Enhanced Allocation | Base | Must not implement. |
| 15h | Flattening Portal Bridge (FPB) | n/a | n/a |

[**9.3.6.1**](9.3.6.1) **VPD Capability**

The VPD Capability is optional in PCI. It remains optional in SR-IOV.

VFs and PFs that implement the VPD Capability must ensure that there can be no “data leakage” between VFs and/or PFs via the VPD Capability.

**9.3.7 PCI Express Extended Capabilities Changes**

SR-IOV usage of PCI Express Extended Capabilities is described in [Table 9-23 .](#bookmark322) Items marked n/a are not applicable to PFs or VFs (e.g., for Capabilities only present in Root Complexes or only present in Function 0).

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Table 9-23 SR-IOV Usage of PCI Express Extended Capabilities

|  |  |  |  |
| --- | --- | --- | --- |
| Extended  Capability  ID | Description | PF Attributes | VF Attributes |
| 0000h | Null Capability | Base | Base |
| 0001h | Advanced Error Reporting Extended Capability (AER) | Base | See [Section 9.4.2 .](#bookmark323) |
| 0002h | Virtual Channel Extended Capability (02h) | Base | Must not implement. See [Section 9.3.7.1 .](#bookmark324) |
| 0003h | Device Serial Number Extended Capability | Base | See [Section 9.3.7.2](#bookmark325) |
| 0004h | Power Budgeting Extended Capability | Base | Must not implement. See [Section 9.3.7.3](#bookmark326) |
| 0005h | Root Complex Link Declaration Extended Capability | n/a | n/a |
| 0006h | Root Complex Internal Link Control Extended Capability | n/a | n/a |
| 0007h | Root Complex Event Collector Endpoint Association Extended Capability | n/a | n/a |
| 0008h | Multi-Function Virtual Channel Extended Capability | Base | Must not implement. See [Section 9.3.7.1 .](#bookmark327) |
| 0009h | Virtual Channel Extended Capability (09h) | Base | Must not implement. See [Section 9.3.7.1 .](#bookmark328) |
| 000Ah | RCRB Header Extended Capability | n/a | n/a |
| 000Bh | Vendor-specific Extended Capability | Base | Base |
| 000Ch | Configuration Access Correlation Extended Capability | n/a | n/a |
| 000Dh | ACS Extended Capability | See [Section 9.3.7.6 .](#bookmark329) | See [Section 9.3.7.6 .](#bookmark330) |
| 000Eh | ARI Extended Capability (ARI) | See [Section 9.3.7.7 .](#bookmark331) | See [Section 9.3.7.7 .](#bookmark332) |
| 000Fh | ATS Extended Capability | See [Section 9.3.7.8 .](#bookmark333) | See [Section 9.3.7.8 .](#bookmark334) |
| 0010h | [SR-IOV Extended Capability](#bookmark190) | See [Section 9.3.3 .](#bookmark190) | Must not implement. See [Section 9.3.3](#bookmark190) |
| 0011h | [MR-IOV Extended Capability](#bookmark134) (MR-IOV) | Must not implement. See [Section 9.3.7.9 .](#bookmark335) | Must not implement. See [Section 9.3.7.9 .](#bookmark336) |
| 0012h | Multicast Extended Capability | See [Section 9.3.7.10 .](#bookmark337) | See [Section 9.3.7.10 .](#bookmark338) |
| 0013h | Page Request Extended Capability (PRI) | See [Section 9.3.7.11 .](#bookmark339) | See [Section 9.3.7.11](#bookmark340) |
| 0014h | Reserved for AMD | Base | Base |
| 0015h | Resizable BAR Extended Capability | Base | Must not implement. See [Section 9.3.7.4](#bookmark341) |
| 0016h | Dynamic Power Allocation Extended Capability (DPA) | See [Section 9.3.7.12 .](#bookmark342) | Must not implement. See [Section 9.3.7.12 .](#bookmark343) |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAIklEQVRYhe3BAQ0AAAgDoNvUeqbUHG5ApWcDAAAAAADw2QHZqwIKU+HaKwAAAABJRU5ErkJggg==)

Page 1149

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Extended  Capability  ID | Description | PF Attributes | VF Attributes |
| 0017h | TPH Requester Extended Capability (TPH) | See [Section 9.3.7.13 .](#bookmark344) | See [Section 9.3.7.13](#bookmark345) |
| 0018h | LTR Extended Capability | Base | Must not implement. LTR is controlled using Function 0 which is never a VF. |
| 0019h | Secondary PCI Express Extended Capability | Base | Must not implement. 8.0 GT/s is controlled using Function 0 which is never a VF. |
| 001Ah | PMUX Extended Capability | Base | Must not implement. PMUX is controlled using Function 0 which is never a VF. |
| 001Bh | PASID Extended Capability | Base | See [Section 9.3.7.14](#bookmark346) |
| 001Ch | LN Requester Extended Capability (LNR) | Base | Base |
| 001Dh | DPC Extended Capability | n/a | n/a. |
| 001Eh | L1 PM Substates Extended Capability | Base | Must not implement. L1 PM Substates is  controlled using Function 0 which is never a VF. |
| 001Fh | Precision Time Management Extended Capability (PTM) | Base | Must not implement. PTM controls the Port and must not be implemented in a VF. |
| 0020h | PCI Express over M-PHY Extended Capability (M-PCIe) | Base | Must not implement. M-PHY is controlled using Function 0 which is never a VF. |
| 0021h | FRS Queueing Extended Capability | n/a | n/a |
| 0022h | Readiness Time Reporting Extended Capability | Base | See [Section 9.3.7.15](#bookmark347) |
| 0023h | Designated vendor-specific Extended Capability | Base | Base |
| 0024h | [VF Resizable BAR Extended Capability](#bookmark348) | See [Section 9.3.7.5](#bookmark349) | Not Implemented. See [Section 9.3.7.5](#bookmark350) |
| 0025h | Data Link Feature Extended Capability | Base | Must not implement. |
| 0026h | Physical Layer 16.0 GT/s Extended Capability | Base | Must not implement. |
| 0027h | Lane Margining at the Receiver Extended Capability | Base | Must not implement. |
| 0028h | Hierarchy ID Extended Capability | Base | Base |
| 0029h | Native PCIe Enclosure Management Extended Capability (NPEM) | Base | Must not implement |

[**9.3.7.1**](9.3.7.1) **Virtual Channel/MFVC**

VFs use the Virtual Channels of the associated PFs. As such, VFs do not contain any Virtual Channel Capabilities. VFs shall not contain the MFVC Capability. This Capability is only present in Function 0 which shall never be a VF.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.3.7.2**](9.3.7.2) **Device Serial Number**

The Device Serial Number Extended Capability may be present in PFs. If a PF contains the capability, its value applies to all associated VFs. VFs are permitted but not recommended to implement this capability. VFs that implement this

capability must return the same Device Serial Number value as that reported by their associated PF.

[**9.3.7.3**](9.3.7.3) **Power Budgeting**

The Power Budgeting Extended Capability may be present in PFs, but VFs must not implement it. If a PF contains the capability, it must report values that cover all associated VFs.

[**9.3.7.4**](9.3.7.4) **Resizable BAR**

The Resizable BAR Extended Capability may be present in PFs. Since VFs do not implement standard BARs the capability must not be present in a VF. The PF’s Resizable BAR settings do not affect any settings in the [SR-IOV Extended Capability.](#bookmark191)

[**9.3.7.5**](9.3.7.5) **VFResizable BAR Extended Capability**

The [VF Resizable BAR Extended Capability](#bookmark266)is permitted to be implemented only in PFs that implement at least one VF BAR, and affects the size and base of a PF’s VF BARs. Since VFs do not implement the BARs themselves the capability must not be present in a VF. A PF may implement both a[VF Resizable BAR Extended Capability](#bookmark266)and a Resizable BAR capability, as each capability operates independently.

The [VF Resizable BAR Extended Capability](#bookmark266)is an optional capability that permits PFs to be able to have their VF’s BARs resized. The[VF Resizable BAR Extended Capability](#bookmark266) permits hardware to communicate the resource sizes that are

acceptable for operation via the [VF Resizable BAR Extended Capability](#bookmark266)and Control registers and system software to communicate the optimal size back to the hardware via the[VF BAR Size](#bookmark351)field of the [VF Resizable BAR Control register.](#bookmark352)

Hardware immediately reflects the size inference in the read-only bits of the appropriate VF BAR. The size inferred is the greater of the values decoded from the System Page Size and[VF BAR Size](#bookmark353)fields. Hardware must Clear any bits that

change from read-write to read-only, so that subsequent reads return zero. Software must clear the [VF MSE](#bookmark217) bit in the

SR-IOV Control register before writing the[VF BAR Size](#bookmark354)field. After writing the [VF BAR Size](#bookmark355)field, the contents of the

corresponding VF BAR are undefined. To ensure that it contains a valid address after resizing the VF BAR, system software must reprogram the VF BAR, and Set the[VF MSE](#bookmark217) bit (unless the resource is not allocated).

The [VF Resizable BAR Extended Capability](#bookmark45)and Control registers are permitted to indicate the ability to operate at 4 GB or greater only if the associated VF BAR is a 64-bit BAR.

It is strongly recommended that a Function not advertise any supported[VF BAR size](#bookmark356)values in this capability that are larger than the space it would effectively utilize if allocated.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**IMPLEMENTATION NOTE**

Using the Capability During Resource Allocation

System software uses this capability in a similar way to the Resizable BAR capability. System software must first configure the System Page Size register (see[Section 9.2.1.1.1)](#bookmark30). Potential usable memory aperture sizes are

reported by the PF, and read, from the [VF Resizable BAR Extended Capability](#bookmark45)and Control registers. It is intended that the software allocate the largest of the reported sizes that it can, since allocating less address space than the largest reported size can result in lower performance. Software then writes the size to the [VF Resizable BAR](#bookmark357)

[Control register](#bookmark358)for the appropriate VF BAR for the Function. Following this, the base address is written to the VF BAR.

For interoperability reasons, it is possible that hardware will set the default size of the VF BAR to a low size; a size lower than the largest reported in the[VF Resizable BAR Capability Register.](#bookmark359) Software that does not use this

capability to size resources will likely result in sub-optimal resource allocation, where the resources are smaller than desirable, or not allocatable because there is no room for them. It is recommended that system software responsible for allocating resources in a resource constrained environment distribute the limited address space to all memory-mapped hardware, including system RAM, appropriately.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAFICAYAAABtIL9TAAAASklEQVRoge3MoREAIAwEwYQmsZSGpcpQAROL2Le38xHNMtapZ90zR/cAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAfAguXYUFjlrvhtEAAAAASUVORK5CYII=)

The [VF Resizable BAR Extended Capability](#bookmark45)structure defines a PCI Express Extended Capability which is located in PCI Express Extended Configuration Space, that is, above the first 256 bytes, and is shown below in [Figure 9-21 .](#bookmark360) This

structure allows PFs with this capability to be identified and controlled. A Capability register and a Control register are implemented for each VF BAR that is resizable. Since a maximum of 6 VF BARs may be implemented by any PF, the VF Resizable BAR Capability structure can range from 12 bytes long (for a single VF BAR) to 52 bytes long (for all 6 VF BARs).

|  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- |
| 31  30  29  28  27  26  25  24 | 23  22  21  20  19  18  17  16 | | 15  14  13  12  11  10  9  8 | | 7  6  5  4  3  2  1  0 |
| PCI Express Extended Capability Header | | | | | |
| VF Resizable BAR Capability Register (0) | | | | | |
|  | | VF Resizable BAR Control Register (0) | |  | |
| VF Resizable BAR Capability Register (1) | | | | | |
|  | | VF Resizable BAR Control Register (1) | |  | |
| ... | | | | | |

Figure 9-21 [VF Resizable BAR Extended Capability](#bookmark45)

Byte Offset +000h

+004h +008h +00Ch +010h +014h

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAJEAAAAjCAYAAACU2uhdAAAB7UlEQVR4nO3bsaraYBjG8ef74imV0kUcrF0dzu7uJF2Ku3dgcRQvIIObUKFuDl6Bo+AduIq4dhCRYFAJIip6ol+ns7/py0ECz2/+Rz7IA4lD4Pv+T9/3WyD6TxbAdwCvzz4IpZd99gEo/TgiUuOISI0jIjWOiNQ4IlLjiEgtAwCLxeIbgOqTz0LpswcwM8Vi8Vev1/udz+cfzz4RpctsNru22+0fmSAIvlarVZPL5b48+1CULp7nAUCe70SkxhGRGkdEahwRqXFEpMYRkRpHRGocEalxRKTGEZEaR0RqHBGpcUSkxhGRGkdEahwRqSUeUbfbRb/fF/etVguj0Ujc1+t1TKdTcV+pVLBcLsV9qVQStwBQLpfF7Xw+R61WE/eTyQSNRkPcD4dD+L4v7judDgaDgbhvNpsYj8fi/l0m6QWHwwHX61Xc7/d7nE4ncR+GYaLfD4IAcRyL+9VqJW4BYL1ei9vb7YbNZiPuL5cLttutuD8ej4iiSNxHUYRsNivud7sdzuezuH/HxxmpcUSkxhGRmgWAOI4TvxsRxXH8AgAGwOdCofAHgOiTocfjkQHgrLV3aW+MccYYaf9ijLkbY0TfwSXt7/f7J8/zbpI2ae+cs845z1r7Juw955yx1or+GSTtP/peOef+hmHY+Qdoz7OfaaGMiQAAAABJRU5ErkJggg==)

31 20

Next Capability Offset

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**9.3.7.5.1 VFResizable BAR Extended Capability Header (Offset 00h)**

|  |
| --- |
| 19 16 |
|  |

|  |
| --- |
| 15 0 |
|  |

PCI Express Extended Capability ID Capability Version

Figure 9-22 VFResizable BAR Extended Capability Header Table 9-24 VFResizable BAR Extended Capability Header

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 15:0 | **PCI Express Extended Capability ID** - This field is a PCI-SIG defined ID number that indicates the nature and format of the extended capability.  PCI Express Extended Capability ID for the [VF Resizable BAR Extended Capability](#bookmark45)is 0024h. | RO |
| 19:16 | **Capability Version** - This field is a PCI-SIG defined version number that indicates the version of the capability structure present.  Must be 1h for this version of the specification. | RO |
| 31:20 | **Next Capability Offset** - This field contains the offset to the next PCI Express Extended Capability structure or 000h if no other items exist in the linked list of capabilities | RO |

**9.3.7.5.2 VFResizable BAR Capability Register (Offset 04h)**

The [VF Resizable BAR Capability Register](#bookmark362)field descriptions are the same as the definitions in the Resizable BAR

Capability Register in Table 7-116. Where those descriptions say ‘BAR’, this register’s description is for ‘VF BAR’ Where those descriptions say ‘Function’, this register’s description is for ‘PF’ Otherwise the field descriptions, the number of

bits, their positions, and their attributes are the same. ConsequentlyFigure 7-145similarly allocates the register fields in this register.

**9.3.7.5.3 VFResizable BAR Control Register (Offset 08h)**

The [VF Resizable BAR Control register](#bookmark357) bits 31:16 follow the same definitions as the Resizable BAR Control register in Table 7-117 .

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- |
| 31 16 | 15 14 | 13 8 | 7 5 | 4 3 | 2 0 |
|  |  | VF BAR Size |  |  |  |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABAAAAAJCAYAAAA7KqwyAAAAhUlEQVQokaXOIQoCURCA4W/XKAYPYDJZBRFMey/LK2LZZBCDh9kTiCA2i2vXoIIWsYwXeE77BuZnOpK1yk3jImNKDNHLOf4FWjxzA39PR7JVuWq0OYESA3RzPyhxwj03UEhGqMMbnLEMr3BFCtd4Yx5eFJI+ZrE44oFp+IAXJuE9PhiHd1+xpxs2qnejUwAAAABJRU5ErkJggg==)VF BAR Index

RsvdP

Number of VF Resizable BARs

RsvdP

identical to the Resizable BAR Control Register

Figure 9-23 VFResizable BAR Control Register Table 9-25 VFResizable BAR Control Register

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | Register Description | | Attributes |
| 2:0 | **VF BAR Index**- This encoded value points to the beginning of this particular VF BAR located in the [SR-IOV](#bookmark191) [Extended Capability.](#bookmark191)  **0** VF BAR located at offset 24h  **1** VF BAR located at offset 28h  **2** VF BAR located at offset 2Ch  **3** VF BAR located at offset 30h  **4** VF BAR located at offset 34h  **5** VF BAR located at offset 38h  **others** All other encodings are reserved.  For a 64-bit Base Address register, the [VF BAR Index](#bookmark363) indicates the lower DWORD.  This value indicates which VF BAR supports a negotiable size. | | RO |
| 7:5 | **Number of VFResizable BARs** - Indicates the total number of resizable VF BARs in the capability structure for the Function. See [Figure 9-21 .](#bookmark361)  The value of this field must be in the range of 01h to 06h. The field is valid in [VF Resizable BAR Control](#bookmark358) [register](#bookmark358) (0) (at offset 08h),and is RsvdPfor all others. | | RO/RsvdP |
| 13:8 | **VF BAR Size** - This is an encoded value. | | RW |
| **0**  **1**  **2**  **3**    **43** | 1 MB (220 bytes)  2 MB (221 bytes)  4 MB (222 bytes)  8 MB (223 bytes)  …  8 EB (263 bytes) |
| The default value of this field is equal to the default size of the address space that the VF BAR resource is requesting via the VF BAR’s read-only bits.  Software must only write values that correspond to those indicated as supported in the VF Resizable  BAR Capability and Control registers. Writing an unsupported value will produce undefined results.  When this register field is programmed, the value is immediately reflected in the size of the resource, as encoded in the number of read-only bits in the VF BAR. | |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAIklEQVRYhe3BAQ0AAAgDoNvUeqbUHG5ApWcDAAAAAADw2QHZqwIKU+HaKwAAAABJRU5ErkJggg==)

Page 1154

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |
| --- | --- | --- |
| Bit Location | Register Description | Attributes |
| 31:16 | These bits are **identical to theResizable BAR Control Register** bits [31:16] defined in Figure 7-146 . Where those descriptions say ‘BAR’, this register’s description is for ‘VF BAR’ Where those descriptions say ‘Function’, this register’s description is for ‘PF’ | See Figure 7-146 |

[**9.3.7.6**](9.3.7.6) **Access Control Services (ACS) Extended Capability Changes**

ACS is an optional extended capability. If an SR-IOV Capable Device other than one in a Root Complex implements internal peer-to-peer transactions, ACS is required with additional requirements described below.

PF and VF functionality is defined inSection 7.7.8.2except where noted in [Table 9-26 .](#bookmark364)

All Functions in SR-IOV Capable Devices (Devices that implement at least one PF) other than RCiEPsthat support peer-to-peer transactions within the Device shall implement ACS Egress Control.

Implementation of ACS in RCiEPsis permitted but not required. It is explicitly permitted that, within a single Root Complex, some RCiEPsimplement ACS and some do not. It is strongly recommended that Root Complex

implementations ensure that all accesses originating from RCiEPs (PFs and VFs) without ACS capability are first

subjected to processing by the Translation Agent (TA) in the Root Complex before further decoding and processing. The details of such Root Complex handling are outside the scope of this specification.

Table 9-26 ACS Capability Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 5 | **ACSP2P Egress Control**- Required to be 1b for Functions that are notRCiEPsif peer-to-peer transactions within the Device are supported. | Base | Base |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAJnCAYAAACqMhItAAAAZklEQVR4nO3MoREAIAwEwYQmsZSGpcpQAROD3Le38xHNMtapZ90zR/cAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAL/ABYtQB8yt68P2AAAAAElFTkSuQmCC)

**IMPLEMENTATION NOTE**

Access Control Services in Systems that Support Direct Assignment of Functions

General-purpose VIs typically have separate address spaces for each SI and for the VI itself. If such a VI also

supports direct assignment of a Function to an SI, Untranslated Memory Request transactions issued by directly assigned Functions are under the complete control of software operating within the associated SI and typically

reference the address space associated with that SI. In contrast, Memory Request transactions issued by the Host (MMIO requests) and by Functions that are not directly assigned are under the control of the VI and typically

reference one or more system address spaces (e.g., the PCIe physical address space, the address space associated with the VI, or the address space associated with some designated SI). General-purpose VIs are not expected to

establish a dependency between these various address spaces. Consequently, these address spaces may freely overlap which could lead to unintended routing of TLPs by Switches. For example, Upstream Memory Request

TLPs originated by a directly assigned Function and intended for main memory could instead be routed to a Downstream Port if the address in the Request falls within the MMIO address region associated with that

Downstream Port. Such unintended routing poses a threat to SI and/or VI stability and integrity.

To guard against this concern, vendors are strongly recommended to implement ACS in platforms that support general purpose VIs with direct assignment of Functions. Such support should include:

• In Switches or Root Complexes located below theTA, the level of ACS support should follow the

guidelines established in this document for Downstream Switch Ports that implement an ACS Extended Capability structure.

Note: Components located above theTA only see Translated Memory Requests, consequently this concern does not apply to those components.

• In SR-IOV devices that are capable of peer-to-peer transactions, ACS support is required.

• In Multi-Function Devicesthat are capable of peer-to-peer transactions, vendors are strongly recommended to implement ACS with ACS P2P Egress Control.

Additionally, platform vendors should test for the presence of ACS and enable it in Root Complexes and Switches on the path from theTA to a Function prior to directly assigning that Function. If the Function is peer-to-peer

capable, ACS should be enabled in the Function as well.

[**9.3.7.7**](9.3.7.7)**Alternative Routing ID Interpretation Extended Capability (ARI) Changes**

ARI is not applicable toRCiEPs; all other SR-IOV Capable Devices (Devices that include at least one PF) shall implement the ARI Extended Capabilityin each Function. PF and VF functionality is defined inSection 7.8.7.2except where noted in [Table 9-27 .](#bookmark365)

Table 9-27 ARI Capability RegisterChanges

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 0 | **MFVC Function Groups Capability (M)** - Additional requirements described below. | Base | Base |
| 1 | **ACS Function Groups Capability (A)** - Additional requirements described below. | Base | Base |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 15:8 | **Next Function Number** - VFs are located using First VF Offset (see[Section 9.3.3.10)](#bookmark254) and VF Stride (see [Section 9.3.3.11)](#bookmark260). | Base | Undefined |

Any Device that implements the MFVC Capability with the optional Function Arbitration Table and consumes more than one Bus Numbershall implement the MVFC Function Groups Enable (M) capability bit as 1b in Function 0.

Any Device that implements the ACS Capability with the optional Egress Control Vector and consumes more than one Bus Numbershall implement the ACS Function Groups Enable (A) capability bit as 1b in Function 0.

[**9.3.7.8**](9.3.7.8) **Address Translation Services Extended Capability Changes (ATS)**

ATS support is optional in SR-IOV devices. If a VF implements an ATS capability, its associated PF must implement an ATS capability. The ATS Capabilities in VFs and their associated PFs may be enabled independently.

PF and VF functionality is defined in Section 10.5.1except where noted in [Table 9-28](#bookmark366)and [Table 9-29 .](#bookmark367) Attributes shown as ATS are the same as specified in Address Translation Services ([Chapter 10)](#bookmark368).

Table 9-28 ATS Capability Register

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From ATS | PF  Attributes | VF  Attributes |
| 4:0 | **Invalidate Queue Depth**- Hardwired to 0 for VFs. Depth of shared PF input queue. | ATS | RO |

Table 9-29 ATS Control Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From ATS | PF  Attributes | VF  Attributes |
| 4:0 | **Smallest Translation Unit (STU)** - Hardwired to 0 for VFs. PF value applies to all VFs. | ATS | RO |

ATS behavior is specified in Address Translation Services ([Chapter 10](#bookmark369)). ATS requests target the Function being

invalidated. ATS responses will have a Routing ID field matching the targeted Function. Each Function is required to

implement sufficient queuing to ensure it can hold the maximum number of outstanding Invalidation Requests from a TA (using either input or output queuing).

However, all VFs associated with a PF share a single input queue in the PF. To implement Invalidation flow control, theTA must ensure that the total number of outstanding Invalidate Requests to the shared PF queue (targeted to the PF and its associated VFs) does not exceed the value in the PF Invalidate Queue Depth field.

[**9.3.7.9**](9.3.7.9) **MR-IOV Changes**

PFs and VFs shall not contain an MR-IOV Capability.

If present, the MR-IOV Capability is contained in other Functions of the Component. See [MR-IOV] for details.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.3.7.10**](9.3.7.10) **Multicast Changes**

Multicast support is optional in SR-IOV devices. If a VF implements a Multicast capability, its associated PF must

implement a Multicast capability. PF and VF functionality is defined inSection 7.9.11except where noted in [Table 9-30 ,](#bookmark370) [Table 9-31 ,](#bookmark371) and [Table 9-32 .](#bookmark372)

Table 9-30 Multicast Capability Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 5:0 | **MC\_Max\_Group** - PF value applies to all VFs. | Base | RsvdP |
| 13:8 | **MC\_Window\_Size\_Requested**- PF value applies to all VFs. | Base | RsvdP |
| 15 | **MC\_ECRC\_Regeneration\_Supported**- Not applicable for Endpoints. | Base | RsvdP |

Table 9-31 Multicast Control Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit | PF and VF Register Differences From Base | PF | VF |
| Location |  | Attributes | Attributes |
| 5:0 | **MC\_Num\_Group** - PF value applies to all VFs. | Base | RsvdP |

Table 9-32 Multicast Base Address Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF Attributes | VF Attributes |
| 5:0 | **MC\_Index\_Position**- PF value applies to all VFs. | Base | RsvdP |
| 63:12 | **MC\_Base\_Address** - PF value applies to all VFs. | Base | RsvdP |

[**9.3.7.11**](9.3.7.11) **Page Request Interface Changes (PRI)**

Page Request Interface functionality is defined in Address Translation Services ([Chapter 10)](#bookmark373).

A PF is permitted to support the Page Request Interface. The Page Request Interface of the PF is also used by the

associated VFs. The PF is permitted to implement the Page Request Interface capability and the VFs shall not implement it.

Even though the Page Request Interface is shared between PFs and VFs, it sends the requesting Function’s ID (PF or VF) in the Requester ID field of the Page Request Message and expects the requesting Function’s ID in the Destination Device ID field of the resulting PRG Response Message.

[**9.3.7.12**](9.3.7.12) **Dynamic Power Allocation Changes (DPA)**

VFs may not implement the Dynamic Power Allocation Capability.

Power allocation for VFs is managed using the PF’s DPA Capability, if implemented.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.3.7.13**](9.3.7.13) **TPH Requester Changes (TPH)**

The TPH Requester Extended Capability may be present in VFs, PFs, both or neither. If any VF associated with a given PF contains the capability, all VFs associated with that PF must also support TPH.

All TPH Requester Extended Capabilityfields in PFs and VFs operate as defined in Section 7.9.13 .

For fields in the TPH Requester Capability Register (offset 04h), a PF and its associated VFs may have different values, but all VFs associated with the same PF must have the same values in all fields.

[**9.3.7.14**](9.3.7.14) **PASID Changes**

An Endpoint device is permitted to support PASID. The PASID configuration of the single function (Function or PF)

representing the device is also used by all VFs in the device. A PF is permitted to implement the PASID capability, but VFs must not implement it.

Even though the PASID configuration is shared between Functions, PFs and VFs, the device sends the requesting Function’s ID (Function, PF or VF) in the Requester ID field of the TLP containing PASID.

[**9.3.7.15**](9.3.7.15) **Readiness Time Reporting Extended Capability Changes**

The Readiness Time Reporting Extended Capability may be present in VFs, PFs, both or neither. If any VF associated with a given PF contains the capability, all VFs associated with that same PF must also support Readiness Time Reporting.

The Reset Time field contains the time required following setting of VF Enable (see [Section 9.6.1)](#bookmark374). The DL\_Up Time field is RsvdP.

All VFs associated with the same PFshall report the same time values.

**9.4 SR-IOV Error Handling**

SR-IOV devices utilize the Error Reporting mechanism defined in Section 6.2 . Errors that are defined as non-function-specific are only logged in the PF.

Section 6.2defines two error reporting paradigms: the baseline Capability and the Advanced Error Reporting Capability. The baseline error reporting capabilities are required of all PCI Express devices and define the minimum error reporting requirements. The Advanced Error Reporting Capability is optional and is defined for more robust error reporting and is implemented with a specific PCI Express Capability structure.

**9.4.1 Baseline Error Reporting**

All SR-IOV devices must support the baseline error reporting capabilities, with some modifications to account for the goal of reduced cost and complexity of implementation.

These control bits are only meaningful in the PF. VFs shall use the error reporting control bits in the associated PF when making decisions on generating error Messages.

These following fields areRsvdPin VFs:

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

• Command register (see[Section 9.3.4.1.3)](#bookmark293) 。SERR# Enable

。 Parity Error Response

• Device Control register (see[Section 9.3.5.4)](#bookmark299) 。 Correctable Reporting Enable

。 Non-Fatal Reporting Enable 。 Fatal Reporting Enable

。 Unsupported Request (UR) Reporting Enable

Each VF shall implement a mechanism to provide error status independent of any other Function. This is necessary to provide SI isolation for errors that are Function-specific.

The following baseline error reporting status bits must be implemented in each VF:

• Status register (see[Section 9.3.4.1.4)](#bookmark295) 。 Master Data Parity Error

。 Signaled Target Abort 。 Received Target Abort 。 Received Master Abort 。 Signaled System Error 。 Detected Parity Error

• Device Status register (see[Section 9.3.5.5)](#bookmark306) 。 Correctable Error Detected

。 Non-Fatal Error Detected 。 Fatal Error Detected

。 Unsupported Request Detected

Each VF shall use its own Routing ID when signaling errors.

**9.4.2 Advanced Error Reporting**

The Advanced Error Reporting Capability is optional. If AER is not implemented in the PF, it must not be implemented in the associated VFs. If AER is implemented in the PF, it is optional in the VFs.

Section 6.2.4classifies errors as Function-specific and non-Function-specific. Each VF that implements the Advanced Error Reporting Capability must maintain its own error reporting status for function-specific errors.

[**9.4.2.1**](9.4.2.1) **VF Header Log**

A Device that implements AER in the VFs may share Header Log Registers among VFs associated with a single PF. See [Section 9.4.2.10f](#bookmark376)or details.

Header logging Registers for the PF are independent of its associated VFs and must be implemented with dedicated storage space.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

When implementing a reduced set of Header Log Registers, a Function may not have room to log a header associated with an error. In this case, the Functionshall update the Uncorrectable Error Status Register and Advanced Error

Capabilities and Control register as required by Section 6.2.4; however, when the Header Log Register is read, it shall return all 1s to indicate an overflow condition and no header was logged.

[**9.4.2.2**](9.4.2.2) **Advanced Error Reporting Capability Changes**

Figure 7-122describes the AER extended capability structure.

[**9.4.2.3**](9.4.2.3) **Advanced Error Reporting Extended Capability Header Changes (Offset 00h)**

This register contains the PCI Express Extended Capability ID, Capability Version, and Next Capability Offset. These fields are unchanged and are described in Section 7.8.4.1 .

[**9.4.2.4**](9.4.2.4) **Uncorrectable Error Status Register Changes (Offset 04h)**

The Uncorrectable Error Status register indicates error detection status of individual errors. Errors that are defined as non-Function-specific are logged in the PF. Only Function-specific errors are logged in the VFs.

PF and VF functionality is defined inSection 7.8.4.2except where noted in [Table 9-33 .](#bookmark377)

Table 9-33 Uncorrectable Error Status Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 4 | Data Link Protocol Error Status | Base | 0b |
| 5 | Surprise Down Error Status | Base | 0b |
| 13 | Flow Control Protocol Error Status | Base | 0b |
| 17 | Receiver Overflow Status | Base | 0b |
| 18 | Malformed TLP Status | Base | 0b |
| 19 | ECRC Error Status | Base | 0b |

[**9.4.2.5**](9.4.2.5) **Uncorrectable Error Mask Register Changes (Offset 08h)**

PF and VF functionality is defined inSection 7.8.4.3except where noted in [Table 9-34 .](#bookmark378) For VF fields markedRsvdP, the PF setting applies to the VF. For VF fields marked 0b, the error is not applicable to a VF.

Table 9-34 Uncorrectable Error Mask Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 4 | Data Link Protocol Error Mask | Base | 0b |
| 5 | Surprise Down Error Mask | Base | 0b |
| 12 | Poisoned TLP Received Mask | Base | RsvdP |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 13 | Flow Control Protocol Error Mask | Base | 0b |
| 14 | Completion Timeout Mask | Base | RsvdP |
| 15 | Completer Abort Mask | Base | RsvdP |
| 16 | Unexpected Completion Mask | Base | RsvdP |
| 17 | Receiver Overflow Mask | Base | 0b |
| 18 | Malformed TLP Mask | Base | 0b |
| 19 | ECRC Error Mask | Base | 0b |
| 20 | Unsupported Request Error Mask | Base | RsvdP |
| 21 | ACS Violation Mask | Base | RsvdP |

[**9.4.2.6**](9.4.2.6) **Uncorrectable Error Severity Register Changes (Offset 0Ch)**

PF and VF functionality is defined inSection 7.8.4.4except where noted in [Table 9-35 .](#bookmark379) For VF fields markedRsvdP, the PF setting applies to the VF. For VF fields marked 0b, the error is not applicable to a VF.

Table 9-35 Uncorrectable Error Severity Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 4 | Data Link Protocol Error Severity | Base | 0b |
| 5 | Surprise Down Error Severity | Base | 0b |
| 12 | Poisoned TLP Received Severity | Base | RsvdP |
| 13 | Flow Control Protocol Error Severity | Base | 0b |
| 14 | Completion Timeout Error Severity | Base | RsvdP |
| 15 | Completer Abort Error Severity | Base | RsvdP |
| 16 | Unexpected Completion Error Severity | Base | RsvdP |
| 17 | Receiver Overflow Severity | Base | 0b |
| 18 | Malformed TLP Severity | Base | 0b |
| 19 | ECRC Error Severity | Base | 0b |
| 20 | Unsupported Request Error Severity | Base | RsvdP |
| 21 | ACS Violation Severity | Base | RsvdP |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.4.2.7**](9.4.2.7) **Correctable Error Status Register Changes (Offset 10h)**

The Correctable Error Status register indicates error detection status of individual correctable errors. Errors that are defined as non-Function-specific are logged in the PF. Only Function-specific errors are logged in the VFs.

PF and VF functionality is defined inSection 7.8.4.5except where noted in [Table 9-36 .](#bookmark380)

Table 9-36 Correctable Error Status Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 0 | Receiver Error Status | Base | 0b |
| 6 | Bad TLP Status | Base | 0b |
| 7 | Bad DLLP Status | Base | 0b |
| 8 | REPLAY\_NUM Rollover Status | Base | 0b |
| 12 | Replay Timer Timeout Status | Base | 0b |
| 15 | Header Log Overflow Status  If the VF implements Header Log sharing (see[Section 9.4.2.1](#bookmark375)), this bit is hardwired to 0b. | Base | Base / 0b |

[**9.4.2.8**](9.4.2.8) **Correctable Error Mask Register Changes (Offset 14h)**

PF and VF functionality is defined inSection 7.8.4.6except where noted in [Table 9-37 .](#bookmark381) For VF fields markedRsvdP, the PF setting applies to the VF.

Table 9-37 Correctable Error Mask Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 0 | Receiver Error Mask | Base | RsvdP |
| 6 | Bad TLP Mask | Base | RsvdP |
| 7 | Bad DLLP Mask | Base | RsvdP |
| 8 | REPLAY\_NUM Rollover Mask | Base | RsvdP |
| 12 | Replay Timer Timeout Mask | Base | RsvdP |
| 13 | Advisory Non-Fatal Error Mask | Base | RsvdP |
| 15 | Header Log Overflow Mask- If the VF implements Header Log sharing (see[Section 9.4.2.1](#bookmark375)), this bit is RsvdP. | Base | Base / RsvdP |

[**9.4.2.9**](9.4.2.9) **Advanced Error Capabilities and Control Register Changes (Offset 18h)**

PF and VF functionality is defined inSection 7.8.4.7except where noted in [Table 9-38 .](#bookmark382) For VF fields markedRsvdP, the PF setting applies to the VF.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Table 9-38 Advanced Error Capabilities and Control Register Changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 6 | ECRC Generation Enable | Base | RsvdP |
| 8 | ECRC Check Enable | Base | RsvdP |
| 9 | Multiple Header Recording Capable- If the VF implements Header Log sharing (see[Section](#bookmark375)  [9.4.2.1](#bookmark375)), this bit is hardwired to 0b. | Base | Base / 0b |
| 10 | Multiple Header Recording Enable- If the VF implements Header Log sharing (see[Section](#bookmark375)  [9.4.2.1](#bookmark375)), this bit is RsvdP. | Base | Base / RsvdP |
| 11 | TLP Prefix Log Present- If the VF implements Header Log Sharing (see[Section 9.4.2.1](#bookmark375)), this bit is 0b when the Header Log contains all 1s due to an overflow condition. | Base | Base (see description) |

[**9.4.2.10**](9.4.2.10) **Header Log Register Changes (Offset 1Ch)**

The Header Log register captures the header for the TLP corresponding to a detected error. See also [Section 9.4.2.1 .](#bookmark375)

A Device that implements AER in the VFs may share Header Log Registers among VFs associated with a single PF. A shared header log must have storage for at least one header.

Header logging Registers for the PF is independent of its associated VFs and must be implemented with dedicated storage space.

When an error is detected in a VF, the error shall be logged as specified inSection 6.2. If a shared set of Header Log

Registers is implemented, a VF may not have room to log a header. In this case, the VF shall update its Uncorrectable

Error Status Register and Advanced Error Capabilities and Control register as required in Section 6.2 ; however, when that VF’s Header Log Register is read, it shall return all 1s to indicate an overflow condition.

The VF’s header log entry shall be locked and remain valid while that VF’s First Error Pointer is valid. As defined in

Section 6.2, the First Error Pointer register is valid when the corresponding bit of the Uncorrectable Error Status register is Set. While the header log entry is locked, additional errors shall not overwrite the locked entry for this or any other VF. When a header entry is unlocked, it shall be available to record a new error for any VF sharing the header logs.

PF and VF functionality is defined inSection 7.8.4.8 , except where noted in [Table 9-39 .](#bookmark383)

Table 9-39 Header Log Register changes

|  |  |  |  |
| --- | --- | --- | --- |
| Bit | PF and VF Register Differences From Base | PF | VF |
| Location |  | Attributes | Attributes |
| 127:0 | Header of TLP associated with error (additional requirements are described above) | Base | Base |

[**9.4.2.11**](9.4.2.11) **Root Error Command Register Changes (Offset 2Ch)**

This register is not applicable to Devices.

[**9.4.2.12**](9.4.2.12) **Root Error Status Register Changes (Offset 30h)**

This register is not applicable to Devices.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**9.4.2.13**](9.4.2.13) **Error Source Identification Register Changes (Offset 34h)**

This register is not applicable to Devices.

[**9.4.2.14**](9.4.2.14) **TLP Prefix Log Register Changes (Offset 38h)**

For PFs and VFs, if End-End TLP Prefixes are supported, this register is implemented.

For a VF, if a shared set of Header Log registers is implemented ([Section 9.4.2.1](#bookmark375)), this register’s contents are undefined when the Header Log contains all 1s due to an overflow condition.

**9.5 SR-IOV Interrupts**

SR-IOV capable devices utilize the same Interrupt signaling mechanisms defined in Section 6.1 .

**9.5.1 Interrupt Mechanisms**

There are three methods of signaling interrupts:

• INTx • MSI

• MSI-X

PFs may implement INTx. VFs must not implement INTx. PFs and VFs shall implement MSI or MSI-X or both if interrupt resources are requested. Each PF and VF must implement its own unique interrupt capabilities.

[**9.5.1.1**](9.5.1.1) **MSI Interrupts**

MSI Capability and PF and VF functionality are defined inSection 7.7except where noted in [Table 9-40 .](#bookmark384)

Table 9-40 MSI Capability: Message Control

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF Attributes | VF Attributes |
| 8 | Per-Vector Masking Capable | 1b | 1b |

[**9.5.1.2**](9.5.1.2) **MSI-X Interrupts**

The MSI-X Capability is defined in Section 7.7and depicted in [Figure 9-24 .](#bookmark385)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

31 16 15 8 7 3 2 0

|  |  |  |  |
| --- | --- | --- | --- |
| Message Control | Next Pointer | Capability ID | |
| Table Offset | | | Table BIR |
| PBA Offset | | | PBA BIR |

Figure 9-24 MSI-X Capability

CP + 00h CP + 04h CP + 08h

A-0383

PF and VF functionality is identical to that of a Function as defined in Section 7.7.2 .

Note that for VFs the Table Offset and PBA Offset values are relative to the VF’s Memory address space.

[**9.5.1.3**](9.5.1.3) **Address Range Isolation**

If a BAR that maps address space for the MSI-X Tableor MSI-X PBAalso maps other usable address space that is not

associated with MSI-X structures, locations (e.g., for CSRs) used in the other address space must not share any naturally aligned System Page Size address range with one where either MSI-X structure resides. The MSI-X Tableand MSI-X PBA are permitted to co-reside within a naturally aligned System Page Size address range, though they must not overlap with each other.

**9.6 SR-IOV Power Management**

This section defines the PCI Express SR-IOV power management capabilities and protocols.

The Power Management Capability is required for PFs as described inChapter 5 . For VFs, the Power Management Capability is optional.

**9.6.1 VF Device Power Management States**

If a VF does not implement the Power Management Capability, then the VF behaves as if it had been programmed into the equivalent power state of its associated PF.

If a VF implements the Power Management Capability, the functionality is defined inSection 7.5except as noted in [Section 9.6.4 .](#bookmark386)

If a VF implements the Power Management Capability, the Device behavior is undefined if the PF is placed in a lower

power state than the VF. Software should avoid this situation by placing all VFs in lower power state before lowering their associated PF’s power state.

A VF in the D0state is in the D0activestate when the VF has completed its internal initialization and either the VF's Bus Master Enable bit is Set (see[Section 9.3.4.1.3](#bookmark293)) or the VF MSE bit in the SR-IOV Control (see [Section 9.3.3.3)](#bookmark212) Extended Capability is Set. The VF’s internal initialization must have completed whenany of the following conditions have

occurred:

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

• The VF has responded successfully (without returning CRS) to a Configuration Request.

• After issuing an FLR to the VF, one of the following is true: 。 At least 1.0 s has passed since the FLR was issued.

。 The VF supports Function Readiness Status and, after the FLR was issued, an FRS Message from the VF with Reason Code FLR Completed has been received.

。 At least FLR time has passed since the FLR was issued. FLR Time is either (1) the FLR Time value in the Readiness Time Reporting capability associated with the VF or (2) a value determined by system

software / firmware161 .

• After Setting VF Enable in a PF, at least one of the following is true: 。 At least 1.0 s has passed since VF Enable was Set.

。 The PF supports Function Readiness Status and, after VF Enable was Set, an FRS Message from the PF with Reason Code VF Enabled has been received.

• After transitioning a VF from D3HottoD0, at least one of the following is true:

。 At least 10 ms has passed since the request to enterD0was issued.

。 The VF supports Function Readiness Status and, after the request to enterD0was issued, an FRS Message from the VF with Reason Code D3HottoD0Transition Completed has been received.

。 At leastD3Hot to D0 Time has passed since the request to enterD0was issued.D3Hot to D0 Timeis either (1) the D3Hot to D0 Timein the Readiness Time Reporting capability associated with the VF or

(2) a value determined by system software / firmware162 .

**9.6.2 PF Device Power Management States**

The PF’s power management state (D-state) has global impact on its associated VFs. If a VF does not implement the Power Management Capability, then it behaves as if it is in an equivalent power state of its associated PF.

If a VF implements the Power Management Capability, the Device behavior is undefined if the PF is placed in a lower

power state than the VF. Software should avoid this situation by placing all VFs in lower power state before lowering their associated PF’s power state.

When the PF is placed into the D3Hotstate:

• If the No\_Soft\_Reset bit is Clear then the PF performs an internal reset on the D3HottoD0transition and all its configuration state returns to the default values.

Note: Resetting the PF resets VF Enable which means that VFs no longer exist and any VF specific context is lost after the D3HottoD0transition is complete.

• If the No\_Soft\_Reset bit is Set then the internal reset does not occur. The SR-IOV extended capability retains state, and associated VFs remain enabled.

When the PF is placed into the D3Coldstate VFs no longer exist, any VF specific context is lost and PME events can only be initiated by the PF.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

161. For example, ACPI tables.

162. For example, ACPI tables.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**IMPLEMENTATION NOTE**

No\_Soft\_Reset Strongly Recommended

It is strongly recommended that the No\_Soft\_Reset bit be Set in all Functions of a Multi-Function Device. This recommendation applies to PFs.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACHCAYAAAAr6RiGAAAAN0lEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAD8AC7NbwQM8kSoswAAAABJRU5ErkJggg==)

**9.6.3 Link Power Management State**

VF Power State does not affect Link Power State.

Link Power State is controlled solely by the setting in the PFs regardless of the VF’s D-state.

**9.6.4 VF Power Management Capability**

The following tables list the requirements for PF and VFs Power Management Capability.

PF and VF functionality is defined inSection 7.5except where noted in [Table 9-41](#bookmark386)and [Table 9-42 .](#bookmark387)

Table 9-41 SR-IOV Power Management Control/Status (PMCSR)

|  |  |  |  |
| --- | --- | --- | --- |
| Bit  Location | PF and VF Register Differences From Base | PF  Attributes | VF  Attributes |
| 14:13 | Data\_Scale | Base | 00b |
| 12:9 | Data\_Select | Base | 0000b |
| 3 | **No\_Soft\_Reset** -If a VF implements the Power Management Capability, the VF's value of this field must be identical to the associated PF's value. | Base | Base |

Table 9-42 SR-IOV Power Management Data Register

|  |  |  |  |
| --- | --- | --- | --- |
| Bit Location | PF and VF Register Differences From Base | PF Attributes | VF Attributes |
| 7:0 | Data | Base | 00000000b |

**9.6.5 VF EmergencyPower Reduction State**

If the Emergency Power Reduction Supported field in a VF is non-zero, that VF enters and exits the Emergency Power

Reduction State at the same time as the associated PF. Software can use the Emergency Power Reduction Detected bit in the PF to emulate the corresponding bit in the VF.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**ATS Specification**

**10.1 ATS Architectural Overview**

**10.**

Most contemporary system architectures make provisions for translating addresses from DMA (bus mastering) I/O Functions. In many implementations, it has been common

practice to assume that the physical address space seen by the CPU and by an I/O

Function is equivalent. While in others, this is not the case. The address programmed into

an I/O Function is a “handle” that is processed by the Root Complex (RC). The result of this processing is often a

translation to a physical memory address within the central complex. Typically, the processing includes access rights checking to insure that the DMA Function is allowed to access the referenced memory location(s).

The purposes for having DMA address translation vary and include:

• Limiting the destructiveness of a “broken” or miss-programmed DMA I/O Function

• Providing for scatter/gather

• Ability to redirect message-signaled interrupts (e.g., MSI or MSI-X) to different address ranges without requiring coordination with the underlying I/O Function

• Address space conversion (32-bit I/O Function to larger system address space)

• Virtualization support

Irrespective of the motivation, the presence of DMA address translation in the host system has certain performance implications for DMA accesses.

Depending on the implementation, DMA access time can be significantly lengthened due to the time required to resolve the actual physical address. If an implementation requires access to a main-memory-resident translation table, the

access time can be significantly longer than the time for an untranslated access. Additionally, if each transaction

requires multiple memory accesses (e.g., for a table walk), then the memory transaction rate (i.e., overhead) associated with DMA can be high.

To mitigate these impacts, designs often include address translation caches in the entity that performs the address

translation. In a CPU, the address translation cache is most commonly referred to as a translation look-aside buffer

(TLB). For an I/OTA, the term address translation cache or ATC is used to differentiate it from the translation cache used by the CPU.

While there are some similarities between TLB and ATC, there are important differences. ATLB serves the needs of a CPU that is nominally running one thread at a time. The ATC, however, is generally processing requests from multiple I/O

Functions, each of which can be considered a separate thread. This difference makes sizing an ATC difficult depending upon cost models and expected technology reuse across a wide range of system configurations.

The mechanisms described in this specification allow an I/O Device to participate in the translation process and provide an ATC for its own memory accesses. The benefits of having an ATC within a Device include:

• Ability to alleviate TA resource pressure by distributing address translation caching responsibility (reduced probability of “thrashing” within theTA)

• Enable ATC Devices to have less performance dependency on a system’s ATC size

• Potential to ensure optimal access latency by sending pretranslated requests to central complex

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

This specification will provide the interoperability that allows PCIe Devices to be used in conjunction with a TA, but the TA and its Address Translation and Protection Table (ATPT) are treated as implementation-specific and are outside the scope of this specification. While it may be possible to implement ATS within other PCIe Components, this specification is confined to PCIe Devices and PCIe Root Complex Integrated Endpoints (RCiEPs).

[Figure 10-1](#bookmark388)illustrates an example platform with a TA and ATPT, along with a set of PCIe Devices and RC Integrated Endpoints with integrated ATC. ATA and an ATPT are implementation-specific and can be distinct or integrated

components within a given system design.

|  |
| --- |
| Translation Agent (TA) |

|  |
| --- |
| Root Port (RP) |

|  |
| --- |
| Root Port (RP) |

Switch

|  |
| --- |
| Memory |

|  |
| --- |
| Address Translation and Protection Table (ATPT) |

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| Root Complex (RC)   |  |  | | --- | --- | | RC Integrated Endpoint |  | | ATC | |  | |

|  |  |
| --- | --- |
| PCIe Device |  |
| ATC |
|  |

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
|  |  | | PCIe Device   |  | | --- | | ATC | |
| PCIe Device | |  |
| ATC |
|  |

A-0588

Figure 10-1 Example Illustrating a Platform with TA, ATPT, and ATC Elements

**10.1.1 Address Translation Services (ATS) Overview**

The ATS chapter provides a new set of TLP and associated semantics. ATS uses a request-completion protocol between a Device163 and a Root Complex (RC) to provide translation services. In addition, a new AT field is defined within the

Memory Read and Memory Write TLP. The new AT field enables an RC to determine whether a given request has been translated or not via the ATS protocol.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

163. All references within this chapter to a Device apply equally to a PCIe Device or a Root Complex Integrated Endpoint. ATS does not delineate between these two types in terms of requirements,semantics, configuration, error handling, etc. From a software perspective, an ATS-capable Root Complex Integrated Endpoint must behave the same as an ATS-capable non-integrated Device.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[Figure 10-2](#bookmark389)illustrates the basic flow of an ATS Translation Request operation.

|  |
| --- |
| Translation Agent (TA) |

|  |
| --- |
| Memory |

|  |
| --- |
| Address Translation and Protection Table (ATPT) |

|  |
| --- |
| Root Complex (RC) |

|  |
| --- |
| Root Port (RP) |

ATS

ATS

Request

Completion

|  |  |
| --- | --- |
| PCIe Device |  |
| ATC |
|  |

A-0589

Figure 10-2 Example ATS Translation Request/Completion Exchange

In this example, a Function-specific work request is received by a single-Function PCIe Device. The Function determines through an implementation-specific method that caching a translation within its ATC would be beneficial. There are a number of considerations a Function or software can use in making such a determination; for example:

• Memory address ranges that will be frequently accessed over an extended period of time or whose associated buffer content is subject to a significant update rate

• Memory address ranges, such as work and completion queue structures, data buffers for low-latency

communications, graphics frame buffers, host memory that is used to cache Function-specific content, and so forth

Given the variability in designs and access patterns, there is no single criteria that can be applied.

The Function generates an ATS Translation Request which is sent upstream through the PCIe hierarchy to the RC which then forwards it to theTA. An ATS Translation Request uses the same routing and ordering rules as defined in this

specification. Further, multiple ATS Translation Requests can be outstanding at any given time; i.e., one may pipeline multiple requests on one or more TC. Each TC represents a unique ordering domain and defines the domain that must be used by the associated ATS Translation Completion.

Upon receipt of an ATS Translation Request, theTA performs the following basic steps:

1. Validates that the Function has been configured to issue ATS Translation Requests.

2. Determines whether the Function may access the memory indicated by the ATS Translation Request and has the associated access rights.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

3. Determines whether a translation can be provided to the Function. If yes, theTA issues a translation to the Function.

a. ATS is required to support a variety of page sizes to accommodate a range of ATPT and processor implementations.

i. Page sizes are required to be a power of two and naturally aligned.

ii. The minimum supported page size is 4096 bytes. ATS capable components are required to support this minimum page size.

b. A Function must be informed of the minimum translation or invalidate size it will be required to support to provide the Function an opportunity to optimize its resource utilization. The smallest minimum translation size must be 4096 bytes.

4. TheTA communicates the success or failure of the request to the RC which generates an ATS Translation Completion and transmits via a Response TLP through a RP to the Function.

a. An RC is required to generate at least one ATS Translation Completion per ATS Translation Request; i.e., there is minimally a 1:1 correspondence independent of the success or failure of the request.

i. A successful translation can result in one or two ATS Translation Completion TLPs per request. The Translation Completion indicates the range of translation covered.

ii. An RC may pipeline multiple ATS Translation Completions; i.e., an RC may return multiple ATS Translation Completions and theseATS Translation Completions may be in any order relative to ATS Translation Requests.

iii. The RC is required to transmit the ATS Translation Completion using the same TC (Traffic Class) as the corresponding ATS Translation Request.

b. The requested address may not be valid. The RC is required to issue a Translation Completion indicating that the requested address is not accessible.

When the Function receives the ATS Translation Completion and either updates its ATC to reflect the translation or notes that a translation does not exist. The Function proceeds with processing its work request and generates subsequent

requests using either a translated address or an untranslated address based on the results of the Completion.

a. Similar to Read Completions, a Function is required to allocate resource space for each completion(s) without causing backpressure on the PCIe Link.

b. A Function is required to discard Translation Completions that might be “stale” Stale Translation Completions can occur for a variety of reasons.

As one can surmise, ATS Translation Request and Translation Completion processing is conceptually similar and, in many respects, identical to PCIe Read Request and Read Completion processing. This is intentional to reduce design complexity and to simplify integration of ATS into existing and new PCIe-based solutions. Keeping this in mind, ATS requires the following:

• ATS capable components must interoperate with [PCIe-1.1] compliant components.

• ATS is enabled through a new Capability and associated configuration structure. To enable ATS, software must detect this Capability and enable the Function to issue ATSTLP. If a Function is not enabled, the Function is

required not to issue ATS Translation Requests and is required to issue all DMA Read and Write Requests with the TLPAT field set to “untranslated”

• ATSTLPs are routed using either address-based or Requester ID (RID) routing.

• ATSTLPs are required to use the same ordering rules as specified in this specification.

• ATSTLPs are required to flow unmodified through [PCIe-1.1] compliant Switches.

• A Function is permitted to intermix translated and untranslated requests.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

• ATS transactions are required not to rely upon the address field of a memory request to communicate additional information beyond its current use as defined by the PCI-SIG.

**IMPLEMENTATION NOTE**

Address Range Overlap

It is likely that the untranslated and translated address range will overlap, perhaps in their entirety. This is not a requirement of ATS but may be an implementation constraint on theTA so that memory requests will be properly routed.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

In contrast to the prior example,[Figure 10-3](#bookmark390)illustrates an exampleMulti-Function Device. In this example Device, there

![](data:image/jpeg;base64,/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/2wBDAQkJCQwLDBgNDRgyIRwhMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjL/wAARCAACAEcDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwDqPCf/ACJuh/8AYPt//Ra1sUUUAFFFFABRRRQAUUUUAFFFFAH/2Q==)are three Functions. Key points to note in [Figure 10-3 are:](#bookmark391)

• Each ATC is associated with a single Function. Each ATS-capable Function must be able to source and sink at least one of each ATS Translation Request or Translation Completion type.

• Each ATC is configured and accessed on a per Function basis. A Multi-Function Deviceis not required to implement ATS on every Function.

• If the ATC implementation shares resources among a set of Functions, then the logical behavior is required to be consistent with fully independent ATC implementations.

|  |
| --- |
| PCIe Port |

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| Function 1 | |  | | --- | | ATC 1 | | |  | | --- | | Physical Resources1 | |

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| Function 2 | |  | | --- | | ATC2 | | |  | | --- | | Physical Resources2 | |

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| Function 3 | |  | | --- | | ATC3 | | |  | | --- | | Physical Resources3 | |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAI0AAAB6CAYAAACCydAEAAAOt0lEQVR4nO2deVgURxbAe24OEcQIqxiroTmCSFTUwIbLE8Q1cVmNJm42JiFqosOhbjwAMYokGjziiQgecTUm0RiNu9GYL9GYkGU1HhCQKAyZFuQS6G6Z+9w/NmZF5Rp6pnqY+v05XfX6ff1qXvereq+Kh3EIAuDep7/8lyJ4+HATbF24xMSY2Cdu375dKyPlZti6cAoC4Px5L//tphnxCJcvXboX4Os3A7aNOEcQ4f9G5a1KBWwDcRXpW4vqCIBLYNuJMxAA75+xKp2BbRguU1dXpxwZMmIVbFthGIYJYCuAYRgWGOD/fn7B3mckEokQti5cxc3NTaRQKP5YXVm1n2JoBWx9oEIAPGBfQaEa9j/ZHlCr1bqJMbFHYdsMOs9Nm/aNXq83wjaIvXDis89UBMDHwLYbNAiAx3134YIStiHsCZPJZH7xhdnXCYDzYNvP5hAAFy18Yz4J2wj2yM+lpQp/3PclWLaD9iHsM3hwSn5hwXR3d3cxLB3sFS9vb3HN7dsTGuvrd1IMrYetj00gAD4wJzu7DfY/1p5paWlRjRk5KgeG/aB4mhEhITt25u0eLRKJUIhtIc7OziK9Xj/25o2Kf1AMzcDWx6oQAA/96PBhFGKzgFar1cdPmvwFbJtaFQLgvFmJicVGo9EE+4H3Fc5+eUZJADzalnbk2/RmfP6MlenpI/h8vuOFi1YiPmGqS1R0dCEBcJvZ0mY3IgAumZqQsGfM2LGutrqno5CZleUjFApft9X9bPYhPGzo0JUFB/ZPdnV1Fdnqno6C50BPcUND4/i6mprdFENrYevDCgTAB2/OzUUzv1bkHsOow8eO3WYLe9rE04weNapw2/YdI4RCISdW1fsiEolEyOcLRpaXln5KMXQrbH16BQHwcZ+fOKGC/U90BPR6vfG5adO+gW3zXkEAnDd3zpxS2A/TkfjuwgUlAfA4a9rVqtETj8ebm5652s+a90C0JyY21mXylCkFBMCtFnBYbdAQAHdNnPmX7SEjQlCIbWMyslYPlEgki60l32ofpoSv77r8woIoZ2dnFGLbGHd3dzFNU1Fk9a97KYZWsy3fKp6GADh4LSkpzdPT09ka8hFds2TZMtEQH5+N1pBtFU8TPm7ckU1btwQKBAKbLlMg/o9IJBI6OUmGX79y9QuKoZtg69MpBMCjvzr7FZrI4wBGo9E0KzGxmO3UUFY9AQFwQUxszL64+DgXNuUiLIPP5/N+WyBmtTqT1dfTH7y85uft3TvL09MTpXByhCFDhogrb1VOaGm6u4NiaCMbMlnzNATA3WfNfiGX8CdQiM0xMtdkufXr1+/vbMljzdM8FRiYm7cXVUlyEVdXV5FKpYqQ3ao8wEZ1JiuehgB44II3F853c3NzYkMegn0WS5NFAMc/YEMWK54mOjLyeM6G9/xQRh53EQqFAnd3d7+fiv9zjmLoOqjKEACP//7iRRRi2wlz58wp7W0I3qvXEwFw0ZT4uIKo6GgUYtsJ6Zmr/Xg83tzeyOjV68ln8ODU/ILC6f3d+6P1JTvBy8tLXFtbM76xrn6XpdWZFnsaAuBPvPTXudlDnxyKvIydsTI93XnAgAGZlva32NOEjgjZtXP37lEikQilcNoZzs7OIoPBMPbmjRuHLanOtMjTEAAPXSSVznFxcUEzv3bK/IULhAEBAbss6dtjL0EAnBcWFnZ6zdq1w3g8Hgqx7RSBQMD38vYe+u/vf/iBYmiyJ3177Gn4fH7iyoyM4WhOxv6Ji49ziY6JKSQA3iPn0aNBQwDcKeFP0/LCxoSh9aU+QmZW1hCxWJzUkz49GmFg2JOrCvbtn4iqJPsOnp6e4sbGxvF3bne/OrPbnoYA+JBX5r26apDXIBRi9zHeXrFC5OXltb677bvtacaEhe3bun3bcFQl2feQSCRCgUAQWlZSeoxi6Jau2nfL0xAAH5eckjJdIpGg11If5eVXXpGEhITkd6dtl16DADgv4o8RZ1dlZvj0XjUEV+Hz+bwnhz3pdfHb81cphq7qtG1XwoRC4cuZWVm+7KmH4CpR0dEuU+K6rs7sdNAQAHf9818Stz0VHIxCbAchY3WWp7Ozs7SzNp2+nvwJIju/sCDSyckJfcs4CP3d+4seqM5UPa5Nh56GADj+elJSioeHB6qSdDDSli4V+Qz1ye3oeoeeJiI8/Mj7mzcFoSpJx0MkEglcXFyCrv105TTF0I0PX3/sgCAAHrv078smoLQHx+WF2bOdwsLCCh+XGvrIoCEALogZH1s4ecoU9FpyYPh8Pu+3henEh6894kn+4OW1IL+gYOaAAQNQroyDM3jIYHFVVdWE5samnRRDG+7/3s7TEAD3mD1nTq6vnx8KsREYhmFY5uqsfm793ZY/+Fs7TxMcFLR5d/6ecahKEnEfV1dXkVqljqi6eesAxdBtGPaApyEAHrTgzYWvu7m5oWN8Ee1YJF0s9PXz+70683dPExMV9RmqkkQ8DqFQKPBw9/C7XFz8NcXQd/gYhmEEwBOWr1o5Ds3JIDri+T/PcA6PCN9PAJzPxzAMi586ddezkZEouQrRKRmrs3z5fP7zfAzDMIlEgibxEF0iFosxkUj0v9fRP0+fTr1y5YoStlIIbpOTnX1Hq9We5GMYhplMplMbcnLKTCaTGbZiCG5y7uxX6u8vXkySkXKjAMMwjGJoTKvWFHt7e78+IjQUzdEg2qHT6Ywpi6XnLl+7ugHDHpinkZHyst27dh9RqVR945ApBGsU5O/VV1VV/b5tfrsPYBFfUKTValOjY2LQuhMCwzAMa21tVS9LS/ug/OYvJ+7/1m5eRkbKW44e+SijtqYWfRQjMAzDsPdy3lXTNN3u0PhHJvPUavWunOx1Xda+IPo+ZT//rDx54kSyjJS3cyKPzM9QDG2iWlrLR4eFzQQAoNxgB8VsNmNpySkVd+7ckVIM3e7aY5cNZKT869yNG4sNBoPJJhoiOMfJzz9XX750KUlGyh+ZhulwJtio0//o7u6xcHTYaBSCOxgajUafJk3+/GppyY7HXe9wgVJGyqsK8vPz2+7d01hPPQQX2bVjh54kySUdXe90VbupqSkr9/330byNA9HQ0KD6x4eHcmSkvKGjNp0uVFIMrb3b0NgcPzVhkudAdLKKI5C+YmVLeXn5nM5ObOkyf0an0+1fv25dLbuqIbjI5UuXlGfPnHlTRso7fbt0mRJBMbRZwdy7FhT01Ev+Af4oBO+jGI1Gc2pyypWG+voVD4fYD9OtTD0ZKS/6YMuWr3U6naHr1gh75JOjH2tLrl+f/7gQ+2G6nXxlNpqKxGLJomfCn0Hepo+hVCq1acnJh0vKy/Z1p323c4JlpLzmwL59W1tbW1k/5xkBl62bN+vr6+tXdLd9jxLJaZrOeS/n3cduP4GwT0iSVB098lGGjJS3drdPj3KDKYbWN9U33J44aWKCl7c3CsH7AG8vXVZXVVX1N4qhu71k1OOSFbPZ/ElO9vpKsxllhto7352/oDr/7bdJMlLeowCnx1UIFENjaqXy8jAw7JXg4GD0UWynGAwGY4pU+v1/rvy0tqd9LSqOk5Hyqzu3bT+p0WgsOmQKAZ+D+w/of6moeMuSvhbXO/Ex3g8mkyn52chI5G3sDIZhNEtSU/eUVdw4akl/iwcNxdDKpvoG04zExGfd3NzQwLEj1q9dp7x86dLzFEPrLOnfq9pthUKx5d3s9T0+mQwBj1u3bimPHzu2VEbK2yyV0atyXIqhjVRLS2V4RMQMn6FDkbexA5akpFaTJLmAYmiLw99e7xJhMplOb3xvw3Wj0YhicI5z5ssv1T8WFSXJSHmv0nh7XfhPMTSm02iKBw0alPT0yKdRaihH0Wq1hhSp9MxP169t6q0sVvajkZHyG3vy8g4plUqU5cdR9u7Zo6+WVSezIYu1LUZEAsGParU6JXZ8LFpe4Bgtzc3qpWlLNt24dfMUG/JYGzQUQ6sb6+oV059/LsbDwwMNHA6xOj2DKS0pmfngtq69gdXt0rRabd76tevusikT0TtKS0qUX5w6JZWRctZSWljdAYtiaBNDUWUjR46ahft2fmYQwvqYzWYsVZpcXldXl9pVCid0/jQ14ZxerzeYEVA5/ukxFQHw0Wzb1yp77Zn0hiLXfv3eHDNmDArBIaHRaHSpUunxaz+X7mZbtlW2gJWR8urCvQV5DMOg6kxIbP9gm76mpmaZNWRbbd/glubmNbkbNqJBA4G6ujrl4UOHsmWkvMka8q22FSzF0Lqm+oamuKlTJw8cOBCF4DZk5dvLm3+pqHixsyrJ3mDVHcoNBsOH2e+svW3NeyDaU/zvYuXX584tlJFyi9IeuoNVN52mGNqsbGu76h8QMDcgMBCF4FbGaDSaUqXJlxsbGtI5H2J3RdykSSc0Go0edgja1/nw4EE1AfBga9vTNtvbm8xFQqFgcXhEBPI2VkKhUGiWpKQeLCkvO2jte9nk1BUZKa89uP9AbktzMyq0sxJbcjfpGxoa0mHrwSoEwJ2Xpqbdhe3C+yK/VlcrggMCF8G2sVXwx31nX792TQH7Ifc1kl59TUYA3GYn6dj0UDCz2XwsJ3v9TTOqzmSNb7/5RnXh/Pk3ZKTcKnMynIAA+KhPP/5EDfvf2RfQ6XSGhLj4M7BtahPGR0UfUqvVWtgP3d7Jz8tTEQD3tbX9oJwoJ+DxfzAYjMmRUVFoecFCGIZRL0lN21X2S8WnsHWxGaHBw5ffqa1Vwv632isr315OEQDvB9uONoUAuPithQvvwH749kjFjRuKIMJ/HmwbQoEA+LQfi4qQt+khf33xpTIC4I57HPaM6c9dNBgMRtiGsBe+OHlKRQA8HLbdoEIA/KkPDx5EIXg30Gg0uknjJxyHbTNOEBkekdfW1oYGThds3bxFRQDcB7a9OAEB8AHvrM66B9soXOZuU5NyVGjoGti24hTBAYFv/VpdjdalOiA1ObmRALgTbDtxCgLggtfmzZPBNg4XuXb16r0AX7+ZsG10H87UJclIuRG7gMVUVVZp/AP80TGID7B2zTuDTCZTJWw97vNfIa2PyBGHnewAAAAASUVORK5CYII=)

|  |
| --- |
| Configuration Resources |

Internal Routing

PCIe Device

A-0592

Figure 10-3 Example Multi-Function Devicewith ATC per Function

Independent of the number of Functions within a Device, the following are required:

• A Function is required not to issue any TLP with the AT field set unless the address within the TLP was obtained through the ATS Translation Request and Translation Completion protocol.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

• Each ATC is required to only be populated using the ATS protocol; i.e., each entry within the ATC must be filled via an ATS Translation Completion in response to the Function issuing an ATS Translation Request for a given address.

• Each ATC cannot be modified except through the ATS protocol. That is:

◦ Host system software cannot modify the ATC other than through the protocols defined in this

specification except to invalidate one or more translations in an ATC. A Device or Function reset

would be an example of an operation performed by software to change the contents of the ATC, but a reset is only allowed to invalidate entries not modify their contents.

◦ It must not be possible for host system software to use software executing on the Device to modify the ATC.

When a TA determines that a Function should no longer maintain a translation within its ATC, theTA initiates the ATS invalidation protocol. The invalidation protocol consists of a single Invalidation Request and one or more Invalidate Completions.

|  |
| --- |
| Memory |

|  |
| --- |
| Translation Agent (TA) |

|  |
| --- |
| Root Port (RP) |

|  |
| --- |
| Step 3. ATS Invalidate Completion [TC = 0, ITAGV = 1000b, CC = 1] |

|  |
| --- |
| Root Complex (RC) |

|  |
| --- |
| Address Translation and Protection Table (ATPT) |

|  |
| --- |
| Step 1. ATS Invalidate Request  [Untranslated Addr, TC = 0, ITAG = 3] |

PCIe Device ATC

|  |
| --- |
| Step 2. Flush matching ATC entries, drain or discard conflicting Reads. |

A-0590

Figure 10-4 Invalidation Protocol with a Single Invalidation Request and Completion

As[Figure 10-4](#bookmark392)illustrates, there are essentially three steps in the ATS Invalidation protocol:

1. The system software updates an entry in the tables used by theTA. After the table is changed, theTA

determines that a translation should be invalidated in an ATC and initiates an Invalidation Request TLP which is transmitted from the RP to the example single-Function Device. The Invalidate Request communicates an untranslated address range, the TC, and an RP unique tag which is used to correlate Invalidate Completions with the Invalidation Request.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

2. The Function receives the Invalidate Request and invalidates all matching ATC entries. A Function is not

required to immediately flush all pending requests upon receipt of an Invalidate Request. If transactions are in a queue waiting to be sent, it is not necessary for the Function to expunge requests from the queue even if

those transactions use an address that is being invalidated.

a. A Function is required not to indicate the invalidation has completed until all outstanding Read

Requests or Translation Requests that reference the associated translated address have been retired or nullified.

b. A Function is required to ensure that the Invalidate Completion indication to the RC will arrive at the RC after any previously posted writes that use the “stale” address.

3. When a Function has ascertained that all uses of the translated address are complete, it issues one or more ATS Invalidate Completions.

a. An Invalidate Completion is issued for each TC that may have referenced the range invalidated. These completions act as a flush mechanism to ensure the hierarchy is cleansed of any in-flight

transactions which may contain references to the translated address.

i. The number of Completions required is communicated within each Invalidate Completion. ATA or RC implementation can maintain a counter to ensure that all Invalidate

Completions are received before considering the translation to no longer be in use.

ii. If more than one Invalidation Complete is sent, the Invalidate Completion sent in each TC must be identical in the fields detailed in [Section 10.3.2 .](#bookmark393)

b. An Invalidate Completion contains the ITAG from Invalidate Request to enable the RC to correlate Invalidate Requests and Completions.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |
| --- |
| Memory |

|  |
| --- |
| Translation Agent (TA) |

|  |
| --- |
| Root Port (RP) |

|  |
| --- |
| Step 3. ATS Invalidate Completion [TC = 0, ITAGV = 1000b, CC = 2] |

|  |
| --- |
| Step 4. ATS Invalidate Completion [TC = 1, ITAGV = 1000b, CC = 2] |

|  |
| --- |
| Root Complex (RC) |

|  |
| --- |
| Address Translation and Protection Table (ATPT) |

|  |
| --- |
| Step 1. ATS Invalidate Request  [Untranslated Addr, TC = 0, ITAG = 3] |

|  |
| --- |
| Step 2. Flush matching ATC entries, drain or discard conflicting Reads. |

PCIe Device ATC

A-0591

Figure 10-5 Single Invalidate Request with Multiple Invalidate Completions

**10.1.2 Page Request Interface Extension**

ATS improves the behavior of DMA based data movement. An associated Page Request Interface (PRI) provides

additional advantages by allowing DMA operations to be initiated without requiring that all the data to be moved into or out of system memory be pinned.164 The overhead associated with pinning memory may be modest, but the negative

impact on system performance of removing large portions of memory from the pageable pool can be significant.

PRI is functionally independent of the other aspects of ATS. That is, a device that supports ATS need not support PRI, but PRI is dependent on ATS’s capabilities.

Intelligent I/O devices can be constructed to make good use of a more dynamic memory interface. Pinning will always

have the best performance characteristics from a device’s perspective-all the memory it wants to touch is guaranteed to be present. However, guaranteeing the residence of all the memory a device might touch can be problematic and force a sub-optimal level of device awareness on a host. Allowing a device to operate more independently (to page fault when it requires memory resources that are not present) provides a superior level of coupling between device and host.165

The mechanisms used to take advantage of a Page Request Interface are very device specific. As an example of a model in which such an interface could improve overall system performance, let us examine a high-speed LAN device. Such a

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

164. Locked in place so that it cannot be swapped out by the system’s dynamic paging mechanism.

165. The alternative is a private interface between a device and its driver that is used to communicate device state so that the driver can ensure the availability of pinned memory resources.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

device knows its burst rate and need only have as much physical buffer space available for inbound data as it can receive within some quantum. A vector of unpinned virtual memory pages could be made available to the device, that the

device then requests as needed to maintain its burst window. This minimizes the required memory footprint of the device and simplifies the interface with the host, both without negatively impacting performance.

The ability to page, begs the question of page table status flag management. Typical TAs associate flags (e.g., dirty and access indications) with each untranslated address. Without any additional hints about how to manage pages mapped to a Function, such TAs would need to conservatively assume that when they grant a Function permission to read or write a page, that Function will use the permission. Such writable pages would need to be marked as dirty before their translated addresses are made available to a Function.

This conservative dirty-on-write-permission-grant behavior is generally not a significant issue for Functions that do not support paging, where pages are pinned and the cost of saving a clean page to memory will seldom be paid. However, Functions that support the Page Request Interface could pay a significant penalty if all writable pages are treated as

dirty, since such Functions operate without pinning their accessible memory footprints and may issue speculative page requests for performance. The cost of saving clean pages (instead of just discarding them) in such systems can diminish the value of otherwise attractive paging techniques. This can cause significant performance issues and risk functional issues in circumstances where the backing store is unable to be written, such as a CD-ROM.

The No Write (NW) flag in Translation Requests indicates that a Function is willing to restrict its usage to only reading the page, independent of the access rights that would otherwise have been granted.

If a device chooses to request only read access by issuing a Translation Request with the NW flag Set and later determines that it needs to write to the page, then the device must issue a new Translation Request.

Upon receiving a Translation Request with the NW flag Clear, TAs are permitted to mark the associated pages dirty. It is strongly recommended that Functions not issue such Requests unless they have been given explicit write permission. An example of write permission is where the host issues a command to a Function to load data from a storage device and

write that data into memory.

**10.1.3 Process Address Space ID (PASID)**

Certain TLPs can optionally be associated with a Process Address Space ID (PASID). This value is conveyed using the

PASID TLP Prefix. The PASID TLP Prefixis defined in the Section 6.20 . The PASID TLP Prefixis permitted on:

• Memory Requests (including Untranslated AtomicOp Requests) with Untranslated Addresses

• Address Translation Requests

• Page Request Messages

• ATS Invalidation Requests

• PRG Response Messages

Usage of thePASID TLP Prefixfor Untranslated Memory Requests is defined in Section 6.20 . This section describes PASID TLP Prefixfor the remaining TLPs.

When a Request does not have a PASID TLP Prefix, the Untranslated Address represents an address space associated with the Requester ID.

When a Request has a PASID TLP Prefix, the Untranslated Address represents an address space associated with both the Requester ID and the PASID value.

When a Response has a PASID TLP Prefix, the PASID value reflects the address space associated the corresponding Request.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Each Function has an independent set of PASID values. The PASID field is 20 bits wide however the effective width is constrained by the lesser of the width supported by the Root Complex (TA) and the width supported by the Function (ATC). Unused upper bits of the PASID value must be 0b.

For Endpoints in systems where a Virtual Intermediary (VI) is present, Untranslated

Addresses with an associated PASID are typically used to represent Guest Virtual Addresses (GVA) and Untranslated

Addresses that are not associated with a PASID represent Guest Physical Addresses (GPA). TheTA could be designed so that the VI manages the tables used to perform translations from GPA to Translated Addresses while the individual Guest Operating Systems manage tables used to perform translations from GVA to GPA. When translating an address with an associated PASID, theTA performs both translations and returns the resulting Translated Address (i.e., GVA to GPA

followed by GPA to Translated Address). The intermediate GPA value is not visible to the ATC.

When an ATC invalidates a cached GPA mapping, it invalidates the GPA mapping and also invalidates all GVA mappings in the ATC. When the GPA invalidate completes, the VI can safely remove pages backing GPA memory range from a Guest

Operating System. The VI does not need to know which GVA mappings involved the GPA mapping.

**10.2 ATS Translation Services**

ATA does translations. An ATC can cache those translations. If an ATC is separated from theTA by PCIe, the memory request from an ATC will need to be able to indicate if the address in the transaction is translated or not. The

modifications to the memory transactions are described in this section, as are the transactions that are used to communicate translations between a remote ATC and a central TA.

**10.2.1 Memory Requests with Address Type**

A Function with an ATC can send memory read/write Requests that contain either translated or untranslated addresses. As shown in [Figure 10-6](#bookmark394)and [Figure 10-7](#bookmark395), the Address Type (AT) field is used to indicate the type of address that is

present in the request header.

Byte 0 → Byte 4 → Byte 8 → Byte 12 →

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0  7  6  5  4  3  2  1  0 | | +1  7  6  5  4  3  2  1  0 | | | | | | +2  7  6  5  4  3  2  1  0 | | | | | | +3  7  6  5  4  3  2  1  0 | | |
| Fmt  0  x  1 | Type  0  0  0  0  0 | T9 | TC | T8 | Attr | LN | TH | TD | EP | Attr | AT | Length | | | | |
| Requester ID | | | | | | | | Tag[7:0] | | | | | Last DW BE | | First DW BE | |
| Address[63:32] | | | | | | | | | | | | | | | | |
| Address[31:2] | | | | | | | | | | | | | | | | R |

Figure 10-6 Memory Request Header with 64-bit Address

Byte 0 → Byte 4 → Byte 8 →

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0  7  6  5  4  3  2  1  0 | | +1  7  6  5  4  3  2  1  0 | | | | | | +2  7  6  5  4  3  2  1  0 | | | | | +3  7  6  5  4  3  2  1  0 | | |
| Fmt  0  x  0 | Type  0  0  0  0  0 | T9 | TC | T8 | Attr | LN | TH | TD | EP | Attr | AT | Length | | | |
| Requester ID | | | | | | | | Tag[7:0] | | | | | Last DW BE | First DW BE | |
| Address[31:2] | | | | | | | | | | | | | | | R |

Figure 10-7 Memory Request Header with 32-bit Address

The AT field in the requests is a redefinition of a reserved field in earlier version of this specification. Functions that do not implement an ATC will continue to set the AT field to its defined reserved value (00b). Functions that implement an ATC will set the AT field as listed in [Table 10-1 .](#bookmark396)

Table 10-1 Address Type (AT) Field Encodings

|  |  |  |
| --- | --- | --- |
| AT[1:0] Coding | Mnemonic | Meaning |
| 00b | Untranslated | ATA may treat the address as either virtual or physical. |
| 01b | Translation Request | TheTA will return the translation of the address contained in the address field of the request as a read completion. This value only has meaning for an explicit Translation Request (see[Section 10.2.2)](#bookmark397). TheTA will signal an Unsupported Request (UR) if it receives a TLP with the AT field set to 01b in a Memory  Request other than Memory Read. |
| 10b | Translated | The address in the transaction has been translated by an ATC. If the Function associated with the SourceID is allowed to present physical addresses to the system memory, then theTA might not translate this  address. If the Function is not allowed to present physical addresses, then theTA may treat this as an UR. |
| 11b | Reserved | TheTA will signal an Unsupported Request (UR) if it receives a Memory Request TLP with the AT field set to 11b. |

The AT field is only defined for Memory Requests. The field remains reserved for other TLPs.

**10.2.2 Translation Requests**

A Translation Request has a format that is similar to that of a memory read. The AT field is used to differentiate a Translation Request from a normal memory read.

The request header for a Translation Request has the formats illustrated in [Figure 10-8](#bookmark398)and [Figure 10-9 .](#bookmark399)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Byte 0 → Byte 4 → Byte 8 → Byte 12 →

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0  7  6  5  4  3  2  1  0 | | +1  7  6  5  4  3  2  1 0 | | | | | | +2  7  6  5  4  3  2  1  0 | | | | | +3  7  6  5  4  3  2  1  0 | | |
| Fmt  0  0  0 | Type  0  0  0  0  0 | T9 | TC | T8 | Attr  R | LN  R | TH  R | TD | EP | Attr  x  R | AT  0  1 | Length  0  0 0  0  x  x x  x  x  0 | | | |
| Requester ID | | | | | | | | Tag[7:0] | | | | | Last DW BE  1  1  1  1 | First DW BE  1  1  1  1 | |
| Untranslated Address[63:32] | | | | | | | | | | | | | | | |
| Untranslated Address[31:12] | | | | | | | | | | | Reserved | | | | NW |

Figure 10-8 64-bit Translation Request Header

Byte 0 → Byte 4 → Byte 8 →

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0  7  6  5  4  3  2  1  0 | | +1  7  6  5  4  3  2  1  0 | | | | | | +2  7  6  5  4  3  2  1  0 | | | | | +3  7  6  5  4  3  2  1  0 | | |
| Fmt  0  0  1 | Type  0  0  0  0  0 | T9 | TC | T8 | Attr  R | LN  R | TH  R | TD | EP | Attr  x  R | AT  0  1 | Length  0  0 0  0  x  x x  x  x  0 | | | |
| Requester ID | | | | | | | | Tag[7:0] | | | | | Last DW BE  1  1  1  1 | First DW BE  1  1  1 1 | |
| Untranslated Address[31:12] | | | | | | | | | | | Reserved | | | | NW |

Figure 10-9 32-bit Translation Request Header

Translation Requests have the same completion timeout intervals as Read Requests.

[**10.2.2.1**](10.2.2.1) **Attribute Field**

For a Translation Request, the Relaxed Ordering (RO) bit is applicable and permitted to be Set, where it affects the

ordering of its associated Translation Completions. The remainder of the Attr field is Reserved. The Requester of a

Translation Request must not depend on theTA to guarantee any specific ordering relationship between Translation Completions and any other Requests or Completions. There are no ordering requirements for a Translation Request. A TA may reorder a Translation Request with respect to any other request.

**IMPLEMENTATION NOTE**

Translation Request Ordering

Because no ordering can be assumed between Translation Requests and other types of Requests, a Translation Request does not make an effective flushing/ordering primitive.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACHCAYAAAAr6RiGAAAAN0lEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAD8AC7NbwQM8kSoswAAAABJRU5ErkJggg==)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**10.2.2.2**](10.2.2.2) **Length Field**

The Length field is set to indicate how many translations may be returned in response to this request. Each translation is 8 bytes in length and represents one or more STUs (Smallest Translation Unit). The maximum setting for the Length field is the RCB. The Length field in a Translation Request must always indicate an even number of DWORDs. If Length is set to indicate a value greater than allowed, or if the least-significant bit of the Length field is non-zero, then theTA will treat the request as a Malformed Packet.

If the Length field has a value greater than two, then the Function is requesting translations for a range of memory greater than a single STU. The additional translations, if provided, are assumed to be for sequentially-increasing, equal-sized, STU-aligned regions, starting at the requested address.

[**10.2.2.3**](10.2.2.3) **Tag Field**

The Tag field has the same meaning as in a Memory Read Request.

[**10.2.2.4**](10.2.2.4) **Untranslated Address Field**

A Translation Request includes either a 32-bit or a 64-bit Untranslated Address field. This field indicates the address to be translated. TheTA will make decisions about the validity of the request, based on the address in the translation request. TheTA is permitted to return fewer translations than requested, but it will not return more.

When multiple translations are requested, theTA will not return a translation if the range of that translation does not overlap the implied range of the Translation Request (this would only apply to translations after the initial value). The implied range of the Translation Request is [2STU+12 \* (Length/2)] bytes.

The Untranslated Address field in the Translation Request is any address in the range of the first STU. Address bits 11:0 are not present in the Translation Request and are implied to be zero. If a Requester has Page Aligned Request Set (see Section 7.8.8.2), it must ensure that bits 11:2 are zero. If a Requester has Page Aligned Request Clear, it is permitted to supply any value for bits 11:2.166 TheTA must ignore bits 11:2 as well as any low-order bits not required to determine the translation.

For example, if using 64-bit addressing for a Function with the Page Aligned Request bit Set that is programmed with an STU of 1 (i.e., 8192-byte pages), bits 63:13 are significant, bit 12 is ignored by theTA and bits 11:0 are implied to be zero.

[**10.2.2.5**](10.2.2.5) **No Write (NW) Flag**

The No Write flag, when Set, indicates that the Function is requesting read-only access for this translation.167

TheTA may ignore the No Write Flag, however, if theTA responds with a translation marked as read-only then the

Function must not issue Memory Write transactions using that translation. In this case, the Function may issue another translation request with the No Write flag Clear, which may result in a new translation completion with or without the W (Write) bit Set.

Upon receiving a Translation Request with the NW flag Clear, TAs are permitted to mark the associated pages dirty. It is strongly recommended that Functions not issue such Requests unless they have been given explicit write permission.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

166. Note: The Page Aligned Request bit was added in Revision 1.1 of the ATS Specification.

167. Note: The No Write Flag was added in Revision 1.1 of the ATS Specification.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

[**10.2.2.6**](10.2.2.6) **PASID TLP Prefix on Translation Request**

If a Translation Request has a PASIDTLP Prefix, the Untranslated Address Field is an address within the process address space indicated by the PASID field.

If a Translation Request has a PASIDTLP Prefix with either the Privileged Mode Requestedor Execute Requested bit Set, these may be used in constructing the Translation Completion Data Entry.

The PASID Extended Capability indicates whether a Function supports and is enabled to send and receive TLPs with the PASIDTLP Prefix.

**10.2.3 Translation Completion**

ATranslation Completion (either a Cpl or a CplD) is sent by a TA for each Translation Request. This specification

describes the meaning of fields in Translation Completions. Fields not defined in this specification have the same

meanings proscribed for Read Completions in this specification. For a Translation Completion, the Relaxed Ordering

(RO) bit is applicable and permitted to be Set, if the corresponding Translation Request RO bit was set. The remainder of the Attr field is Reserved.

If theTA was not able to perform the requested translation, a completion with the format shown in [Figure 10-10](#bookmark400)is used.

Byte 0 → Byte 4 → Byte 8 →

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0  7  6  5  4  3  2  1  0 | | +1  7  6  5  4  3  2  1  0 | | | | | | +2  7  6  5  4  3  2  1  0 | | | | | | | | +3  7  6  5  4  3  2  1  0 | |
| Fmt  0  0  0 | Type  0  1  0  1  0 | T9 | TC | T8 | Attr  R | LN  R | TH  R | TD | EP | Attr  x  R | | | AT  R  R | | Length | | |
| Completer ID | | | | | | | | Cpl. Status | | | | BCM  0 | Byte Count | | | | |
| Requester ID | | | | | | | |  | | | Tag | | |  | | R | Lower Address |

Figure 10-10 Translation Completion with No Data

**IMPLEMENTATION NOTE**

Byte Count Field for Unsuccessful Translation Completions with No Data

Previous versions of this specification indicated the Byte Count and Lower Address field should be 0000 0000

0000b for Unsuccessful Translation Completions with No Data. It is strongly recommended that implementations do not depend on the Byte Count and Lower Address field being set to any particular value in Unsuccessful

Translation Completions with No Data.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAADFCAYAAABpebPhAAAAOklEQVRYhe3KoQEAIAzAsI0nsZyG5cphsLuA1DYZ61R07ZmjnS8AAAAAAAAAAAAAAAAAAAAAAIAPwQW0jwSKo9MTCQAAAABJRU5ErkJggg==)

The values and meaning for the Completion Status field are listed in [Table 10-2 .](#bookmark401)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Table 10-2 Translation Completion with No Data Status Codes

|  |  |  |
| --- | --- | --- |
| Value | Status | Meaning |
| 000b | Success | This Completion Status has a nominal meaning of “success” TheTA will not return this value in a Cpl. |
| 001b | Unsupported Request (UR) | Translation Requests from this Function are not supported by theTA. If a Function receives this  Completion code, it must disable its ATC and not send requests using translated addresses until the ATC is re-enabled. For transactions the Function may internally have in flight, the Function may either terminate or complete them. The mechanism a Function receiving this code uses to report this condition is outside the scope of this specification. TheTA detecting this error is a "Completer Sending a Completion with UR/ CA Status" and shall behave as defined in this specification. |
| 010b | CRS | This value is not allowed in any Completion to a request initiated by a PCI Express Function. If received by a Function, it shall be treated as a Malformed TLP. |
| 100b | Completer Abort (CA) | TheTA was not able to translate the address because of an error in theTA. This nominally causes an error to be reported to the device driver associated with the ATC. See AER in this specification. |
| All  others | Reserved | A Translation Completion with a Reserved Completion Status value is treated as if the Completion Status was Unsupported Request (001b). |

Note: Return values other than Success indicate an error.

Byte 0 → Byte 4 → Byte 8 →

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0  7  6  5  4  3  2  1  0 | | +1  7  6  5  4  3  2  1 0 | | | | | | +2  7  6  5  4  3  2  1  0 | | | | | | | | +3  7  6  5  4  3  2  1  0 | |
| Fmt  0  0  0 | Type  0  1  0  1  0 | T9 | TC | T8 | Attr  R | LN  R | TH  R | TD | EP | Attr  x R | | | AT  R  R | | Length | | |
| Completer ID | | | | | | | | Cpl. Status  0  0  0 | | | | BCM  0 | Byte Count | | | | |
| Requester ID | | | | | | | |  | | | Tag | | |  | | R | Lower Address |

Figure 10-11 Successful Translation Completion

Fields are set in accordance with Section 2.2.9and Section 2.3 .

Translation Completions must be sent using the same TC as the Translation Request. The Function is not required to verify that the same TC was used.

The Lower Address field will contain a value that will make the packet consistent with RCB semantics. If the result is

returned in a single packet, Lower Address is set to RCB minus Byte Count. If the results are returned in multiple packets, the first packet will have a Lower Address field of RCB minus (Total Completion Length \* 4) and subsequent packets will have a Lower Address field of 000 0000b. See [Section 10.2.4f](#bookmark402)or additional requirements for multiple packet completions.

If the Completion Status field is 000b, then the translation was successful and a data payload will follow the header. The contents of the data payload are shown in [Figure 10-12 .](#bookmark403)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0 | | | | | | | | +1 | | | | | | | | +2 | | | | | | | | +3 | | | | | | | |
| 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|  | Translated Address [63:32] | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |  |
| Translated Address [31:12] | | | | | | | | | | | | | | | | | | | | S | N | Reserved | | | | Global | Priv | Exe | U | W | R |

A-0583A

Figure 10-12 Translation Completion Data Entry Table 10-3 Translation Completion Data Fields

|  |  |
| --- | --- |
| Field | Meaning |
| S | **Size of translation**- This field is 0b if the translation applies to a 4096-byte range of memory. If this field is 1b, then the translation applies to a range of memory that is larger than 4096 bytes (see [Section 10.2.3.1 )](#bookmark404). |
| N | **Non-snooped accesses** - If this field is 1b, then the read and write requests that use this translation must Clear theNo Snoop bit in the Attribute field. If it is 0b, then the Function may use other means to determine ifNo Snoopshould be Set. |
| Reserved | These bits shall be ignored by the ATC. |
| Global | **Global Mapping** - If this bit is Set, the ATC is permitted to cache this mapping entry in all PASIDs. If Clear, the ATC is  permitted to cache this mapping entry only in the PASID associated with the requesting PASID. This bit may only be Set if the associated Translation Request had a PASIDTLP Prefix. |
| Exe | **Execute Permitted**- If this bit is Set, the requesting Function is permitted to execute code contained in the associated memory range.  This bit may be Set only if the associated Translation Request had a PASIDTLP Prefix with the Execute Requested bit Set. If this bit is Set, R must also be Set.  The Priv bit indicates the Privilege level associated with the Exe bit. If Priv is Set, the Exe bit indicates permissions associated with Privileged Mode entities in the Function. If Priv is Clear, the Exe bit indicates permissions associated with Non- Privileged Mode entities in the Function.  This value may be cached if R is Set. |
| Priv | **Privileged Mode Access** - If this bit is Set, R, W and Exe refer to permissions associated with Privileged Mode entities. If this bit is Clear, R, W and Exe refer to permissions associated with Non-Privileged Mode entities.  This bit may only be Set if the associated Translation Request contained a PASIDTLP Prefix with the Privileged Mode Requested bit Set.  This value must be cached any of the R, W or Exe values are cached. |
| U | **Untranslated access only**- When this field is Set, the indicated range may only be accessed using untranslated  addresses, and the Translated Address field of this Translation Completion Data Entry may not be used in a subsequent Read/Write Request with AT set to Translated. This value may be cached if R or W is Set. |

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAIklEQVRYhe3BAQ0AAAgDoNvUeqbUHG5ApWcDAAAAAADw2QHZqwIKU+HaKwAAAABJRU5ErkJggg==)

Page 1184

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |
| --- | --- | --- |
| Field | Meaning | |
| R,W | **Read, Write** - These two fields indicate the transaction types that are allowed for requests using the translation. The encodings are: | |
| **00b**  **01b**  **10b**  **11b** | Neither read nor write transactions are allowed. This translation is considered not to be valid. The  contents of the Translated Address, N, U, and Exe fields are undefined. A translation with this value must not be cached in the ATC (see[Section 10.2.3.5 )](#bookmark405).  Write Requests that target this range are allowed, but Read Requests are not unless they are zero-length reads.  Read Requests that target this range are allowed (including zero-length reads), but Write Requests are not.  Read and Write Requests that target this range are allowed. |
| The Priv bit indicates the Privilege level associated with R and W. If Priv is Set, R and W indicate permissions associated with Privileged Mode entities in the Function. If Priv is Clear, R and W indicate permissions associated with Non-  Privileged Mode entities in the Function. | |
| [**10.2.3.1**](10.2.3.1) **Translated Address Field**  If the R and W fields are both Clear, or if U is Set, then the Translated Address field may not be used by the Function for any purpose.  If either the R or W field is Set, and the U field is Clear, then the Translated Address field contains an address that can be used by the Function in a Memory Request with the AT field set to Translated and the Function may cache the Translated Address. When cached, the R and W fields must be stored with the same value as the Translation Completion entry. The address that is cached must be a subset of the address range indicated in the Translation Completion (the subset may include the entire range).  While the Translated Address is cached in the Function’s ATC, it shall not be possible for the Function to modify the entry other than to delete it. The entry must be deleted from the ATC when an Invalidation Request is received that has an  indicated range that overlaps any portion of the cached address.  A Function is not allowed to make an entry into its ATC unless the entry is in a Translation Completion and the E (Enable) field within the ATS Capability is Set. Entries in an ATC cache that are written before the E field is Set must not be used in Memory Request. They must either be invalidated when the E field is Set or ignored and not used.  [**10.2.3.2**](10.2.3.2) **Translation Range Size (S) Field**  If S is Set, then the translation applies to a range that is larger than 4096 bytes. If S = 1b, then bit 12 of the Translated  Address is used to indicate whether or not the range is larger than 8192 bytes. If bit 12 is 0b, then the range size is 8192 bytes, but it is larger than 8192 bytes if Set. If S = 1band bit 12 = 1b, then bit 13 is used to determine if the range is larger than 16384 bytes or not. If bit 13 is 0b, then the range size is 16384 bytes, but it is larger than 16384 bytes if Set.  Low-order address bits are consumed in sequence to indicate the size of the range associated with the translation. Note: This encoding method is also used to indicate the size of the memory range being invalidated.  Examples for different translation sizes are shown in [Table 10-4 .](#bookmark407) | | |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Table 10-4 Examples of Translation Size Using S Field

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| Address Bits | | | | | | | | | | | | | | | | | | | | | S | Translation  Range Size  in Bytes |
| 63:32 \* | 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 15 | 14 | 13 | 12 |
| x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | 0 | 4 K |
| x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | 0 | 1 | 8 K |
| x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | x | 0 | 1 | 1 | 16 K |
| x | x | x | x | x | x | x | x | x | x | x | x | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 M |
| x | x | x | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1G |
| x | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 4 G |

Note:

\* Upper address bits are used to indicate the size for ranges larger than 4 GB.

![](data:image/jpeg;base64,/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAgGBgcGBQgHBwcJCQgKDBQNDAsLDBkSEw8UHRofHh0aHBwgJC4nICIsIxwcKDcpLDAxNDQ0Hyc5PTgyPC4zNDL/2wBDAQkJCQwLDBgNDRgyIRwhMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjIyMjL/wAARCAACAnsDASIAAhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQAAAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWmp6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEAAwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSExBhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElKU1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD5/ooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooA//2Q==)

The size field is set to indicate the range size in multiples of 4096 bytes regardless of the setting of STU. For example, if STU is set to indicate that the minimum translation is 8192 bytes, then S should be Set on all translation returned in a Translation Completion and in all Invalidate Requests. If STU is set to indicate a 16384-byte minimum, then S and bit 12 would both be Set in all translation and invalidate ranges.

If S is Set and bits 63:12 are all 1b, then the behavior is undefined. If S is Set and bit 63 is 0b, and bits 62:12 are all 1b, then the request is to invalidate all translations.

If a Function receives a Translation Completion with a Translation Size field smaller than the Function’s programmed STU value, it shall treat the Translation Completion as if it had Completion Status UR.

[**10.2.3.3**](10.2.3.3) **Non-snooped (N) Field**

This field is Set to indicate that Read and Write Requests that target memory in the range of this translation must Clear the No SnoopAttribute bit in the Request header. When this field is 0b, the Function is allowed to Set theNo Snoop

Attribute bit in a Function-specific manner.

Note: When this field is Cleared, the Function is not allowed to SetNo Snoopin a Memory Request if the Enable No Snoopfield in the Device Control register is Cleared.

The N bit may be cached by the ATC if either R or W is Set.

When U is Set, the meaning of this field is undefined, and theTA may set this field to any value. A translation has a single value for the N field that is not affected by privilege level. An ATC is permitted to cache the N field without regard to the value of the Priv bit.

[**10.2.3.4**](10.2.3.4) **Untranslated Access Only (U) Field**

This field is Set when the Function is not allowed to access the implied range of memory using a translated address (the range is implied by the untranslated address in the Translation Request and the offset of the translation in the

Translation Completion). The Function may use untranslated addresses to access the range as long as the accesses are allowed by the R and W fields. The Function may cache this translation value if either R or W is Set. If the U field is Set, the Translated Address field in the translation is not necessarily a valid memory address and the Function may not use the value in a Read or Write Request with AT set to Translated.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

Note: One of the possible uses of this field is to avoid unnecessary invalidations. If a Function uses translated requests for some portions of memory, but not others, then the U field can be used on the portions for which translated requests are not used. When a translation changes if the U field is Set, then it will not necessarily be required that an Invalidate Request be sent to the Function. An example of this use is a Function with a ring buffer that is used for commands. The ring buffer may be allocated for a long period of time and have very high re-use (locality). For this reason, it is useful for the Function to use translated addresses in its memory request that target the command buffer. The same Function

might access data buffers that have poor locality and low reuse. Accesses to the data buffers might best be handled by using untranslated Requests. Setting the U field for the data buffer translations ensures that the Function will not

attempt to use a translated value to access the data buffer so, when the data buffer mappings are changed, no

Invalidation Request is required. When U is Set and either R or W is Set, the ATC is permitted to cache U, R, W Exe, and Priv, as well as the Translation Range Size (see[Section 10.2.3.2)](#bookmark406). An Invalidation Request is required if these values change.

[**10.2.3.5**](10.2.3.5) **Read (R) and Write (W) Fields**

These fields indicate if the returned translation value may be used in a read or write memory request. The ATC may not issue a non-zero read request using the translation value if the R field is Cleared. The ATC may not issue a write request using the translation value if the W field is Cleared. The ATC may not issue any type of request using the translation value if neither the R nor W fields are Set. If both R and W fields are Cleared, the range of the translation is still indicated, but the meaning of the other values in the translation is undefined.

Note: The range of a Translation entry is indicated even if R = W = 0b in order to allow a “hole” in the Translation

Completion. For example,if the Translation Request has a Length of six DWs, then up to three translations could be

included in the Translation Completion. The first and third translations may have Set R or W but the second could have R = W = 0b. To avoid ambiguity about the size of the indicated gap, the range of the gap is indicated in the Translation

Completion even if R = W = 0b.

The R = 0b, W = 0b state is used to indicate that the address field in the translation may not be used to form a translated address value for a subsequent request.

When the host changes the translation in theTA, to make the translation present, the host is not required to send an

invalidation indication to the ATC so that it will know of the change in state of the translation. Since the ATC may not be notified of changes of the translation, a translation value of R = W = 0b must not be cached.

If no table entry is found for the requested address, theTA will return a CplD with a single translation value with R = W = 0b.

Note: Implementations should not assume that receiving a translation response with the R or W bits Set (independent of the value of the Ubit) implies that a subsequent read or write request with the same **untranslated** address will succeed. Although it may be possible for a device and its controlling software to ensure this property, the method for doing so is outside the scope of this specification.

The Priv bit is used to qualify R and W. If Priv is Set, R and W indicate permissions granted to Privileged Mode entities in the Function. If Priv is Clear, R and W indicate permissions granted to Non-Privileged Mode entities in the Function. The R and W values for the two privilege levels are independent. The ATC must not assume any correlation between the

Privileged Mode and Non-Privileged Mode permissions associated with a translation.

[**10.2.3.6**](10.2.3.6) **Execute Permitted (Exe)**

If Exe is Set, the requesting Function is permitted to execute code in the implied range of memory. If Exe is Clear, the requesting Function is not permitted to execute code in the implied range of memory.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

The definition of what it means for a Function to execute code is outside the scope of this specification. Various system components may have different instruction sets. Behavior within the requesting Function when it attempts to execute code that is not permitted by this bit is outside the scope of this specification.

The Exe bit may only be Set theTA supports Execute permissions, the associated Translation Request had a PASIDTLP Prefix with an effective value of 1bfor the Execute Requested bit168 and R is Set in the Translation Completion Data Entry.

Otherwise, the Exe bit must be Clear. This value may be cached if R is Set.

The Priv bit is used to qualify the Exe bit. If Priv is Set, the Exe bit indicates permission granted to Privileged Mode

entities in the Function. If Priv is Clear, the Exe bit indicates permission granted to Non-Privileged Mode entities in the Function. The Exe bit values for the two privilege levels are independent. The ATC must not assume any correlation between the Privileged Mode and Non-Privileged Mode permissions associated with a translation.

Functions may optionally check that:

• If the Execute Requested bit is Clear in a Translation Request, the Exe bits in the associated Translation Completion Data Entries are also Clear.

• If Exe is Set, R is also Set.

If either optional check fails, the Functionshall signal Unexpected Completion (UC). These checks are independently optional.

[**10.2.3.7**](10.2.3.7) **Privileged Mode Access (Priv)**

If Priv is Set, R,W, and Exe refer to permissions granted to entities operating in Privileged Mode in the requesting

Function. If Priv is Clear, R,W, and Exe refer to permissions granted to entities operating in Non-Privileged Mode in the requesting Function.

The meaning of Privileged Mode and Non-Privileged Mode and what it means for an entity to be operating as in

Privileged Mode or in Non-Privileged Mode depends on the protection model of the system and is outside the scope of this specification.

Behavior is outside the scope of this specification when an entity in the requesting Function attempts to access memory that it is not permitted to access.

The Priv bit may only be Set if theTA supports Privileged Mode and the associated Translation Request had a PASIDTLP Prefix with an effective value of 1bfor the Privileged Mode Requested169 bit. Otherwise, the Priv bit must be Clear.

The Privileged and Non-Privileged Mode versions of R, W and Exe are independent. An ATC may cache either or both

versions of R, W and Exe. An ATC that receives a translation with R=W=0b for one privilege level may not assume anything about what it might receive for the other privilege level.

This value may be cached if R or W is Set. This value must be cached when the corresponding R,W, or Exe values are cached.

Note: Since the Priv bit is Set only when the requesting Function Sets thePrivileged Mode Requestedbit, Functions that never set that bit should always receive the Priv bit Clear and thus don’t need to cache it.

Functions may optionally check that when the Privileged Mode Requestedbit is Clear in a Translation Request, the Priv bits in the associated Translation Completion Data Entries are also Clear. If this optional check fails, the Functionshall signal Unexpected Completion (UC).

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

168. The effective value of the bit is 0b unless the bit in the PASIDTLP Prefix is 1band usage of the bit is enabled for the request.

169. The effective value of the bit is 0b unless the bit in the prefix is 1band usage of the bit is enabled for the request.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**IMPLEMENTATION NOTE**

Execute Permission and Privilege Mode Enforcement

The requesting Function determines whether a particular Memory Request needs Execute permission or is associated with a Privileged Mode or Non-Privileged Mode entity. The ATC implements the protection checks indicated by the Exe and Priv bits.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

[**10.2.3.8**](10.2.3.8) **Global Mapping (Global)**

If Global is Set, the requesting Function is permitted to create a Global Mapping entry in the ATC for this translation. If Global is Clear, the requesting Function is not permitted to create a Global Mapping entry in the ATC for this translation. Global Mapping entries apply to all PASIDs of the Function. They permit the ATC to reduce the number of translation

requests needed and to reduce the memory needed for caching the results.

A Function is permitted to ignore this bit and always create non-Global Mapping entries in the ATC. This could result in multiple translations being requested for the same Untranslated Address under different PASIDs.

Functions that use this bit must also have the Global Invalidate Supported bit Set (seeSection 10.5.1.2).

**10.2.4 Completions with Multiple Translations**

An ATC is allowed to request that theTA provide translations for a virtually contiguous range of addresses. It does this by setting the Length field in the Translation Request to a value that is two times the number of requested translations as long as the request size (Total Completion Length \* 4) is not larger than either Max\_Read\_Request\_Size or RCB.

If multiple translations are requested, theTA may return one or more translations as long as the number of translations does not exceed the number of requested translations. It is not an error for theTA to return fewer translations than

requested and no error indication is sent unless there is an error in accessing the data.

If the Translation Completion contains multiple translations, all translations must have the same indicated size. Also, successive translations must apply to the virtual address range that abuts the previous translation in the same

completion.

If a translation has both R = 0band W = 0b, theTA must still set the Size field and the lower bits of the Translated Address field used to encode the completion size to appropriate values.

Each translation in a Translation Completion will have some overlap with the implied memory range of the Translation Request (see[Section 10.2.2)](#bookmark397).

A successful Translation Completion must consist of one or two CplDs.

If a Translation Completion CplD has a Byte Count that is greater than four times the Length field, then additional CplDs are required to complete the transaction.

If a Translation Completion CplD has a Byte Count that is equal to four times the Length field, then the packet completes the request. For such a CplD, if the sum of Byte Count and Lower Address is not a multiple of RCB, then the CplD is the

last of a sequence. If no previous CplD for this request has been received, an error has occurred and all translation values should be discarded.

Note: There are multiple reasons that theTA may truncate the results of the completion. For example, the request might ask for a range of addresses, not all of which are defined. This could occur if the first translation is valid but located at the

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

end of a page of translations. TheTA, in looking up the next page of translations, may find that the page is not valid so the addresses are not valid. The range of addresses that are valid would be returned and no error indicated. When

truncating a Translation Completion theTA is not allowed to pad the response with invalid entries (R = 0b, W = 0b).

Note: There are multiple reasons that theTA may break a Translation Completion into multiple TLPs. As an example, if

the virtual address of the Translation Completion resolves to a table access that crosses an RCB boundary of the

memory system, the completion to theTA may be broken into multiple completions by the memory. Rather than require that theTA accumulate the results, it is allowed to send each portion of the Translation Completion to a Function when it is received from the system memory.

**10.3 ATS Invalidation**

ATS uses the messages shown in this section to maintain consistency between theTA and the ATC. This specification

assumes there is a single TA associated with each ATC. TheTA (in conjunction with its associated software) must ensure that the address translations cached in the ATC are not stale by issuing Invalidate Requests.

**10.3.1 Invalidate Request**

When a translation is changed in theTA and that translation might be contained within an ATC in a Function, theTA (in conjunction with its associated software) must send an Invalidate Request to the ATC to maintain proper synchronization between the ATPT and the ATC. An Invalidate Request is used to clear a specific subset of the address range from the ATC. Invalidate Requests are constrained to cover power of 2 multiple of 4096-byte pages.

The format of an Invalidate Request is shown in [Figure 10-13 .](#bookmark408)

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 7 6 | | 5 | +0 4  3 | |  |  | | --- | --- | | 2 | 1 | | 0 | 7 6 | | 5 | +1 4 3 | | 2 1 | | 0 | 7 6 5 | | | +2 4 3 | | 2 | 1 | 0 | 7 | |  |  | | --- | --- | | 6 | 5 | | +3 4  3 | |  |  | | --- | --- | | 2 | 1 | | 0 |
| 0 | Fmt 1 1 | | Type 1 0010 | | | R | TC | | | R | Attr R | R | | T D | E P | Attr R R | | R | | Length  00 0000 0010 | | | | | | |
|  | Requester ID | | | | | | | | | | | | | 0 | 0 | 0 | ITag | | | | | Message Code 0000 0001 | | | | |
| Device ID | | | | | | | | | | | | | | Reserved | | | | | | | | | | | | |
| Reserved | | | | | | | | | | | | | | | | | | | | | | | | | | |

A-0584A

Figure 10-13 Invalidate Request Message

The Invalidate Request is a MsgD transaction with 64 bits of data. Invalidate Request messages may be sent in any TC. The ITag field is constrained to the values 0 to 31 and is used by theTA to uniquely identify requests it issues. ATA must ensure that once an ITag is used, it is not reused until either released by the corresponding Invalidate Completions or by a vendor-specific timeout mechanism (see below).

TheTA may have a single pool of ITag values for all invalidates that it issues or it may have a pool for each Device ID or any other combination. A Device with multiple ATCs on different Functions must manage the ITags separately for each Requester ID.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

The address range specified in an Invalidate Request may span one or more STU 4096-byte pages. Invalidation ranges are required to be naturally aligned and may not be smaller than STU 4096-byte pages. Upon receiving an Invalidate Request with a range less than STU an ATC may either (1) signal an Unsupported Request or (2) round the range of the request up to a value greater than or equal to the STU.

**IMPLEMENTATION NOTE**

Invalidate Completion Timeout

Devices should respond to Invalidate Requests within 1 minute (+50% -0%).Having a bounded time permits an ATPT to implement Invalidate Completion Timeouts and reuse the associated ITag values. ATPT designs are

implementation-specific. As such, Invalidate Completion Timeouts and their associated error handling are outside the scope of this specification.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACoCAYAAADdE69lAAAAN0lEQVRYhe3KoQEAIAzAsI0nsZyG5cphsFPY1DYZ61R07ZmjnS8AAAAAAAAAAAAAAAAAAADgpwskJwROWh0xhAAAAABJRU5ErkJggg==)

The content of the payload is the untranslated address range to be invalidated. The payload format is shown in Remove Word {RD} tag used to help generate ToC/ToF/ToT.

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 7 6 | | 5 | +0 4I 3 | |  |  | | --- | --- | | 2 | 1 | | 0 | 7 | |  |  | | --- | --- | | 6 | 5 | | +1 4I 3 | |  |  | | --- | --- | | 2 | 1 | | 0 | 7 | |  |  | | --- | --- | | 6 | 5 | | +2 4 3 | | | 0 | 7 | |  |  | | --- | --- | | 6 | 5 | | +3 4  3 | 2 | 1 0 | |
|  | Untranslated Address [63:32] | | | | | | | | | | | | | | | | | | | | |  |
| Untranslated Address [31:12] | | | | | | | | | | | | | | S | Reserved | | | | | | |  |

|  |  |
| --- | --- |
| 2 | 1 |

▲

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAIAAAAUCAYAAACnOeyiAAAAKElEQVQImc3GoQEAIAgAsInRYPIGDvZgLRzB0mDAxIXAwQ6lRx4k1geP/gIPgjPEiAAAAABJRU5ErkJggg==)

Global Invalidate

Figure 10-14 Invalidate Request Message Body

The S field is used to indicate if the range being invalidated is greater than 4096 bytes. Its meaning is the same as for the Translation Completion (see [Section 10.2.3.1](#bookmark404)and [Section 10.2.3.2)](#bookmark406).

The Global Invalidate bit indicates that the Invalidation Request Message affects all PASID values (see[Section 10.3.8)](#bookmark409). This bit is Reserved unless the Invalidation Request has a PASIDTLP Prefix. The bit is ignored by the ATC if Global

Invalidate Supported bit is Clear (see[Section 10.3.8)](#bookmark410).

**10.3.2 Invalidate Completion**

When a Function completes an Invalidate operation, it will send one or more Invalidate Completion messages to theTA. These messages must be tagged with information extracted from the Invalidate Request to enable theTA to associate the Invalidate Completions with the Invalidate Request.

The format of the Invalidate Completion message is shown in [Figure 10-13 .](#bookmark408)

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 7 6 | | 5 | +0 4  3 | |  |  | | --- | --- | | 2 | 1 | | 0 | 7 6 | | 5 | +1  4 3 2 1 0 | | | | | 7 6 5 | | | +2 4 3 | | 2 1 0 | | | +3  7 6 5 4 3 2 1 0 | | | | | | | |
| 0 | Fmt 0 1 | | Type  1 0 0 1 0 | | | 0 | TC | | | 0 | Attr R | 0 | 0 | T D | E P | Attr R R | | R | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Requester ID | | | | | | | | | | | | | | Reserved | | | | | | | | Message Code 0000 0010 | | | | | | | |
| Device ID | | | | | | | | | | | | | | Reserved | | | | | | | | | | | | | CC | | |
| ITag Vector | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |

A-0586A

Figure 10-15 Invalidate Completion Message Format

The Invalidate Completion message is a Msg transaction routed by ID. The Requester ID field of the Invalidate

Completion message is set to the Requester ID of the Function containing the ATC. The Device ID field of the Invalidate Completion is set to the Requester ID of theTA. The ATC may derive the Requester ID of theTA from the Requester ID field of the corresponding Invalidate Request. Alternatively, since the ATC is only associated with a single TA, the ATC may sample and store the Requester ID from the first Invalidate Request following a Fundamental Reset or FLR.

Subsequent Invalidate Completion messages may use this value to set the Device ID field of Invalidate Completion messages.

The Completion Count (CC) field indicates the number of individual Invalidate Completion messages that must be sent for the associated Invalidate Request. Setting the CC field to 0 indicates that eight responses must be sent. TheTA is

responsible for collecting all the responses associated with a given Tag before considering the corresponding Invalidate Request to be complete.

Invalidate Completion messages may be sent on any TC, independent of the TC the originating Invalidate Request was received. This enables implementations to utilize the Invalidate Completion to push outstanding transactions to theTA to guarantee the required invalidation semantics are met. Implementations that utilize a single Upstream TC are

required to send a single Invalidate Completion in the utilized TC.

The ITag Vector field is used to indicate which Invalidate Request has been completed. Each of the 32 possible ITag field values from the Invalidation Request is represented by a single bit in the ITag Vector field. The least significant bit (bit 0; i.e., the right-most bit in the schematic representation of the Invalidate Completion message shown in [Figure 10-13)](#bookmark408) of the ITag Vector field corresponds to the ITag field value of 0. The most significant bit (bit 31) of the ITag Vector field

corresponds to the ITag field value of 31. Implementations are allowed to coalesce multiple Invalidate Completions by setting multiple ITag Vector bits in a single message provided the following conditions are met:

• The Invalidate Completions flow in the same TC.

• The Invalidate Completions have the same CC value.

• All fragments of an Invalidate Completion must have identical Request ID, CC, and ITag Vector fields.

ATA that receives an Invalidation Completion for an ITag that has no outstanding Invalidation Request shall report this

error using implementation specific mechanisms. One possible such mechanism is to report the Invalidation Completion as an Unexpected Completion (UC).

Functions that do not support ATS will treat an Invalidate Request as UR.

Functions supporting ATS are required to send an Invalidate Completion in response to a Invalidate Request

independent of whether the Bus Master Enable bit is Set or not. Note that the above conditions must be satisfied even when Bus Master Enable is Cleared. The method for a device to achieve this is implementation dependent.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**IMPLEMENTATION NOTE**

Bus Master Enable Change

When Bus Master Enable changes from Set to Clear, no further memory requests should be queued. It is possible

that queued write requests are present when BME is Cleared. These requests could block an Invalidate Completion. These requests must be either sent or dropped. This will ensure that all outstanding write transactions that are potentially dependent upon the outstanding invalidation are complete.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACoCAYAAADdE69lAAAAOklEQVRYhe3MMREAIAwEwQSTtEijRWVQADGw1/7ORzRlrFPPdc8c3QMAAAAAAAAAAAAAAAAAAADw6QKWKwRO6L9RAwAAAABJRU5ErkJggg==)

**10.3.3 Invalidate Completion Semantics**

Before an ATC can return an Invalidate Completion for a given Invalidate Request, it must ensure the following conditions are satisfied:

• All new requests initiated by the Function will not utilize stale address translations.

• All outstanding read requests utilizing translated address matching the invalidated range have either completed or been tagged to be discarded (method to discard is implementation specific).

• All outstanding posted writes utilizing a translated address matching the invalidated range have been pushed to theTA. The ATC is required to send a copy of the Invalidate Completion message in each TC in which a

posted write has been issued but not known to have been pushed to theTA. The CC field must be set to the same value in each copy of the Invalidate Completion message indicating number of copies sent. TheTA is responsible for collecting all sent responses before considering the invalidation to be complete.

**IMPLEMENTATION NOTE**

Implied TC Flushing

When making the decision as to which TC to send Invalidate Completions, an ATC may infer, in an implementation specific manner, that an issued posted write has been pushed to theTA. For example, a Function that has sent a read transaction to a destination above theTA and received its corresponding response may infer that any

preceding posted writes issued in the same TC have been pushed to theTA.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACnCAYAAAAsRR2wAAAAN0lEQVRYhe3KoQEAIAzAsI0nsZyG5cphsOOC1DYZ61R07ZmjnS8AAAAAAAAAAAAAAAAAAADg0wWWCgRONTYlZAAAAABJRU5ErkJggg==)

**10.3.4 Request Acceptance Rules**

In accord with the request acceptance rules enumerated in this section, a Function is not allowed to create a

dependency in which the acceptance of a posted transaction is dependent upon the transmission of a posted

transaction. Given Invalidate Requests and Invalidate Completions both are posted transactions, Functions must not make the acceptance of an Invalidate Request dependent upon the transmission of an Invalidate Completion. The method for achieving this is implementation specific.

A Function with an ATS capability in its configuration space must be able to accept Invalidate Requests and send Invalidate Completions even if ATS is not enabled.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**IMPLEMENTATION NOTE**

Invalidate Queue Depth

An ATC is only associated with a single TA. Each TA is limited to a total of 32 outstanding invalidations to any given

ATC. This limits the number of outstanding Invalidation Requests active to a single ATC to 32. To avoid a post-to-post dependency, an ATC is required to accept up to 32 Invalidation Requests.

An ATC may choose to implement a maximally sized input queue holding Invalidate Requests. Alternatively, an ATC may choose to implement a maximally sized output queue holding Invalidate Completions. Note that

queuing Invalidate Completions requires significantly less state per entry resulting in a potentially more efficient implementation than input queue buffering.

Note that the choice of whether to implement input queuing or output queuing (or a hybrid of both) has no impact on ensuring deadlock free behavior. But implementation choices with regard to queuing may have a significant impact on performance (see[Section 10.3.5)](#bookmark412).

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAEgCAYAAACJhzJkAAAARklEQVRoge3MoREAIAwEwYQmsZSGpcpQAROJ2be38xHNMtapZ90zR/cAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAADwBVyAyQU+MacGSgAAAABJRU5ErkJggg==)

**10.3.5 Invalidate Flow Control**

Due to the variety of caching architectures and queuing strategies, implementations may vary greatly with respect to

invalidation latency and throughput. It is possible that a TA may generate Invalidate Requests at a rate that exceeds the average ATC service rate. When this happens, the credit based flow control mechanisms will throttle theTA issue rate. A side effect of this is congestion spreading to other channels and Links through the credit based flow control mechanism. Depending on the frequency and duration of this congestion, performance may suffer. It is highly recommended that TA and its associated software implement higher level flow control mechanisms.

To assist with the implementation of Invalidate Flow Control, an ATC must publish the number of Invalidate Requests it can buffer before back pressuring the Link. This field applies to all invalidations serviced by the Function, independent of the size of the invalidation. This value is communicated in the Invalidate Queue Depth field in the ATS capability

structure (seeSection 7.8.8). A value of 0 0000b indicates that invalidate flow control is not necessary to this Function.

**IMPLEMENTATION NOTE**

Invalidate Flow Control

A Function may indicate that invalidate flow control is not required when one or more of the following is true:

1. The Function can handle invalidations at the maximum arrival rate of Invalidate Requests.

2. The Function will not or very rarely cause Link backpressure (performance loss is negligible).

3. The Function can fully buffer the maximum number of incoming invalidations without back pressuring the Link.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAADZCAYAAAAdbXEBAAAAPElEQVRYhe3KoQEAIAzAsI0nsZyG5cphsNOY1DYZ61R07ZmjnS8AAAAAAAAAAAAAAAAAAAAAAAAAAL6DC58vBLIXUGXwAAAAAElFTkSuQmCC)

**10.3.6 Invalidate Ordering Semantics**

Invalidate Requests and Translation Completions may be sent using different TC and are, therefore, unordered with respect to each other (from the Link’s perspective). An ATC must ensure that the proper invalidation behavior is

maintained when an Invalidate Request bypasses a Translation Completion to an overlapping region.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

An ATC must “snoop” its outstanding translation request queue against all arriving Invalidate Requests. When snooping a request for a N\*STU sized translation (Nis a power of 2), the ATC must snoop the range of addresses starting at the STU aligned region containing the specified address and ending (N-1) STU size pages later.

If an Invalidate Request overlaps the address range in an outstanding Translation Request, the Translation Request must be tagged as invalid and the results of its corresponding Translation Response must be discarded prior to transmission of the Invalidate Completion. If the Translation Response is received before the Invalidate Completion is sent, an

implementation is free to issue requests utilizing the translation result provided the Invalidate Completion Semantics (see [Section 10.3.3)](#bookmark411) are satisfied.

**IMPLEMENTATION NOTE**

Request Range Overlap in Invalidations

In the description above, N is the number of STU sized translations that were requested in the Translation Request. This is equal to (Length field in Translation Request)/2.

As an example:

STU is 00 0010b indicating 16384-byte pages.

An outstanding Translation Request has a Length field of 00 0000 0100b indicating two translations covering a range of 32768 bytes.

The high-order 48 bits of the Translation Request are 0000 0FFF FFFFh.

The low-order 16 bits of the address in the request are 11xx xxxx xxxx xxxxb indicating that the translation request covers a range that overlaps a 32768-byte boundary (in fact, the request crosses a 16-TB boundary).

If two translations are returned, they would cover the two STU sized regions at 0000 0FFF FFFF C000hand 0000 1000 0000 0000h.

An Invalidate Request is received with the high-order 48 bits of 0000 1000 0000hand the low-order 16 bits of 0001 1xxx xxxx xxxxb.

The ATC must detect that a translation associated with a portion of the Translation Request is now invalidated and the Translation Completion associated with the invalidated region must be discarded (for simplification, the ATC is allowed to discard all of the Translation Completion).

It should be noted that, processing of the Invalidate Requests is simplified if Translation Requests do not cross alignment boundaries of the request. The Translation Request from the above example is not aligned to a

32768-byte boundary. If it were broken into two requests, it would be simpler to associate the range of the

Invalidate Request with the address in the Translation Request. Breaking the Translation Requests into aligned requests is not a requirement.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAAIdCAYAAAAAizyKAAAAXUlEQVR4nO3KoREAIAwAsZYlsYyGZUow2B6ey9tPxlg7qmbPVs4bAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAADAf+DZAc2gBzgM5YKDAAAAAElFTkSuQmCC)

**10.3.7 Implicit Invalidation Events**

The following events will cause the invalidation of all ATC entries:

• Conventional Reset (all forms)

• Function Level Reset

• E field in ATS Capability changes from Clear to Set

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

The following events will cause the invalidation of all non-Global Mapping ATC entries that were requested using a specific PASID:

• Stopping the use of a PASID as defined inthis specification.

No explicit Invalidate Completion message is sent when these implied invalidate events occur.

**IMPLEMENTATION NOTE**

Implicit Invalidation and PASID

Software may not change any of the PASID enable bits when the E field in the ATS Capability is Set. The

invalidation that occurs when software Sets the E field also invalidates ATC entries with an associated PASID value.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAN0lEQVRYhe3KoREAIAwAsZYlsYyGZUow2B4DkLefjLF2VM2erZw3AAAAAAAAAAAAAAAAAPgNPDvTUQQsJUhN5wAAAABJRU5ErkJggg==)

**10.3.8 PASIDTLP Prefix and Global Invalidate**

The requirements in this section apply to Functions that support the PASIDTLP Prefix. For Invalidation Requests that have a PASIDTLP Prefix, the ATC shall:

• Optionally signal Unsupported Request (UR) if the associated PASID value is greater than or equal to 2Max PASID Width. This error may be signaled anytime an out of range PASID value is present, even when the PASID value is ignored (see below).

• Return an Invalidation Completion if PASID Enable is Clear.

• If the Function supports Global Invalidate (seeSection 7.8.8.2):

。 If the Global Invalidate bit in the Request is Set, invalidate Global and non-Global Mapping entries in the ATC within the indicated memory range associated with any PASID value and return an

Invalidation Completion. The PASID value in the PASIDTLP Prefix is ignored.

。 If the Global Invalidate bit in the Request is Clear, invalidate only non-Global Mapping entries in the ATC within the indicated memory range that were requested using the associated PASID value and return an Invalidation Completion.

Global Mapping entries in the ATC for some or all of the indicated memory range may be retained.

• If the Function does not support Global Invalidate (seeSection 7.8.8.2), invalidate entries in the ATC within the indicated memory range that were requested using the associated PASID value and return an Invalidation

Completion.

• If no matching entries are present in the ATC, invalidate no ATC entries and return an Invalidation Completion. For Invalidation Requests that do not have a PASIDTLP Prefix, the ATC shall:

• Invalidate ATC entries within the indicate memory range that were requested without a PASID value.

• Invalidate ATC entries at all addresses that were requested with any PASID value.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

**10.4 Page Request Services**

The general model for a page request is as follows:

1. A Function determines that it requires access to a page for which an ATS translation is not available.

2. The Function causes the associated Page Request Interface to send a Page Request Message to its RC. A Page Request Message contains a page address and a Page Request Group (PRG) index. The PRG index is used to identify the transaction and is used to match requests with responses.

3. When the RC determines its response to the request (which will typically be to make the requested page resident), it sends a PRG Response Message back to the requesting Function.

4. The Function can then employ ATS to request a translation for the requested page(s).

A Page Request Message is a PCIe Message Request that is Routed to the Root Complex with a Message Code of 4

(0000 0100b). The mechanism employed at the RC to buffer requests is implementation specific. The only requirement is that an RC not silently discard requests.

All Page Request Messages and PRG Response Messages travel in PCIe Traffic Class 0.A Page Request Message or PRG Response Message with a Traffic Class other than 0 shall be treated as Malformed TLPs by the RC or endpoint that

receives the same. Intermediate routing elements (e.g., Switches) shall not detect this error.

The Relaxed Orderingand ID-Based Orderingbits in the Attr field of Page Request Messages and PRG Response messages may be used. The No Snoop bit in the Attr field is reserved.

The page request service allows grouping of page requests into Page Request Groups (PRGs). A PRG can contain one or more page requests. All pages in a PRG are responded to en mass by the host. Individual pages within a PRG are

requested with independent Page Request Messages and are recognized as belonging to a common PRG by sharing the same PRG index. The last request of a PRG is marked as such within its Page Request Message. One request credit is

consumed per page request (not per PRG).

A PRG Response Message is a PCIe Message Request that is Routed by ID back to the requesting Function. It is used by

system software to alert a Function that the page request(s) associated with the corresponding PRG has (have) been

satisfied. The page request mechanism does not guarantee any request completion order and all requests are inherently independent of all other concurrently outstanding requests. If a Function requires that a particular request be

completed before another request, the initial request will need to complete before the subsequent request is issued. It is valid for a Function to speculatively request a page without ascertaining its residence state and/or to issue multiple

concurrently outstanding requests for the same page.

A Page Request Interface is allocated a specific number of page request message credits. An RC (system software) can

divide the available credits in any manner deemed appropriate. Any measures the host chooses to employ to ensure that credits are correctly metered by Page Request Interfaces (a Page Request Interface is not using more than its allocation) is an implementation choice. A Page Request Interface is not allowed to oversubscribe the available number of requests (doing so can result in the page request mechanism being disabled if the buffer limit is exceeded at the root). A Page

Request Interface’s page request allocation is static. It is determined when the Page Request Interface is enabled and can only be changed by disabling and then re-enabling the interface.

**10.4.1 Page Request Message**

A Function uses a Page Request Message to send page requests to its associated host. A page request indicates a page needed by the Function. The Page Request Interface associated with a Function is given a specific Page Request

allocation. A Page Request Interface shall not issue page requests that exceed its page request allocation.

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

A page request contains the untranslated address of the page that is needed, the access permissions needed for that

page, and a PRG index. A PRG Index is a 9-bit scalar that is assigned by the Function to identify the associated page

request. Multiple pages may be requested using a single PRG index. When more than a single page is to be associated

with a given PRG, the Last flag in the Page Request Record is cleared in all the requests except the last request associated with a given PRG (the flag is set in the last request). Page requests are responded to en mass. No response is possible

(except for a Response Failure error) until the last request of a PRG has been received by the root. The number of PRGs that a Function can have outstanding at any given time is less than or equal to the associated Page Request Interface’s Outstanding Page Request Allocation. It is valid for a request group to contain multiple requests for the same page and for multiple outstanding PRGs to request the same page.

The first two DWs of a Page Request Message contain a standard PCIe message header. The second two DWs of the message contain page request specific data fields.

00h

04h

08h

0Ch

|  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| +0 | | | | | | +1 | | | | | | | +2 | | | | | | | +3 | | | | | |
| 7 6 5 4 | | | | |  |  |  | | --- | --- | --- | | 3 | 2 | 1 | | 0 | 7 6 | | |  |  | | --- | --- | | 5 | 4 3 2 1 | | | | | 0 | 7 6 5 | | | |  |  | | --- | --- | | 4 3 | 2 1 | | | | 0 | 7 | |  |  |  |  |  |  | | --- | --- | --- | --- | --- | --- | | 6 | 5 | 4 | 3 | 2 | 1 | | | | | 0 |
| 0 | 0 | 1 | Type  (10000) | | | T 9 | TC | | T 8 | Attr | R | | T D | E P | Attr | | R | Length (0) | | | | | | | |
| Requestor ID | | | | | | | | | | | | | Tag | | | | | | | Message Code (0000 0100b) | | | | | |
| Page Address [63:32] | | | | | | | | | | | | | | | | | | | | | | | | | |
| Page Address [31:12] | | | | | | | | | | | | | | | | | Page Request Group Index | | | | | L | W | R | |

A-0737A

Figure 10-16 Page Request Message

Table 10-5 Page Request Message Data Fields

|  |  |
| --- | --- |
| Field | Meaning |
| R | **Read Access Requested**- This field, when Set, indicates that the requesting Function seeks read access to the associated page. When Clear, this field indicates that the requesting Function will not read the associated page.  The R field must be Set for Page Requests with a PASID TLP Prefixthat has the Execute Requested bit Set. If R and W are both Clear and L is Set, this is a Stop Marker (see[Section 10.4.1.2.1)](#bookmark413). |
| W | **Write Access Requested**- This field, when Set, indicates that the requesting Function seeks write access and/or  zero-length read access to the associated page. When Clear, this field indicates that the requesting Function will not write to the associated page.  Upon receiving a Page Request Message with the W field Set, the host is permitted to mark the associated page dirty. Thus, Functions must not issue such Requests unless the Function has been given explicit write permission.  If R and W are both Clear and L is Set, this is a Stop Marker (see[Section 10.4.1.2.1)](#bookmark414). |
| L | **Last Request in PRG** - This field, when Set, indicates that the associated page request is the last request of the  associated PRG. A PRG can have a single entry, in which case the PRG consists of a single request in which this field is Set. When Clear, this field indicates that additional page requests will be posted using this record’s PRG Index. |

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

|  |  |
| --- | --- |
| Field | Meaning |
|  | If R and W are both Clear and L is Set, this is a Stop Marker (see[Section 10.4.1.2.1)](#bookmark415). |
| Page  Request  Group  Index | **Page Request Group Index** - This field contains a Function supplied identifier for the associated page request. A  Function need not employ the entire available range of PRG index values. A host shall never respond with a PRG Index that has not been previously issued by the Function and that is not currently an outstanding request PRG Index (except when issuing a Response Failure, in which case the host need not preserve the associated request’s PRG Index value in the error response). |
| Page Address | **Page Address** - This field contains the untranslated address of the page to be loaded. For pages larger than 4096 bytes, the least significant bits of this field are ignored. For example, the least significant bit of this field is ignored when an 8096-byte page is being requested. |

**IMPLEMENTATION NOTE**

Last Bit and Relaxed Ordering

If multiple page requests are associated with a single PRG index, the last page request of a PRG should have the Relaxed Orderingattribute bit Clear in addition to having the Last flag Set. All other page request messages may have the Relaxed Orderingattribute bit set to any value.

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAgAAACXCAYAAAAoPxodAAAAOUlEQVRYhe3MMREAIAwEwQSTtEijRWVQwEQA++3tfESzjHXqWffM0T0AAAAAAAAAAAAAAAAAwG/gAknGBCzHRqDrAAAAAElFTkSuQmCC)

[**10.4.1.1**](10.4.1.1) **PASID TLP Prefix Usage**

The PASID Extended Capability indicates whether a Function supports PASIDTLP Prefixes and whether it is enabled to send and receive them.

Functions that support the PASID TLP Prefixare permitted to send a PASID TLP Prefixon Page Request Messages. The PASID field contains the process address space of the page being requested and the Execute Requestedand Privileged Mode Requested bits indicate the access being requested.

If one Page Request Message in a PRG has a PASID TLP Prefix, all Page Request Messages in that PRG must contain identical PASIDTLP Prefixes. Behavior is undefined when the PASIDTLP Prefixes are inconsistent.

Functions that support the PASID TLP Prefixand have the PRG Response PASID Required bit Set (seeSection 10.5.2.3), expect that PRG Response Messages will contain a PASID TLP Prefixif the associated Page Request Message had a PASID TLP Prefix. For such PRG Response Messages, the Execute Requestedand Privileged Mode Requested bits are reserved and the PASID field contains the PASID from the associated Page Request Message.

[**10.4.1.2**](10.4.1.2) **Managing PASID TLP Prefix Usage on PRG Requests**

There are rules for stopping and starting the use of a PASID.

This section describes additional rules that apply to Functions that have issued Page Request Messages in a PASID that is being stopped. No additional rules are required to start the usage of the Page Request Interface for a PASID.

When stopping the use of a particular PASID, a Stop Marker Message may be optionally used to avoid waiting for PRG Response Messages before the Function indicates that the stop request for a particular PASID has completed.

To stop without using a Stop Marker Message, the Functionshall:

5.0-1.0-PUB — PCI Express® Base Specification Revision 5.0 Version 1.0

1. Stop queueing new Page Request Messages for this PASID.

2. Finish transmitting any multi-page Page Request Messages for this PASID (i.e. send the Page Request Message with the L bit Set).

3. Wait for PRG Response Messages associated any outstanding Page Request Messages for the PASID.

4. Indicate that the PASID has stopped using a device specific mechanism. This mechanism must indicate that a Stop Marker Message will not be generated.

To stop with the use of a Stop Marker Message the Functionshall:

1. Stop queueing new Page Request Messages for this PASID.

2. Finish transmitting any multi-page Page Request Messages for this PASID (i.e. send the Page Request Message with the L bit Set).

3. Internally mark all outstanding Page Request Messages for this PASID as stale. PRG Response Messages associated with these requests will return Page Request Allocation credits and PRG Index values but are otherwise ignored.170

4. Indicate that the PASID has stopped using a device specific mechanism. This mechanism must indicate that a Stop Marker Message will be generated.

5. Send a Stop Marker Message to indicate to the host that all subsequent Page Request Messages for this PASID are for a new use of the PASID value.

Note: Steps 4 and 5 may be performed in either order, or in parallel.

**10.4.1.2.1 Stop Marker Messages**

A Stop Marker Message indicates that a Function has stopped using the Page Request Interface and has transmitted all

pending Page Request Messages for a specific PASID. Stop Marker Messages are strongly ordered with respect to Page

Request Messages and serve to push Page Request Messages toward the Host. When the Host receives the Stop Marker Message, this indicated that all Page Request Messages associated with the PASID being stopped have been delivered

and that any subsequent Page Request Message with the same PASID value are associated with a new incarnation of that PASID value.

Stop Marker Messages do not have a response. They do not have a PRG Index and do not consume Page Request allocation (see Section 10.5.2.5).

The Stop Marker Message bit layout is shown in Figure 10-17 .

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAKElEQVRYhe3OAQkAQAgEMO1f6pr9pxBBtgSrAgAAGNZJ3nYCAAC47QOcDgNBkGYMzwAAAABJRU5ErkJggg==)

![](data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAx4AAAACCAYAAADWxgpDAAAAIklEQVRYhe3BAQ0AAAgDoNvUeqbUHG5ApWcDAAAAAADw2QHZqwIKU+HaKwAAAABJRU5ErkJggg==)170. Page Request Allocation is shared across all PASIDs of the Function (see Section 10.5.2.5). IfPRG Response PASID Requiredis Clear, PRG Index values are shared across all PASIDs of the Function (see Section 10.5.2.3).

Page 1200