# PERSISTENT MEMORY ARCHITECTURE

In this chapter, we provide an overview of the persistent memory architecture while focusing on the essentials of the hardware to emphasize requirements and decisions that developers need to know.

Applications that are designed to recognize the presence of persistent memory in a system can run much faster than using other storage devices because data doesn't have to transfer back and forth between the CPU and slower storage devices. Applications that only utilize persistent memory may be slower than DRAM. Therefore, applications should decide what data resides in DRAM, persistent memory, and storage.

In Chapter 1, we described the *memory-storage* hierarchy which introduces a persistent memory tier to fill the gap between fast, expensive, capacity limited DRAM and slower, cheap, seemingly limitless capacity traditional storage. The use of persistent memory opens new possibilities for application architectures and features. Applications can persistently store in-memory data structures natively without having to serialize and de-serialize the data into some other format, such as storing data in files or a database. Storing data structures in their native state can reduce code complexity and significantly expedite application startup times since the application need only open and memory map the file to access the data rather than having to read data in from external sources, and convert it to in-memory data. Data residing in persistent memory means applications no longer need to wait for caches to warm up as the data is immediately available when it starts.

With the capacity of persistent memory many times larger than DRAM, the volume of data that applications can potentially store and process in place is also much larger. This significantly reduces the number of disk I/Os which improves performance and reduces wear on the storage media.

On systems without persistent memory, large datasets that cannot fit into DRAM have to be processed in segments or streamed. This introduces processing delays as the application stalls waiting for data to be paged from disk or streamed from the network.

If the working dataset size fits within the capacity of persistent memory and DRAM, applications can perform in-memory processing without needing to checkpoint or page data to or from storage. This results in significant performance improvements.

### **Persistent Memory Characteristics**

As with every new technology, there are always new things to consider. Persistent memory is no exception. Below are some of these characteristics to consider when architecting and developing solutions.

- Performance (throughput, latency, and bandwidth) of persistent memory is much better than NAND, but potentially slower than DRAM.
- Persistent memory is durable like DRAM. Persistent memory usually has orders of magnitude better endurance than NAND and should exceed the lifetime of the server without wearing out.
- Persistent memory module capacities are typically much larger than DRAM DIMMs and can co-exist on the same memory channels.
- Persistent memory enabled applications can update data in-place without needing to serialize/deserialize the data.
- Persistent memory is byte-addressable like memory. Applications can update just the data needed without any read-modify-write overhead.
- Data is CPU Cache Coherent.
- Persistent memory provides DMA and RDMA operations.
- Data written to persistent memory is not lost when power is removed. Data may be retained for years without risk of data degradation or corruption.
- Data located on persistent memory is directly accessible from user space.
   No Kernel code, file system page caches, or interrupts are in the data path.
- Data on persistent memory is instantly available, ie:
  - Data is available as soon as power is applied to the system.
  - Applications don't need to spend time warming up caches. They can access the data immediately upon memory mapping it.

- Data residing on persistent memory has no DRAM footprint unless the application copies data to DRAM for faster access.
- Data written to persistent memory modules is local to the system.
   Applications are responsible for replicating data across systems.

#### **Platform Support for Persistent Memory**

Platform vendors such as Intel, AMD, and ARM will decide how persistent memory should be implemented at the lowest hardware levels. Throughout this book, we try to provide a vendor-agnostic perspective and will call out platform-specific details along the way.

For systems with persistent memory, failure atomicity guarantees systems can always recover to a consistent state following a power or system failure. Failure atomicity for applications can be achieved using logging, flushing, and memory store barriers that order such operations. Logging, either undo or redo, ensures atomicity when a failure interrupts the last atomic operation from completion. Cache flushing ensures that data held within volatile caches reach the persistence domain, so it won't be lost if a sudden failure occurs. Memory store barriers, such as an SFENCE operation on x86, help prevent potential reordering in the memory hierarchy, as caches and memory controllers may reorder memory operations. For example, a barrier ensures the undo log copy of the data gets persisted onto the persistent memory before the actual data is modified in-place. This guarantees the last atomic operation can be rolled-back should a failure occur. However, it is non-trivial to add such failure atomicity in user applications with low-level operations such as write logging, cache flushing, and barriers. The persistent memory development kit (PMDK) was developed to isolate developers from having to re-implement the hardware intricacies.

This should be a familiar concept as most file systems implement and perform journaling and flushing of dirty pages to non-volatile storage devices.

#### Cache Hierarchy

We use load and store operations to read and write to persistent memory rather than using block-based I/O to read and write to traditional storage. It is a good idea to read the CPU architecture documentation for an in-depth description since each

successive CPU generation may introduce new features, methods, and optimizations.

A CPU cache typically has three distinct levels, L1, L2, and L3. The hierarchy makes references to the distance from the CPU core, its speed and size of the cache. The L1 cache is closest to the CPU. It is extremely fast but very small. L2 and L3 caches are increasingly larger in capacity, but they are also relatively slower. Figure 2-1 shows a typical CPU microarchitecture with three levels of CPU cache and a memory controller with three memory channels. Each memory channel has a single DRAM and persistent memory (PMEM) attached. All three levels of CPU cache are volatile, so any content that has not been flushed to non-volatile storage will be lost if the system crashes or power is lost.



Figure 2-1: CPU Cache and Memory Hierarchy

An L1 (Level 1) cache is the fastest memory that is present in a computer system. In terms of priority of access, the L1 cache has the data the CPU is most likely to need while completing a specific task. The L1 cache is also usually split two ways - into the instruction cache (L1 I) and the data cache (L1 D). The instruction cache deals with the information about the operation that the CPU has to perform, while the data cache holds the data on which the operation is to be performed.

An L2 (Level 2) cache has a larger capacity than the L1 cache but it is slower. L2 cache holds data that is likely to be accessed by the CPU next. In most modern

CPUs, the L1 and L2 caches are present on the CPU cores themselves, with each core getting dedicated caches.

An L3 (Level 3) cache is the largest cache memory, but it is also the slowest of the three. Commonly, it is also a shared resource among all the cores on the CPU. Additionally, it may be internally partitioned to allow each core to have dedicated L3 resources.

Data read from DRAM or persistent memory is transferred through the memory controller into the L3 cache, then propagated into the L2 cache, and finally the L1 cache where the CPU core consumes it. When the processor is looking for data to carry out an operation, it first tries to find it in the L1 cache. If the CPU can find it, the condition is called a cache hit. If the CPU cannot find the data within the L1 cache, it then proceeds to search for it first within L2, then L3. If it cannot find the data in any of the three, it tries to access it from memory. Each failure to find data in a cache is called a *cache miss*. Failure to locate the data in memory requires the operating system to page the data into memory from a storage device.

When the CPU writes data, it is initially written to the L1 cache. Due to ongoing activity within the CPU, at some point in time, the data will be evicted from the L1 cache into the L2 cache. The data may be further be evicted from L2 and placed into L3 and eventually evicted from L3 into the memory controllers write buffers where it is then written to the memory device.

In a system that does not possess persistent memory, the data in DRAM is persisted to a non-volatile storage device such as an SSD, HDD, SAN, NAS, or a volume in the cloud. This protects data from application or system crashes. Critical data can be manually flushed using calls such as <code>msync()</code>, <code>fsync()</code> or <code>fdatasync()</code> which flushes uncommitted dirty pages from volatile memory to the non-volatile storage device. File systems protect applications from torn pages by frequently flushing dirty page data to non-volatile storage, and provide <code>recovery mechanisms</code> in the case of system crashes. File systems provide <code>fdisk</code> or <code>chkdsk</code> utilities to check and attempt repairs on damaged filesystems if required.

Persistent memory provides no such guarantees without software assistance. Applications memory map the persistent memory address range directly into its own memory address space. Therefore, the application must, assume responsibility for checking and guaranteeing data integrity. The rest of this chapter describes the responsibilities for application developers in a persistent memory environment and how to achieve data consistency and integrity.

#### **Power Fail Protected Domains**

A computer system may include one or more CPUs, volatile or persistent memory modules, and non-volatile storage devices such as solid-state drives or hard disk drives.

System platform hardware supports the concept of a *persistence domain*, also called *power fail protected domains*. Depending on the platform, a persistence domain may include the persistent memory controller and write queues, memory controller write queues, and CPU caches. Once data has reached the persistence domain, it may be recoverable during a process that results from a system restart. That is, if data is located within hardware write queues or buffers protected by power failure domain applications should assume it is safe. For example, if a power failure occurs, the data will be flushed from the power fail protected domain using stored energy guaranteed by the platform for this purpose. Data that has not yet made it into the protected domain will be lost. Recoverability depends on whether the pattern of failures affecting the system during the restart can be tolerated by the design and configuration of the persistence domain.

Multiple persistence domains may exist within the same system, for example on systems with more than one physical CPU. Systems may also provide a mechanism for partitioning the platform resources for isolation. It is an administrative requirement to align persistence domains with volumes and file systems. This must be done in such a way that SNIA NVM Programming Model behavior is assured from each compliant volume or file system. In Chapter 3, we describe the programming model as it applies to operating systems and file systems. The "Detecting Platform Capabilities" section within this chapter describes the logic that applications should perform to detect platform capabilities including power failure protected domains. Future chapters provide a more in-depth discussion into why, how, and when applications should flush data, if required, to guarantee the data is safely within the protected domain and persistent memory.

Volatile memory loses its contents when the computer system's power is interrupted. Just like non-volatile storage devices, persistent memory keeps its contents even in the absence of system power. Data that has been physically saved to the persistent memory media is called "data at rest." "Data in-flight" has the following meanings:

- Writes sent to the persistent memory device but have not yet been physically committed to the media.
- Any writes that are in progress, but not yet complete.

 Data that has been temporarily buffered or cached in either the CPU caches or memory controller.

When a system is gracefully rebooted or shut down, the system maintains power and can ensure all contents of the CPU caches and memory controllers are flushed such that any in-flight or uncommitted data is successfully written to persistent memory or non-volatile storage. In the situation where an unexpected power failure occurs and assuming no uninterruptable power supply is available, the system must have enough stored energy within the power supplies and capacitors dotted around the system to flush data before the power is completely exhausted. Any data that is not flushed is lost and not recoverable.

ADR stands for *Asynchronous DRAM Refresh*. ADR is a feature supported on Intel chipsets that triggers a hardware interrupt to the memory controller which flushes the write-protected data buffers and place the DRAM in self-refresh. This process is critical during a power loss event or system crash to ensure the data is in a safe and consistent state on persistent memory. By default, ADR does not flush the processor caches. A platform that supports ADR only includes persistent memory and the memory controller's write pending queues within the persistence domain. This is the reason data in the CPU caches must be flushed by the application using the CLWB, CLFLUSHOPT, CLFLUSH, Non-Temporal Stores, or WBINVD machine instructions.

Enhanced Asynchronous DRAM Refresh (eADR) requires that an NMI routine be called to flush the CPU caches before the ADR event can begin. Applications running on an eADR platform do not need to perform flush operations because the hardware should flush the data automatically, but they are still required to perform an SFENCE operation to maintain write order correctness. Stores should be considered persistent only when they are globally-visible, which the SFENCE guarantees.

Figure 2-2 shows both the ADR and eADR persistence domains.



Figure 2-2: ADR and eADR Power Fail Protection Domains

ADR is a mandatory platform requirement for persistent memory. The write pending queue (WPQ) within the memory controller acknowledges receipt of the data to the writer once all the data is received. Although the data has not yet made it to the persistent media, a platform supporting ADR guarantees that it will be successfully written should a power loss event occur. If ADR is not available or it has been disabled, the application is fully responsible for flushing data to the media. This is called a deep flush event and can incur significant performance penalties. During a crash or power failure, data that is in flight through the CPU caches can only be guaranteed to be flushed to persistent media if the platform supports eADR. It will be lost on platforms that only support ADR.

The challenge with extending the persistence domain to include the CPU caches is that the CPU caches are quite large and it would take considerably more energy than the capacitors in a typical power supply can practically provide. This means the platform would have to contain batteries or utilize an external uninterruptable power supply (UPS). Requiring a battery for every server supporting persistent memory is not generally practical or cost-effective. The lifetime of a battery is typically shorter than the server, which introduces additional maintenance routines that reduces server uptime. There is also an environmental impact when using batteries as they must be disposed of or recycled correctly. It is entirely possible for server or appliance OEM's to include a battery in their product.

Because some appliance and server vendors plan to use batteries, and because platforms will someday include the CPU caches in the persistence domain, a property is available within ACPI such that the BIOS can notify the operating system when the CPU flushes can be skipped. On platforms with eADR, there is no need for manual cache line flushing which allows the operating system to implement calls like <code>msync()</code> in the most optimal way.

# The Need for Flushing, Ordering, and Fencing

Except for WBINVD, which is a kernel-mode only operation, the machine instructions in Table 2-1 are supported in user space by Intel and AMD CPUs. Intel adopted the SNIA NVM Programming Model for working with persistent memory. This model allows for direct access (DAX) using byte-addressable operations (i.e., load/store). However, the persistence of the data in the cache is not guaranteed until it has entered the persistence domain. x86 provides a set of instructions for flushing cache lines in a more optimized way. In addition to existing x86 instructions such as nontemporal stores, CLFLUSH, and WBINVD, two new instructions were added. These are CLFLUSHOPT and CLWB. Both new instructions must be followed by an SFENCE to ensure all flushes are completed before continuing. Flushing a cache line using CLWB, CLFLUSHOPT, or CLFLUSH and using non-temporal stores are all supported from user space. Details for each machine instruction can be found in the software developer manuals for the architecture. On Intel platforms, for example, this detailed information can be found in the Intel® 64 and IA-32 Architectures Software Developer Manuals (https://software.intel.com/en-us/articles/intel-sdm).

Non-Temporal Stores implies the data being written is not going to be read again soon so we bypass the CPU caches. In other words, there is *temporal locality* so there is no benefit to keeping the data in the processor's cache(s), and there may be a penalty if the stored data displaces other useful data from the cache(s).

Flushing to persistent memory directly from user space negates calling into the kernel, which makes it highly efficient. The feature is documented in the SNIA persistent memory programming model specification as an "Optimized Flush". The specification document<sup>1</sup> describes Optimized Flush as optionally supported by the

<sup>&</sup>lt;sup>1</sup> SNIA NVM Programming Model Spec - https://www.snia.org/tech\_activities/standards/curr\_standards/npm

platform, depending on the hardware and operating system support. Despite the CPU support, it is essential for applications to use only optimized flushes when the operating system indicates it is safe to use. The operating system may require the control point provided by calls like msync() when, for example, there are changes to file system metadata that need to be written as part of the msync() operation.

To better understand instruction ordering, let's look at a very simple linked list example. Our pseudocode has three simple steps to add a new node into an existing list that already contains two nodes. Figure 2-3 depicts these steps.

- 1. Create the new node (Node 2)
- Update the node pointer (next pointer) to point to the last node in the list (Node 2 -> Node 1)
- 3. Update the head pointer to point at the new node (Head -> Node 2)



Figure 2-3: Adding a new node to an existing linked list without a store barrier

Figure 2-3 shows the head pointer was updated in the CPU cached version, but the node 2 to node 1 pointer has not yet been updated in persistent memory. This is because the hardware can choose which cache lines to commit and the order may not match the source code flow. If the system or application were to crash at this point, the persistent memory state would be inconsistent, and the data structure would no longer be usable.

The solution to this problem is to introduce a memory store barrier, also called a fence to ensure the order of the write operations is maintained. Starting from the same initial state, the pseudocode now looks like this:

- 1. Create the new node
- 2. Update the node pointer (next pointer) to point to the last node in the list and perform a store barrier/fence operation
- 3. Update the head pointer to point at the new node

Figure 2-4 shows the addition of the store barrier allows the code to work as expected and maintains a consistent data structure in the volatile CPU caches and on persistent memory.



Figure 2-4: Adding a new node to an existing linked list using a store barrier

The Persistent Memory Development Kit (PMDK) detects the platform, CPU, and persistent memory features when the memory pool is opened, then uses the optimal instructions and fencing to preserve write ordering. Memory pools are files that are memory-mapped into the process address space. We describe these in more detail in later chapters.

To insulate application developers from the complexities of the hardware, and to keep them from having to research and implement code specific to each platform or device, the <code>libpmem</code> library provides a function which tells the application when Optimized Flush is safe to use or failback to the standard way of flushing stores to memory-mapped files.

Developers are encouraged to use libraries, such as libpmem and others within the PMDK to simply programming. The libpmem library is also designed to detect the case of the platform with a battery which automatically converts flush calls into simple SFENCE instructions. Chapter 5 introduces and describes the core libraries within the PMDK in more detail. Future chapters take a more in-depth look into each of the libraries to assist developers understand their APIs and features.

#### **Data Visibility**

Understanding when data is visible to other processes or thread and when it is safely in the persistence domain is critical to understand when using persistent memory in applications. In the examples provided in Figures 2-2 and 2-3, updates made to data in the CPU caches could become visible to other processes or threads that run on the same CPU. Depending on which CPU cache the data is in - L1, L2, or L3 - will depend on the visibility of the data within the CPU. For example, if data is in either L1 or L2, it is only locally visible to threads running on that core. If the data resides in the L3 cache, any thread running on any core on the CPU could access it. Only when the data is written to persistent memory will it become globally visible across all CPUs in the system. This is one reason we use flushing and fencing operations.

A pseudo C code example may look like this:

```
open()  // Open a file on a file system
...
mmap()  // Memory map the file
...
```

```
strcpy() // Execute a store operation
... // Data is locally visible but not globally visible
msync() // Data is now persistent and globally visible
```

Developing for persistent memory follows this decades-old model. The difference is that stores may exist in different caches on the system. Applications have very little physical control over when data is evicted from the various hardware caches unless explicitly flushing using one of the user space machine instructions described in Table 2-1.

# **Intel Machine Instructions for Persistent Memory**

Applicable to Intel and AMD based ADR platforms, executing an Intel® 64 and IA-32 architecture store instruction is not enough to make data persistent since the data may be sitting in the CPU caches indefinitely and could be lost by a power failure. Additional cache flush actions are required to make the stores persistent. Importantly, these non-privileged cache flush operations can be called from user space, meaning applications decide when and where to fence and flush data. Table 2-1 summarizes each of these instructions. For more detailed information, the "Intel® 64 and IA-32 Architectures Software Developer Manuals" can be found online - https://software.intel.com/en-us/articles/intel-sdm.

As a developer, you should primarily focus on <code>CLWB</code> and <code>Non-Temporal</code> <code>Stores</code> if available, and fall back to the others as necessary. We list other opcodes for completeness.

Table 2-1: Intel Architecture Instructions for Persistent Memory

| OPCODE     | Description                                                                                                                                                                                                                       |
|------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| CLFLUSH    | This instruction, supported in many generations of CPU, flushes a single cache line. Historically, this instruction is serialized, causing multiple CLFLUSH instructions to execute one after the other, without any concurrency. |
| CLFLUSHOPT | This instruction, newly introduced for persistent                                                                                                                                                                                 |

(followed by an SFENCE)

memory support, is like CLFLUSH but without the serialization. To flush a range, the software executes a CLFLUSHOPT instruction for each 64-byte cache line in the range, followed by a single SFENCE instruction to ensure the flushes are complete before continuing. CLFLUSHOPT is optimized, hence the name, to allow some concurrency when executing multiple CLFLUSHOPT instructions back-to-back.

CLWB (followed by an SFENCE)

CLWB stands for "Cache Line Write Back." The effect is the same as CLFLUSHOPT except that the cache line may remain valid in the cache but no longer dirty since it was flushed. This makes it more likely to get a cache hit on this line if the data is accessed again later.

Non-Temporal stores (followed by an SFENCE)

Another feature that has been around for a while in x86 CPUs is the non-temporal store. These stores are "write combining" and bypass the CPU cache, so using them does not require a flush. A final SFENCE instruction is still required to ensure the stores have reached the persistence domain.

SFENCE

Performs a serializing operation on all store-tomemory instructions that were issued prior the SFENCE instruction. This serializing operation guarantees that every store instruction that precedes in program order the SFENCE instruction is globally visible before any store instruction that follows the SFENCE instruction is globally visible. The SFENCE instruction is ordered with respect to store instructions, other SFENCE instructions, any MFENCE instructions. and any serializing instructions (such as the CPUID instruction). It is not ordered with respect to load instructions or the LFENCE instruction.

WBINVD

This kernel-mode-only instruction flushes and invalidates every cache line on the CPU that executes it. After executing this on all CPUs, all stores to persistent memory are certainly in the persistence domain, but all cache lines are empty, impacting performance. Also, the overhead of sending a message to each CPU to execute this instruction can be significant. Because of this, WBINVD is only expected to be used by the kernel for flushing very large ranges, many megabytes at least.

### **Detecting Platform Capabilities**

Server platform, CPU, and persistent memory features and capabilities are exposed to the operating system through the BIOS and ACPI that can be queried by applications. Applications should not assume they are running on hardware with all the optimizations available. Even if the physical hardware supports it, virtualization technologies may or may not expose those features to the guests, or your operating system may or may not implement them. As such, developers are encouraged to use libraries such as the persistent memory development kit that perform the required feature checks or implement the checks within the application code base.

Figure 2-5 shows the flow implemented by libpmem which initially verifies the memory-mapped file, called a memory pool, resides on a filesystem that has the Direct Access (DAX) feature enabled and is backed by physical persistent memory. We describe DAX in more detail in chapter 3. On Linux, direct access is achieved by mounting an XFS or ext4 file system with the "-o dax" option. On Windows, NTFS automatically enables DAX when the volume is created and mounted on top of persistent memory. If the filesystem is not DAX enabled, applications should fall back the legacy approach of using msync(), fsync() FlushFileBuffers(). If the file system is DAX enabled, the next check is to determine whether the platform supports ADR or eADR by verifying whether the CPU caches are considered persistent or not. On an eADR platform where CPU caches are considered persistent, no further action is required. Any data written will be considered persistent, and thus there is no requirement to perform any flushes, which is a significant performance optimization. On an ADR platform, the next sequence of events identifies the most optimal flush operation based on Intel machine instructions previously described.



Figure 2-5: Flow chart showing how applications can detect platform features

#### **Application Startup and Recovery**

In addition to detecting platform features, applications should verify whether the platform was previously stopped and restarted gracefully or ungracefully. Figure 2-6 shows the checks performed by libpmem from the persistent memory development kit. libpmem provides low-level persistent memory support. In particular, support for the persistent memory instructions for flushing changes to persistent memory is provided. The libpmem library is discussed in more detail in Chapter 6.

Some persistent memory devices, such as Intel's Optane® DC persistent memory, provide SMART counters that can be queried to check the health and status. Several libraries such as <code>libpmemobj</code> query BIOS, ACPI, OS, and persistent memory module information then performs the necessary validation steps to decide which operation is most optimal to use.

We described earlier that if a system loses power there should be enough stored energy within the power supplies and platform to successfully flush the contents of the memory controllers write pending queue (WPQ) and the write buffers on the persistent memory devices. Data will be considered consistent upon successful completion. If this process fails, due to exhausting all the stored energy before all the data was successfully flushed, the persistent memory modules will report a "Dirty Shutdown." A dirty shutdown indicates data on the device may be inconsistent. If a dirty shutdown is detected, the operating system, virtualization technology, or hardware may mark the device as offline and request user intervention before the data can be accessed. This may or may not result in needing to restore the data from backups. You can find more information on this process and what errors and signals are sent in the RAS (Reliability Availability Serviceability) documentation for your platform and the persistent memory device.

Assuming the dirty shutdown counter is not set, indicating a clean system reboot or that the data was successfully flushed, the application should check to see if the persistent memory media is reporting any known poison blocks. Poisoned blocks are areas on the physical media that are known to be bad.



Figure 2-6: Application Startup and Recovery Flow

If an application were not to check these things at startup, due to the persistent nature of the media, the application could get stuck in an infinite loop such as this:

- 1. Application starts
- 2. Reads a memory address
- 3. Encounters poison
- 4. Application Crashes or System Crashes and reboots
- 5. Application starts and resumes operation from where it left off
- 6. Performs a read on the same memory address that triggered the previous restart
- 7. Application or System crashes
- 8. ...
- 9. Repeats infinitely until manual intervention

The ACPI specification defines an *Address Range Scrub* (ARS) operation which the operating system implements. This allows the operating system to perform a run time background scan operation across the memory address range of the persistent memory. System administrators may initiate an ARS scrub manually. The intent is to identify bad or potentially bad memory regions before the application does. If ARS identifies an issue, the hardware can provide a status notification to the operating system and application that can be consumed and handled gracefully. If the bad address range contains data, some method to reconstruct or restore the data needs to be implemented. We describe ARS in more detail in Chapter 17.

Developers are free to implement these features directly within the application code. However, the libraries contained within the PMDK handle these complex conditions and they will be maintained for each product generation while maintaining stable APIs. This gives application developers a future-proof option without needing to understand the intricacies of each CPU or persistent memory product.

#### What's Next?

In chapter 3 we continue to provide foundational information from the perspective of the kernel and user spaces. We describe how operating systems such as Linux and Windows have adopted and implemented the SNIA Non-Volatile Programming Model which defines recommended behavior between various user space and operating system kernel components supporting persistent memory. Future chapters build on the foundations provided in Chapters 1 through 3.

#### **Summary**

In this chapter, we have defined persistent memory and its characteristics, recapped how CPU caches work, and described why it is crucial for applications directly accessing persistent memory to assume responsibility for flushing CPU caches. This chapter was focused primarily on hardware implementations. User libraries such as those delivered with the persistent memory development kit take on the responsibilities for architecture and hardware-specific operations and allow developers to use simple APIs to implement them. In future chapters, we describe the PMDK libraries in more detail and show how to use them in your application.