Dirty Pipe Vulnerability

Executive summary

CVE-2022-0847, also known as the Dirty Pipe Vulnerability, affects the Linux Kernel and allows read-only files to be overwritten by users that normally do not have that permission.¹

This vulnerability is catastrophic. /etc/passwd is a read-only file that contains usernames and hashed passwords. ² An unprivileged user with the power to modify this file could set a new password for the root user and log in.

Context

Linux Memory Management

A program running in user space (i.e. everything that's not kernel code) uses virtual memory addresses. The set of virtual memory addresses available to each process is unique. Virtual memory addresses are mapped to physical addresses in page tables.

Pages are a fixed size: in order to find the physical address to access data stored in RAM, the system needs to know (1) which program's page table to look at, (2) which page to look at, and (3) where in the page to look (offset). The page and offset is encoded in the virtual memory address. ³ Below is a simplified example:

File I/O

When data is read from a file on a hard disk, the contents are copied to RAM.

At first this data is written to kernel memory, not the user program's virtual memory. This allows for the kernel to cache the data: if this data is being read often, the kernel can reduce the amount of load on the hardware device with caching. Caching is handled by the page cache subsytem.

In many cases, the data is copied over to the user program's virtual memory, but if a user wants reduce the amount of data being copied, there are optimizations available.

One such optimization exists when data is read from a file and sent to a pipe by program A (that will later be read by program B). In this case, copying the data to program A's userspace after reading the file is unnecessary. We can can circumvent doing this by directly passing around references to the page owned by page cache. ⁴

The splice Linux system call is built for developers that want to use this optimization:

splice() moves data between two file descriptors without copying between kernel address space and user address space. It transfers up to len bytes of data from the file descriptor fd_in to the file descriptor fd_out, where one of the file descriptors must refer to a pipe. ⁵

Pipe

Pipes are used for inter-process communication in Linux. Program A can push data to a pipe while Program B reads data from the pipe. A common example of using pipes in Linux on the command line is below:

cat test.txt | grep "james"

cat will read the contents of the file and dump them to the pipe, thengrep will read from the pipe, find any lines that contain "james", and dump them to stdout

Under the hood, data is written to a pipe this way:

On the first write to the pipe, a page is allocated to the pipe buffer
If the page isn't full, subsquent writes to the pipe can append to that page (except if it's the page cache, see note)
Once the page is full, a new page is allocated ⁴

Note: Because the page cache could contain the cached versions of important files, it's critical not to let users append arbitrary data to pages there.

A bug in the pipe buffer

The pipe buffer has different flags, one of which (PIPE_BUF_FLAG_CAN_MERGE) allows for writing to an existing page. Unfortunately, a bug in the Linux kernel does not initialize the pipe buffer's flags in some circumstances. Because of this, by writing specialized data to the pipe, this flag can be set in situations where it shouldn't be set.⁶

Exploitation

The key principles for exploitation

The page cache, from the operating system's perspective, is a trustworthy source of information about the contents of files stored on the hard disk.
splice() reads data from a file into the kernel's page cache. As long as this file stays in the cache, other programs trying to access this file will read this cached version of this file.
Then, splice() will create at least one pipe_buffer that points to the page (or pages) in the page cache.
Because flags are accidentally uninitialized in the pipe_buffer, the PIPE_BUF_FLAG_CAN_MERGE flag can be set if special data is written to the buffer.
If PIPE_BUF_FLAG_CAN_MERGE is somehow set, and the user quickly writes their own data into the pipe, their data will be merged into most recently written page in the page cache.
This means that the page cache can be modified by a user with no special permissions.

Step by step exploitation

Fill a pipe with data formatted in such a way that PIPE_BUF_FLAG_CAN_MERGE is set
Empty the pipe by reading from it
Use splice() to copy data from a target file to the pipe. (Don't copy the entire file: copy the file all the way up to one byte before you want to make changes.)
Write your own data to the pipe. Because PIPE_BUF_FLAG_CAN_MERGE has been set, the data you write to the pipe is merged into the page cache.

Impact

Severity

This allows non-privileged users to modify any file they have read access to. Ultimately, this allows these users to insert code into programs that are running as root: Privilege escalation!

The requirements to exploit this are met in many circumstances (there are some edge cases relating to data being modified on page boundaries that can make this attack impossible ⁴, but they are rare) and it has been described as "trivial to exploit". ⁷

The NIST CVSS base score is 7.8 (HIGH).

Affected devices

This vulnerability impacts Linux kernels released from Aug 2020 to Feb 2022.

Because Android devices ship with the Linux kernel, flagship devices from Google and Samsung were impacted. Dirty Pipe was demonstrated giving a user a root shell on a Pixel 6 Pro and Samsung S22 with a proof-of-concept sideloaded app. ⁷

A wide variety of other devices running Linux are also affected. While patches have been released by all major Linux distributions, those that do not update remain vulnerable. Systems running Linux that are rarely or never updated (IoT, Routers, NAS) are of special attention.

Is this being exploited?

This vulnerability was only disclosed March 7th, 2022. There have been no reports of this exploit being used in the wild. ⁷ However, due to the nature of this vulnerability (modifying caches), it can be difficult to detect as it can perform attacks without actually writing to the hard disk. ⁴

Conclusion

It is very interesting to see a real example of the consequences of not initializing variables. Exploits relying on this were demonstrated in this class, and it is indicative of a larger problem in systems programming that these mistakes continue to happen.

This exploit is yet another that shows the advantages of programming languages (like Rust) that offer strong memory safety guarantees. Perhaps that's why Microsoft recommends using Rust for safe systems programming. ⁸

Note: The sample exploit used in the video presentation is from GitHub user Arinerron

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
exploit-example.png		exploit-example.png
memory-management.png		memory-management.png
page-cache-splice.png		page-cache-splice.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

exploit-example.png

exploit-example.png

memory-management.png

memory-management.png

page-cache-splice.png

page-cache-splice.png

Repository files navigation

Dirty Pipe Vulnerability

Executive summary

Context

Linux Memory Management

File I/O

Pipe

A bug in the pipe buffer

Exploitation

The key principles for exploitation

Step by step exploitation

Impact

Severity

Affected devices

Is this being exploited?

Conclusion

About

Releases

Packages

jamesbrunet/dirtypipe-writeup

Folders and files

Latest commit

History

Repository files navigation

Dirty Pipe Vulnerability

Executive summary

Context

Linux Memory Management

File I/O

Pipe

A bug in the pipe buffer

Exploitation

The key principles for exploitation

Step by step exploitation

Impact

Severity

Affected devices

Is this being exploited?

Conclusion

Footnotes

About

Resources

Stars

Watchers

Forks