arch: xtensa: Update arch_user_string_nlen() #90033

peter-mitsis · 2025-05-15T18:14:31Z

On some platforms, address 0 is actually valid in the kernel, but we do not want it to be valid in userspace--so it is insufficient to only use mem_kernel_has_access(). However, arch_buffer_validate() does not always recover well, so it too is insufficient on its own. Thus, we use both.

andyross · 2025-05-15T18:18:43Z

Can you clarify how the predicates differ? I guess I'm unclear on the bug being fixed here. The function validates user-provided strings vs. a userspace memory domain that clearly shouldn't ever have null mapped, was that broken somehow?

Note that the Intel audio DSPs tend to have MMIO registers at the bottom of memory, so the hardware does allow access to the null page and maybe the defaults have it mapped (incorrectly) for userspace threads too?

peter-mitsis · 2025-05-20T16:22:08Z

Here's my understanding as to what is going on ...

Address 0x0 can be valid in kernel space on some Xtensa platforms; however, on those platforms test_null_dynamic_name() fails when called from userspace. The syscall's verification function calls k_usermode_string_copy() which relies on arch_user_string_nlen() to detect that when NULL is passed to detect an error. However, since NULL (or 0x0) is valid in the kernel space where this is called, the old code, which only called xtensa_mem_kernel_has_access() will initially conclude that this is perfectly fine. However, it is not.

Initially, I tried replacing xtensa_mem_kernel_has_access() with arch_buffer_validate(), and had some success. However, running twister showed that resulted in numerous errors on qemu_xtensa/dc233c/mmu. After some discussions with @dcpleung, I learned that xtensa_mem_kernel_has_access() was initially used as it allowed for better recovery from access exceptions. Some further discussions and experimentation indicated that first doing the xtensa_mem_kernel_has_access() and then arch_buffer_validate() results in something that seems to work everywhere. First we check for kernel access and ensure and where there are exceptions we can recover from them. If all that worked, then we check for userspace access and should there be an exception, it should be recoverable.

All that so that we can properly detect a NULL device name from userspace.

dcpleung · 2025-05-20T17:43:13Z

The issue here is that we only check if kernel has access, which it has by default since it is mapped to hardware registers. However, the test relies on the assumption that the first page is not mapped to catch NULL access exception. The kernel access check thus would return true. Adding the user access check is to workaround this issue where the first page is mapped in kernel mode.

peter-mitsis · 2025-06-03T17:32:46Z

@andyross - ping

peter-mitsis · 2025-06-13T22:17:44Z

Re-pushed to resolve sonarqubecloud code quality issue.

andyross

OK, I'm sort of getting it. But can we update the commit message to clarify what's happening here? This is a security fix, right? The code as written was checking string arguments to syscalls vs. the kernel MMU/MPU state and not the user mode setup, and that's obviously very wrong.

Also one note about needlessly calling the underlying mem_buffer_validate() twice.

andyross · 2025-06-17T12:10:25Z

arch/xtensa/core/syscall_helper.c

 	 */
-	if (!xtensa_mem_kernel_has_access((void *)s, maxsize, 0)) {
+	if (!xtensa_mem_kernel_has_access(s, maxsize, 0) ||
+	    arch_buffer_validate(s, maxsize, 0)) {


Why bother checking kernel_has_access() when you're going to do a proper/slow validation vs. the MMU configuration anyway? What are the circumstances where arch_buffer_validate() returns zero but xtensa_mem_kernel_has_access() returns non-zero?

For MMU systems, in fact (see ptables.c) these are exactly the same function, just with a different wrapper for return value convention.

Adds 'const' to address pointer as its memory contents do not change. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>

When calling device_get_binding(NULL) from userspace, this eventually funnels down to a call to arch_user_string_nlen() where it tried to verify that the kernel has access to this address (0x0). But since this originates from userspace, we really want to know if this is accessible from userspace, so using arch_buffer_validate() instead of xtensa_mem_kernel_has_access() is preferable. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>

sonarqubecloud · 2025-06-17T20:30:18Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

peter-mitsis · 2025-06-17T20:31:11Z

I could have sworn that I had previously encountered regression errors on an xtensa platform when using only arch_buffer_validate() instead of that and xtensa_mem_kernel_has_access(). However, that behavior has disappeared and I can no longer duplicate it even rewinding to previous commit points. As a result, I am going with just the arch_buffer_validate()--it seems to be doing the job.

peter-mitsis requested a review from dcpleung May 15, 2025 18:14

github-actions bot added the area: Xtensa Xtensa Architecture label May 15, 2025

github-actions bot requested review from andyross, ceolin and nashif May 15, 2025 18:15

github-actions bot assigned dcpleung May 15, 2025

peter-mitsis force-pushed the pmitsis-arch-buffer-validate branch from c5b231b to 813bad9 Compare June 13, 2025 22:16

dcpleung previously approved these changes Jun 16, 2025

View reviewed changes

andyross reviewed Jun 17, 2025

View reviewed changes

peter-mitsis added 2 commits June 17, 2025 13:11

arch: tweak xtensa_mem_kernel_has_access() API

1cb802d

Adds 'const' to address pointer as its memory contents do not change. Signed-off-by: Peter Mitsis <peter.mitsis@intel.com>

peter-mitsis dismissed dcpleung’s stale review via a9c8b9f June 17, 2025 20:26

peter-mitsis force-pushed the pmitsis-arch-buffer-validate branch from 813bad9 to a9c8b9f Compare June 17, 2025 20:26

dcpleung approved these changes Jun 17, 2025

View reviewed changes

nashif approved these changes Jun 18, 2025

View reviewed changes

kartben merged commit 2f2eaf7 into zephyrproject-rtos:main Jun 18, 2025
30 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

arch: xtensa: Update arch_user_string_nlen() #90033

arch: xtensa: Update arch_user_string_nlen() #90033

Uh oh!

peter-mitsis commented May 15, 2025

Uh oh!

andyross commented May 15, 2025

Uh oh!

peter-mitsis commented May 20, 2025

Uh oh!

dcpleung commented May 20, 2025

Uh oh!

peter-mitsis commented Jun 3, 2025

Uh oh!

peter-mitsis commented Jun 13, 2025

Uh oh!

andyross left a comment

Uh oh!

andyross Jun 17, 2025

Uh oh!

sonarqubecloud bot commented Jun 17, 2025

Uh oh!

peter-mitsis commented Jun 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

arch: xtensa: Update arch_user_string_nlen() #90033

arch: xtensa: Update arch_user_string_nlen() #90033

Uh oh!

Conversation

peter-mitsis commented May 15, 2025

Uh oh!

andyross commented May 15, 2025

Uh oh!

peter-mitsis commented May 20, 2025

Uh oh!

dcpleung commented May 20, 2025

Uh oh!

peter-mitsis commented Jun 3, 2025

Uh oh!

peter-mitsis commented Jun 13, 2025

Uh oh!

andyross left a comment

Choose a reason for hiding this comment

Uh oh!

andyross Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Jun 17, 2025

Quality Gate passed

Uh oh!

peter-mitsis commented Jun 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants