Can I disable the use of statx() syscall even though it's available on the build system? #172

Gei0r · 2021-01-11T11:52:56Z

During build, the test has_statx_syscall in Jamfile.v2 is perfomed to check if statx() is available. Because it is available on my build system, boost::filesystem uses the statx syscall.
However, my executable also has to run on older kernels which don't have this syscall yet (before 4.11).

How can I disable the use of the statx syscall, preferably in the b2 invocation while building boost::filesystem?

The text was updated successfully, but these errors were encountered:

Lastique · 2021-01-11T12:05:56Z

There is no switch to disable statx, or any other modern system APIs used in Boost.Filesystem. This is deliberate. The recommendation is to build Boost on the system that is equivalent to that which will run the executable. That means both libc and kernel headers must match (or be older than) the kernel that will run the executable. Another solution is to patch your version of Boost, of course.

luke-jr · 2021-03-30T01:34:13Z

The recommendation is to build Boost on the system that is equivalent to that which will run the executable. That means both libc and kernel headers must match (or be older than) the kernel that will run the executable.

But it doesn't mean that. It's pretty common to have kernel headers newer than the actual kernel, even with only one system involved (ie, build system itself now can't run it!). Boost is the first project I've heard of having a problem with this.

luke-jr · 2021-03-30T18:35:25Z

Furthermore, it isn't actually possible to use old linux-headers anymore. glibc build fails with:

configure: error: GNU libc requires kernel header files from
Linux 3.2.0 or later to be installed before configuring.

Lastique · 2021-03-30T18:58:24Z

If you're using an old kernel then you might have to use older userspace, including libc and Boost.

luke-jr · 2021-03-30T19:13:19Z

That's unreasonable.

Lastique · 2021-03-30T20:15:29Z

Not at all. Newer software is expected to require newer third party components. Not that Boost.Filesystem requires - it doesn't, as long as you use proper headers. From my POV, supporting inconsistent configurations is what's unreasonable. The fact that this is to support Linux kernel 2.6.32, which was released 9 years ago or more, depending on the patch version, is not a compelling argument either.

Boost.Filesystem uses a number of modern Linux syscalls, and that number will probably grow in the future. Introducing an option for each of them is unmaintainable in the long term. And it will not resolve the issue in general anyway, because some functionality may not involve a separate syscall, but a macro or a parameter value for example. So no, mismatched configs are not supported, period.

luke-jr · 2021-03-30T21:31:14Z

From my POV, supporting inconsistent configurations is what's unreasonable.

It is a normal configuration to have a newer headers than kernel.

Boost.Filesystem uses a number of modern Linux syscalls, and that number will probably grow in the future. Introducing an option for each of them is unmaintainable in the long term.

I agree. Generally, well-designed software can simply fall back to other (often standards-defined) methods when the optimal code path is unsupported.

For example (just one instance, optimised for smallest diff size):

diff --git a/operations.cpp b/operations.cpp
index fc853fb..0397fe6 100644
--- a/operations.cpp
+++ b/operations.cpp
@@ -1202,25 +1202,33 @@ bool copy_file(const path& from, const path& to, unsigned int options, error_cod
     break;
   }
 
+  mode_t from_mode;
 #if defined(BOOST_FILESYSTEM_HAS_STATX) || defined(BOOST_FILESYSTEM_HAS_STATX_SYSCALL)
   unsigned int statx_data_mask = STATX_TYPE | STATX_MODE | STATX_INO | STATX_SIZE;
   if ((options & static_cast< unsigned int >(copy_options::update_existing)) != 0u)
     statx_data_mask |= STATX_MTIME;
 
+{
   struct ::statx from_stat;
   if (BOOST_UNLIKELY(statx(infile.fd, "", AT_EMPTY_PATH | AT_NO_AUTOMOUNT, statx_data_mask, &from_stat) < 0))
   {
-  fail_errno:
-    err = errno;
-    goto fail;
+    if (errno == ENOSYS)
+    {
+      goto fallback:
+    }
+    goto fail_errno;
   }
 
-  if (BOOST_UNLIKELY((from_stat.stx_mask & statx_data_mask) != statx_data_mask))
+  if (BOOST_LIKELY((from_stat.stx_mask & statx_data_mask) == statx_data_mask))
   {
-    err = ENOSYS;
-    goto fail;
+    from_mode = get_mode(from_stat);
+    goto have_mode;
   }
-#else
+}
+
+fallback:
+#endif
+{
   struct ::stat from_stat;
   if (BOOST_UNLIKELY(::fstat(infile.fd, &from_stat) != 0))
   {
@@ -1228,9 +1236,11 @@ bool copy_file(const path& from, const path& to, unsigned int options, error_cod
     err = errno;
     goto fail;
   }
-#endif
 
-  const mode_t from_mode = get_mode(from_stat);
+  from_mode = get_mode(from_stat);
+}
+
+have_mode:
   if (BOOST_UNLIKELY(!S_ISREG(from_mode)))
   {
     err = ENOSYS;

Lastique · 2021-03-30T22:23:58Z

It is a normal configuration to have a newer headers than kernel.

I don't consider that normal. That's certainly not the case with the systems I work with.

Gei0r · 2021-03-31T20:10:51Z

I'm inclined to agree that mismatching headers and used kernel are not common. For example, with crosstool-NG, you can select which kernel headers to install to match the target system kernel.

However, I believe my original use case (build on a newer system than is used for running) is common. With regards to statx(), boost::filesystem already includes the fallback code, using regular old stat(). I believe making the fallback detection in the Jamfile configurable would not impact maintainability.

luke-jr · 2021-03-31T20:20:01Z

I'm inclined to agree that mismatching headers and used kernel are not common.

This is the standard behaviour for Gentoo... linux-headers is installed and updated independently from whatever kernel is being used (which users expect to be able to swap-and-go without needing changes to userspace, so building against the latest headers is needed to avoid a rebuild when you switch to a new kernel).

However, I believe my original use case (build on a newer system than is used for running) is common.

Indeed, that's another real-world issue that we're going to have to address in Bitcoin Core. We have some minimal glue to enable users to run with ancient glibcs (missing newer versions of standard functions), but didn't realise Boost has this problem. :/

Lastique · 2021-04-01T06:17:52Z

so building against the latest headers is needed to avoid a rebuild when you switch to a new kernel

This is backwards. You can compile against older headers and run on a newer kernel as the kernel maintains backward compatibility. You can't do the opposite because the kernel does not maintain forward compatibility.

If Gentoo does not provide a way to install headers matching the kernel then their setup is broken and you should report a bug in their issue tracker.

paresy · 2021-04-01T11:53:40Z

This also fails for any older docker builds: docker/for-linux#208

If you have an outdated docker version (e.g. you use Docker on Synology) upgrading to Boost 1.75 will fail miserably.

luke-jr · 2021-04-01T14:59:44Z

This is backwards. You can compile against older headers and run on a newer kernel as the kernel maintains backward compatibility. You can't do the opposite because the kernel does not maintain forward compatibility.

Backward compatibility is a job of the userspace application wishing to use headers directly. Using standard C/C++ puts this job on the stdlib (which it tends to handle fine).

If Gentoo does not provide a way to install headers matching the kernel then their setup is broken and you should report a bug in their issue tracker.

Point is that the kernel may be different without changing userspaces.

Lastique · 2021-04-01T16:42:04Z

Backward compatibility is a job of the userspace application wishing to use headers directly.

No. Compatibility (forward or backward) is a result of guarantees provided by the kernel. The kernel only guarantees backward compatibility. Some userspace software jumps through hoops to detect kernel features in runtime despite what was available at compile time, but that is definitely not how software normally works.

luke-jr · 2021-04-01T16:50:42Z

Software normally uses the standard libraries, not interacting with the kernel directly...

It's literally impossible for Linux itself to solve this. It can't know about syscalls that didn't exist when it was released.

So basically you're saying anyone providing precompiled binaries can't use Boost at all... (Building on an ancient system often isn't possible, and even if it was, would miss out on the new features provided by newer kernels)

Lastique · 2021-04-01T17:01:01Z

So basically you're saying anyone providing precompiled binaries can't use Boost at all...

No, I'm not saying that. But if someone is shipping precompiled binaries for system X, then he should presumably have built those binaries on that system X. Those binaries will then be compatible with system X and later. If that system X is ancient, well that's your choice to support it.

Lastique · 2021-04-01T17:03:34Z

on that system X

That should say "for that system X", meaning that the compilation should use headers and libraries from that system X. This may be done in chroot environment on a newer system, for example.

luke-jr · 2021-04-01T17:16:10Z

That ancient system X likely won't have a C++17 compiler (much less a recent-enough Boost)...

paresy · 2021-04-01T18:56:11Z

I'd like to reiterate on my comment. Docker 18.09 might seem old, but it is actively used on all Synology systems running the current DSM 6. Therefore, anyone using Boost 1.75 for building Docker images that shall run on Synology systems through Docker will be out of luck.

I can just recommend to use this patch to completely disable this optimization: https://github.com/cms-sw/cmsdist/blob/17b1cc0e73e4a640d81a7fc5a92835505a006a16/boost-1.75.0-disable-statx.patch

In my opinion this should be fixed inside Boost with a fallback to the old mechanism, when statx is not available. I always though about Boost as a solid incubator and polyfill library that maximizes compatibility, if a recent compiler/stdlib is not available on the specific platform. I am not sure why this issue seem to be the exception.

Lastique · 2021-04-03T07:03:12Z

Docker 18.09 might seem old, but it is actively used on all Synology systems running the current DSM 6.

@paresy I'm not a Docker or Synology user, but the same recommendation to use matching headers apply. If it cannot be implemented in your setup, please explain why.

paresy · 2021-04-03T09:11:00Z

@Lastique The headers are matching. We are building with Ubuntu 18.04. And the Docker Container is also running Ubuntu 18.04. The problem is, that Docker whitelisted the statx call just after Docker 18.09. Therefore statx is not available on any Docker installation running 18.09 and older regardless what is running inside the container.

I am using the mentioned patch and the problem is solved for me - but i don't think your stance on the problem is the right one and it will needlessly give a lot of headaches to anyone using Boost Filesystem.

Lastique · 2021-04-03T15:50:39Z

The headers are matching. We are building with Ubuntu 18.04. And the Docker Container is also running Ubuntu 18.04. The problem is, that Docker whitelisted the statx call just after Docker 18.09. Therefore statx is not available on any Docker installation running 18.09 and older regardless what is running inside the container.

I see, so this is an overly strict sandboxing then. I don't consider this a problem of Boost.Filesystem per se, but if those Docker versions are still popular (I have no idea if they are), this might be a better reason to add a workaround. I believe, in your case the only solution is to disable the use of newer system calls at compile time.

paresy · 2021-04-03T16:20:24Z

That's what i am doing now. I am not suggesting that Boost is at fault here - it is just a usability issue, which seems to be relevant when counting the created issues about this topic.

Docker 18.09 is unfortunately in use on all Synology NAS devices. The current DSM 6 version is still shipping it. See the release notes here: https://www.synology.com/de-de/releaseNote/Docker (DSM 7 is due sometime later this year)
QNAP seems to have just recently (March 2021) upgraded to a newer Docker version: https://www.qnap.com/en/app_releasenotes/list.php?app_choose=container-station

Gei0r · 2021-04-09T23:47:19Z

Trying to break the deadlock here -- would be too much to ask for boost to handle a statx() return with ENOSYS by falling back to regular stat() at runtime (probably caching the result)?

Lastique · 2021-04-10T00:20:52Z

I don't like the idea of constantly hitting ENOSYS. Caching the result has its own issues, thread safety for example. Compile time options seem like a better way.

sdarwin · 2021-05-07T12:09:35Z

It seems we have also encountered this problem. Any time you run docker containers, they depend on the host's kernel. As luke-jr wrote "It's pretty common to have kernel headers newer than the actual kernel". Keep in mind: running docker, and containers, is very common, out in the real world. The general principle is that you don't have to be overly careful about matching the host version, and the container version. Mix-and-match is typical.
if you are making programmatic decisions based on the kernel features, be sure the detection mechanism is foolproof. Ideally, verify that a kernel syscall is actually available and fallback gracefully.

Lastique · 2021-05-07T15:55:57Z

As I have said before, I do not consider a newer Docker image running on an older kernel a valid use case. If someone is doing this, he is doing it wrong. I will not support this use case, period. If that means no support for Docker, so be it.

I don't plan to add a runtime detection of the features for the purpose of compatibility with inconsistent runtime configs. However, there are some use cases when some syscalls fail for some filesystems and not the other. Those use cases I would like to support, and part of that may involve runtime detection (though, not wrt. statx).

By defining these new config macros the user can configure the library to avoid using some system APIs even if they are detected as available by the library build scripts. This can be useful in case if the API is known to consistently fail at runtime on the target system. Related to #172.

rcombs · 2021-06-09T07:11:14Z

Checking for ENOSYS is very common in these kinds of situations, as is doing runtime checks for newer libc routines (e.g. getrandom) using dlsym. Even glibc internally checks for ENOSYS and falls back on older and better-supported syscalls for routines like time() (to support kernels without the time64 syscall; this also applies to all other libc functions that use other syscalls taking 64-bit time on 32-bit kernels, such as sigtimedwait), preadv, fexecve (for SYS_execveat), fstatat64, getdents64, and various others.

Performance concerns can be addressed fairly trivially by storing an atomic (if multiple threads race past and hit ENOSYS at the same time, it's harmless).

I don't see a reason to require all builds to choose between supporting older kernels at all, or supporting newer features when available.

This can be useful if the syscall is present at compile time but fails with ENOSYS at run time (for example, in Docker containers that restrict the syscall, even if available on the host). Additionally, marked statx syscall wrappers with attributes to disable MSAN for them. It was reported that MSAN on clang 10 is showing errors accessing uninitialized data in stx_mask, which must be initialized by the syscall. Related to #172 Related to #185

paresy · 2021-08-14T07:27:59Z

Just FYI the issue is fixed in Boost 1.77 for me by using the new BOOST_FILESYSTEM_DISABLE_STATX define.

Usage (Correct me if i'm wrong):

./b2 ... define=BOOST_FILESYSTEM_DISABLE_STATX ...

Gei0r · 2021-08-15T17:30:18Z

@paresy As I understand it, that's not even necessary, because if statX() is not available, this is handled transparently at runtime.

Btw, thanks @Lastique for implementing the change!

o01eg · 2021-10-18T17:51:41Z

Runtime check could fail on Android because

signal 31 (SIGSYS), code 1 (SYS_SECCOMP), fault addr --------
Cause: seccomp prevented call to disallowed arm system call 397

o01eg · 2021-10-18T18:27:28Z

Just FYI the issue is fixed in Boost 1.77 for me by using the new BOOST_FILESYSTEM_DISABLE_STATX define.

Is it possible to write it in user-config.jam?

Lastique · 2021-10-18T18:35:14Z

You can set it in your compiler description, such as:

using clang : : clang-12 : <define>BOOST_FILESYSTEM_DISABLE_STATX ;

See Boost.Build docs.

o01eg · 2021-10-18T20:13:25Z

@Lastique Thank you.

hjmallon · 2022-01-06T14:51:52Z

Nvidia Jetson SDK (Nvidia Jetpack Linux 4.6, based on Ubuntu 18.04) seems to ship kernel 4.9 (4.9.253-tegra) and linux-libc-dev 4.15.0-166.174, so it hits this problem all the time. statx was introduced in 4.11 (torvalds/linux@a528d35)

I'm going to update Boost and use BOOST_FILESYSTEM_DISABLE_STATX as mentioned above but hopefully this allows someone else to get here from a search.

Gei0r changed the title ~~Can I disable the use of statx() syscall even though it's available on the build system?~~ Can I disable the use of statx() syscall even though it's available on the build system? Jan 11, 2021

Lastique closed this as completed Jan 11, 2021

Lastique mentioned this issue Jan 15, 2021

statx syscall on boost-1.75 #173

Closed

davidlange6 mentioned this issue Mar 4, 2021

Revert "Boost - update to 1.75.0" cms-sw/cmsdist#6698

Merged

smuzaffar mentioned this issue Mar 11, 2021

Boost - update to 1.75.0 cms-sw/cmsdist#6699

Merged

Lastique mentioned this issue Apr 2, 2021

boost::filesystem is broken on Android since 1.74.0 #183

Closed

Lastique mentioned this issue Jun 17, 2021

Functions fail when used inside docker container in Jetson modules (ARM) #196

Closed

elsamuko mentioned this issue Jun 21, 2021

`GLIBC_2.28' not found: Regression between 1.74.0 and 1.75.0? boostorg/boost#526

Closed

Lastique mentioned this issue Jul 23, 2021

"rotating text file" log example error boostorg/log#156

Closed

Lastique mentioned this issue Aug 30, 2021

Unexpected error code thrown by boost::filesystem::copy_file in Boost 1.75.0 #205

Closed

daira mentioned this issue Sep 14, 2021

Permission error running v4.2.0 under Docker, or v4.5.1+ under Docker v18 or lower [regression] zcash/zcash#4945

Open

Lastique mentioned this issue Sep 28, 2021

boost::filesystem::copy_file: Function not implemented (1.76) #207

Closed

o01eg mentioned this issue Oct 18, 2021

Build Freeorion SDK for Android freeorion/freeorion-sdk#86

Merged

o01eg mentioned this issue Oct 18, 2021

Bump boost version to 1.77 and disable statx syscall moritz-wundke/Boost-for-Android#226

Closed

Lastique mentioned this issue Dec 10, 2021

Error: boost::filesystem::create_directories: Function not implemented #222

Closed

innerlee mentioned this issue Dec 13, 2021

Error: boost::filesystem::create_directories: Function not implemented open-mmlab/denseflow#54

Open

charlesdunbar mentioned this issue Dec 23, 2021

2.16.0 worked - 2.16.1 is failing jlesage/docker-crashplan-pro#341

Closed

Phil25 mentioned this issue Feb 15, 2022

stat/statx usage on Android 9.0+ #229

Closed

JensUweUlrich mentioned this issue Feb 18, 2022

compilation error JensUweUlrich/ReadBouncer#36

Closed

sipa mentioned this issue Apr 28, 2022

guix: consolidate kernel headers to 5.15, specify 3.2.0 as minimum supported bitcoin/bitcoin#25006

Merged

mattcieslak mentioned this issue May 9, 2023

missing settings files QMICodeBase/TORTOISEV4#5

Closed

Can I disable the use of statx() syscall even though it's available on the build system? #172

Can I disable the use of statx() syscall even though it's available on the build system? #172

Comments

Gei0r commented Jan 11, 2021 • edited Loading

Lastique commented Jan 11, 2021

luke-jr commented Mar 30, 2021

luke-jr commented Mar 30, 2021

Lastique commented Mar 30, 2021

luke-jr commented Mar 30, 2021

Lastique commented Mar 30, 2021

luke-jr commented Mar 30, 2021

Lastique commented Mar 30, 2021 • edited Loading

Gei0r commented Mar 31, 2021

luke-jr commented Mar 31, 2021

Lastique commented Apr 1, 2021

paresy commented Apr 1, 2021

luke-jr commented Apr 1, 2021

Lastique commented Apr 1, 2021

luke-jr commented Apr 1, 2021

Lastique commented Apr 1, 2021

Lastique commented Apr 1, 2021

luke-jr commented Apr 1, 2021

paresy commented Apr 1, 2021

Lastique commented Apr 3, 2021

paresy commented Apr 3, 2021

Lastique commented Apr 3, 2021

paresy commented Apr 3, 2021

Gei0r commented Apr 9, 2021

Lastique commented Apr 10, 2021

sdarwin commented May 7, 2021

Lastique commented May 7, 2021

rcombs commented Jun 9, 2021

paresy commented Aug 14, 2021

Gei0r commented Aug 15, 2021 • edited Loading

o01eg commented Oct 18, 2021

o01eg commented Oct 18, 2021

Lastique commented Oct 18, 2021

o01eg commented Oct 18, 2021

hjmallon commented Jan 6, 2022 • edited Loading

Gei0r commented Jan 11, 2021 •

edited

Loading

Lastique commented Mar 30, 2021 •

edited

Loading

Gei0r commented Aug 15, 2021 •

edited

Loading

hjmallon commented Jan 6, 2022 •

edited

Loading