Java crashes due to hsperfdata file conflicts across sandboxes #3236

or-shachar · 2017-06-21T16:25:07Z

Running bazel build on a fat java/scala project (several thousands of targets) fails when working on linux debian with user namespace enabled.

Issue

Trying to run bazel build with user namespace enabled:

$ sysctl kernel.unprivileged_userns_clone=1

The build runs alright but at some point it crashes with weird memory issue:

ERROR: <target-path>/BUILD:35:1: error executing shell command: '
  rm -rf bazel-out/local-fastbuild/bin/<package>/<target>.jar_temp_resources_dir
  set -e
  mkdir -p bazel-out/local-fastbuild/bin/<target>' failed: Process terminated by signal 6 [sandboxed].
#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGBUS (0x7) at pc=0x00007f094606874b, pid=5, tid=0x00007f09472e0700
#
# JRE version:  (8.0_131-b11) (build )
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.131-b11 mixed mode linux-amd64 compressed oops)
# Problematic frame:
# V  [libjvm.so+0x96874b]  PerfMemory::alloc(unsigned long)+0x7b
#
# Core dump written. Default location: /home/builduser/.cache/bazel/_bazel_builduser/bc0e462ab01ac9379d22ad058ca1cb1f/bazel-sandbox/4864102460254154064/execroot/__main__/core or core.5
#
# An error report file with more information is saved as:
# /home/builduser/.cache/bazel/_bazel_builduser/bc0e462ab01ac9379d22ad058ca1cb1f/bazel-sandbox/4864102460254154064/execroot/__main__/hs_err_pid5.log
#
# If you would like to submit a bug report, please visit:
#   http://bugreport.java.com/bugreport/crash.jsp
#

Environment info

The machine is docker container based on debian image

$ uname -a
Linux 167-docker99 3.16.0-4-amd64 #1 SMP Debian 3.16.43-2 (2017-04-30) x86_64 GNU/Linux
builduser@167-docker99:~/ws/bazel-port-isolation$ cat /etc/*-release
PRETTY_NAME="Debian GNU/Linux 8 (jessie)"
NAME="Debian GNU/Linux"
VERSION_ID="8"
VERSION="8 (jessie)"
ID=debian
HOME_URL="http://www.debian.org/"
SUPPORT_URL="http://www.debian.org/support"
BUG_REPORT_URL="https://bugs.debian.org/"Bazel version

The project is fat (several thousands of java / scala targets)
Bazel was built from a81264e
rules_scala - wix-incubator/rules_scala@d66c9d7 (basically bazelbuild/rules_scala@5d6ff51 with small changes around specs2 versions and test runner env preparation)

additional information

issue does not happen when unprivileged_userns_clone=0 (but clearly - that's not a solution)
with user namespace enabled, bazel 0.5.1 showed this issue . May also be related to Bazel HEAD build fails on Ubuntu 16.04.2 LTS VirtualBox VM #3064 .

The text was updated successfully, but these errors were encountered:

philwo · 2017-06-22T07:41:35Z

Can you please send the output of "cat /proc/mounts", "free -m", "df -h" and "df -i" from inside the Docker container where the SIGBUS happened?

I think this happens when /tmp does not have enough space or is a tmpfs and there's not enough free RAM to back all files that the JVM wants to create via mmap there, but I had a hard time tracking it down exactly.

or-shachar · 2017-06-22T09:03:21Z

@philwo - The machine should be crazy strong... Maybe even too strong? :-)

#cat /proc/mounts
rootfs / rootfs rw 0 0
none / aufs rw,relatime,si=bd2d63d83e471179,dio,dirperm1 0 0
proc /proc proc rw,nosuid,nodev,noexec,relatime 0 0
tmpfs /dev tmpfs rw,nosuid,mode=755 0 0
devpts /dev/pts devpts rw,nosuid,noexec,relatime,gid=5,mode=620,ptmxmode=666 0 0
sysfs /sys sysfs rw,nosuid,nodev,noexec,relatime 0 0
tmpfs /sys/fs/cgroup tmpfs rw,nosuid,nodev,noexec,relatime,mode=755 0 0
cgroup /sys/fs/cgroup/systemd cgroup rw,nosuid,nodev,noexec,relatime,xattr,release_agent=/lib/systemd/systemd-cgroups-agent,name=systemd 0 0
cgroup /sys/fs/cgroup/cpuset cgroup rw,nosuid,nodev,noexec,relatime,cpuset 0 0
cgroup /sys/fs/cgroup/cpu,cpuacct cgroup rw,nosuid,nodev,noexec,relatime,cpu,cpuacct 0 0
cgroup /sys/fs/cgroup/memory cgroup rw,nosuid,nodev,noexec,relatime,memory 0 0
cgroup /sys/fs/cgroup/devices cgroup rw,nosuid,nodev,noexec,relatime,devices 0 0
cgroup /sys/fs/cgroup/freezer cgroup rw,nosuid,nodev,noexec,relatime,freezer 0 0
cgroup /sys/fs/cgroup/net_cls,net_prio cgroup rw,nosuid,nodev,noexec,relatime,net_cls,net_prio 0 0
cgroup /sys/fs/cgroup/blkio cgroup rw,nosuid,nodev,noexec,relatime,blkio 0 0
cgroup /sys/fs/cgroup/perf_event cgroup rw,nosuid,nodev,noexec,relatime,perf_event 0 0
mqueue /dev/mqueue mqueue rw,nosuid,nodev,noexec,relatime 0 0
/dev/xvda2 /etc/resolv.conf ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /mnt/tmp ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /tmp/config ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /opt/intel ext4 rw,relatime,data=ordered 0 0
tmpfs /dev/shm tmpfs rw,nosuid,nodev 0 0
/dev/xvda2 /opt/bin ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /etc/hostname ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /opt/private ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /etc/hosts ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /home/builduser/.npmrc ext4 ro,relatime,data=ordered 0 0
tmpfs /run/docker.sock tmpfs rw,nosuid,relatime,size=52934720k,mode=755 0 0
/dev/xvda2 /home/builduser/.pypirc ext4 ro,relatime,data=ordered 0 0
/dev/xvda2 /home/builduser/.ssh/google_compute_engine ext4 rw,relatime,data=ordered 0 0
/dev/xvda2 /home/builduser/.jfrog/jfrog-cli.conf ext4 ro,relatime,data=ordered 0 0
/dev/xvda2 /home/builduser/.ssh/google_compute_engine.pub ext4 rw,relatime,data=ordered 0 0
devpts /dev/console devpts rw,nosuid,noexec,relatime,gid=5,mode=620,ptmxmode=000 0 0

#free -m
             total       used       free     shared    buffers     cached
Mem:        258470      43545     214924         32       4771      28253
-/+ buffers/cache:      10520     247950
Swap:            0          0          0

# df -h
Filesystem      Size  Used Avail Use% Mounted on
none             99G   36G   60G  38% /
tmpfs           127G     0  127G   0% /dev
tmpfs           127G     0  127G   0% /sys/fs/cgroup
/dev/xvda2       99G   36G   60G  38% /mnt/tmp
tmpfs           127G   16M  127G   1% /dev/shm
tmpfs            51G   17M   51G   1% /run/docker.sock

# df -i
Filesystem       Inodes   IUsed    IFree IUse% Mounted on
none            6553600 1186838  5366762   19% /
tmpfs          33084198     135 33084063    1% /dev
tmpfs          33084198      14 33084184    1% /sys/fs/cgroup
/dev/xvda2      6553600 1186838  5366762   19% /mnt/tmp
tmpfs          33084198      16 33084182    1% /dev/shm
tmpfs          33084198     966 33083232    1% /run/docker.sock

philwo · 2017-06-22T09:11:36Z

Whoah, yes, resources are not an issue on that machine. :)

Still, the only thing I could ever find about JVMs crashing with SIGBUS is this: http://bugs.java.com/view_bug.do?bug_id=6563308

I wonder if it's the same issue. Do you use any --sandbox_* flags for your build? Maybe --sandbox_tmpfs_path=...?

or-shachar · 2017-06-22T09:36:31Z

No... I have some test targets where I set the java_flags with -Djava.io.tmpdir=/tmp but it fails on the build phase.

ittaiz · 2017-06-23T11:46:58Z

@philwo any ideas? the problem is that network-isolation on our machine (docker container) doesn't work without enabling user namespace, fails for a different reason on 0.5.1 when it's enabled and doesn't build on head when it's enabled.
We were able to verify on a very small java repo that on HEAD with user namespace enabled that network-isolation works for us.
If we could just get past this we could parallel our tests...

philwo · 2017-06-23T12:18:52Z

I'm trying to find out why this is happening now.

philwo · 2017-06-23T12:46:49Z

Can you please try using the flag "--sandbox_tmpfs_path=/tmp" in your "bazel build" (or "bazel test") command and see if the error still happens? This will mount an empty tmpfs on /tmp for each running action. It's generally not a bad idea, because it increases hermeticity (otherwise /tmp is a writable directory shared between all actions of a build, so they could create conflicting files or accidentally keep state there.)

An alternate idea might be to mount a tmpfs on /tmp inside the Docker container before running the bazel command, but I'd like to try the first one first, because I remember that it helped a different user.

It would be interesting to see if this makes the problem disappear.

philwo · 2017-06-23T12:56:21Z

A process gets SIGBUS in one of two conditions:

It tries to read an address that no longer exists from an mmap'd file (e.g. because the file was truncated by a different process).
It tries to write more bytes to an mmap'd file than the underlying device can hold.

The first thing might happen when multiple JVM processes try to use the same file via mmap and one of them truncates it, I guess. Is the failing action always the same kind, e.g. a Scala compilation?

The second possibility is more likely, but you have 60G free on / and /tmp is not a mount point, so it's hard to see how you can run out of disk space during a build... OTOH you have almost four times as much RAM as free disk space on that machine - maybe some process uses a heuristic like "let's allocate a temp file with the size of 1/4 the RAM, this should always be a reasonable number"?

I also don't understand why this only happens when you use the linux-sandbox and not when you use the standalone strategy. Do I understand correctly that the build works fine then?

ittaiz · 2017-06-23T20:39:45Z

@philwo using --sandbox_tmpfs_path=/tmp indeed passes the build, thanks!
It seems that it slows down the build a bit. Am I right? Didn't have time yet to run multiple benchmarks.

ittaiz · 2017-06-23T20:40:50Z

Also, this should be solved right? Shouldn't it be a release blocker?

philwo · 2017-06-23T20:53:38Z

using --sandbox_tmpfs_path=/tmp indeed passes the build, thanks!

Nice! :)

It seems that it slows down the build a bit.

It shouldn't.. actually it should make things a bit faster, because /tmp is now backed by RAM instead of disk. On the other hand, it might be possible that mounting the tmpfs incurs some overhead, too.

If you measure a noticeable difference, I'd be quite interested in it.

this should be solved right? Shouldn't it be a release blocker?

Absolutely! The problem is, I cannot reproduce this on my machine and the cause is completely unknown. :( It also doesn't seem to affect many people. If we had a clear repro case or someone more familiar with JVM internals, it might be easier to get to the bottom of this issue.

philwo · 2017-06-23T20:58:06Z

What I'd be really interested in is if simply mounting a tmpfs on /tmp and not using the --sandbox_tmpfs_path flag also helps, or if the issue then happens again. What I mean is:

# I'm in my Docker container!
$ mount -t tmpfs tmpfs /tmp
$ bazel build //...

If the issue reoccurs, I believe that we're seeing a race condition here, maybe related to this: https://stackoverflow.com/questions/76327/how-can-i-prevent-java-from-creating-hsperfdata-files

Maybe the JVMs in a highly parallel build accidentally create hsperfdata files with the same name and when one truncates the file of another running JVM, that JVM gets a SIGBUS because the file underlying its mmap went away. But this is really just a guess.

More info: http://www.evanjones.ca/jvm-mmap-pause.html

philwo · 2017-06-23T21:15:23Z

OMG, wait, I got it

philwo · 2017-06-23T21:21:24Z

@aehlig @ulfjack FYI.

root@ubuntu:~# strace -f -- java HelloWorld
[pid 1432] open("/tmp/hsperfdata_root", O_RDONLY|O_NONBLOCK|O_DIRECTORY|O_CLOEXEC) = 4
[pid 1432] fchdir(4) = 0
[pid 1432] open("1431", O_RDWR|O_CREAT|O_NOFOLLOW, 0600) = 5
[pid 1432] ftruncate(5, 0) = 0
[pid 1432] mmap(NULL, 32768, PROT_READ|PROT_WRITE, MAP_SHARED, 5, 0) = 0x7f54cad93000

Every JVM creates a temporary performance instrumentation file in /tmp/hsperfdata_$USERNAME/$PID. When we use sandboxing, we use PID namespaces, which means that the PIDs are virtualized and all running JVMs believe they are PID 2.

This means that they all open/ftruncate/mmap the same file and that gives you SIGBUS eventually, due to case 1 I mentioned above: "It tries to read an address that no longer exists from an mmap'd file".

When you use --sandbox_tmpfs_path=/tmp, each running sandbox gets its own /tmp, so the files don't conflict.

This means the solution is quite simple and I'll come up with something on Monday. For now, I'd recommend to use the --sandbox_tmpfs_path=/tmp flag.

ittaiz · 2017-06-23T21:27:52Z

Well done!

…

On Sat, Jun 24, 2017 at 12:21 AM Philipp Wollermann < ***@***.***> wrote: @aehlig <https://github.com/aehlig> @ulfjack <https://github.com/ulfjack> FYI. ***@***.***:~# strace -f -- java HelloWorld [pid 1432] open("/tmp/hsperfdata_root", O_RDONLY|O_NONBLOCK|O_DIRECTORY|O_CLOEXEC) = 4 [pid 1432] fchdir(4) = 0 [pid 1432] open("1431", O_RDWR|O_CREAT|O_NOFOLLOW, 0600) = 5 [pid 1432] ftruncate(5, 0) = 0 [pid 1432] mmap(NULL, 32768, PROT_READ|PROT_WRITE, MAP_SHARED, 5, 0) = 0x7f54cad93000 Every JVM creates a temporary performance instrumentation file in /tmp/hsperfdata_$USERNAME/$PID. When we use sandboxing, we use PID namespaces, which means that the PIDs are virtualized and all running JVMs believe they are PID 2. This means that they all open/ftruncate/mmap the same file and that gives you SIGBUS eventually, due to case 1 I mentioned above: "It tries to read an address that no longer exists from an mmap'd file". When you use --sandbox_tmpfs_path=/tmp, each running sandbox gets its own /tmp, so the files don't conflict. This means the solution is quite simple and I'll come up with something on Monday. For now, I'd recommend to use the --sandbox_tmpfs_path=/tmp flag. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#3236 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABUIFwM1GwYHur29dB7Ex-QEaStV-72Dks5sHCxXgaJpZM4OBNWF> .

ulfjack · 2017-06-24T19:27:53Z

Does it help if we set TMPDIR to a unique path for each action?

ittaiz · 2017-06-24T19:57:43Z

Is that targeted to me? If so how can I do that?

…

On Sat, Jun 24, 2017 at 10:27 PM Ulf Adams ***@***.***> wrote: Does it help if we set TMPDIR to a unique path for each action? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#3236 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABUIF53PNfGvdHgfy4dVVDdUlJnl_4Egks5sHWM9gaJpZM4OBNWF> .

philwo · 2017-06-24T20:21:27Z

I don't think there is any way to change this. "/tmp" seems to be the hard-coded location.

philwo@ubuntu:~$ java -version
openjdk version "1.8.0_131"
OpenJDK Runtime Environment (build 1.8.0_131-8u131-b11-0ubuntu1.16.04.2-b11)
OpenJDK 64-Bit Server VM (build 25.131-b11, mixed mode)

philwo@ubuntu:~$ strace -f \
  -E TMP=/home/philwo/tmp \
  -E TMPDIR=/home/philwo/tmp -- \
  java -Djava.io.tmpdir=/home/philwo/tmp HelloWorld 2>&1 | \
  fgrep hsperfdata
[pid  1589] open("/tmp/hsperfdata_philwo", O_RDONLY|O_NOFOLLOW) = 3
[pid  1589] open("/tmp/hsperfdata_philwo", O_RDONLY|O_NONBLOCK|O_DIRECTORY|O_CLOEXEC) = 4
[pid  1589] mkdir("/tmp/hsperfdata_philwo", 0755) = -1 EEXIST (File exists)
[pid  1589] lstat("/tmp/hsperfdata_philwo", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
[pid  1589] open("/tmp/hsperfdata_philwo", O_RDONLY|O_NOFOLLOW) = 3
[pid  1589] open("/tmp/hsperfdata_philwo", O_RDONLY|O_NONBLOCK|O_DIRECTORY|O_CLOEXEC) = 4
[pid  1589] unlink("/tmp/hsperfdata_philwo/1588") = 0

This is consistent with what I found on the Java bugtracker:

Initial bug report: http://bugs.java.com/view_bug.do?bug_id=6447182
Fix: http://bugs.java.com/view_bug.do?bug_id=6938627
Rollback: http://bugs.java.com/view_bug.do?bug_id=7009828

No further action or intend to change this behavior since then.

philwo · 2017-06-24T20:32:25Z

Some ideas:

Always mount tmpfs on /tmp. Problem: Some users might not want this behavior, e.g. on memory constrained environments or if they store huge temporary files in /tmp.
Make /tmp/hsperfdata_$USERNAME read-only inside the sandbox. This will prevent the JVM from creating the conflicting file. Problem: Adds JVM specific knowledge to our sandbox implementation.
Call Java with -XX:-UsePerfData or -XX:+PerfDisableSharedMem flag. Problem: Unclear how we can make sure that this will always be used, even when users manually launch Java from their own Skylark actions or something.

Any other ideas? I think I like 2) the best and it would be simple to implement.

jmmv · 2017-06-26T13:51:54Z

I don't know what the constraints are here, but instead of mounting a tmpfs instance, could you mount --bind a fresh temporary directory onto /tmp? Should avoid the memory issues, though you'd still be paying the cost to mount the extra fs. (Given that this is a problem caused by PID namespaces, which are Linux-specific, I'm assuming this unportable solution would be acceptable.)

philwo · 2017-06-26T13:53:53Z

@jmmv In theory yes, but this will unfortunately break the people who put their workspace or output base inside /tmp again, because then the tmp mounted on top of /tmp will hide your input respectively output files / dirs.

I still have no clue why anyone would do that, but it comes up every single time I accidentally break it.

ulfjack · 2017-06-27T07:27:01Z

We could detect that case and construct a sequence of bind mounts that make the workspace / output base visible even though we're bind mounting an empty dir to /tmp. Alternatively, we could bind mount an empty directory to /tmp/hsperfdata_/.

philwo · 2017-06-27T07:49:35Z

I'll give the "empty dir on /tmp" idea a try today and if that doesn't work out go for the make "/tmp/hsperfdata_$USERNAME read-only in sandboxes" version.

ittaiz · 2017-07-07T05:24:51Z

Any update?

…

On Tue, 27 Jun 2017 at 10:49 Philipp Wollermann ***@***.***> wrote: I'll give the "empty dir on /tmp" idea a try today and if that doesn't work out go for the make "/tmp/hsperfdata_$USERNAME read-only in sandboxes" version. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#3236 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABUIF9vrOZAO8A97DnTuoowCBdVmfXlyks5sILQSgaJpZM4OBNWF> .

philwo · 2017-07-08T09:06:45Z

@ittaiz Unfortunately I got sick just after writing that comment and was out of office the entire week :| I'm fine again now and will be back on Monday. Current plan is to mkdir that directory and then make it read-only unless a tmpfs is mounted on /tmp (because that also solves the problem in a different way).

It should be a rather simple fix that I can get easily done on Monday.

ittaiz · 2017-07-08T10:14:56Z

Thanks! Glad to hear you're better.

…

On Sat, 8 Jul 2017 at 12:06 Philipp Wollermann ***@***.***> wrote: @ittaiz <https://github.com/ittaiz> Unfortunately I got sick just after writing that comment and was out of office the entire week :| I'm fine again now and will be back on Monday. Current plan is to mkdir that directory and then make it read-only unless a tmpfs is mounted on /tmp (because that also solves the problem in a different way). It should be a rather simple fix that I can get easily done on Monday. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3236 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABUIF0V7f0PAG0mofhq3OzmKHvD3QHwgks5sL0apgaJpZM4OBNWF> .

JaredNeil · 2023-07-24T20:37:15Z

Interestingly, HotSpot recently fixed this problem, too: openjdk/jdk@84f2314

Unfortunately fixing the problem was accompanied by a warning message printed to STDOUT, which is breaking some of our build actions that write to STDOUT (https://github.com/google/google-java-format), and filling our build logs with hundreds of those warnings for all other JVM-tool actions. I'm not sure how we were not affected by the crash, but are now affected by the logging, but hopefully --incompatible_sandbox_hermetic_tmp will fix that new problem for us. We're still on Bazel 5.3, but we'll be sure to try this flag when we can.

larsrc-google · 2023-07-25T06:42:29Z

Here's a thing to try for the warning message: Add the following to the JVM options for the actions:

-Xlog:disable
-Xlog:all=warning:stderr:uptime,level,tags

Fixes spurious Java compile action failures when the action is sandboxed. Work towards #3236 Closes #18892. PiperOrigin-RevId: 551818900 Change-Id: I3aee9a6d20a265c476c94be36be8974da2d4079b

Conhokis · 2023-09-07T14:30:57Z

Quick chime in, looking at --incompatible_hermetic_sandbox_tmp

This is fundamentally not a fix, it's a workaround for not using secure temp file creation. Enabling this workaround globally as done in #18892 smells like a "bruteforce" attempt that could actually mask real issues in tools (such as the one in the JDK) that are not really prepared for parallelization, regardless whether you're running it in a isolated environment.
So seems to me it's indeed fixing a java specific problem, but pretending it's a new feature while in reality adding extra complexity to the users.

And of course flag is marked as incompatible since users are surely relying already on the assumption that /tmp is shared.

So I think that the solution is definitely going in the right direction, but it has a couple issues:

Unlike --sandbox_tmpfs_path, I don't actually have the flexibility to provide extra paths. This is a bit of a smell that it's trying to solve a very specific problem here instead of providing an extra feature for isolation.
I strongly believe it should be opt-in, not opt-out. We give the flexibility for users to set this on their builds or fix the problems in their tools.

fmeum · 2023-09-07T15:00:05Z

This is fundamentally not a fix, it's a workaround for not using secure temp file creation. Enabling this workaround globally as done in #18892 smells like a "bruteforce" attempt that could actually mask real issues in tools (such as the one in the JDK) that are not really prepared for parallelization, regardless whether you're running it in a isolated environment.
So seems to me it's indeed fixing a java specific problem, but pretending it's a new feature while in reality adding extra complexity to the users.

It is both a fix and a workaround:

The JVM used to (until openjdk/jdk@84f2314) rely on the PID to generate the hsperfdata file. I fully agree with you that this is not a good practice and it should probably use some other mechanism to prevent collisions among these files.
The JVM uses a shared socket in a well-known location under /tmp that depends on the PID to handle certain remote access features, such as agent installation (see Java crashes due to hsperfdata file conflicts across sandboxes #3236 (comment)). This seems to be fully appropriate for this use case and I don't immediately see how this could be set up in other ways - "well-known location" and "collision-free" are pretty much at odds by definition. Since this scheme only breaks if two processes share /tmp but not the PID namespace, not sharing /tmp seemed to be the only way if we want to preserve isolation.

I think that 2) by itself is a reason for having and flipping --incompatible_hermetic_sandbox_tmp. I searched for but couldn't find tools that have similar issues and don't rely on /tmp, which is why I haven't thought about making this handling more configurable. If such tools exist, this could definitely be helpful.

Conhokis · 2023-09-07T15:31:15Z

Agreed for point 2, still would like to see the flexibility there but I understand now better this issue is beyond the hsperfdata file only.

Also understand that from bazel maintainers POV having out of the box support for common JVMs is a must-have.

Then the note on flexibility is done, consider it a possible minor improvement for some more esoteric tools / setups.

Appreciate the swift the reply 😄

See bazelbuild/bazel#3236 (comment)

…M crashes. See bazelbuild/bazel#3236 (comment)

The switch to Java-11 has brought instability in Bazel execution and random crashes (see [1]). Bazel doesn't have a fix at the moment, the only workaround is to add the --sandbox_tmpfs_path=/tmp option after the build/test commands. [1] bazelbuild/bazel#3236 Change-Id: I3aaabe756808a7da4ae51042922f9094c3759e22

Fixes bazelbuild#3236 Fixes bazelbuild#19915

Fixes bazelbuild#3236 Closes bazelbuild#19915 RELNOTES[INC]: `--incompatible_sandbox_hermetic_tmp` is enabled by default. See bazelbuild#19915 for migration advice. Closes bazelbuild#19943. PiperOrigin-RevId: 581165770 Change-Id: I0d98102f10b1e47c1d8fcf32fb1f7dee5ae0788c

Fixes #3236 Closes #19915 RELNOTES[INC]: `--incompatible_sandbox_hermetic_tmp` is enabled by default. See #19915 for migration advice. Closes #19943. Commit e2c0276 PiperOrigin-RevId: 581165770 Change-Id: I0d98102f10b1e47c1d8fcf32fb1f7dee5ae0788c Co-authored-by: Fabian Meumertzheim <fabian@meumertzhe.im>

iancha1992 · 2023-12-01T19:03:37Z

A fix for this issue has been included in Bazel 7.0.0 RC5. Please test out the release candidate and report any issues as soon as possible. Thanks!

The Bazel flag `--incompatible_sandbox_hermetic_tmp` is added to fix some issues with `mockito-core` and `byte-buddy`, e.g. see https://github.com/EngFlow/bazel_invocation_analyzer/actions/runs/7067252322/job/19240379595?pr=151 See mockito/mockito#1879 for a similar issue reported, and bazelbuild/bazel#3236 for the "fix" by specifying `--incompatible_sandbox_hermetic_tmp`. Note that this flag will be set to true by default with Bazel 7, which is expected to be released next week. --------- Signed-off-by: Sara Adams <sara.e.adams@gmail.com>

…chainCompileBootClasspath Copybara Import from #149 BEGIN_PUBLIC Disable perfdata when running JavaToolchainCompileClasses or JavaToolchainCompileBootClasspath (#149) A [recent JDK update](openjdk/jdk@84f2314#diff-7313eb3d328797a7720fa1b2b73cd159934506593443e45534baad80cb1382b7R924-R927) started printing the following warning message from the JVM ``` [warning][perf,memops] Cannot use file /tmp/hsperfdata_username/2 because it is locked by another process (errno = 11) ``` on linux hosts. Also referenced from bazelbuild/bazel#3236 This PR disables the perfdata generation when running JavaToolchainCompileClasses or JavaToolchainCompileBootClasspath so this warning message won't be printed. Closes #149 END_PUBLIC COPYBARA_INTEGRATE_REVIEW=#149 from cheister:no-perfdata 601c3e5 PiperOrigin-RevId: 614679249 Change-Id: I7e90c0ec0b93ac57763d3a5af546c4af9f7c9dc0

philwo self-assigned this Jun 22, 2017

philwo added category: sandboxing P2 We'll consider working on this in future. (Assignee optional) type: bug labels Jun 22, 2017

brettchabot added a commit to android/android-test that referenced this issue Sep 9, 2023

Move sandbox_tmpfs to .bazelrc.

f131221

See bazelbuild/bazel#3236 (comment)

brettchabot added a commit to android/android-test that referenced this issue Sep 9, 2023

Set sandbox_tmpfs_path for bazel build and test to attempt to stem JV…

d669b6d

…M crashes. See bazelbuild/bazel#3236 (comment)

shahms mentioned this issue Oct 18, 2023

java indexer tests fail with --config=java20 kythe/kythe#5743

Closed

fmeum mentioned this issue Oct 21, 2023

incompatible_sandbox_hermetic_tmp #19915

Closed

fmeum added a commit to fmeum/bazel that referenced this issue Oct 25, 2023

Enable --incompatible_sandbox_hermetic_tmp by default

ddadf98

Fixes bazelbuild#3236 Fixes bazelbuild#19915

fmeum mentioned this issue Oct 25, 2023

Enable --incompatible_sandbox_hermetic_tmp by default #19943

Closed

fmeum added a commit to fmeum/bazel that referenced this issue Oct 25, 2023

Enable --incompatible_sandbox_hermetic_tmp by default

92ec681

Fixes bazelbuild#3236 Fixes bazelbuild#19915

fmeum added a commit to fmeum/bazel that referenced this issue Oct 25, 2023

Enable --incompatible_sandbox_hermetic_tmp by default

3b9cc81

Fixes bazelbuild#3236 Fixes bazelbuild#19915

This was referenced Oct 26, 2023

Disable JVM perfdata when stamping jars bazel-contrib/rules_jvm_external#988

Merged

Disable perfdata when running JavaToolchainCompileClasses or JavaToolchainCompileBootClasspath bazelbuild/rules_java#149

Closed

fmeum added a commit to fmeum/bazel that referenced this issue Oct 30, 2023

Enable --incompatible_sandbox_hermetic_tmp by default

88fd6fb

Fixes bazelbuild#3236 Fixes bazelbuild#19915

fmeum added a commit to fmeum/bazel that referenced this issue Nov 6, 2023

Enable --incompatible_sandbox_hermetic_tmp by default

0a95ee2

Fixes bazelbuild#3236 Fixes bazelbuild#19915

copybara-service bot closed this as completed in e2c0276 Nov 10, 2023

bazel-io mentioned this issue Nov 10, 2023

[7.0.0] Enable --incompatible_sandbox_hermetic_tmp by default #20136

Merged

saraadams mentioned this issue Dec 2, 2023

Update external dependencies mockito and byte-buddy EngFlow/bazel_invocation_analyzer#151

Merged

rdesgroppes mentioned this issue Apr 24, 2024

"dangling symbolic link" flakes after upgrading to Bazel 7 #20886

Closed

ryanmacdonald mentioned this issue Jul 30, 2024

SIGBUS JVM error seen with bazel version 6.3.0 #23146

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Java crashes due to hsperfdata file conflicts across sandboxes #3236

Java crashes due to hsperfdata file conflicts across sandboxes #3236

or-shachar commented Jun 21, 2017 •

edited

Loading

philwo commented Jun 22, 2017

or-shachar commented Jun 22, 2017 •

edited

Loading

philwo commented Jun 22, 2017

or-shachar commented Jun 22, 2017

ittaiz commented Jun 23, 2017

philwo commented Jun 23, 2017

philwo commented Jun 23, 2017 •

edited

Loading

philwo commented Jun 23, 2017 •

edited

Loading

ittaiz commented Jun 23, 2017

ittaiz commented Jun 23, 2017

philwo commented Jun 23, 2017 •

edited

Loading

philwo commented Jun 23, 2017 •

edited

Loading

philwo commented Jun 23, 2017

philwo commented Jun 23, 2017

ittaiz commented Jun 23, 2017 via email

ulfjack commented Jun 24, 2017

ittaiz commented Jun 24, 2017 via email

philwo commented Jun 24, 2017 •

edited

Loading

philwo commented Jun 24, 2017 •

edited

Loading

jmmv commented Jun 26, 2017

philwo commented Jun 26, 2017 •

edited

Loading

ulfjack commented Jun 27, 2017

philwo commented Jun 27, 2017

ittaiz commented Jul 7, 2017 via email

philwo commented Jul 8, 2017

ittaiz commented Jul 8, 2017 via email

JaredNeil commented Jul 24, 2023

larsrc-google commented Jul 25, 2023

Conhokis commented Sep 7, 2023

fmeum commented Sep 7, 2023

Conhokis commented Sep 7, 2023

iancha1992 commented Dec 1, 2023

Java crashes due to hsperfdata file conflicts across sandboxes #3236

Java crashes due to hsperfdata file conflicts across sandboxes #3236

Comments

or-shachar commented Jun 21, 2017 • edited Loading

Issue

Environment info

additional information

philwo commented Jun 22, 2017

or-shachar commented Jun 22, 2017 • edited Loading

philwo commented Jun 22, 2017

or-shachar commented Jun 22, 2017

ittaiz commented Jun 23, 2017

philwo commented Jun 23, 2017

philwo commented Jun 23, 2017 • edited Loading

philwo commented Jun 23, 2017 • edited Loading

ittaiz commented Jun 23, 2017

ittaiz commented Jun 23, 2017

philwo commented Jun 23, 2017 • edited Loading

philwo commented Jun 23, 2017 • edited Loading

philwo commented Jun 23, 2017

philwo commented Jun 23, 2017

ittaiz commented Jun 23, 2017 via email

ulfjack commented Jun 24, 2017

ittaiz commented Jun 24, 2017 via email

philwo commented Jun 24, 2017 • edited Loading

philwo commented Jun 24, 2017 • edited Loading

jmmv commented Jun 26, 2017

philwo commented Jun 26, 2017 • edited Loading

ulfjack commented Jun 27, 2017

philwo commented Jun 27, 2017

ittaiz commented Jul 7, 2017 via email

philwo commented Jul 8, 2017

ittaiz commented Jul 8, 2017 via email

JaredNeil commented Jul 24, 2023

larsrc-google commented Jul 25, 2023

Conhokis commented Sep 7, 2023

fmeum commented Sep 7, 2023

Conhokis commented Sep 7, 2023

iancha1992 commented Dec 1, 2023

or-shachar commented Jun 21, 2017 •

edited

Loading

or-shachar commented Jun 22, 2017 •

edited

Loading

philwo commented Jun 23, 2017 •

edited

Loading

philwo commented Jun 23, 2017 •

edited

Loading

philwo commented Jun 23, 2017 •

edited

Loading

philwo commented Jun 23, 2017 •

edited

Loading

philwo commented Jun 24, 2017 •

edited

Loading

philwo commented Jun 24, 2017 •

edited

Loading

philwo commented Jun 26, 2017 •

edited

Loading