Make Docker images non-root, by default, and OpenShift compliant #13376

michaeljmarshall · 2021-12-17T06:13:09Z

Master Issue: #11269

Motivation

In order to increase the overall security of our Pulsar docker images, they should default to run as the non-root user. While updating these permissions, I make sure to comply with the OpenShift spec so the docker image can run on that platform out of the box.

Once we finalize these changes, we will need to update the Apache Pulsar Helm chart to make sure that deployments take advantage of this feature. We'll use the fsGroup to make sure that k8s sets the appropriate file system permissions for the zookeeper, bookkeeper, and function pods.

Modifications

Default to run as UID 10000. As noted in the Dockerfile, this UID is arbitrary. No logic should rely on this id.
Update filesystem permissions so that the group user has sufficient write permission. The group user is 0 (root).
Remove unnecessary write access.
- The /pulsar/{conf,data,logs} directories and their members must be writable by the root group. I don't know of any other directories that need to be written to. Note that the bin/pulsar-admin too creates a log file in the /pulsar/logs directory. Please let me know if there are any additional
- Note also that the executable file permissions are already set in our git repo. Those permissions are inherited by the docker image when we run the COPY directive in the Dockerfile.
There are no changes to the function worker in the k8s runtime. We do not need them because we already merged 04b5da0.
Add note to conf/bkenv.sh, as it is a .sh script that is not executable (and doesn't need to be).
Update test docker image and supervisord configuration.

Note: it's unclear to me how the OpenShift spec handles restarts. I know that the UID is arbitrary. It's possible that the umask needs to be switched from 022 to 002. Setting the umask in the docker image does not persist for consumers of the image, so this would need to be set in a helm chart.

Verifying this change

You can access a test image built with these changes here: michaelmarshall/pulsar:2.10.0-SNAPSHOT. I have already run some manual tests like bin/pulsar standalone in the container. I still need to deploy an actual cluster to verify that all of the unique components work correctly. Because we already merged 04b5da0, the upgrade scenarios are already simplified. If this change is in 2.10.0, that means 2.8 and 2.9 will be compatible for certain function worker upgrade scenarios.

I wrote test criteria in #11269. I'll need to follow up on that criteria using my newly build image. I should be able to look closer at this tomorrow.

We'll also need tests to pass, as I modified some tests with this PR.

References

The following links were useful in understanding how to make these changes:

Does this pull request potentially affect one of the following parts:

This PR updates our Docker images in a breaking way. It could result in bookkeepers, zookeepers, or functions with insufficient permissions. We will mitigate these permissions by updating the helm chart. These changes are easily overridden by extending the docker image. In k8s, you can use the pod's securityContext to override the user or group.

Documentation

doc-required

I need to add docs for this PR. I'll do that in a follow up commit tomorrow.

github-actions · 2021-12-17T06:14:12Z

@michaeljmarshall:Thanks for your contribution. For this PR, do we need to update docs?
(The PR template contains info about doc, which helps others know more about the changes. Can you provide doc-related info in this and future PR descriptions? Thanks)

michaeljmarshall · 2021-12-17T06:35:06Z

Mailing list thread: https://lists.apache.org/thread/lrt5ycdbms69npblp4j1jp75138wvn62

michaeljmarshall · 2021-12-17T07:03:03Z

Looks like the test failures are related to containers failing to start. I'll take a closer look in the morning.

docker/pulsar/Dockerfile

nicoloboschi · 2021-12-17T13:26:20Z

conf/bkenv.sh

@@ -18,6 +18,8 @@
 # under the License.
 #

+# NOTE: this script is intentionally not executable. It is only meant to be sourced for environment variables.


what about removing the execution permission ? in this way you can run source conf/bkenv.sh but you can't run it using ./conf/bkenv.sh

btw not sure it's worth to do that

The conf/bkenv.sh file does not currently have execute permissions. It is used in a few places with commands like:

source $BK_HOME/conf/bkenv.sh

and

# Check bookkeeper env and load bkenv.sh if [ -f "$PULSAR_HOME/conf/bkenv.sh" ] then . "$PULSAR_HOME/conf/bkenv.sh" fi

I added this note because it surprised me that a .sh file was in the conf directory. If the note is obvious/confusing, I can remove it.

michaeljmarshall · 2021-12-17T22:41:58Z

The Tiered FileSystem test failed because the broker couldn't write to the location. Since it is unrealistic for brokers to persist to a local file system, I am going to update the test to write to the /pulsar/data directory. In practice, even if the broker were writing to the local file system, the file system owner would just need to update the host's file system permissions.

2021-12-17T17:22:31,911+0000 [offloader-OrderedScheduler-0-0] WARN  org.apache.bookkeeper.mledger.impl.ManagedLedgerImpl - [offload-test-deletion-lag-tayt/ns1/persistent/topic1] Exception occurred for ledgerId 2 timestamp 1639761751911 during offload
java.util.concurrent.CompletionException: java.io.IOException: Mkdirs failed to create directory offload-test-deletion-lag-tayt/ns1/persistent/topic1/2-97ea8cf8-c5d4-4296-a42a-a09944c49b4f
	at java.util.concurrent.CompletableFuture.encodeRelay(CompletableFuture.java:367) ~[?:?]
	at java.util.concurrent.CompletableFuture.completeRelay(CompletableFuture.java:376) ~[?:?]
	at java.util.concurrent.CompletableFuture$UniRelay.tryFire(CompletableFuture.java:1019) ~[?:?]
	at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506) [?:?]
	at java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088) [?:?]
	at org.apache.bookkeeper.mledger.offload.filesystem.impl.FileSystemManagedLedgerOffloader$LedgerReader.run(FileSystemManagedLedgerOffloader.java:220) [sdEcYt3w0SKH4TJGsfGbrg/:?]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
	at com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:125) [com.google.guava-guava-30.1-jre.jar:?]
	at com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:69) [com.google.guava-guava-30.1-jre.jar:?]
	at com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:78) [com.google.guava-guava-30.1-jre.jar:?]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
	at java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) [?:?]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
	at io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30) [io.netty-netty-common-4.1.72.Final.jar:4.1.72.Final]
	at java.lang.Thread.run(Thread.java:829) [?:?]
Caused by: java.io.IOException: Mkdirs failed to create directory offload-test-deletion-lag-tayt/ns1/persistent/topic1/2-97ea8cf8-c5d4-4296-a42a-a09944c49b4f
	at org.apache.hadoop.io.MapFile$Writer.<init>(MapFile.java:269) ~[?:?]
	at org.apache.bookkeeper.mledger.offload.filesystem.impl.FileSystemManagedLedgerOffloader$LedgerReader.run(FileSystemManagedLedgerOffloader.java:187) ~[?:?]
	... 11 more

michaeljmarshall · 2021-12-18T07:09:44Z

Still failing with the same error. Need to dig into the test a bit more. In my opinion, we should keep the docker image as locked down as possible, so I'd prefer to modify the failing test instead of the docker image.

2021-12-18T06:38:38,025+0000 [offloader-OrderedScheduler-0-0] ERROR org.apache.bookkeeper.mledger.offload.filesystem.impl.FileSystemManagedLedgerOffloader - Exception when get CompletableFuture<LedgerEntries> : ManagerLedgerName: offload-test-deletion-lag-ataq/ns1/persistent/topic1, LedgerId: 0, UUID: da175fe9-809c-4064-982b-f7b39fac16e7 
java.io.IOException: Mkdirs failed to create directory offload-test-deletion-lag-ataq/ns1/persistent/topic1/0-da175fe9-809c-4064-982b-f7b39fac16e7
	at org.apache.hadoop.io.MapFile$Writer.<init>(MapFile.java:269) ~[hadoop-common-3.3.0.jar:?]
	at org.apache.bookkeeper.mledger.offload.filesystem.impl.FileSystemManagedLedgerOffloader$LedgerReader.run(FileSystemManagedLedgerOffloader.java:187) [rm6oA3cNO6zD4km9765wTA/:?]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
	at com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:125) [com.google.guava-guava-30.1-jre.jar:?]
	at com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:69) [com.google.guava-guava-30.1-jre.jar:?]
	at com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:78) [com.google.guava-guava-30.1-jre.jar:?]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
	at java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) [?:?]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
	at io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30) [io.netty-netty-common-4.1.72.Final.jar:4.1.72.Final]
	at java.lang.Thread.run(Thread.java:829) [?:?]
2021-12-18T06:38:38,093+0000 [offloader-OrderedScheduler-0-0] WARN  org.apache.bookkeeper.mledger.impl.ManagedLedgerImpl - [offload-test-deletion-lag-ataq/ns1/persistent/topic1] Exception occurred for ledgerId 0 timestamp 1639809518093 during offload
java.util.concurrent.CompletionException: java.io.IOException: Mkdirs failed to create directory offload-test-deletion-lag-ataq/ns1/persistent/topic1/0-da175fe9-809c-4064-982b-f7b39fac16e7
	at java.util.concurrent.CompletableFuture.encodeRelay(CompletableFuture.java:367) ~[?:?]
	at java.util.concurrent.CompletableFuture.completeRelay(CompletableFuture.java:376) ~[?:?]
	at java.util.concurrent.CompletableFuture$UniRelay.tryFire(CompletableFuture.java:1019) ~[?:?]
	at java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:506) [?:?]
	at java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:2088) [?:?]
	at org.apache.bookkeeper.mledger.offload.filesystem.impl.FileSystemManagedLedgerOffloader$LedgerReader.run(FileSystemManagedLedgerOffloader.java:220) [rm6oA3cNO6zD4km9765wTA/:?]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
	at com.google.common.util.concurrent.TrustedListenableFutureTask$TrustedFutureInterruptibleTask.runInterruptibly(TrustedListenableFutureTask.java:125) [com.google.guava-guava-30.1-jre.jar:?]
	at com.google.common.util.concurrent.InterruptibleTask.run(InterruptibleTask.java:69) [com.google.guava-guava-30.1-jre.jar:?]
	at com.google.common.util.concurrent.TrustedListenableFutureTask.run(TrustedListenableFutureTask.java:78) [com.google.guava-guava-30.1-jre.jar:?]
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) [?:?]
	at java.util.concurrent.FutureTask.run(FutureTask.java:264) [?:?]
	at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) [?:?]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) [?:?]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) [?:?]
	at io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30) [io.netty-netty-common-4.1.72.Final.jar:4.1.72.Final]
	at java.lang.Thread.run(Thread.java:829) [?:?]
Caused by: java.io.IOException: Mkdirs failed to create directory offload-test-deletion-lag-ataq/ns1/persistent/topic1/0-da175fe9-809c-4064-982b-f7b39fac16e7
	at org.apache.hadoop.io.MapFile$Writer.<init>(MapFile.java:269) ~[?:?]
	at org.apache.bookkeeper.mledger.offload.filesystem.impl.FileSystemManagedLedgerOffloader$LedgerReader.run(FileSystemManagedLedgerOffloader.java:187) ~[?:?]
	... 11 more

freeznet

I have tested with michaelmarshall/pulsar:2.10.0-SNAPSHOT, please check my comment, seems the download dir cannot be created.

I have no name!@d48234a40a05:/pulsar$ mkdir download
mkdir: cannot create directory 'download': Permission denied

freeznet · 2021-12-20T02:48:51Z

docker/pulsar/Dockerfile

+RUN chmod -R g+w /pulsar/conf
+RUN mkdir /pulsar/data && chmod -R g+w /pulsar/data
+RUN mkdir /pulsar/logs && chmod -R g+w /pulsar/logs
+RUN chmod -R u-w /pulsar


the /pulsar do not have write permission anymore, so it will cause functions failed to download with default functions_worker configs. Since downloadDirectory: download/pulsar_functions and the /pulsar/download/pulsar_functions cannot be created by pulsar user.
We should also check other folder permissions for functions as well, like narExtractionDirectory.

Thanks for testing this @freeznet. I hadn't come across that error yet. We could create the /pulsar/download/pulsar_functions directory as part of this docker build so that we can ensure the build has the right permissions. That solution won't help anyone who has overridden narExtractionDirectory, though.

I would like to avoid giving the user unnecessary permissions. The permission to write to the /pulsar directory seems like more permission than the user needs.

freeznet · 2021-12-20T02:50:03Z

@timmyyuan Please help to review this PR when you have time, thanks.

docker/pulsar/Dockerfile

michaeljmarshall · 2021-12-20T17:33:21Z

I removed the chmod -R u-w /pulsar from the builder image. I didn't fix the tests yet, so this latest commit will still fail tests.

michaeljmarshall · 2021-12-20T21:06:15Z

@freeznet - I discovered that our default hdfs configuration also requires arbitrary writing to the /pulsar directory.

pulsar/conf/filesystem_offload_core_site.xml

Lines 27 to 30 in f7abada

    
           <property> 
        
               <name>hadoop.tmp.dir</name> 
        
               <value>pulsar</value> 
        
           </property>

I don't think this is a very good default, but I don't think we can change this default without breaking current implementation. With the latest commit, I gave the root group write permission to /pulsar.

michaeljmarshall · 2021-12-20T21:09:05Z

@congbobo184 - is it possible to update the default hadoop.tmp.dir without breaking already running pulsar clusters? I'm not familiar with hdfs. It would be really helpful to move it to /pulsar/data or some subdirectory of the /pulsar directory.

pulsar/conf/filesystem_offload_core_site.xml

Lines 27 to 30 in f7abada

    
           <property> 
        
               <name>hadoop.tmp.dir</name> 
        
               <value>pulsar</value> 
        
           </property>

michaeljmarshall · 2021-12-20T23:24:30Z

The Pulsar SQL tests have been failing because presto writes some logs to /pulsar/lib/presto/ (technically, it writes the logs to PRESTO_HOME defined in bin/pulsar). Since we're working to remove presto in 2.10, which is also when this change would go live, we don't need a long term solution for this permission issue.

michaeljmarshall · 2021-12-21T20:37:26Z

New docker image built from the most recent commit: michaelmarshall/pulsar:2.10.0-SNAPSHOT-1dec8b9.

Note that I ran the dive tool (https://github.com/wagoodman/dive) against the 2.9.0 image and the 1dec8b9 image. Below are screenshots of the results. It looks like our command to install utilities introduces some inefficiency. My changes don't affect image size, though.

RUN apt-get update \
     && apt-get -y dist-upgrade \
     && apt-get -y install openjdk-11-jdk-headless netcat dnsutils less procps iputils-ping \
                 python3 python3-dev python3-setuptools python3-yaml python3-kazoo \
                 libreadline-gplv2-dev libncursesw5-dev libssl-dev libsqlite3-dev tk-dev libgdbm-dev libc6-dev libbz2-dev \
                 curl \
     && apt-get -y --purge autoremove \
     && apt-get autoclean \
     && apt-get clean \
     && rm -rf /var/lib/apt/lists/*

freeznet

lgtm, but I do have some more questions about non-root image, please check my comments, thanks.

freeznet · 2021-12-22T01:40:33Z

tests/docker-images/latest-version-image/Dockerfile

+USER root
+
+# We need to define the user in order for supervisord to work correctly
+# We don't need a user defined in the public docker image, though.


Could you please show more context about why we don't need a user defined for the public image? Since the test image here is from pulsar image, and they both using UID 10000 here, why not add the user in the public image?

The UID needs to be a defined user in order to work with supervisord. Before I defined the user, the tests failed with this error:

Error: Invalid user id 10000 in section 'program:bookie' (file: '/etc/supervisord/conf.d/bookie.conf')

The public docker image does not need the user defined because we don't use supervisord to run the different pulsar processes. Further, when the docker image is used in OpenShift, it will receive a random UID that is guaranteed to be a member of the root group. My design here is similar. I don't think we should name the pulsar user because we don't want any logic tied to the UID. For reference, all bitnami docker images behave this way. They are well known for making non-root docker images. Here is one of their blogs that I consulted while preparing this change: https://engineering.bitnami.com/articles/running-non-root-containers-on-openshift.html.

freeznet · 2021-12-22T01:42:40Z

docker/pulsar/Dockerfile

+
+# The UID must be non-zero. Otherwise, it is arbitrary. No logic should rely on its specific value.
+USER 10000


What is the best practices to debug the system when the container is in non-root mode?

Debugging can be challenging, especially because a non-root user does not have the privilege to download packages. The bitnami blog I referenced has some documentation on their experience debugging non root containers: https://engineering.bitnami.com/articles/running-non-root-containers-on-openshift.html.

Depending on your environment, there are several concrete options. In kubernetes, you can set the pod's securityContext to runAsUser: 0, if you really need to run it as root. If you have access to the docker container's host, you should be able to exec into the container as the root user. We could also produce a "debug" image that contains additional debug tools.

I can add some documentation about this, too. My initial focus for this PR was to get the technical part done first. Once we have solidified the design, I'll add docs.

eolivelli · 2021-12-22T07:26:49Z

after committing this patch we should take into consideration this #10815
about adding tools to manage/inspect/debug the system

eolivelli

Lgtm

michaeljmarshall · 2022-02-10T03:56:35Z

The conflict was only on white space. No code/configuration was changed.

michaeljmarshall · 2022-02-10T05:50:01Z

@codelipenghui - I haven't yet been able to test the function upgrade scenario that @sijie requested. It'd be great to get this included in branch-2.10, but I want to be clear about the status of testing before this gets merged.

codelipenghui · 2022-02-10T12:51:02Z

@michaeljmarshall Ok, I see. Thanks for the update. I will also ask @freeznet for testing this case if he has time to confirm it. Thanks again for the great work!

freeznet · 2022-02-11T02:22:34Z

@michaeljmarshall this brings me a one more question, do we need to add securityContext to Kubernetes runtime to prevent the possible insufficient permissions issue when upgrade to 2.10?

michaeljmarshall · 2022-02-12T00:34:57Z

@freeznet - yes, it'll be important to set the securityContext. That will ensure mounted volumes have the correct fs group write permissions. It'll also take care of upgrading existing volumes.

freeznet · 2022-02-13T10:23:10Z

@michaeljmarshall thanks for the reply, so will this PR add the securityContext to kubernetes runtime?

michaeljmarshall · 2022-02-14T17:21:15Z

@freeznet - no. The securityContext is set in the pod spec, which is configured in the statefulset or the deployment. I plan to follow up with a PR to the helm chart.

freeznet · 2022-02-15T01:21:21Z

@michaeljmarshall but for Pulsar Functions Kubernetes Runtime, it will creates the statefulset for each function, and the securityContext for function's statefulsets/pods are needed as well, right?

michaeljmarshall · 2022-02-15T03:33:17Z

@michaeljmarshall but for Pulsar Functions Kubernetes Runtime, it will creates the statefulset for each function, and the securityContext for function's statefulsets/pods are needed as well, right?

@freeznet - why does the function's pod need a securityContext? The secrets are currently readable by the root group, and there are no persistent volumes that would need to have updated file system permissions.

lhotari · 2022-02-15T10:51:19Z

@codelipenghui @freeznet @michaeljmarshall Can we merge this PR?

lhotari · 2022-02-15T10:51:58Z

@freeznet Are you planning to do some testing before this is merged? Please test this asap. Thank you /cc @codelipenghui

freeznet · 2022-02-16T01:32:25Z

@lhotari yes I am working on it, and should be done in this week.

lhotari · 2022-02-17T10:46:22Z

@freeznet I noticed your mailing list post that you have completed verification. Thanks for verifying.

Ready to merge?

) Master Issue: #11269 ### Motivation In order to increase the overall security of our Pulsar docker images, they should default to run as the non-root user. While updating these permissions, I make sure to comply with the OpenShift spec so the docker image can run on that platform out of the box. Once we finalize these changes, we will need to update the Apache Pulsar Helm chart to make sure that deployments take advantage of this feature. We'll use the `fsGroup` to make sure that k8s sets the appropriate file system permissions for the zookeeper, bookkeeper, and function pods. ### Modifications * Default to run as UID 10000. As noted in the `Dockerfile`, this UID is arbitrary. No logic should rely on this id. * Update filesystem permissions so that the group user has sufficient write permission. The group user is 0 (root). * Remove unnecessary write access. * The `/pulsar/{conf,data,logs}` directories and their members must be writable by the root group. I don't know of any other directories that need to be written to. Note that the `bin/pulsar-admin` too creates a log file in the `/pulsar/logs` directory. Please let me know if there are any additional * Note also that the executable file permissions are already set in our git repo. Those permissions are inherited by the docker image when we run the `COPY` directive in the `Dockerfile`. * There are no changes to the function worker in the k8s runtime. We do not need them because we already merged 04b5da0. * Add note to `conf/bkenv.sh`, as it is a `.sh` script that is not executable (and doesn't need to be). * Update test docker image and `supervisord` configuration. Note: it's unclear to me how the OpenShift spec handles restarts. I know that the UID is arbitrary. It's possible that the umask needs to be switched from `022` to `002`. Setting the umask in the docker image does not persist for consumers of the image, so this would need to be set in a helm chart. ### Verifying this change You can access a test image built with these changes here: `michaelmarshall/pulsar:2.10.0-SNAPSHOT`. I have already run some manual tests like `bin/pulsar standalone` in the container. I still need to deploy an actual cluster to verify that all of the unique components work correctly. Because we already merged 04b5da0, the upgrade scenarios are already simplified. If this change is in 2.10.0, that means 2.8 and 2.9 will be compatible for certain function worker upgrade scenarios. I wrote test criteria in #11269. I'll need to follow up on that criteria using my newly build image. I should be able to look closer at this tomorrow. We'll also need tests to pass, as I modified some tests with this PR. ### References The following links were useful in understanding how to make these changes: * https://engineering.bitnami.com/articles/running-non-root-containers-on-openshift.html * https://cloud.redhat.com/blog/a-guide-to-openshift-and-uids ### Does this pull request potentially affect one of the following parts: This PR updates our Docker images in a breaking way. It could result in bookkeepers, zookeepers, or functions with insufficient permissions. We will mitigate these permissions by updating the helm chart. These changes are easily overridden by extending the docker image. In k8s, you can use the pod's `securityContext` to override the user or group. (cherry picked from commit f7f8619)

…che#13376) Master Issue: apache#11269 ### Motivation In order to increase the overall security of our Pulsar docker images, they should default to run as the non-root user. While updating these permissions, I make sure to comply with the OpenShift spec so the docker image can run on that platform out of the box. Once we finalize these changes, we will need to update the Apache Pulsar Helm chart to make sure that deployments take advantage of this feature. We'll use the `fsGroup` to make sure that k8s sets the appropriate file system permissions for the zookeeper, bookkeeper, and function pods. ### Modifications * Default to run as UID 10000. As noted in the `Dockerfile`, this UID is arbitrary. No logic should rely on this id. * Update filesystem permissions so that the group user has sufficient write permission. The group user is 0 (root). * Remove unnecessary write access. * The `/pulsar/{conf,data,logs}` directories and their members must be writable by the root group. I don't know of any other directories that need to be written to. Note that the `bin/pulsar-admin` too creates a log file in the `/pulsar/logs` directory. Please let me know if there are any additional * Note also that the executable file permissions are already set in our git repo. Those permissions are inherited by the docker image when we run the `COPY` directive in the `Dockerfile`. * There are no changes to the function worker in the k8s runtime. We do not need them because we already merged apache@04b5da0. * Add note to `conf/bkenv.sh`, as it is a `.sh` script that is not executable (and doesn't need to be). * Update test docker image and `supervisord` configuration. Note: it's unclear to me how the OpenShift spec handles restarts. I know that the UID is arbitrary. It's possible that the umask needs to be switched from `022` to `002`. Setting the umask in the docker image does not persist for consumers of the image, so this would need to be set in a helm chart. ### Verifying this change You can access a test image built with these changes here: `michaelmarshall/pulsar:2.10.0-SNAPSHOT`. I have already run some manual tests like `bin/pulsar standalone` in the container. I still need to deploy an actual cluster to verify that all of the unique components work correctly. Because we already merged apache@04b5da0, the upgrade scenarios are already simplified. If this change is in 2.10.0, that means 2.8 and 2.9 will be compatible for certain function worker upgrade scenarios. I wrote test criteria in apache#11269. I'll need to follow up on that criteria using my newly build image. I should be able to look closer at this tomorrow. We'll also need tests to pass, as I modified some tests with this PR. ### References The following links were useful in understanding how to make these changes: * https://engineering.bitnami.com/articles/running-non-root-containers-on-openshift.html * https://cloud.redhat.com/blog/a-guide-to-openshift-and-uids ### Does this pull request potentially affect one of the following parts: This PR updates our Docker images in a breaking way. It could result in bookkeepers, zookeepers, or functions with insufficient permissions. We will mitigate these permissions by updating the helm chart. These changes are easily overridden by extending the docker image. In k8s, you can use the pod's `securityContext` to override the user or group. (cherry picked from commit f7f8619)

michaeljmarshall · 2022-02-20T04:49:52Z

@freeznet - thanks for testing this feature out and helping get this merged in time for the 2.10 release!

…che#13376) Master Issue: apache#11269 ### Motivation In order to increase the overall security of our Pulsar docker images, they should default to run as the non-root user. While updating these permissions, I make sure to comply with the OpenShift spec so the docker image can run on that platform out of the box. Once we finalize these changes, we will need to update the Apache Pulsar Helm chart to make sure that deployments take advantage of this feature. We'll use the `fsGroup` to make sure that k8s sets the appropriate file system permissions for the zookeeper, bookkeeper, and function pods. ### Modifications * Default to run as UID 10000. As noted in the `Dockerfile`, this UID is arbitrary. No logic should rely on this id. * Update filesystem permissions so that the group user has sufficient write permission. The group user is 0 (root). * Remove unnecessary write access. * The `/pulsar/{conf,data,logs}` directories and their members must be writable by the root group. I don't know of any other directories that need to be written to. Note that the `bin/pulsar-admin` too creates a log file in the `/pulsar/logs` directory. Please let me know if there are any additional * Note also that the executable file permissions are already set in our git repo. Those permissions are inherited by the docker image when we run the `COPY` directive in the `Dockerfile`. * There are no changes to the function worker in the k8s runtime. We do not need them because we already merged apache@04b5da0. * Add note to `conf/bkenv.sh`, as it is a `.sh` script that is not executable (and doesn't need to be). * Update test docker image and `supervisord` configuration. Note: it's unclear to me how the OpenShift spec handles restarts. I know that the UID is arbitrary. It's possible that the umask needs to be switched from `022` to `002`. Setting the umask in the docker image does not persist for consumers of the image, so this would need to be set in a helm chart. ### Verifying this change You can access a test image built with these changes here: `michaelmarshall/pulsar:2.10.0-SNAPSHOT`. I have already run some manual tests like `bin/pulsar standalone` in the container. I still need to deploy an actual cluster to verify that all of the unique components work correctly. Because we already merged apache@04b5da0, the upgrade scenarios are already simplified. If this change is in 2.10.0, that means 2.8 and 2.9 will be compatible for certain function worker upgrade scenarios. I wrote test criteria in apache#11269. I'll need to follow up on that criteria using my newly build image. I should be able to look closer at this tomorrow. We'll also need tests to pass, as I modified some tests with this PR. ### References The following links were useful in understanding how to make these changes: * https://engineering.bitnami.com/articles/running-non-root-containers-on-openshift.html * https://cloud.redhat.com/blog/a-guide-to-openshift-and-uids ### Does this pull request potentially affect one of the following parts: This PR updates our Docker images in a breaking way. It could result in bookkeepers, zookeepers, or functions with insufficient permissions. We will mitigate these permissions by updating the helm chart. These changes are easily overridden by extending the docker image. In k8s, you can use the pod's `securityContext` to override the user or group.

michaeljmarshall requested review from lhotari and merlimat December 17, 2021 06:13

michaeljmarshall added this to the 2.10.0 milestone Dec 17, 2021

github-actions bot assigned michaeljmarshall Dec 17, 2021

github-actions bot added the doc-required Your PR changes impact docs and you will update later. label Dec 17, 2021

michaeljmarshall added area/security type/feature The PR added a new feature or issue requested a new feature and removed doc-required Your PR changes impact docs and you will update later. labels Dec 17, 2021

github-actions bot added the doc-label-missing label Dec 17, 2021

lhotari reviewed Dec 17, 2021

View reviewed changes

docker/pulsar/Dockerfile Outdated Show resolved Hide resolved

nicoloboschi reviewed Dec 17, 2021

View reviewed changes

freeznet requested changes Dec 20, 2021

View reviewed changes

timmyyuan reviewed Dec 20, 2021

View reviewed changes

docker/pulsar/Dockerfile Outdated Show resolved Hide resolved

michaeljmarshall added the release/note-required label Dec 21, 2021

michaeljmarshall requested a review from freeznet December 21, 2021 21:14

freeznet reviewed Dec 22, 2021

View reviewed changes

freeznet previously approved these changes Jan 6, 2022

View reviewed changes

eolivelli previously approved these changes Jan 7, 2022

View reviewed changes

congbobo184 dismissed stale reviews from lhotari, eolivelli, and freeznet via fda6ae7 February 16, 2022 02:58

codelipenghui approved these changes Feb 17, 2022

View reviewed changes

lhotari approved these changes Feb 17, 2022

View reviewed changes

codelipenghui merged commit f7f8619 into apache:master Feb 17, 2022

nodece mentioned this pull request Feb 19, 2022

ci: use root streamnative/pulsarctl#617

Merged

michaeljmarshall deleted the non-root-docker-for-pulsar branch February 20, 2022 04:49

codelipenghui added a commit to codelipenghui/incubator-pulsar that referenced this pull request Feb 21, 2022

Add apache#13376 to update notice section

c4e1520

michaeljmarshall mentioned this pull request Apr 5, 2022

[Docker Images] Default Pulsar docker images to run as a non-root user #11269

Closed

lhotari mentioned this pull request Dec 7, 2023

[improve][sec] Add group pulsar and add user pulsar to it instead of root #21084

Merged

5 tasks

guimou mentioned this pull request Feb 23, 2024

Cannot run Milvus in our Openshift cluster with (runsAsUser, runAsGroup, fsGroup) - How to remove them?? zilliztech/milvus-helm#43

Open

haorenfsa mentioned this pull request Feb 26, 2024

[Enhancement]: Upgrade Pulsar to 3.0.1 milvus-io/milvus#30812

Open

1 task


		# The UID must be non-zero. Otherwise, it is arbitrary. No logic should rely on its specific value.
		USER 10000

Make Docker images non-root, by default, and OpenShift compliant #13376

Make Docker images non-root, by default, and OpenShift compliant #13376

Conversation

michaeljmarshall commented Dec 17, 2021

Motivation

Modifications

Verifying this change

References

Does this pull request potentially affect one of the following parts:

Documentation

github-actions bot commented Dec 17, 2021

michaeljmarshall commented Dec 17, 2021

michaeljmarshall commented Dec 17, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

michaeljmarshall commented Dec 17, 2021

michaeljmarshall commented Dec 18, 2021

freeznet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

freeznet commented Dec 20, 2021

michaeljmarshall commented Dec 20, 2021

michaeljmarshall commented Dec 20, 2021

michaeljmarshall commented Dec 20, 2021

michaeljmarshall commented Dec 20, 2021

michaeljmarshall commented Dec 21, 2021

freeznet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eolivelli commented Dec 22, 2021

eolivelli left a comment

Choose a reason for hiding this comment

michaeljmarshall commented Feb 10, 2022

michaeljmarshall commented Feb 10, 2022

codelipenghui commented Feb 10, 2022

freeznet commented Feb 11, 2022

michaeljmarshall commented Feb 12, 2022

freeznet commented Feb 13, 2022

michaeljmarshall commented Feb 14, 2022

freeznet commented Feb 15, 2022 • edited Loading

michaeljmarshall commented Feb 15, 2022

lhotari commented Feb 15, 2022

lhotari commented Feb 15, 2022

freeznet commented Feb 16, 2022

lhotari commented Feb 17, 2022

michaeljmarshall commented Feb 20, 2022

freeznet commented Feb 15, 2022 •

edited

Loading