[WIP]: Remote Cluster #1

Shrinjay · 2023-01-24T23:14:49Z

No description provided.

DerekTBrown · 2023-01-25T22:31:12Z

enterprise_gateway/services/external/k8s_client_factory.py

+                kubeconfig_path = os.environ.get('EG_REMOTE_CLUSTER_KUBECONFIG_PATH', '/etc/kube/config')
+                context = os.environ.get('EG_REMOTE_CLUSTER_CONTEXT', None)


Nit: I would be consistent with the method used for env fetching. I don't think there is a big difference, but I think it is wise to pick one or another.

Suggested change

kubeconfig_path = os.environ.get('EG_REMOTE_CLUSTER_KUBECONFIG_PATH', '/etc/kube/config')

context = os.environ.get('EG_REMOTE_CLUSTER_CONTEXT', None)

kubeconfig_path = os.getenv('EG_REMOTE_CLUSTER_KUBECONFIG_PATH', '/etc/kube/config')

context = os.getenv('EG_REMOTE_CLUSTER_CONTEXT', None)

DerekTBrown · 2023-01-25T22:31:45Z

enterprise_gateway/services/external/k8s_client_factory.py

+        if os.getenv("KUBERNETES_SERVICE_HOST"):
+            # Running inside cluster
+            if os.getenv('EG_USE_REMOTE_CLUSTER') and get_remote_if_available:
+                kubeconfig_path = os.environ.get('EG_REMOTE_CLUSTER_KUBECONFIG_PATH', '/etc/kube/config')


Did you pick the /etc/kube/config default, or did this come from something existing?

I picked that, honestly by random, I realize I forgot to set this as the default in the helm chart as well, but the idea was for the path to be the same in the helm chart and in code

DerekTBrown · 2023-01-25T22:32:08Z

enterprise_gateway/services/external/k8s_client_factory.py

+        super().__init__()
+
+    def get_kubernetes_client(self, get_remote_if_available=True) -> client.ApiClient:
+        """Get kubernetes api client with appropriate configuration"""


Can you document what this does and also what the effect of get_remote_if_available is?

DerekTBrown · 2023-01-25T22:32:44Z

enterprise_gateway/services/external/k8s_client_factory.py

+            config.load_kube_config(client_configuration=kubernetes_config)
+
+        self.log.debug(
+            "Created kubernetes client for host {host}".format(host=kubernetes_config.host)


Nit: f-strings for the win

Suggested change

"Created kubernetes client for host {host}".format(host=kubernetes_config.host)

f"Created kubernetes client for host {kubernetes_config.host}"

DerekTBrown · 2023-01-25T22:35:17Z

enterprise_gateway/services/processproxies/k8s.py

+        service_account_list_in_namespace: client.V1ServiceAccountList = client\
+            .CoreV1Api(api_client=kubernetes_client)\
+            .list_namespaced_service_account(namespace=namespace)


(More of a meta point than a specific set of lines)

Slash continuations in Python are pretty evil. Was this something you put in manually, or something the formatter did for you?

You might want to put these on one (really long) line, and then use black to find a solution that doesn't involve slashes. I believe the project uses black anyway, so it is a good idea to run that over your code from the getgo.

PyCharm did it when I tried to chain the invocations over multiple lines like I usually do in js/ts, honestly wasn't aware slash continuations were a problem until I just googled it, so thanks for pointing this one out

DerekTBrown · 2023-01-25T22:50:35Z

enterprise_gateway/services/processproxies/k8s.py

+        service_account_names_in_namespace: List[str] = list(map(lambda svcaccount: svcaccount.metadata.name, service_accounts_in_namespace))
+
+        if service_account_name not in service_account_names_in_namespace:


Super super good on you for using functional programming here. This is a senior dev move.

We can make this an even more big-headed by using a generator comprehension. The list/map method is on the order of n in terms of memory complexity and roughly 2n in terms of time complexity (we build a list of length n which takes n time, then we scan the list which takes n time). The lambda also requires constructing a function context, which can be expensive. The generator method is on the order of o(1) in terms of memory complexity and takes only n in terms of time complexity (we just perform a single scan of the other list without actually constructing anything).

Suggested change

service_account_names_in_namespace: List[str] = list(map(lambda svcaccount: svcaccount.metadata.name, service_accounts_in_namespace))

if service_account_name not in service_account_names_in_namespace:

service_account_names_in_namespace = (svcaccount.metadata.name for svcaccount in service_accounts_in_namespace)

if service_account_name not in service_account_names_in_namespace:

At this point, you are probably thinking "why do we care about performance, this is not that performance intensive?"

This has less to do with performance and more to do with code quality; we want the Juypter reviewers to think "damn, thats some slick code, they must know what they are doing."

Oh I love finding out about optimizations like this, I honestly don't know enough about Python for how much I use it, so thanks for teaching me about this 😄

Oh it's list comprehension... darn I should've remembered about that, I keep defaulting to the list-map combo, probably because I use .map a lot in TS

DerekTBrown · 2023-01-25T22:53:49Z

enterprise_gateway/services/processproxies/k8s.py

+        remote_cluster_role_names = list(map(lambda role: role.metadata.name, remote_cluster_roles.items))
+
+        if kernel_cluster_role not in remote_cluster_role_names:


Same as above RE: generator comprehension

DerekTBrown · 2023-01-25T23:23:26Z

enterprise_gateway/services/processproxies/k8s.py

+
+            self.log.info(f"Created service account {service_account_name} in namespace {namespace}")
+
+    def _forward_role_to_remote(self) -> None:


A few thoughts:

I don't think we (Lacework) want to grant a ClusterRole to JEG. At most, I think we want to grant roles to a handful of namespaces.

I don't think we should be copying a role from one place to another, this seems non-transparent.

I would recommend adding a static rolebinding to a specific namespace.

I would recommend adding an option to not create the service account, allowing the user to supply their own service account. This lets people customize as they choose.

DerekTBrown · 2023-01-25T23:26:15Z

enterprise_gateway/services/external/k8s_client_factory.py

+        kubernetes_config: client.Configuration = client.Configuration()
+        if os.getenv("KUBERNETES_SERVICE_HOST"):
+            # Running inside cluster
+            if os.getenv('EG_USE_REMOTE_CLUSTER') and get_remote_if_available:


I am not sure this works as expected. This will evaluate to true if EG_USE_REMOTE_CLUSTER is set to anything (including EG_USE_REMOTE_CLUSTER=False).

We should confirm this has the same semantics as other env options defined in JEG. Do they look for True/False or just Set/Unset. If True/False, we should add a more rigorous check. If set/unset, we should use EG_USE_REMOTE_CLUSTER=1 in the helm chart to denote this.

Shrinjay · 2023-01-25T23:41:46Z

The reason I added the copying was more as a first pass than anything. The issue is we want the role created in the remote cluster to be configurable via Helm. The only way to inject configurations from helm into runtime is environment variables. I’m not quite sure if we want to configure the role for the remote using environment variables, hence the copying. The role in the local space is defined by the helm chart so we just copy that over.

…

On Wed, Jan 25, 2023 at 4:23 PM Derek Brown ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In enterprise_gateway/services/processproxies/k8s.py <#1 (comment)> : > + if service_account_name not in service_account_names_in_namespace: + service_account_metadata = { + "name": service_account_name + } + service_account_to_create: client.V1ServiceAccount = client.V1ServiceAccount( + kind="ServiceAccount", + metadata=service_account_metadata + ) + + client\ + .CoreV1Api(api_client=kubernetes_client)\ + .create_namespaced_service_account(namespace=namespace, body=service_account_to_create) + + self.log.info(f"Created service account {service_account_name} in namespace {namespace}") + + def _forward_role_to_remote(self) -> None: A few thoughts: 1. I don't think we (Lacework) want to grant a ClusterRole to JEG. At most, I think we want to grant roles to a handful of namespaces. 2. I don't think we should be copying a role from one place to another, this seems non-transparent. 3. I would recommend adding a static rolebinding to a specific namespace. 4. I would recommend adding an option to *not* create the service account, allowing the user to supply their own service account. This lets people customize as they choose. — Reply to this email directly, view it on GitHub <#1 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AO5NQAWJ6YWC2OZ6RNLSOOTWUGYXTANCNFSM6AAAAAAUFVOHUI> . You are receiving this because you authored the thread.Message ID: ***@***.***>

Shrinjay · 2023-01-26T15:38:31Z

I realized I'm an idiot and configmaps exist... we can mount the local role resource definitions as a configmap and then create it in the remote rather than copying the role.

for more information, see https://pre-commit.ci

* add more linting * Address codeQL alert --------- Co-authored-by: Kevin Bates <kbates4@gmail.com>

…enterprise_gateway into feature/remote-cluster

for more information, see https://pre-commit.ci

updates: - [github.com/charliermarsh/ruff-pre-commit: v0.0.253 → v0.0.254](astral-sh/ruff-pre-commit@v0.0.253...v0.0.254) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Set restart strategy to never for spark operator * Disable eviction for driver pod * Disable eviction on Kubernetes python kernels as well

* Bump ruff from 0.0.253 to 0.0.254 Bumps [ruff](https://github.com/charliermarsh/ruff) from 0.0.253 to 0.0.254. - [Release notes](https://github.com/charliermarsh/ruff/releases) - [Changelog](https://github.com/charliermarsh/ruff/blob/main/BREAKING_CHANGES.md) - [Commits](astral-sh/ruff@v0.0.253...v0.0.254) --- updated-dependencies: - dependency-name: ruff dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * Update ruff version in pre-commit config --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

…ver#1271) * Properly report failures on Spark Operator kernels * Update helper methods to get initial and error states * Try to use detect_launch_failure as an option to identify failed CRD submission * Fix small typos on lower conversion * Update logic to better handle kernel connection parameters * Fix restart issues * Remove detect_launch_failure unnecessary code * Remove debug code * Simplify get_container_status signature * Remove lower after the signature change * Move get_initial_states to CustomResourceProcessProxy * Conditionally display get_container_status debug message * Move get_container_status to CRD process proxy * Update get_initial_states function doc to emphasize lowercase contract * Reword to remove spark operator references * Add log updates * Remove obsolete code * Add more fields on the log statement * Update log logic * More logging fix-ups --------- Co-authored-by: Kevin Bates <kbates4@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

…jupyter-server#1277)

…rver#1279)

* Bump ruff from 0.0.254 to 0.0.255 Bumps [ruff](https://github.com/charliermarsh/ruff) from 0.0.254 to 0.0.255. - [Release notes](https://github.com/charliermarsh/ruff/releases) - [Changelog](https://github.com/charliermarsh/ruff/blob/main/BREAKING_CHANGES.md) - [Commits](astral-sh/ruff@v0.0.254...v0.0.255) --- updated-dependencies: - dependency-name: ruff dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * Additional changes necessary for ruff-0.0.255 --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

) * GatewayClient changes * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Initialize variables before websocket.create_connection in KernelClient.__init__() * Refactor fail on WS connection * Avoid need for attribute checks --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

* Bump ruff from 0.0.255 to 0.0.257 Bumps [ruff](https://github.com/charliermarsh/ruff) from 0.0.255 to 0.0.257. - [Release notes](https://github.com/charliermarsh/ruff/releases) - [Changelog](https://github.com/charliermarsh/ruff/blob/main/BREAKING_CHANGES.md) - [Commits](astral-sh/ruff@v0.0.255...v0.0.257) --- updated-dependencies: - dependency-name: ruff dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * Address changes for ruff upgrade --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

* Bump ruff from 0.0.257 to 0.0.259 Bumps [ruff](https://github.com/charliermarsh/ruff) from 0.0.257 to 0.0.259. - [Release notes](https://github.com/charliermarsh/ruff/releases) - [Changelog](https://github.com/charliermarsh/ruff/blob/main/BREAKING_CHANGES.md) - [Commits](astral-sh/ruff@v0.0.257...v0.0.259) --- updated-dependencies: - dependency-name: ruff dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * Ignore S311 - only used for port selection within range and suffix --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

* Bump ruff from 0.0.259 to 0.0.260 Bumps [ruff](https://github.com/charliermarsh/ruff) from 0.0.259 to 0.0.260. - [Release notes](https://github.com/charliermarsh/ruff/releases) - [Changelog](https://github.com/charliermarsh/ruff/blob/main/BREAKING_CHANGES.md) - [Commits](astral-sh/ruff@v0.0.259...v0.0.260) --- updated-dependencies: - dependency-name: ruff dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * Update ruff version in pre-commit config --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

* Bump black[jupyter] from 23.1.0 to 23.3.0 Bumps [black[jupyter]](https://github.com/psf/black) from 23.1.0 to 23.3.0. - [Release notes](https://github.com/psf/black/releases) - [Changelog](https://github.com/psf/black/blob/main/CHANGES.md) - [Commits](psf/black@23.1.0...23.3.0) --- updated-dependencies: - dependency-name: black[jupyter] dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * Update black version in pre-commit config --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

updates: - [github.com/python-jsonschema/check-jsonschema: 0.21.0 → 0.22.0](python-jsonschema/check-jsonschema@0.21.0...0.22.0) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Bump ruff from 0.0.260 to 0.0.261 Bumps [ruff](https://github.com/charliermarsh/ruff) from 0.0.260 to 0.0.261. - [Release notes](https://github.com/charliermarsh/ruff/releases) - [Changelog](https://github.com/charliermarsh/ruff/blob/main/BREAKING_CHANGES.md) - [Commits](astral-sh/ruff@v0.0.260...v0.0.261) --- updated-dependencies: - dependency-name: ruff dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> * Update ruff version in pre-commit config --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Kevin Bates <kbates4@gmail.com>

Co-authored-by: Kevin Bates <kbates4@gmail.com>

for more information, see https://pre-commit.ci

Shrinjay and others added 5 commits January 24, 2023 16:14

add remote cluster functionality

49985f5

Add context selection to helm and passthrough

d491508

Remove label

686ed39

Put back comments

7e90611

Fix names

2daf2cc

DerekTBrown reviewed Jan 25, 2023

View reviewed changes

Shrinjay added 2 commits January 25, 2023 16:11

Fix docstrings, generator, and various other

ff5aab5

black fixes

20a5c56

DerekTBrown reviewed Jan 25, 2023

View reviewed changes

Shrinjay and others added 12 commits January 26, 2023 16:04

Add remote autoconfiguration and role

0d43bec

Fix env var checking

875fafa

move to namespaced roles for remote clusters

5d1d732

Formatting and comments

19f92c7

Add documentation and cleanup

e0df4e2

[pre-commit.ci] auto fixes from pre-commit.com hooks

5e23382

for more information, see https://pre-commit.ci

Add more linting (jupyter-server#1245)

08d6060

* add more linting * Address codeQL alert --------- Co-authored-by: Kevin Bates <kbates4@gmail.com>

Fix doc requirements

c9b90ce

Merge branch 'feature/remote-cluster' of https://github.com/Shrinjay/…

0e0ae6f

…enterprise_gateway into feature/remote-cluster

Merge branch 'main' into feature/remote-cluster

9a9416e

[pre-commit.ci] auto fixes from pre-commit.com hooks

9ba9ab7

for more information, see https://pre-commit.ci

Change to external cluster naming

0a4c6fb

pre-commit-ci bot and others added 30 commits March 6, 2023 16:20

[pre-commit.ci] pre-commit autoupdate (jupyter-server#1273)

ec10b51

updates: - [github.com/charliermarsh/ruff-pre-commit: v0.0.253 → v0.0.254](astral-sh/ruff-pre-commit@v0.0.253...v0.0.254) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Set restart strategy to never for spark operator (jupyter-server#1269)

7effcb5

* Set restart strategy to never for spark operator * Disable eviction for driver pod * Disable eviction on Kubernetes python kernels as well

Fix docker swarm not being to start due to missing inherited function (…

de6e3f6

…jupyter-server#1277)

Updating base image tag to latest available tag (jupyter-server#1280)

584882b

Filter sensitive values from being logged in ProcessProxy (jupyter-se…

a09cfb7

…rver#1279)

Bump to 3.2.2

fabfe93

Prepare release 3.2.2

12fae0a

Bump to 3.3.0.dev0

d2f6c9c

Prepare for next development interaction 3.3.0.dev0

5f986b9

[pre-commit.ci] pre-commit autoupdate (jupyter-server#1286)

03369b3

updates: - [github.com/python-jsonschema/check-jsonschema: 0.21.0 → 0.22.0](python-jsonschema/check-jsonschema@0.21.0...0.22.0) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Update docs/source/operators/deploy-kubernetes.md

4fbe8b7

Co-authored-by: Kevin Bates <kbates4@gmail.com>

Update docs/source/operators/deploy-kubernetes.md

51256b9

Co-authored-by: Kevin Bates <kbates4@gmail.com>

Update enterprise_gateway/services/processproxies/crd.py

8107dac

Co-authored-by: Kevin Bates <kbates4@gmail.com>

Apply suggestions from code review

26cba1a

Co-authored-by: Kevin Bates <kbates4@gmail.com>

Update docs/source/operators/deploy-kubernetes.md

55065ca

Co-authored-by: Kevin Bates <kbates4@gmail.com>

Update etc/kubernetes/helm/enterprise-gateway/templates/deployment.yaml

3282e14

Co-authored-by: Kevin Bates <kbates4@gmail.com>

progress

047538e

throw warning

7c06f2b

merge latest

c909106

Merge branch 'main' into feature/remote-cluster

a1a9e46

[pre-commit.ci] auto fixes from pre-commit.com hooks

8554343

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]: Remote Cluster #1

[WIP]: Remote Cluster #1

Shrinjay commented Jan 24, 2023

DerekTBrown Jan 25, 2023

DerekTBrown Jan 25, 2023

Shrinjay Jan 25, 2023

DerekTBrown Jan 25, 2023

DerekTBrown Jan 25, 2023

DerekTBrown Jan 25, 2023

Shrinjay Jan 25, 2023 •

edited

Loading

DerekTBrown Jan 25, 2023

DerekTBrown Jan 25, 2023

Shrinjay Jan 25, 2023

Shrinjay Jan 25, 2023

DerekTBrown Jan 25, 2023

DerekTBrown Jan 25, 2023

DerekTBrown Jan 25, 2023

Shrinjay commented Jan 25, 2023 via email

Shrinjay commented Jan 26, 2023

		kubeconfig_path = os.environ.get('EG_REMOTE_CLUSTER_KUBECONFIG_PATH', '/etc/kube/config')
		context = os.environ.get('EG_REMOTE_CLUSTER_CONTEXT', None)

	"Created kubernetes client for host {host}".format(host=kubernetes_config.host)
	f"Created kubernetes client for host {kubernetes_config.host}"

		service_account_names_in_namespace: List[str] = list(map(lambda svcaccount: svcaccount.metadata.name, service_accounts_in_namespace))

		if service_account_name not in service_account_names_in_namespace:

		remote_cluster_role_names = list(map(lambda role: role.metadata.name, remote_cluster_roles.items))

		if kernel_cluster_role not in remote_cluster_role_names:


		self.log.info(f"Created service account {service_account_name} in namespace {namespace}")

		def _forward_role_to_remote(self) -> None:

[WIP]: Remote Cluster #1

Are you sure you want to change the base?

[WIP]: Remote Cluster #1

Conversation

Shrinjay commented Jan 24, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Shrinjay Jan 25, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Shrinjay commented Jan 25, 2023 via email

Shrinjay commented Jan 26, 2023

Shrinjay Jan 25, 2023 •

edited

Loading