Integrating Dataset functionality with job scheduling and Docker image for ngen worker container #148

robertbartel · 2022-03-04T17:36:25Z

Work related to preparing the Docker image itself to handle data via Datasets, in particular object store datasets that have the backing data bucket mounted directly into the container file system. Also updating scheduler code to integrate this functionality.

Note that this should remain a draft PR until #147 is complete.

Creating new package for (pending) relocation of core types that will need to be migrated here to avoid circular dependencies.

Updating the Dockerfile for ngen-deps to also install the expected dependencies for the Python test BMI package via pip, to help optimize the main ngen build.

Updating the Dockerfile for ngen-deps to install s3fs-fuse as a system dependencies, as it will be used to mount object store buckets in the local file system.

Updating Dockerfile instruction building the (now) noah-owp-modular submodule.

Small optimization to combine two RUN statements into one and reduce layers.

Updating to have image create the parent directories that will contain DMOD dataset directories during execution.

Updating entrypoint script to account for Dataset functionality as the means to move data around within the system, like config, forcing, and hydrofabric data; also updating script to be able to mount object store datasets into the container's file system via s3fs.

Adding support for holding SecretReferences and environment variable values within helper DockerServiceParameters type.

Updating Launcher.create_service() to have it utilize the newly added properties to DockerServiceParameters for secrets and environment variables.

Adding new _generate_docker_cmd_args function and several other helper functions for its use, in order to support new needs for generating Docker entrypoint CMD args appropriately after recent changes for dataset utilization.

Adjusting Launcher.start_service() to use new function for generating Docker CMD arg values, and modifying things to ensure a new set of CMD args is generated for each individual allocation/worker, since these in part reflect dataset needs and thus could be different.

Updating Launcher.create_service() to have the DockerServiceParameters used be created (when appropriate) with Docker secrets for object store user access and MinIO-deployment-related environment variables.

docker/main/ngen/entrypoint.sh

hellkite500 · 2022-03-16T20:22:37Z

docker/main/ngen/entrypoint.sh

+    _MOUNT_DIR="${ALL_DATASET_DIR}/${2}/${1}"
+    # TODO (later): this is a non-S3 implementation URL; add support for S3 directly also
+    # This is based on the nginx proxy config (hopefully)
+    _URL="http://minio_proxy:9000/"


I think I see what is going on here??? But not 💯 sure...

Again, not sure why exactly I was doing things the way I was planning, and then seemingly temporarily not doing them that way ...

I've cleaned this up a bit, but the _URL is based on the proxy hostname. I also fixed a problem (i.e., just now) where the proxy hostname and service name hadn't been consistent with this in the HA config.

python/lib/scheduler/dmod/scheduler/scheduler.py

Updating non-desktop configuration of object_store stack to have proxy service name and its hostname be 'minio_proxy' to be consistent with the desktop config and avoid any unexpected collision problems with any other generic 'nginx' service.

Removing function (and usage) that would determine what minio URL to use based on the data category, which was not always using the proxy.

robertbartel added enhancement New feature or request maas MaaS Workstream labels Mar 4, 2022

robertbartel added this to the 1.0.0 (AGU FIH) milestone Mar 4, 2022

robertbartel self-assigned this Mar 4, 2022

robertbartel mentioned this pull request Mar 8, 2022

Complete initial internal storage infrastructure and workflows #128

Closed

6 tasks

robertbartel force-pushed the launch_jobs/forcings_5 branch 2 times, most recently from 7a5f7c7 to 574eef6 Compare March 16, 2022 18:45

robertbartel added 19 commits March 16, 2022 14:42

Add minio package as a project requirement.

facd6e8

Remove TODO comment for completed task.

bdce172

Creating dmod.core package for core types.

aedc112

Creating new package for (pending) relocation of core types that will need to be migrated here to avoid circular dependencies.

Update ngen-deps Dockerfile with testing deps.

f5e34ea

Updating the Dockerfile for ngen-deps to also install the expected dependencies for the Python test BMI package via pip, to help optimize the main ngen build.

Update ngen-deps Dockerfile with s3fs-fuse deps.

68c5ef2

Updating the Dockerfile for ngen-deps to install s3fs-fuse as a system dependencies, as it will be used to mount object store buckets in the local file system.

Update ngen Dockerfile from noah-mp to noah-owp.

e63285a

Updating Dockerfile instruction building the (now) noah-owp-modular submodule.

Combine layers in ngen Dockerfile.

8c1806e

Small optimization to combine two RUN statements into one and reduce layers.

Update ngen Dockerfile to create datasource dirs.

a8d1f4b

Updating to have image create the parent directories that will contain DMOD dataset directories during execution.

Add secrets, env vars to DockerServiceParameters.

6eff182

Adding support for holding SecretReferences and environment variable values within helper DockerServiceParameters type.

Update create_service() for service param updates.

aca457d

Updating Launcher.create_service() to have it utilize the newly added properties to DockerServiceParameters for secrets and environment variables.

Add Launcher helper functions for Docker CMD args.

1de2f4d

Adding new _generate_docker_cmd_args function and several other helper functions for its use, in order to support new needs for generating Docker entrypoint CMD args appropriately after recent changes for dataset utilization.

Update create_service params for object store use.

97166c7

Updating Launcher.create_service() to have the DockerServiceParameters used be created (when appropriate) with Docker secrets for object store user access and MinIO-deployment-related environment variables.

Bump dmod-scheduler version, reflecting elsewhere.

e91587c

Add CONFIG DataCategory value.

c0a7bc7

Add simple NGEN_OUTPUT DataFormat value.

ea25296

Turn off secure in object store client.

134cb3e

Fix type hint problem with Job in scheduler.py.

71390bd

robertbartel force-pushed the launch_jobs/forcings_5 branch from 574eef6 to 71390bd Compare March 16, 2022 19:42

robertbartel marked this pull request as ready for review March 16, 2022 19:44

robertbartel requested a review from hellkite500 March 16, 2022 19:44

hellkite500 reviewed Mar 16, 2022

View reviewed changes

robertbartel mentioned this pull request Mar 16, 2022

Reworking message structure as needed for conveying data requirements #152

Merged

robertbartel added 5 commits March 17, 2022 08:27

Consistent name of minio proxy service.

2980850

Updating non-desktop configuration of object_store stack to have proxy service name and its hostname be 'minio_proxy' to be consistent with the desktop config and avoid any unexpected collision problems with any other generic 'nginx' service.

Ensure worker entrypoint uses minio proxy.

5e54315

Removing function (and usage) that would determine what minio URL to use based on the data category, which was not always using the proxy.

Fix inverted boolean test condition.

1768905

Remove unnecessary env var add to workers.

6ce35f2

Fix sytax problem after env_var removal.

91600f6

hellkite500 approved these changes Mar 17, 2022

View reviewed changes

hellkite500 merged commit 7889731 into NOAA-OWP:master Mar 17, 2022

robertbartel deleted the launch_jobs/forcings_5 branch March 17, 2022 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating Dataset functionality with job scheduling and Docker image for ngen worker container #148

Integrating Dataset functionality with job scheduling and Docker image for ngen worker container #148

robertbartel commented Mar 4, 2022

hellkite500 Mar 16, 2022

robertbartel Mar 17, 2022

Integrating Dataset functionality with job scheduling and Docker image for ngen worker container #148

Integrating Dataset functionality with job scheduling and Docker image for ngen worker container #148

Conversation

robertbartel commented Mar 4, 2022

hellkite500 Mar 16, 2022

Choose a reason for hiding this comment

robertbartel Mar 17, 2022

Choose a reason for hiding this comment