Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix: Onboard HRRR processes in NOAA ETL #549

Open
wants to merge 24 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
24 commits
Select commit Hold shift + click to select a range
ec09554
Feat: Added GHCN_M.
nlarge-google Nov 29, 2022
ad1d0fe
fix: Flake.
nlarge-google Nov 29, 2022
d097786
feat: Adding HRRR Failover pipeline.
nlarge-google Dec 5, 2022
38f35ac
fix: Resolution to graph path.
nlarge-google Dec 5, 2022
0a0e860
fix: Attempting to make url_directory_list recurse through directorie…
nlarge-google Dec 5, 2022
4c6c275
fix: Still Testing HRRR Failover. Not Production Ready.
nlarge-google Dec 6, 2022
f0d92ef
fix: Still Testing HRRR Failover. Not Production Ready.
nlarge-google Dec 6, 2022
c6f3235
fix: Still Testing HRRR Failover.
nlarge-google Dec 6, 2022
685e7a6
fix: Pre-testing.
nlarge-google Dec 7, 2022
c0a287b
fix: Configure quiet mode when processing HRRR Failover files.
nlarge-google Dec 7, 2022
9d20b3d
Merge branch 'main' into noaa
nlarge-google Dec 7, 2022
5847669
fix: Configure quiet mode in upload to GCS function when processing H…
nlarge-google Dec 7, 2022
5aa24d2
fix: Flake issues.
nlarge-google Dec 7, 2022
b6a5861
fix: Added count of files transferred.
nlarge-google Dec 7, 2022
3c200d8
fix: Modified HRRR Failover transfer process to NOT recurse into http…
nlarge-google Dec 7, 2022
4fb550b
fix: Implement code-regions.
nlarge-google Dec 7, 2022
5a6d644
fix: Added step to remove staged source file from pod after transferr…
nlarge-google Dec 8, 2022
b31e510
fix: Clean up code.
nlarge-google Dec 8, 2022
9a029e5
fix: Testing NOAA HRRR ARL Formatting in AF.
nlarge-google Dec 9, 2022
5f03f4b
fix: Cleanup
nlarge-google Dec 9, 2022
5f2cf16
fix: Extended ephemeral storage.
nlarge-google Dec 9, 2022
8e89384
fix: ARL Formattiong - attempting to incorporate gsutil.
nlarge-google Dec 13, 2022
1651fef
Merge remote-tracking branch 'origin' into noaa
nlarge-google Dec 13, 2022
362e69a
fix: Modified Dockerfile to install gsutil.
nlarge-google Dec 14, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
32 changes: 32 additions & 0 deletions datasets/noaa/pipelines/_images/run_csv_transform_kub/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -12,10 +12,42 @@
# See the License for the specific language governing permissions and
# limitations under the License.


# The base image for this build
# FROM gcr.io/google.com/cloudsdktool/cloud-sdk:slim
FROM python:3.8

# Allow statements and log messages to appear in Cloud logs
ENV PYTHONUNBUFFERED True

# Copy the requirements file into the image
COPY requirements.txt ./

# Install the packages specified in the requirements file
RUN python3 -m pip install --no-cache-dir -r requirements.txt

# Upgrade pip
RUN python3 -m pip install --upgrade pip

# Install gcloud
RUN apt-get -y update && apt-get install -y apt-transport-https ca-certificates gnupg && \
echo "deb https://packages.cloud.google.com/apt cloud-sdk main" | tee -a /etc/apt/sources.list.d/google-cloud-sdk.list && \
curl https://packages.cloud.google.com/apt/doc/apt-key.gpg | apt-key add - && \
apt-get -y update && apt-get install -y google-cloud-sdk && \
apt-get install google-cloud-sdk-gke-gcloud-auth-plugin && \
apt-get install kubectl

# Return gcloud installation folder
RUN gcloud info --format="value(installation.sdk_root)"

# The WORKDIR instruction sets the working directory for any RUN, CMD,
# ENTRYPOINT, COPY and ADD instructions that follow it in the Dockerfile.
# If the WORKDIR doesn’t exist, it will be created even if it’s not used in
# any subsequent Dockerfile instruction
WORKDIR /custom

# Copy the specific data processing script/s in the image under /custom/*
COPY ./csv_transform.py .

# Command to run the data processing script when the container is run
CMD ["python3", "csv_transform.py"]