Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pangeo-forge test #1

Closed
thodson-usgs opened this issue Nov 13, 2023 · 0 comments
Closed

Pangeo-forge test #1

thodson-usgs opened this issue Nov 13, 2023 · 0 comments

Comments

@thodson-usgs
Copy link
Owner

thodson-usgs commented Nov 13, 2023

@rsignell-usgs,

Trying to test pangeo-forge-runner by writing a zarr to disk using the direct runner on Denali, but I'm running into issues.
The recipe is on the us-ssebop branch of this repo. Would you take a look before I bug their dev?

Here's my config.py


BUCKET_PREFIX = "test_run/"

c.Bake.bakery_class = "pangeo_forge_runner.bakery.local.LocalDirectBakery"
c.Bake.feedstock_subdir = 'recipes/us-ssebop'

c.TargetStorage.fsspec_class = "fsspec.implementations.local.LocalFileSystem"

c.TargetStorage.root_path = f"{BUCKET_PREFIX}/test_run/output"
c.TargetStorage.fsspec_args = {}


c.InputCacheStorage.fsspec_class = c.TargetStorage.fsspec_class
c.InputCacheStorage.fsspec_args = c.TargetStorage.fsspec_args
# Input data cache should *not* be partitioned by job id, as we want to get the datafile
# from the source only once
c.InputCacheStorage.root_path = f"{BUCKET_PREFIX}/cache/input"

c.MetadataCacheStorage.fsspec_class = c.TargetStorage.fsspec_class
c.MetadataCacheStorage.fsspec_args = c.TargetStorage.fsspec_args
# Metadata cache should be per job, as kwargs changing can change metadata
#c.MetadataCacheStorage.root_path = f"{BUCKET_PREFIX}/{{job}}/cache/metadata"
c.MetadataCacheStorage.root_path = f"{BUCKET_PREFIX}/test_run/cache/metadata"

I run the following

#!/bin/bash

REPO=./staged-recipes
JOB_NAME=test
RECIPE_ID=us-ssebop
CONFIG_FILE=config.py

pangeo-forge-runner bake --repo=$REPO -f=$CONFIG_FILE --Bake.recipe_id=$RECIPE_ID --Bake.job_name=$JOB_NAME --prune

which generates lots of messages ending in

 File "/home/thodson/.local/miniconda3/envs/pangeo-forge/lib/python3.11/site-packages/apache_beam/runners/worker/sdk_worker.py", line 194, in __init__
    grpc.channel_ready_future(self._control_channel).result(timeout=60)
  File "/home/thodson/.local/miniconda3/envs/pangeo-forge/lib/python3.11/site-packages/grpc/_utilities.py", line 162, in result
    self._block(timeout)
  File "/home/thodson/.local/miniconda3/envs/pangeo-forge/lib/python3.11/site-packages/grpc/_utilities.py", line 106, in _block
    raise grpc.FutureTimeoutError()
grpc.FutureTimeoutError
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant