Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BQ dataset used in dataflow example doesn't exist #527

Open
dylanmpeck opened this issue Jul 30, 2020 · 7 comments
Open

BQ dataset used in dataflow example doesn't exist #527

dylanmpeck opened this issue Jul 30, 2020 · 7 comments

Comments

@dylanmpeck
Copy link

The python-dataflow-example.example_data BigQuery dataset used in the cookbook batch-examples of dataflow-examples code is returning a 404 and all operations with code that references it are failing.

Did this dataset change in some way recently?

Link to two files where I know dataset is referenced:
https://github.com/GoogleCloudPlatform/professional-services/blob/master/examples/dataflow-python-examples/batch-examples/cookbook-examples/pipelines/data_enrichment.py
https://github.com/GoogleCloudPlatform/professional-services/blob/master/examples/dataflow-python-examples/batch-examples/cookbook-examples/pipelines/data_lake_to_mart.py

@prathapreddy123
Copy link
Member

We recently moved the code around (without any logic changes) to different directory as part of restructuring. Can you verify if call is happening to right path.

@dylanmpeck
Copy link
Author

We are using a forked version of repo that is still in the state of before the directories were restructured. This is for a Qwiklabs lab that uses this code.

I believe the path is right, but here's a picture with error message:
Job failed

To add more info, we only started getting reports of this error since yesterday.

@prathapreddy123
Copy link
Member

Based on error message project python-dataflow-example seems to be missing. Can you cross check if project and corresponding BQ table exists

@dylanmpeck
Copy link
Author

dylanmpeck commented Jul 31, 2020

I don't think so. In BigQuery within my "qwiklabs..." project, I'm unable to locate the table and I'm unable to do simple queries on the python-dataflow-example project + table. Always getting a "doesn't exist" related error.

Is the table publicly hosted somewhere? Or, is the table supposed to be generated by the example code?

I had thought both examples were using a public table as a 2nd source based on how the table is referenced.

@empz
Copy link

empz commented Aug 19, 2020

The data doesn't not exist or is not set as public.

gsutil ls -r gs://python-dataflow-example/data_filesreturns BucketNotFoundException: 404

@tommo245
Copy link
Contributor

Hi - Also not finding the necessary template, getting this error.

File "/path/to/lib/python3.8/site-packages/google/cloud/_http.py", line 438, in api_request
    raise exceptions.from_http_response(response)
google.api_core.exceptions.NotFound: 404 GET https://storage.googleapis.com/storage/v1/b/python-dataflow-example?projection=noAcl&prettyPrint=false: Not Found

@agold-rh
Copy link
Contributor

@joshmcginley @lcaggio Do you have any input here, or should this be closed as stale?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants