Skip to content

Conversation

max-ostapenko
Copy link
Contributor

@max-ostapenko max-ostapenko commented Aug 27, 2024

Automation with Workflows:

  • before crawl (polling BQ tables)
  • after crawl

For a list of supported pipelines and a corresponding triggers see README.

Please also read through notes in the pipeline doc.

Closes HTTPArchive/cwv-tech-report#36

TODO before prod ready:

  1. Dataform service account has only read-only access to BigQuery (to avoid messing with tables before tested). Needs to be updated with write access to particular datasets.
  2. FYI Dataform repo using PAT, so all the changes done via Dataform console will be committed on behalf of @max-ostapenko. Do we need any action?

@max-ostapenko max-ostapenko mentioned this pull request Aug 27, 2024
14 tasks
@max-ostapenko max-ostapenko changed the title all tables publish Finalize the pipelines automation v2 Aug 27, 2024
@pmeenan pmeenan removed their request for review September 3, 2024 20:12
@pmeenan
Copy link
Member

pmeenan commented Sep 3, 2024

Just pulling myself off the reviewers list since I'm not all that familiar with the back-end data pipelines (other than writing data to them).

@max-ostapenko
Copy link
Contributor Author

The pipeline automation PR is complete.
The crawl starts next Tuesday (so should the CWV tech report pipeline).

@tunetheweb Can I help so we get this reviewed by then and see the first pipeline results?

Co-authored-by: Barry Pollard <barrypollard@google.com>
Copy link
Member

@tunetheweb tunetheweb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Really excited about automating this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Automate populating tech report SQL
3 participants