-
Notifications
You must be signed in to change notification settings - Fork 707
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Getting run_id from Pipeline class or InteractiveContext and using it in components #2773
Comments
Specifically, we would like to access it from Any alternative approaches? Use case specifically: We generate embedding vectors regularly as part of one pipeline. One of our push destinations is BigQuery/DB, however we would like to avoid replacing embeddings to allow for user to query old version of the embedding vectors. We are using datetime and a few hacks to generate a unique id to avoid them being replaced. However it's not ideal. Our idea was that if we can pull from inside the decorated function the run_id for the current run, then we can use it for versioning ourselves. Something like apache/airflow#8058 |
When defining the pipeline's components, you can use Example:
|
This issue has been marked stale because it has no recent activity since 7 days. It will be closed if no further activity occurs. Thank you. |
This issue was closed due to lack of activity after being marked stale for past 7 days. |
For a
Pipeline
class, the run_id is generated whenBeamDagRunner().run()
is called, and forInteractiveContext
, the run_id is generated whencontext.run()
is called each time and is not exposed as an instance attribute.Is there a recommended way to use
run_id
within the components in the pipeline? We would like to use it as an identifier for some Artifacts (e.g. embeddings) that we push to a DB.The text was updated successfully, but these errors were encountered: