Spark: Add SparkApplicationDetailsFacet #2688
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Problem
Discussion: #2589 (comment)
Solution
Add run facet containing Spark application information:
master: string
appName: string
applicationId: string
deployMode: string
userName: string
driverHost: string
webUiUrl: string | null
- URL of Spark driver WebUI. It can be null ifspark.ui.enabled
is false.proxyUrl: string | null
- URL of Spark driver if it is served behind a reverse proxy (e.g. K8s ingress, YarnUI proxy), if any.historyUrl: string | null
- URL of Spark history server, if any. Can be set only on Yarn, because K8s and standalone Spark instances does not populate option with Spark History address, only with event logs location.One-line summary:
Add SparkApplicationDetailsFacet to runEvents emitted on Spark application start.
Checklist
SPDX-License-Identifier: Apache-2.0
Copyright 2018-2023 contributors to the OpenLineage project