feat: allow persistence of runtime generated job metadata to the DB and allow addition of config-driven job attributes by ShivangNagta · Pull Request #113 · patterninc/heimdall

Shivang Nagta (ShivangNagta) · 2026-06-23T10:48:23Z

Description

Adds a generic, config-driven mechanism to attach extra metadata to a job and surface it in the UI, plus persistence of that metadata to the DB via a new extra_job_attributes column. The first use case is a "Spark History" link on the job details page for Spark-on-EKS jobs.

Rather than a plugin-specific column, attributes are defined in config on a cluster and rendered by core after the job runs:

clusters:
  - name: spark-eks
    context:
      spark_history_url: https://spark-history.example.com
    attributes:
      Spark History:
        kind: link
        value: "{{ .Cluster.spark_history_url }}/history/{{ .Outputs.spark_application_id }}/jobs/"

Each attribute is label → { kind, value }. kind (link/text) tells the UI how to render, and value is a Go text/template rendered over four namespaces: .Job, .Command, .Cluster (context maps), and .Outputs (runtime values published by the plugin).

Flow:

Plugins publish raw runtime values to a transient outputs channel with one call - the sparkeks plugin captures Status.SparkApplicationID during monitoring and emits job.SetOutput("spark_application_id", id).
Core renders cluster.Attributes after Execute into job.ExtraJobAttributes and persists it to the new extra_job_attributes jsonb not null default '{}' column.
UI renders each attribute by kind on the job details page.
This means new attributes (static, or from existing plugin outputs) can be added by editing config alone - devs don't touch plugin code per parameter unless very specific runtime generated metadata needs to be stored. The outputs channel is transient (not persisted); only the rendered extra_job_attributes is stored.

Test

Tested locally (migration, persistence, UI);
Haven't done e2e testing in sandbox as this adds no extra API call for the id. The monitor loop
already fetches Status (for AppState), and SparkApplicationID is just another field on that same object, so reading it adds no new call or failure mode.

Confirmed the spark-operator populates Status.SparkApplicationID at runtime by running the operator's spark-pi example on a local kind cluster and reading the field back.

Manual seeding for testing (for spark and non-spark job)

Button Rendering in Job Details Page (for a spark job)

Some Notes (open for comments)

spark_application_id is the first plugin-specific column on jobs (the other columns are generic). It looks a little bit odd to me but Claude's reasoning for it was - "there's no generic home for plugin runtime metadata as of now", which seems to be true, because our use case is to store a runtime generated data (spark_application_id) in the heimdall database. I could not find in any other plugins, doing something like this.
Some other options could be:
a. add a separate table for storing spark_application_id with a foreign key reference to the original job table. This separates the spark specific data from generic job table but that adds an extra API/read call, and also does not avoid the fact that we would still have to add spark specific table update somewhere in updateAsyncJobStatus function.
b. If more plugins need to store runtime metadata, a generic metadata column may be preferable to per-plugin columns. But this seems like an early abstraction.

EDIT

Option b styled approach was chosen after discussions

Copilot

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

prasadlohakpure · 2026-06-24T14:32:11Z

+                      <Button
+                        styleType='text-blue'
+                        as='externalLink'
+                        href={`https://spark-history.data-platform.aws.pattern.com/history/${jobData?.spark_application_id}/jobs/`}


Shivang Nagta (@ShivangNagta) this is going to be shipped with oss docker image. Lets find another way to inject this in UI

Yash Shrivastava (alephys26)

If this is going to the jobs table, add something more generic, maybe a json that stores key-value and the column is extra_job_attributes. And the frontend then uses the key as display text and the value as the hyperlink for all the attributes that exist for that column.
Or some better approach, anyway, a specific column for spark application ID is not what I would like to see.

Shivang Nagta (ShivangNagta) · 2026-06-28T12:50:37Z

If this is going to the jobs table, add something more generic, maybe a json that stores key-value and the column is extra_job_attributes. And the frontend then uses the key as display text and the value as the hyperlink for all the attributes that exist for that column. Or some better approach, anyway, a specific column for spark application ID is not what I would like to see.

I have added an extra_job_attributes column in job table instead of spark_app_id which was plugin specific. It is a key value pair as you said, but for json value I have preferred using a nested object containing kind and value instead of just value. The kind field allows to store both hyperlinks and non-hyperlinks which would be a more flexible change for the future.

....
ExtraJobAttributes map[string]Attribute
....

const (
	AttributeKindLink = "link"
	AttributeKindText = "text"
)

type Attribute struct {
	Kind  string `yaml:"kind,omitempty" json:"kind,omitempty"`
	Value string `yaml:"value,omitempty" json:"value,omitempty"`
}

Shivang Nagta (ShivangNagta) · 2026-06-28T12:51:25Z

Also, the spark history server URL is now added in the cluster context instead

Copilot

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

Shivang Nagta (ShivangNagta) · 2026-06-30T17:43:28Z

I have moved the logic for defining the template for extra job attribute values to config itself(cluster). Now the runtime metadata is stored in-memory (output field in job struct), and is rendered based on what was passed in the template. It is finally persisted to the DB column - extra_job_atttributes as it was previously
Example cluster config

  - name: spark-eks
    attributes:
      Spark History:
        kind: link
        value: "{{ .Cluster.spark_history_url }}/history/{{ .Outputs.spark_application_id }}/jobs/"
    status: active
    ....

prasadlohakpure

Nice work, LGTM

Shivang Nagta (ShivangNagta) · 2026-07-01T10:20:46Z

Rendering of config driven links and tests (they can be separated if needed)

feat: add Spark History Server link to job details

0776c4c

Shivang Nagta (ShivangNagta) marked this pull request as ready for review June 24, 2026 09:49

Copilot AI review requested due to automatic review settings June 24, 2026 09:49

Copilot AI reviewed Jun 24, 2026

prasadlohakpure reviewed Jun 24, 2026

View reviewed changes

prasadlohakpure requested changes Jun 24, 2026

View reviewed changes

Yash Shrivastava (alephys26) requested changes Jun 24, 2026

View reviewed changes

Add a generic extra_job_attributes column in jobs

a022cc5

Shivang Nagta (ShivangNagta) requested a review from Copilot June 28, 2026 13:05

Copilot AI reviewed Jun 28, 2026

Shivang Nagta (ShivangNagta) requested review from Yash Shrivastava (alephys26) and prasadlohakpure June 28, 2026 13:08

prasadlohakpure previously approved these changes Jun 30, 2026

View reviewed changes

make extra_job_attributes config-driven via cluster attributes

2caf7cd

Shivang Nagta (ShivangNagta) dismissed prasadlohakpure’s stale review via 2caf7cd June 30, 2026 17:13

prasadlohakpure previously approved these changes Jun 30, 2026

View reviewed changes

Shivang Nagta (ShivangNagta) changed the title ~~feat: add Spark History Server link to job details~~ feat: allow persistence of runtime generated job metadata to the DB via extra_job_attributes and allow config-driven job attributes Jul 1, 2026

Refactor and Readme update

9df3cab

Shivang Nagta (ShivangNagta) dismissed prasadlohakpure’s stale review via 9df3cab July 1, 2026 07:43

Shivang Nagta added 2 commits July 1, 2026 13:36

Improve attributes' value meaning in local.yaml

3c35806

Add divider b/w hardcoded links and config generated links

584c8d1

Shivang Nagta (ShivangNagta) force-pushed the feat/spark-history-link branch from c51e2f7 to 584c8d1 Compare July 1, 2026 10:14

rename extra_job_attributes to job_attributes

932a046

Yash Shrivastava (alephys26) approved these changes Jul 1, 2026

View reviewed changes

Merge branch 'main' into feat/spark-history-link

6553008

Yash Shrivastava (alephys26) merged commit 8a43a12 into patterninc:main Jul 1, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: allow persistence of runtime generated job metadata to the DB and allow addition of config-driven job attributes#113

feat: allow persistence of runtime generated job metadata to the DB and allow addition of config-driven job attributes#113
Yash Shrivastava (alephys26) merged 8 commits into
patterninc:mainfrom
ShivangNagta:feat/spark-history-link

Shivang Nagta (ShivangNagta) commented Jun 23, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

prasadlohakpure Jun 24, 2026

Uh oh!

Yash Shrivastava (alephys26) left a comment

Uh oh!

Shivang Nagta (ShivangNagta) commented Jun 28, 2026

Uh oh!

Shivang Nagta (ShivangNagta) commented Jun 28, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Shivang Nagta (ShivangNagta) commented Jun 30, 2026

Uh oh!

prasadlohakpure left a comment

Uh oh!

Shivang Nagta (ShivangNagta) commented Jul 1, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Shivang Nagta (ShivangNagta) commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Test

Some Notes (open for comments)

EDIT

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

prasadlohakpure Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Yash Shrivastava (alephys26) left a comment

Choose a reason for hiding this comment

Uh oh!

Shivang Nagta (ShivangNagta) commented Jun 28, 2026

Uh oh!

Shivang Nagta (ShivangNagta) commented Jun 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Shivang Nagta (ShivangNagta) commented Jun 30, 2026

Uh oh!

prasadlohakpure left a comment

Choose a reason for hiding this comment

Uh oh!

Shivang Nagta (ShivangNagta) commented Jul 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Shivang Nagta (ShivangNagta) commented Jun 23, 2026 •

edited

Loading

Shivang Nagta (ShivangNagta) commented Jun 28, 2026 •

edited

Loading

Shivang Nagta (ShivangNagta) commented Jul 1, 2026 •

edited

Loading