Skip to content

Worker: sometimes sends events without a step id #1072

@josephjclark

Description

@josephjclark

Strange sequence of events in worker logs:

[SRV] ✘ cf698f95-4b16-4a75-9ac4-98e1729f7017 :: step:complete :: ERR: [step:complete] {"step_id":["This field can't be blank."]}
[SRV] ℹ cf698f95-4b16-4a75-9ac4-98e1729f7017 :: run:log :: OK
[SRV] ✘ cf698f95-4b16-4a75-9ac4-98e1729f7017 :: step:start :: ERR: [step:start] {"job_id":["does not exist"]}
[SRV] ✘ cf698f95-4b16-4a75-9ac4-98e1729f7017 :: run:log :: ERR: [run:log] {"step_id":["must be associated with the run"]}

How is step_id blank?

This should lead to lost. I think this run might recover, but interestingly there is a lost run (ea330df2-6b88-4cb3-84b9-4619e5d3c319) at almost the same time as this. Coincidence?

GCP logs for the run

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

Ready

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions