Jobs with multiple files don't complete when backend fails #5359

bmasonrh · 2018-07-27T02:37:00Z

When the following conditions are met:

A print job is submitted with multiple files (i.e. lp -d pq /etc/services /etc/services)
The print job uses at least one filter (i.e. not a raw queue)
The backend fails (due to, for example, a Broken Pipe error, which is a common occurrence)

Then the job stays in the queue. The backend fails, but the filter doesn't receive SIGPIPE and never exits, and the scheduler doesn't kills the job.

If the print job has only one file, then the filter gets SIGPIPE when the backend fails and the job is aborted (or retried depending on the Error Policy).

I've reproduced this in CUPS 1.4.2, 1.6.3 and 2.2.6.

I've been trying (unsuccessfully) to discover a difference in the way the scheduler starts filters for multi-file jobs vs. single-file jobs that would account for this behavior. Suggestions for where to look in the code would be appreciated (as would a patch to fix this, but I'm happy to work on the patch myself if I can get a push in the right direction).

Or would it make sense to call stop_job() somewhere if the backend fails. There's not much point in continuing to process a job if the backend has failed, is there?

Thanks.

The text was updated successfully, but these errors were encountered:

michaelrsweet · 2018-07-27T13:53:05Z

@bryan-mason "Broken pipe" should not be a common occurrence for a backend, particularly when all of the standard backends block/handle it.

The difference with single-vs-multi document jobs is that the backend (and associated pipes) remains active for all of the documents in the job - basically we reuse them for all documents, e.g:

filters for document 1  \
filters for document 2   | backend
...                     /
filters for document N /

Anyways, we should probably be closing the pipes when the backend fails, which will allow the filters to see that the backend has gone away (rather than block on IO) and allow the job to abort.

bmasonrh · 2018-07-27T16:37:30Z

"Broken pipe" should not be a common occurrence for a backend, particularly when all of the standard backends block/handle it.

It's been my experience supporting enterprise customers that:

D [25/Jul/2018:17:24:05 -0700] [Job 42] Error reading back-channel data: Connection reset by peer
E [25/Jul/2018:17:24:05 -0700] [Job 42] Unable to write print data: Broken pipe

Is one of the more common failure modes that people report. The cause always seems to be some sort of network equipment problem, and the socket backend handles it gracefully and exits cleanly, but it's still a problem (usually because ErrorPolicy is stop-printer and the customer wants to know why their print queue stopped).

michaelrsweet · 2018-11-07T16:36:10Z

[master 72a2134] Fix stuck multi-file jobs (Issue #5359, Issue #5413)

[branch-2.2 e7e33bf] Fix stuck multi-file jobs (Issue #5359, Issue #5413)

michaelrsweet added this to the CUPS 2.2.x Updates milestone Jul 27, 2018

michaelrsweet self-assigned this Jul 27, 2018

michaelrsweet added the priority-medium label Jul 27, 2018

zdohnal mentioned this issue Oct 17, 2018

Abort job with multiple files when backend fails #5413

Closed

michaelrsweet added a commit that referenced this issue Nov 7, 2018

Fix stuck multi-file jobs (Issue #5359, Issue #5413)

72a2134

michaelrsweet added a commit that referenced this issue Nov 7, 2018

Fix stuck multi-file jobs (Issue #5359, Issue #5413)

e7e33bf

michaelrsweet closed this as completed Nov 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jobs with multiple files don't complete when backend fails #5359

Jobs with multiple files don't complete when backend fails #5359

bmasonrh commented Jul 27, 2018

michaelrsweet commented Jul 27, 2018 •

edited

Loading

bmasonrh commented Jul 27, 2018

michaelrsweet commented Nov 7, 2018

Jobs with multiple files don't complete when backend fails #5359

Jobs with multiple files don't complete when backend fails #5359

Comments

bmasonrh commented Jul 27, 2018

michaelrsweet commented Jul 27, 2018 • edited Loading

bmasonrh commented Jul 27, 2018

michaelrsweet commented Nov 7, 2018

michaelrsweet commented Jul 27, 2018 •

edited

Loading