fix: Set up listeners first, in case of long write to stdin. #520

Jordan-Alloy · 2022-09-02T15:38:19Z

We discovered a bug where large payloads can cause unhandled errors. This is a fix for that bug.

Root cause of bug

In the exec method defined in src/exec.js, the child process is spawned, then the JSON data is sent to stdin, and then listeners are added. The problem here is that the child process can exit while we are still writing to its stdin. If this happens, its stdin will also close, and will generate a low-level EPIPE error event, as we will be writing to a closed pipe. Nothing will handle that error event.

const process = childProcess.spawn(command, args, spawnOptions)

if (stdin) {
  process.stdin.setEncoding('utf-8')
  process.stdin.write(stdin) // <- This line can take a while to complete, during which the child process can exit.
  process.stdin.end()
}
// ... other listeners
process.on('close', code => {
  // ... further code

Steps to reproduce

The following code should reproduce the bug:

const jq = require('node-jq');

const generateRandomString = (n) => {
  var result = "";
  var characters = "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789";
  for (var i = 0; i < n; i++) {
    result += characters.charAt(Math.floor(Math.random() * characters.length));
  }
  return result;
}

async function demo() {
  // The big JSON payload is key, as it will take a long time to write to stdin.
  // Depending on the speed of your machine, you may need to increase its size further in order to trigger the bug.
  var bigPayload = { attribute: generateRandomString(10000) };
  try {
    // "Hello world" is an invalid filter, and will cause jq to exit early (without waiting for stdin)
    await Jq.run("Hello world!", bigPayload, { input: "json" })
  } catch (e) {
    console.log(`Successfully caught jq error: \n----------\n${e}`);
  }
}

demo();

Expected result:

With a big enough payload, we should get and catch the EPIPE:

Successfully caught jq error:
----------
Error: write EPIPE

Actual result:

But without the bugfix, you get this:

node:events:491
      throw er; // Unhandled 'error' event
      ^

Error: write EPIPE
    at WriteWrap.onWriteComplete [as oncomplete] (node:internal/stream_base_commons:94:16)
Emitted 'error' event on Socket instance at:
    at emitErrorNT (node:internal/streams/destroy:157:8)
    at emitErrorCloseNT (node:internal/streams/destroy:122:3)
    at processTicksAndRejections (node:internal/process/task_queues:83:21) {
  errno: -32,
  code: 'EPIPE',
  syscall: 'write'
}

Notes

With a smaller payload, the bug is not triggered, and we see this:

Caught error: Error: jq: error: syntax error, unexpected IDENT, expecting $end (Unix shell quoting issues?) at <top-level>, line 1:
Hello world!
jq: 1 compile error

Node versioning note

On older versions of Node, this sort of unhandled error event may have been ignored. On Node v16, it will crash the application.

Jordan-Alloy · 2022-09-02T16:40:03Z

Closed as this does not seem to actually fix the bug. Investigating further.

Jordan-Alloy · 2022-09-02T17:06:52Z

Re-opened: demo code now passes on this branch; fails on main.

eaviles · 2022-09-02T18:20:53Z

Hi @davesnx! do you mind reviewing this? The same issue might be affecting others trying to upgrade to Node 16. Thanks in advance! 🙏🏽

davesnx

Thanks for the PR, those cases are great to fix!

Let a few comments for having a better status of the code.

I don't think is a good idea to merge this without any test case. Let me know If you need help making it.

Thanks again!

davesnx · 2022-09-04T11:37:33Z

src/exec.js

+    process.stdin.on('error', err => {
+      if (!promiseIsClosed) {
+        promiseIsClosed = true
+        return reject(err)
+      }
+    })


This onError should be only attached to process.stdin when there's stdin, right?

Fair enough, yes. If there is not stdin, there will be no error.

davesnx · 2022-09-04T11:40:53Z

src/exec.js

+    // All of these handlers can close the Promise, so guard closing it twice.
+    let promiseIsClosed = false
+
+    process.on('error', err => {


I'm confused about this. Reading your comments seems like the error happens when process is closed, not when it "errors". When this happens?

I'm not sure this process.on('error' is actually necessary; I can try removing it. What is necessary is the next handler, the process.stdin.on('error.

The error happens when the process is closed, but is actually generated by the process's stdin. The chain of events is:

We are still writing to stdin.

Child process closes for some reason (such as an invalid jq filter).

When the child process closes, its stdin closes as well.

We continue to try to write to its stdin.

Its stdin generates an error, because we're trying to write to it but it's been closed.

davesnx · 2022-09-04T13:08:01Z

src/exec.js

@@ -12,6 +12,33 @@ const exec = (command, args, stdin, cwd) => {

    const process = childProcess.spawn(command, args, spawnOptions)

+    // All of these handlers can close the Promise, so guard closing it twice.
+    let promiseIsClosed = false


Could we rename this to express better what it means?

isStdinClosed or stdinClosedEarly?

Maybe promiseIsRejected? I should've used better wording here; this is about making sure we don't call reject() on the Promise twice.

Jordan-Alloy · 2022-09-07T18:46:40Z

Hi @davesnx, can you review again? I've refactored some things per your comments, and have added a unit test. If you comment out the process.stdin.on('error'... listener in exec.js, you should see the unit test fail.

davesnx · 2022-09-12T13:41:58Z

I allowed the CI to run again (this isn't ideal :()

If you need more help, DM me on Twitter https://twitter.com/davesnx

davesnx · 2022-09-20T09:01:02Z

src/jq.test.js

+    const largeJsonString = JSON.parse(readFileSync(PATH_LARGE_JSON_FIXTURE))
+    run(FILTER_INVALID, largeJsonString, { input: 'json' })
+      .then(result => {
+        done('Expected an error to be thrown from child process stdin')


This shouldn't be here otherwise the test always pass green. Can you remove the done call, and maybe throw or handle this in another fashion?

Calling done without any arguments passes a test, but calling it with arguments fails the test. Per the Mocha documentation, "This callback accepts both an Error instance (or subclass thereof) or a falsy value; anything else is invalid usage and throws an error (usually causing a failed test)."

I've tried this out locally - making the run(...) Promise resolve so that this done call is hit, and the test does fail in that case. It would be more proper to wrap this error message in an Error, but either way this done call does fail the test.

davesnx · 2022-09-26T17:06:37Z

🎉 This PR is included in version 2.3.4 🎉

The release is available on:

Your semantic-release bot 📦🚀

fix: Set up listeners first, in case of long write to stdin.

5aa1f25

Jordan-Alloy marked this pull request as ready for review September 2, 2022 15:54

Jordan-Alloy closed this Sep 2, 2022

fix: We need to catch errors from stdin as well.

8450553

Jordan-Alloy reopened this Sep 2, 2022

davesnx requested changes Sep 4, 2022

View reviewed changes

Jordan-Alloy closed this Sep 6, 2022

Jordan-Alloy reopened this Sep 6, 2022

Jordan-Alloy added 3 commits September 7, 2022 03:17

refactor: Rename variable; only listen to stdin if writing to it.

3179616

refactor: combine if blocks

9c60176

feat: Add unit test for EPIPE error.

48794ac

fix: unit test; Windows has different error code.

6053141

davesnx requested changes Sep 20, 2022

View reviewed changes

davesnx merged commit df6d580 into sanack:main Sep 26, 2022

davesnx added the released label Sep 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Set up listeners first, in case of long write to stdin. #520

fix: Set up listeners first, in case of long write to stdin. #520

Jordan-Alloy commented Sep 2, 2022 •

edited

Loading

Jordan-Alloy commented Sep 2, 2022

Jordan-Alloy commented Sep 2, 2022

eaviles commented Sep 2, 2022

davesnx left a comment

davesnx Sep 4, 2022

Jordan-Alloy Sep 6, 2022

davesnx Sep 4, 2022

Jordan-Alloy Sep 6, 2022

davesnx Sep 4, 2022

Jordan-Alloy Sep 6, 2022

Jordan-Alloy commented Sep 7, 2022

davesnx commented Sep 12, 2022

davesnx Sep 20, 2022

Jordan-Alloy Sep 26, 2022 •

edited

Loading

davesnx commented Sep 26, 2022

fix: Set up listeners first, in case of long write to stdin. #520

fix: Set up listeners first, in case of long write to stdin. #520

Conversation

Jordan-Alloy commented Sep 2, 2022 • edited Loading

Root cause of bug

Steps to reproduce

Expected result:

Actual result:

Notes

Node versioning note

Jordan-Alloy commented Sep 2, 2022

Jordan-Alloy commented Sep 2, 2022

eaviles commented Sep 2, 2022

davesnx left a comment

Choose a reason for hiding this comment

davesnx Sep 4, 2022

Choose a reason for hiding this comment

Jordan-Alloy Sep 6, 2022

Choose a reason for hiding this comment

davesnx Sep 4, 2022

Choose a reason for hiding this comment

Jordan-Alloy Sep 6, 2022

Choose a reason for hiding this comment

davesnx Sep 4, 2022

Choose a reason for hiding this comment

Jordan-Alloy Sep 6, 2022

Choose a reason for hiding this comment

Jordan-Alloy commented Sep 7, 2022

davesnx commented Sep 12, 2022

davesnx Sep 20, 2022

Choose a reason for hiding this comment

Jordan-Alloy Sep 26, 2022 • edited Loading

Choose a reason for hiding this comment

davesnx commented Sep 26, 2022

Jordan-Alloy commented Sep 2, 2022 •

edited

Loading

Jordan-Alloy Sep 26, 2022 •

edited

Loading