Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

childprocess kill() results in -4048 EPERM error on windows since update to v21.6.2 #51766

Open
Uzlopak opened this issue Feb 15, 2024 · 14 comments · Fixed by libuv/libuv#4341 or netlify/build#5932
Labels
child_process Issues and PRs related to the child_process subsystem. libuv Issues and PRs related to the libuv dependency or the uv binding. windows Issues and PRs related to the Windows platform.

Comments

@Uzlopak
Copy link
Contributor

Uzlopak commented Feb 15, 2024

Version

v21.6.2

Platform

Microsoft Windows NT 10.0.22621.0 x64

Subsystem

child_process

What steps will reproduce the bug?

Suddenly in latest undici the tests in test/node-test/debug.js fail if run under windows with node v21.6.2.

It fails under PowerShell and cmd.

It seems that there is suddenly a permission issue when trying to kill a child_process. With node v21.6.1 .kill() doesnt throw.

How often does it reproduce? Is there a required condition?

Always reproducable

What is the expected behavior? Why is that the expected behavior?

Should not throw (and close the child process)

What do you see instead?

PS C:\Users\Aras\Desktop\undici> node .\test\node-test\debug.js
✖ debug#websocket (175.6675ms)
  Error: kill EPERM
      at ChildProcess.kill (node:internal/child_process:512:26)
      at TestContext.<anonymous> (C:\Users\Aras\Desktop\undici\test\node-test\debug.js:48:9)
      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
      at async Test.run (node:internal/test_runner/test:632:9)
      at async startSubtest (node:internal/test_runner/harness:218:3) {
    errno: -4048,
    code: 'EPERM',
    syscall: 'kill'
  }

✖ debug#fetch (174.8844ms)
  Error: kill EPERM
      at ChildProcess.kill (node:internal/child_process:512:26)
      at TestContext.<anonymous> (C:\Users\Aras\Desktop\undici\test\node-test\debug.js:83:9)
      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
      at async Test.run (node:internal/test_runner/test:632:9)
      at async Test.processPendingSubtests (node:internal/test_runner/test:374:7) {
    errno: -4048,
    code: 'EPERM',
    syscall: 'kill'
  }

✖ debug#undici (150.1489ms)
  Error: kill EPERM
      at ChildProcess.kill (node:internal/child_process:512:26)
      at TestContext.<anonymous> (C:\Users\Aras\Desktop\undici\test\node-test\debug.js:121:9)
      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
      at async Test.run (node:internal/test_runner/test:632:9)
      at async Test.processPendingSubtests (node:internal/test_runner/test:374:7) {
    errno: -4048,
    code: 'EPERM',
    syscall: 'kill'
  }

ℹ tests 3
ℹ suites 0
ℹ pass 0
ℹ fail 3
ℹ cancelled 0
ℹ skipped 0
ℹ todo 0
ℹ duration_ms 511.956

✖ failing tests:

test at test\node-test\debug.js:11:1
✖ debug#websocket (175.6675ms)
  Error: kill EPERM
      at ChildProcess.kill (node:internal/child_process:512:26)
      at TestContext.<anonymous> (C:\Users\Aras\Desktop\undici\test\node-test\debug.js:48:9)
      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
      at async Test.run (node:internal/test_runner/test:632:9)
      at async startSubtest (node:internal/test_runner/harness:218:3) {
    errno: -4048,
    code: 'EPERM',
    syscall: 'kill'
  }

test at test\node-test\debug.js:51:1
✖ debug#fetch (174.8844ms)
  Error: kill EPERM
      at ChildProcess.kill (node:internal/child_process:512:26)
      at TestContext.<anonymous> (C:\Users\Aras\Desktop\undici\test\node-test\debug.js:83:9)
      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
      at async Test.run (node:internal/test_runner/test:632:9)
      at async Test.processPendingSubtests (node:internal/test_runner/test:374:7) {
    errno: -4048,
    code: 'EPERM',
    syscall: 'kill'
  }

test at test\node-test\debug.js:86:1
✖ debug#undici (150.1489ms)
  Error: kill EPERM
      at ChildProcess.kill (node:internal/child_process:512:26)
      at TestContext.<anonymous> (C:\Users\Aras\Desktop\undici\test\node-test\debug.js:121:9)
      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
      at async Test.run (node:internal/test_runner/test:632:9)
      at async Test.processPendingSubtests (node:internal/test_runner/test:374:7) {
    errno: -4048,
    code: 'EPERM',
    syscall: 'kill'
  }

Additional information

I assume that this bug is coming from libuv.

@targos
Copy link
Member

targos commented Feb 15, 2024

@nodejs/libuv

@richardlau richardlau added the windows Issues and PRs related to the Windows platform. label Feb 15, 2024
@Uzlopak
Copy link
Contributor Author

Uzlopak commented Feb 19, 2024

@targos

Should I open another issue in libuv?

@targos
Copy link
Member

targos commented Feb 20, 2024

Maybe? I don't know if that would be relevant without a repro using libuv directly.

FWIW this commit changed the implementation of the uv_kill function on Windows: libuv/libuv@129362f

@vtjnash

@targos targos added the child_process Issues and PRs related to the child_process subsystem. label Feb 20, 2024
@mcollina
Copy link
Member

mcollina commented Mar 4, 2024

@santigimeno could you take a look?

@santigimeno
Copy link
Member

So, I think there's a regression in libuv which might be solved in libuv/libuv#4341. Regardless, just curious why the need to call child.kill() in those tests as the child processes should exit gracefully.

@santigimeno santigimeno added the libuv Issues and PRs related to the libuv dependency or the uv binding. label Mar 5, 2024
@Uzlopak
Copy link
Contributor Author

Uzlopak commented Mar 5, 2024

@santigimeno

It spins up the server and then kills it. Was this way for quiete a while. i could have patched it, but then I could not have report this bug ;)

@santigimeno
Copy link
Member

This is fixed in libuv, so it'll be fixed once a new version reaches node.

@santigimeno santigimeno reopened this Mar 5, 2024
@Uzlopak
Copy link
Contributor Author

Uzlopak commented Mar 5, 2024

great job, thanks

@SimenB
Copy link
Member

SimenB commented Apr 26, 2024

Is it correct that the update in libuv did not land in Node 22 either? Is there any timeline for this?

@Uzlopak
Copy link
Contributor Author

Uzlopak commented Apr 26, 2024

I dont know. I just know that we patched our test in undici accordingly to not run into the issue again. So probably the bug exists still. Would need to check on a windows machine.

@bukowa
Copy link

bukowa commented Jul 5, 2024

Happens for me only when killed from inside this event handler:

process.on('SIGINT', () => {
    killChildProcess();
})

When in main loop killChildProcess is called it works as expected.

@DeyanTodorov17
Copy link

Have someone found the issue ? I'm still receiving the same error with node version 21.7.3 :
Error: kill EPERM
at ChildProcess.kill (node:internal/child_process:512:26)
at ChildProcessWorker.killChild (C:\dev\dogs\node_modules\jest-worker\build\workers\ChildProcessWorker.js:411:17)
at ChildProcessWorker.forceExit (C:\dev\dogs\node_modules\jest-worker\build\workers\ChildProcessWorker.js:416:33)
at Timeout._onTimeout (C:\dev\dogs\node_modules\jest-worker\build\base\BaseWorkerPool.js:137:16)
at listOnTimeout (node:internal/timers:573:17)
at process.processTimers (node:internal/timers:514:7)
Emitted 'error' event on ChildProcess instance at:
at ChildProcess.kill (node:internal/child_process:512:12)
at ChildProcessWorker.killChild (C:\dev\dogs\node_modules\jest-worker\build\workers\ChildProcessWorker.js:411:17)
[... lines matching original stack trace ...]
at process.processTimers (node:internal/timers:514:7) {
errno: -4048,
code: 'EPERM',
syscall: 'kill'
}

@twilson90
Copy link

I'm getting this error whenever I try and SIGKILL an ffmpeg child process during encoding.

kodiakhq bot pushed a commit to vercel/vercel that referenced this issue Oct 23, 2024
Reverts #12267

This test was skipped in Windows because it was flakey. This PR unskips it and resolves the flakiness by adjusting some timeout values.

---

This turned out to be a specific issue with [Node 22 and Windows](nodejs/node#51766) and timers.

The way our use of `waitUntil` works with the `devServer`'s simulations of serverless functions creates an environment where we can kill the dev server at the same time that the `waitUntil` promise expires. I didn't nail down the exact flow that happens here, but this causes the Node 22 + Windows `EPREM` error to bubble up to the test failure.

Ultimately, the specific flow isn't critical to figure out here. This is a test setup issue, not a production issue.

---

In an attempt to prove this is no longer flakey, here are several consecutive successful runs:
- https://github.com/vercel/vercel/actions/runs/11469305192/job/31916358922?pr=12333
- https://github.com/vercel/vercel/actions/runs/11469447637/job/31916775908?pr=12333
- https://github.com/vercel/vercel/actions/runs/11469447637/job/31917078929?pr=12333
- https://github.com/vercel/vercel/actions/runs/11469447637/job/31917252353?pr=12333
@alicanertop
Copy link

Not constant but sometimes still gives error, also need to check v22.11.0
Screenshot 2024-12-04 at 13 14 13

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
child_process Issues and PRs related to the child_process subsystem. libuv Issues and PRs related to the libuv dependency or the uv binding. windows Issues and PRs related to the Windows platform.
Projects
None yet
10 participants