[node-core-library] Gracefully handle irregular LockFile.tryAcquire fail on macOS/Linux by andrew--r · Pull Request #4497 · microsoft/rushstack

andrew--r · 2024-01-29T14:21:03Z

Fixes #4491

Summary and details

See #4497 (comment)

There are two fixes in this PR:

Fixed the root cause of the unexpected ENOENT error
Added getStatistics method to FileWriter as requested

How it was tested

I’ve added a corresponding test case to LockFile tests suite

…ail on macOS/Linux Fixes #4491

andrew--r · 2024-01-29T16:39:26Z

@microsoft-github-policy-service agree company="Joom Unipessoal LDA"

libraries/node-core-library/src/LockFile.ts

…Handle for getting stats

libraries/node-core-library/src/FileWriter.ts

libraries/node-core-library/src/LockFile.ts

andrew--r · 2024-01-31T01:18:28Z

I’ve dug a bit deeper into the problem, and I think I’ve managed to find the root cause.

The problem occurs when at least two processes constantly try to acquire the same lock, e.g. when processing a task queue inside each process.

Reproduction steps:

The first process acquires lock and starts executing something (e.g. current task).
The second process simultaneously tries to acquire the same lock, writes it’s lockfile, …, and scans directory in search for other lockfiles.
Immediately after it, the first process lock is released, which leads to the first lockfile being deleted
The second lock processes each file found on the step 2. The first lockfile doesn’t exist anymore, so it fails to retrieve info about the first process lock, and the error is silently swallowed
The first process takes the next task from the queue and starts acquiring new lock, writing a new lockfile
The second process uses obsolete data about the first process lockfile and deletes it with an assumption that the first process is no longer executing, but the lockfile still exists for some reason
The first process tries to retrieve stats for it’s new lockfile, which was deleted on the previous step by the second process, which finally leads to unexpected ENOENT error

To be sure, in the next few days I’ll write some test cases covering this behaviour, and if it confirms, I’ll update the PR with a fix for the root cause.

…mStats

…file

andrew--r · 2024-02-07T14:05:42Z

@iclanton could you please take a look at the updates when you’ll have time?

[node-core-library] Gracefully handle irregular LockFile.tryAcquire f…

a9d8c9a

…ail on macOS/Linux Fixes #4491

andrew--r marked this pull request as ready for review January 29, 2024 16:39

andrew--r requested review from D4N14L, apostolisms, dmichon-msft, iclanton and octogonz as code owners January 29, 2024 16:39

Restore mocks after each test

169037b

dmichon-msft reviewed Jan 29, 2024

View reviewed changes

libraries/node-core-library/src/LockFile.ts Outdated Show resolved Hide resolved

[node-core-library] Add getStatistics() to FileWriter, reuse lockFile…

eb1297a

…Handle for getting stats

andrew--r requested a review from dmichon-msft January 29, 2024 21:25

iclanton reviewed Jan 30, 2024

View reviewed changes

libraries/node-core-library/src/FileWriter.ts Outdated Show resolved Hide resolved

libraries/node-core-library/src/LockFile.ts Outdated Show resolved Hide resolved

andrew--r marked this pull request as draft January 31, 2024 01:19

andrew--r added 3 commits February 4, 2024 00:47

[node-core-library] Remove FileWriterStats type in favor of FileSyste…

4e9901c

…mStats

[node-core-library] Prevent accidentaly deleting freshly created lock…

880906a

…file

Cleanup

b534ea7

andrew--r marked this pull request as ready for review February 4, 2024 20:10

andrew--r requested a review from iclanton February 4, 2024 20:15

Refactor currentBirthTimeMs definition

37b5aa4

dmichon-msft approved these changes Feb 5, 2024

View reviewed changes

iclanton approved these changes Feb 7, 2024

View reviewed changes

iclanton enabled auto-merge February 7, 2024 18:32

iclanton merged commit 0ce9231 into microsoft:main Feb 7, 2024

andrew--r deleted the feature/lockfile-handle-failure-gracefully branch February 7, 2024 22:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[node-core-library] Gracefully handle irregular LockFile.tryAcquire fail on macOS/Linux#4497

[node-core-library] Gracefully handle irregular LockFile.tryAcquire fail on macOS/Linux#4497
iclanton merged 7 commits intomicrosoft:mainfrom
andrew--r:feature/lockfile-handle-failure-gracefully

andrew--r commented Jan 29, 2024 •

edited

Loading

Uh oh!

andrew--r commented Jan 29, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andrew--r commented Jan 31, 2024 •

edited

Loading

Uh oh!

andrew--r commented Feb 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

andrew--r commented Jan 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary and details

How it was tested

Uh oh!

andrew--r commented Jan 29, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andrew--r commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andrew--r commented Feb 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrew--r commented Jan 29, 2024 •

edited

Loading

andrew--r commented Jan 31, 2024 •

edited

Loading