Grammar caching causes EOFError and race condition when used as pre-commit hook #1164

a-gardner1 · 2023-10-10T23:07:12Z

Thanks to pre-commit#851, hooks can now be executed in parallel for each file. This feature is enabled by default.

When a large number of files need to be formatted at once, this concurrency introduces a race condition in wherein one process can conclude that the pickled grammar file does not exist and start to create it. Meanwhile, another process sees the newly created pickle file before it is done being written, concludes that it is newer than the raw unpickled grammar, and then attempts to load it.
The result is the following stack trace:

Traceback (most recent call last):
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/bin/yapf", line 5, in <module>
    from yapf import run_main
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf/__init__.py", line 41, in <module>
    from yapf.yapflib import yapf_api
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf/yapflib/yapf_api.py", line 38, in <module>
    from yapf.pyparser import pyparser
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf/pyparser/pyparser.py", line 44, in <module>
    from yapf.yapflib import format_token
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf/yapflib/format_token.py", line 23, in <module>
    from yapf.pytree import pytree_utils
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf/pytree/pytree_utils.py", line 30, in <module>
    from yapf_third_party._ylib2to3 import pygram
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf_third_party/_ylib2to3/pygram.py", line 29, in <module>
    python_grammar = driver.load_grammar(_GRAMMAR_FILE)
                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf_third_party/_ylib2to3/pgen2/driver.py", line 252, in load_grammar
    g.load(gp)
  File "~/.cache/pre-commit/repo73fphdkg/py_env-python3.11/lib/python3.11/site-packages/yapf_third_party/_ylib2to3/pgen2/grammar.py", line 95, in load
    d = pickle.load(f)
        ^^^^^^^^^^^^^^
EOFError: Ran out of input

Since the process writing the file is not impacted by this error, the pickled grammar will get cached. Thus, subsequent runs will succeed. However, if one is using yapf in a continuous integration context, this would cause a failed pipeline with potentially high probability (depending on the number of files) with no obvious recourse.

A workaround is to add require_serial to the yapf hook in your project's .pre-commit-config.yaml, but this comes at the cost of losing the advantages of pre-commit#851. I believe the logic in _load_grammar could be refactored to avoid the race condition with some more careful checks.

The text was updated successfully, but these errors were encountered:

kamahen · 2023-10-10T23:24:37Z

The problem seems to be in Grammar.dump() in file yapf/third_party/yapf_third_party/_ylib2to3/pgen2/grammar.py - it opens the file directly instead of creating a temporary name, writing to the file, then renaming it (this works on Unix, where rename is an atomic operation (as are open, close, linke, etc.), so it's safe if two processes do the same thing at the same time). I don't know if this technique works on Windows.

Something like this:

def dump(self, filename):
  """Dump the grammar tables to a pickle file."""
  with tempfile.NamedTemporaryFile(mode='wb') as f:
    pickle.dump(self.__dict__, f, pickle.HIGHEST_PROTOCOL)
    os.link(f.name, filename)

a-gardner1 · 2023-10-11T14:35:17Z

Agreed, I think that would work on Unix systems. To my knowledge, there is no documented, guaranteed atomic write operation on Windows. The python-atomicwrites library provides a best-effort atomic write operation for Windows; there may or may not be something better out there.

kamahen · 2023-10-11T15:11:12Z

There's an alternative method that runs a small risk of leaving a temporary file lying around, but might work better on Windows [this code is untested; and it probably should have a try/except to deal with cleanup on error]:

  with tempfile.NamedTemporaryFile(mode='wb', delete=False, delete_on_close=False) as f:
    tmp_file_name = f.name
    pickle.dump((self.__dict__, f, pickle.HIGHEST_PROTOCOL)
  os.rename(tmp_file_name, filename)

jwwangchn · 2024-01-05T13:21:31Z

Same problem, how to solve it?

See google/yapf#1164.

* GitHub workflow for new runners * Ignore first yapf failure See google/yapf#1164. * run test_line_info.py separately * triton-runner-base:0.0.2 * Include cmake/llvm-hash.txt to a cache key for packages * Run pre-commit checks in parallel * Use pip cache for pre-commit checks * Redirect first failing yapf to /dev/null * Use jobs.<job_id>.defaults.run to initialize oneapi

We have to run yapf twice because in a clean environment the first run most likely fails because of the race condition in yapf, see google/yapf#1164. If, however, the first run was successful and yapf has modified files, then we need to reset the changes, so the second run can detect them again. Note that the whole block (ignore the first run and reset the tree) will not be necessary and will be removed after fixed google/yapf#1164.

whlook · 2024-02-27T05:55:25Z

Same problem, how to solve it?

similar issue and the way to fix it: #1204

pbchekin added a commit to intel/intel-xpu-backend-for-triton that referenced this issue Jan 7, 2024

Ignore first yapf failure

355bf95

See google/yapf#1164.

pbchekin added a commit to intel/intel-xpu-backend-for-triton that referenced this issue Jan 8, 2024

Ignore first yapf failure

a4df81a

See google/yapf#1164.

pbchekin mentioned this issue Jan 10, 2024

Reset git tree after first yapf failure intel/intel-xpu-backend-for-triton#230

Merged

DeclK mentioned this issue Jan 24, 2024

[Bug] config to import yapf causes 'EOFError: Ran out of input' when distributed training open-mmlab/mmengine#1480

Closed

2 tasks

whlook mentioned this issue Feb 27, 2024

[Bug] [Crash][Reproducible] EOFError: Ran out of input when import yapf with multiprocess #1204

Closed

kehemo mentioned this issue Feb 28, 2024

Add type inference for constants exo-lang/exo#581

Merged

hartwork mentioned this issue Oct 3, 2024

Fix pickle related race condition (fixes #1164, fixes #1204) #1243

Merged

bwendling closed this as completed in #1243 Oct 7, 2024

hartwork mentioned this issue Oct 12, 2024

New release YAPF v0.40.3 needed #1248

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grammar caching causes EOFError and race condition when used as pre-commit hook #1164

Grammar caching causes EOFError and race condition when used as pre-commit hook #1164

a-gardner1 commented Oct 10, 2023

kamahen commented Oct 10, 2023

a-gardner1 commented Oct 11, 2023

kamahen commented Oct 11, 2023

jwwangchn commented Jan 5, 2024

whlook commented Feb 27, 2024

Grammar caching causes EOFError and race condition when used as pre-commit hook #1164

Grammar caching causes EOFError and race condition when used as pre-commit hook #1164

Comments

a-gardner1 commented Oct 10, 2023

kamahen commented Oct 10, 2023

a-gardner1 commented Oct 11, 2023

kamahen commented Oct 11, 2023

jwwangchn commented Jan 5, 2024

whlook commented Feb 27, 2024