Skip to content

Conversation

Chillee
Copy link
Collaborator

@Chillee Chillee commented Apr 14, 2021

^^

@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Apr 14, 2021

💊 CI failures summary and remediations

As of commit 5336870 (more details on the Dr. CI page):



🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

See CircleCI build pytorch_linux_xenial_cuda10_2_cudnn7_py3_gcc7_test2 (1/1)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun)

Apr 14 03:09:08 [E request_callback_no_python.cpp:656] Received error while processing request type 256: The following operation failed in the TorchScript interpreter.
Apr 14 03:09:08 
Apr 14 03:09:08 [E request_callback_no_python.cpp:656] Received error while processing request type 256: The following operation failed in the TorchScript interpreter.
Apr 14 03:09:08 Traceback of TorchScript (most recent call last):
Apr 14 03:09:08   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/distributed/rpc/rpc_test.py", line 334, in raise_func_script
Apr 14 03:09:08 @torch.jit.script
Apr 14 03:09:08 def raise_func_script(expected_err: str) -> torch.Tensor:
Apr 14 03:09:08     raise ValueError(expected_err)
Apr 14 03:09:08     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <--- HERE
Apr 14 03:09:08 RuntimeError: Expected error
Apr 14 03:09:08 
Apr 14 03:09:08 [E request_callback_no_python.cpp:656] Received error while processing request type 256: The following operation failed in the TorchScript interpreter.
Apr 14 03:09:08 Traceback of TorchScript (most recent call last):
Apr 14 03:09:08   File "/opt/conda/lib/python3.6/site-packages/torch/testing/_internal/distributed/rpc/rpc_test.py", line 334, in raise_func_script
Apr 14 03:09:08 @torch.jit.script
Apr 14 03:09:08 def raise_func_script(expected_err: str) -> torch.Tensor:
Apr 14 03:09:08     raise ValueError(expected_err)
Apr 14 03:09:08     ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <--- HERE
Apr 14 03:09:08 RuntimeError: Expected error
Apr 14 03:09:08 
Apr 14 03:09:08 ok (1.631s)
Apr 14 03:09:10   test_wait_all_multiple_call (__main__.TensorPipeRpcTestWithSpawn) ... ok (1.631s)

1 failure not recognized by patterns:

Job Step Action
CircleCI pytorch_bazel_build Bazel Build 🔁 rerun

1 job timed out:

  • pytorch_bazel_build

❄️ 1 failure tentatively classified as flaky

but reruns have not yet been triggered to confirm:

See CircleCI build pytorch_linux_bionic_py3_8_gcc9_coverage_test2 (1/1)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun) ❄️

Apr 14 03:35:00 RuntimeError: Process 0 terminated or timed out after 100.07156300544739 seconds
Apr 14 03:35:00 ======================================================================
Apr 14 03:35:00 ERROR [100.142s]: test_py_tensors_multi_async_call (__main__.TensorPipeRpcTestWithSpawn)
Apr 14 03:35:00 ----------------------------------------------------------------------
Apr 14 03:35:00 Traceback (most recent call last):
Apr 14 03:35:00   File "/opt/conda/lib/python3.8/site-packages/torch/testing/_internal/common_distributed.py", line 322, in wrapper
Apr 14 03:35:00     self._join_processes(fn)
Apr 14 03:35:00   File "/opt/conda/lib/python3.8/site-packages/torch/testing/_internal/common_distributed.py", line 515, in _join_processes
Apr 14 03:35:00     self._check_return_codes(elapsed_time)
Apr 14 03:35:00   File "/opt/conda/lib/python3.8/site-packages/torch/testing/_internal/common_distributed.py", line 563, in _check_return_codes
Apr 14 03:35:00     raise RuntimeError('Process {} terminated or timed out after {} seconds'.format(i, elapsed_time))
Apr 14 03:35:00 RuntimeError: Process 0 terminated or timed out after 100.07156300544739 seconds
Apr 14 03:35:00 
Apr 14 03:35:01 ----------------------------------------------------------------------
Apr 14 03:35:01 Ran 356 tests in 1290.218s
Apr 14 03:35:01 
Apr 14 03:35:01 FAILED (errors=1, skipped=6)
Apr 14 03:35:01 
Apr 14 03:35:01 Generating XML reports...
Apr 14 03:35:01 Generated XML report: test-reports/python-unittest/distributed.rpc.test_tensorpipe_agent/TEST-TensorPipeDdpComparisonTestWithSpawn-20210414031330.xml
Apr 14 03:35:01 Generated XML report: test-reports/python-unittest/distributed.rpc.test_tensorpipe_agent/TEST-TensorPipeDdpUnderDistAutogradTestWithSpawn-20210414031330.xml
Apr 14 03:35:01 Generated XML report: test-reports/python-unittest/distributed.rpc.test_tensorpipe_agent/TEST-TensorPipeDistAutogradTestWithSpawn-20210414031330.xml

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

@facebook-github-bot
Copy link
Contributor

@Chillee has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

1 similar comment
@facebook-github-bot
Copy link
Contributor

@Chillee has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@Chillee merged this pull request in 3c4e1cd.

krshrimali pushed a commit to krshrimali/pytorch that referenced this pull request May 19, 2021
Summary:
^^

Pull Request resolved: pytorch#55982

Reviewed By: mruberry

Differential Revision: D27776380

Pulled By: Chillee

fbshipit-source-id: 22b3a8de73416821bed56b75b68dca1c33a21250
@github-actions github-actions bot deleted the removeAnnoyingWarnings branch February 11, 2024 01:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants