Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[iOS][OSS][BE] Add simulator tests for Metal #64852

Closed
wants to merge 15 commits into from

Conversation

xta0
Copy link
Contributor

@xta0 xta0 commented Sep 10, 2021

@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented Sep 10, 2021

🔗 Helpful links

💊 CI failures summary and remediations

As of commit b2fb7d9 (more details on the Dr. CI page):


  • 3/3 failures introduced in this PR

🕵️ 2 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

See GitHub Actions build linux-xenial-py3.6-gcc5.4 / test (distributed, 1, 1, linux.2xlarge) (1/2)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-09-14T00:02:21.9242727Z test_udf_remote_...yUniqueId(created_on=0, local_id=0) to be created.
2021-09-14T00:01:42.8798014Z frame #12: c10::ThreadPool::main_loop(unsigned long) + 0x2a3 (0x7f5fa229b263 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-09-14T00:01:42.8799644Z frame #13: <unknown function> + 0xc92bd (0x7f5fa21cc2bd in /opt/conda/lib/libstdc++.so.6)
2021-09-14T00:01:42.8801359Z frame #14: <unknown function> + 0x76ba (0x7f5fb75316ba in /lib/x86_64-linux-gnu/libpthread.so.0)
2021-09-14T00:01:42.8803225Z frame #15: clone + 0x6d (0x7f5fb726751d in /lib/x86_64-linux-gnu/libc.so.6)
2021-09-14T00:01:42.8804032Z 
2021-09-14T00:01:43.0628790Z ok (3.317s)
2021-09-14T00:01:57.9979826Z   test_rpc_builtin_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (14.935s)
2021-09-14T00:02:06.9234916Z   test_rpc_script_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (8.925s)
2021-09-14T00:02:10.3401498Z   test_rref_to_here_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (3.417s)
2021-09-14T00:02:17.7625560Z   test_udf_remote_message_delay_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (7.422s)
2021-09-14T00:02:21.9242727Z   test_udf_remote_message_delay_timeout_to_self (__main__.FaultyFaultyAgentRpcTest) ... [E request_callback_no_python.cpp:559] Received error while processing request type 261: falseINTERNAL ASSERT FAILED at "/var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp":387, please report a bug to PyTorch. Expected OwnerRRef with id GloballyUniqueId(created_on=0, local_id=0) to be created.
2021-09-14T00:02:21.9245409Z Exception raised from getOwnerRRef at /var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp:387 (most recent call first):
2021-09-14T00:02:21.9247403Z frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x69 (0x7fc8e9b3c429 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-09-14T00:02:21.9249181Z frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xd2 (0x7fc8e9b389d2 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-09-14T00:02:21.9251387Z frame #2: c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0x4e (0x7fc8e9b3a36e in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-09-14T00:02:21.9253324Z frame #3: torch::distributed::rpc::RRefContext::getOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, bool) + 0x4a4 (0x7fc8edd8b844 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-09-14T00:02:21.9255815Z frame #4: torch::distributed::rpc::RequestCallbackNoPython::assignOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, torch::distributed::rpc::GloballyUniqueId const&, c10::intrusive_ptr<c10::ivalue::Future, c10::detail::intrusive_target_default_null_type<c10::ivalue::Future> >) const + 0x71 (0x7fc8edd7bd61 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-09-14T00:02:21.9258714Z frame #5: torch::distributed::rpc::RequestCallbackImpl::processPythonRemoteCall(torch::distributed::rpc::RpcCommandBase&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0xc8 (0x7fc8f6190d68 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-09-14T00:02:21.9261222Z frame #6: torch::distributed::rpc::RequestCallbackNoPython::processRpc(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x194 (0x7fc8edd80554 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-09-14T00:02:21.9263804Z frame #7: torch::distributed::rpc::RequestCallbackImpl::processRpcWithErrors(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x65 (0x7fc8f6190365 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-09-14T00:02:21.9265553Z frame #8: <unknown function> + 0x400710a (0x7fc8edd7d10a in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)

See CircleCI build pytorch_ios_12_5_1_x86_64_metal_build (2/2)

Step: "Run Simulator Tests" (full log | diagnosis details | 🔁 rerun)

[!] Tests have failed�[0m
[23:58:03]: ▸ �[35mLoading...�[0m
[00:00:58]: �[31mExit status: 65�[0m
+--------------------+---+
|      Test Results      |
+--------------------+---+
| Number of tests    | 1 |
| Number of failures | �[31m1�[0m |
+--------------------+---+

�[31m
[!] Tests have failed�[0m

#######################################################################
# fastlane 2.193.1 is available. You are on 2.185.1.
# You should use the latest version.
# Please update using `gem install fastlane`.
#######################################################################

�[32m2.193.1 Improvements�[0m
* [gradle] add output context with "tasks" options (#19312) via David Kuo
* [match] ignore force_for_new_devices for developer_id provisioning type (#19314) via Vitalii Budnik

1 failure not recognized by patterns:

Job Step Action
GitHub Actions Lint / quick-checks Ensure correct trailing newlines 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

xta0 added a commit that referenced this pull request Sep 10, 2021
Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)

ghstack-source-id: 137826522
Pull Request resolved: #64852
xta0 added a commit that referenced this pull request Sep 11, 2021
Pull Request resolved: #64852


ghstack-source-id: 137849299

Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)
xta0 added a commit that referenced this pull request Sep 11, 2021
Pull Request resolved: #64852


ghstack-source-id: 137856491

Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)
xta0 added a commit that referenced this pull request Sep 11, 2021
Pull Request resolved: #64852


ghstack-source-id: 137863752

Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)
xta0 added a commit that referenced this pull request Sep 11, 2021
Pull Request resolved: #64852


ghstack-source-id: 137865205

Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)
xta0 added a commit that referenced this pull request Sep 12, 2021
Pull Request resolved: #64852


ghstack-source-id: 137868541

Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)
xta0 added a commit that referenced this pull request Sep 12, 2021
Pull Request resolved: #64852


ghstack-source-id: 137877480

Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)
xta0 added a commit that referenced this pull request Sep 13, 2021
Pull Request resolved: #64852


ghstack-source-id: 137970230

Differential Revision: [D30877961](https://our.internmc.facebook.com/intern/diff/D30877961/)
@github-actions
Copy link

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
Stale pull requests will automatically be closed 30 days after being marked Stale

@github-actions github-actions bot added the Stale label Nov 13, 2021
@github-actions github-actions bot closed this Apr 13, 2022
@facebook-github-bot facebook-github-bot deleted the gh/xta0/129/head branch May 14, 2022 14:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants