-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Set RAY_FORCE_DIRECT=1 for run_rllib_tests, test_basic #6171
Conversation
Can one of the admins verify this patch? |
Test FAILed. |
Test FAILed. |
Test FAILed. |
jenkins retest this please |
Test FAILed. |
jenkins retest this please |
Test FAILed. |
Test FAILed. |
Test FAILed. |
Test PASSed. |
Test FAILed. |
Test FAILed. |
Test FAILed. |
name = "test_actor_direct", | ||
size = "medium", | ||
srcs = ["test_actor_direct.py", "test_actor.py"], | ||
tags = ["exclusive", "manual"], |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
test_actor_load_balancing is flaky, so setting it to MANUAL for now
python/ray/tests/BUILD
Outdated
srcs = ["test_basic.py"], | ||
tags = ["exclusive"], | ||
deps = ["//:ray_lib"], | ||
) | ||
|
||
py_test( | ||
name = "test_advanced", | ||
size = "large", | ||
srcs = ["test_advanced.py"], |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this have been removed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oops, fixed.
@@ -42,7 +42,7 @@ void DoInlineObjectValue(const ObjectID &obj_id, std::shared_ptr<RayObject> valu | |||
RAY_CHECK(found) << "obj id " << obj_id << " not found"; | |||
} | |||
|
|||
void LocalDependencyResolver::ResolveDependencies(const TaskSpecification &task, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why no more const?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since this mutates the task, it is weird to call it const.
return raylet_client_->SubmitTask(task_spec); | ||
// TODO(ekl) if we moved actor creation to use direct call tasks, then we won't | ||
// need to manually resolve direct call args here. | ||
resolver_->ResolveDependencies(task_spec, [this, task_spec]() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just wondering, was there a test that was failing with the old version of this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, it causes rllib ES to hang, since we pass an object id to an actor constructor there.
Test FAILed. |
Test FAILed. |
Test FAILed. |
Interestingly, the RLlib test is failing with corrupted sample data returned. |
@@ -873,7 +874,7 @@ Status CoreWorker::BuildArgsForExecutor(const TaskSpecification &task, | |||
metadata = std::make_shared<LocalMemoryBuffer>( | |||
const_cast<uint8_t *>(task.ArgMetadata(i)), task.ArgMetadataSize(i)); | |||
} | |||
args->at(i) = std::make_shared<RayObject>(data, metadata); | |||
args->at(i) = std::make_shared<RayObject>(data, metadata, /*copy_data*/ true); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @edoakes @stephanie-wang this fixes #6264
Test PASSed. |
Test PASSed. |
Test PASSed. |
hm //python/ray/tests:test_multi_node seems to be consistently failing in bazel, but passes locally. |
multi_node_test errors seem to be due to breakage on master |
Why are these changes needed?
Related issue number
Closes #6264
Checks
scripts/format.sh
to lint the changes in this PR.