-
Notifications
You must be signed in to change notification settings - Fork 13.9k
[FLINK-938] Automatically configure the jobmanager address when using start-cluster.sh #48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
I guess the rewrite of the yaml file works well with this pull request. But I think this approach is not always working. |
|
I would like to rewrite at least the yaml file on the master node. Because when you run stratosphere, it also reads config file to get the jobmanager address. And |
|
The newest commit using cli invocation to pass jobmanager's hostname to |
|
I agree. Rewriting the config for the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Was the & removed on purpose?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes: https://issues.apache.org/jira/browse/FLINK-938?focusedCommentId=14036049&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14036049
It suppresses remote error messages.
|
The rewriting of the config by the script seems fragile. Why is that even needed, when the taskmanagers get the jobmanager host as a parameter? Also: What happens if someone put an address there (like a specific address to bind to? Seems that it will be just overwritten. That we cannot do. |
|
The rewriting of the config is because |
|
Then I think we need a bit of a different approach. Can we remove the jobmanager rpc address from the config file, and have the script check whether there is an entry for the jobmanager address. If not, it uses the local hostname. I would actually prefer to have something like a |
Conflicts: stratosphere-dist/src/main/stratosphere-bin/bin/config.sh stratosphere-dist/src/main/stratosphere-bin/bin/jobmanager.sh stratosphere-dist/src/main/stratosphere-bin/bin/start-cluster.sh stratosphere-dist/src/main/stratosphere-bin/bin/taskmanager.sh stratosphere-runtime/src/main/java/eu/stratosphere/nephele/jobmanager/JobManager.java stratosphere-runtime/src/main/java/eu/stratosphere/nephele/taskmanager/TaskManager.java
|
I am skeptical whether it is a good idea to modify the config file at all. I bet that many people make the implicit assumption that this file is constant. And modifying it when the host is "localhost" is hidden constraint that makes it tricky to understand why the script sometimes configures things for you and sometimes not. I vote for an approach to add this as a command line parameter to the ssh-transmitted call to start the taskmanager. The command line parser should check for that argument similarly as it checks for the config directory. |
|
I agree. Modifying the config file not sounds like a good approach. But otherwise user has to type like |
|
Why don't we comment out the jm address in the config by default? Then no entry in the config would mean that the parameter appended to the cli command should be taken... |
|
Ok. I have this in mind, will do it in another PR since most of the commits here would be irrelevant. Then this one could be closed. |
|
I think only the author of the pull request (you) can close it. |
|
Yeah. I meant it. |
|
Thanks for the PR. I've looked into the other approach (#248) and I think you can safely close this on now. |
Add 1.6.1 and 1.5.4
This closes apache#48.
…cordingly This closes apache#48.
No description provided.