Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HeartBeat vs. mapreduce.task.timeout doesn't consider "0 == infinite" case #426

Closed
rusnyder opened this issue Apr 19, 2015 · 4 comments
Closed

Comments

@rusnyder
Copy link

For the Hadoop config value "mapreduce.task.timeout" (or "mapred.task.timeout" for Hadoop 1), a value of 0 is the equivalent of disabling that timeout. However, the ES-Hadoop HeartBeat compares "mapred.task.timeout" to its own delay and throws an error if the heartbeat is greater than the task timeout.

I'm seeing this with 2.1.0.Beta3

org.elasticsearch.hadoop.mr.HeartBeat

class HeartBeat {
    ...
    HeartBeat(final Progressable progressable, Configuration cfg, TimeValue delay, final Log log) {
        Assert.notNull(progressable, "a valid progressable is required to report status to Hadoop");
        TimeValue tv = HadoopCfgUtils.getTaskTimeout(cfg);
        Assert.isTrue(tv.getSeconds() > delay.getSeconds(), "Hadoop timeout is shorter than the heartbeat");
        ...
    }
    ...
}

The assert should probably be or'ing w/ a "tv.getSeconds == 0" or something like that.

costin added a commit that referenced this issue Apr 19, 2015
costin added a commit that referenced this issue Apr 19, 2015
@costin
Copy link
Member

costin commented Apr 19, 2015

Hi,

Thanks for the report. I've fixed this in master and on the 2.x branch and published the artifacts (available in maven). Can you please check it out?
Besides handling the infinite use case, I've also fixed an issue with the scheduling as the heartbeat was only activating one (per task) but now it should run continuously.

Cheers,

@rusnyder
Copy link
Author

Nice! I just ran into that scheduling issue last night and hadn't yet diagnosed it, but looking at your commits now realize that you've fixed both of my problems in one swoop!

Thanks for the quick turnaround!

@costin
Copy link
Member

costin commented Apr 19, 2015

Glad to help! Please let me know if the latest build solves your issue(s).

On 4/19/15 4:39 PM, rusnyder wrote:

Nice! I just ran into that scheduling issue last night and hadn't yet diagnosed it, but looking at your commits now
realize that you've fixed both of my problems in one swoop!

Thanks for the quick turnaround!


Reply to this email directly or view it on GitHub
#426 (comment).

Costin

@rusnyder
Copy link
Author

Solved - things are running smoothly now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants