-
Notifications
You must be signed in to change notification settings - Fork 35
drat 'go' command stalls at reduce prompt #17
Comments
Hmm. The go function should wait for all mappers to finish before firing off the reducer. But, the reducer still checks for running mappers. In this case, it seems the go didn't find a mapper, but the reduce function did. My guess is that one unluckily fired off in the short time between the two. Regardless, we should give the command to rerun later in the confirmation message. Will send a PR tomorrow. |
We could also only wait for 10 or so seconds for user input if the reduce function finds running mappers, then simply trying to run reduce again. That way, asleep go users don't get an unpleasant surprise in the morning and reduce users can just say no then try again later. |
I'm wondering why it stalls - I thought the point of go was to be automated - I think it dynamically checks for the mappers to be done and then fires the reducer, right? Where does the prompt come in? I thought it comes in if you don't use "go" and if you ran each cycle (crawl, index, map) before running reduce? |
Correct, go waits for mappers to finish, then calls the reduce function. The reduce function checks for running mappers, then prompts the user for confirmation if there are still mappers running. This double-check is useful for when a user doesn't want to use In this issue, Unless, does this happen every time, Lewis? |
every time Tyler. It seems tobe built in. On Thu, Jul 24, 2014 at 8:57 PM, Tyler Palsulich notifications@github.com
Lewis |
Lewis that's totally expected (some tasks taking 20 mins, versus seconds). It has to do with the type of code it's looking at - usually Javascript or other MIME types, etc., take WAY longer than e.g., Java, for instance to check. I think this is more a functionality of RAT than anything else, but it could be that there is some difficulty RAT has in finding licenses in particular programming languages. But note, this is a KNOWN issue, (that Tika and DRAT helped to uncover, check out the DRAT docs for more info via the XNET presentation). |
Great. On Thu, Jul 24, 2014 at 10:42 PM, Chris Mattmann notifications@github.com
Lewis |
you are awesome! Chris Mattmann -----Original Message-----
|
WOW I totally missed all of this conversation and also @tpalsulich patch. |
BOOM XNET PRESSIE |
BTW the go command is dynamite. Really easy to get up and running.
I've recently been running DRAT on Apache Usergrid (Incubating) [0] in an attempt to add missing license headers, detect binaries and also set up up for a release as we have yet to release within the Incubator.
I see that after we've crawler, indexed and fired off map task(s), we are immediately prompted to fire of reducers e.g.
This therefore essentially stalls the 'go' process. I would imagine for a user it would be be a catch22 situation as to whether we answer yes or no to this prompt.
I think the behavhiour should find another mechanism for firing off reduce tasks which do not require user input to complete (or destroy) the current DRAT job.
I need ot delve more closely into the mechanics of getting this working but will try my best over the next wee while.
[0] https://git-wip-us.apache.org/repos/asf?p=incubator-usergrid.git;a=tree;h=refs/heads/master;hb=master
The text was updated successfully, but these errors were encountered: