-
Notifications
You must be signed in to change notification settings - Fork 173
Getting less responses than expected? #42
Comments
@agustintorres , do you mean the count does not match? Normally this happens with duplicated requests; as we deduplicate and use a hashmap to store the different requests. Could you try to execute just 100 of the clientIds and see if you miss any? And try to put into a hashset first for the clientIds and see if there is any duplicate? |
@jeffpeiyt Yes, that's what I mean. I have also confirmed that there are no duplicates. They're 19749 unique requests. If I make smaller requests(100 or even 8000), I do not miss any. It is only the 19,749 set for which I am missing results. |
Interesting. Could you please try
Some internal team has similar to single target usage and scale and they did not report this. I have not encountered this before either. |
@jeffpeiyt Here are some findings: The job steadily progresses and at some point it jumps from about 54% progress to 100%. It seems that it gets terminated mid-way. These are some things I see in the logs:
I am putting everything in the responseContext on the onCompleted method and the size of the map at the end is: 10718. Further, task.getParallelTaskResult().keySet().size() size is 19749. Keep in mind that the job runs for about 10 minutes before it gets cancelled. Why would it terminate? Maybe there is some sort of timeout? |
Sorry. My bad, we got internal users report the same issue. Very easy to fix. It is a global timeout that kills the whole job. |
#38 has been tracked this issue. Please set it to a larger value. Default is 600 seconds .... defaults: /**
* The command manager internal timeout and cancel itself time in seconds
* Note this may need to be adjusted for long polling jobs.
*/
public static long timeoutInManagerSec = 600;
/** The timeout the director send to the manager to cancel it from outside. */
public static long timeoutAskManagerSec = timeoutInManagerSec + 10;
|
@jeffpeiyt Thanks! That fixed my problem and it works great now. I'm actually planning to do something similar for around 300,000 clientIds, even if it takes 5+ hours. Do you foresee any problems with it running for this long? |
@agustintorres Great! I am updating the documents to be more clear on this. I do not see any problems. We do run jobs that are on 100,000+ hosts and it runs fine. Please let us know with any issues you encounter. |
Updated doc at : http://www.parallec.io/docs/configurations/#long-running-jobs |
I'm still seeing this.... 2018-06-23 00:46:03.401 INFO 20095 --- [ParallecActorSystem-akka.actor.default-dispatcher-10] io.parallec.core.actor.ExecutionManager : I have the following config Set as per the Doc
This is my task call
|
Will check on it later when i can access my laptop. . Saw you have http
request but set tcp timeout?
…On Fri, Jun 22, 2018, 9:48 PM harjitdotsingh ***@***.***> wrote:
I'm still seeing this....
2018-06-23 00:46:03.401 INFO 20095 ---
[ParallecActorSystem-akka.actor.default-dispatcher-10]
io.parallec.core.actor.ExecutionManager :
[4]__RESP_RECV_IN_MGR 4 (+0) / 4 (100.00%) AFT 14.133 S @ API_2 @
2018.06.23.00.46.03.401-0400 , TaskID : 0f39b86b-82a , CODE: NA,
RESP_BRIEF: EMPTY , ERR: java.util.concurrent.TimeoutException: No response
received after 14000
I have the following config Set as per the Doc
private ParallelTaskConfig genParallelTaskConfig() {
ParallelTaskConfig config = new ParallelTaskConfig();
config.setActorMaxOperationTimeoutSec(120);
config.setTimeoutInManagerSec(120);
config.setTimeoutAskManagerSec(710);
return config;
}
This is my task call
.setHttpHeaders(new ParallecHeader().addPair("x-user", env.getProperty("ifi.user")).addPair("x-password", env.getProperty("ifi.password"))).setProtocol(RequestProtocol.HTTPS)
.setHttpPort(443)
.setConfig(genParallelTaskConfig())
.setTcpConnectTimeoutMillis(100*120)
.async()
.setReplaceVarMapToSingleTargetSingleVar("QUERY", queryList, "cdws21.ificlaims.com")
.setResponseContext(returnMap)
.execute((res, responseContext) ->
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#42 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AB9DfxBgXNGQITiK0UuUaR2GZbgx0gy2ks5t_ciGgaJpZM4KFYQT>
.
|
I am executing the following:
The size of the "clientIds" collection is 19749. However, after the task is done, I get the following results:
How is this possible?
The text was updated successfully, but these errors were encountered: