New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[doParallel/doAzureParallel] task 356 failed - "arguments imply differing number of rows: 0, 75" #869
Comments
I can't reproduce this without the |
Hi, The minimal dataset is Iris in R. X = iris[,1:3] as mentioned by Brian. |
Missed that! 😦 |
If I run it without parallelism, I get this:
The warnings are:
|
Hi Max, Could you please suggest a way to avoid this warning and the error ? As, in how to modify the subsampling rate while using caret or is it fixed ? Thanks |
Hi @topepo , Also, it is becoming really important for me to understand two issues that sprout up when I performed my runs. They are:
Could you please explain why these messages come up at the first place or when do they come up while running any algorithm for classification/regression ? |
I have had similar issues but the error messages are different depending on the tuning grid and whether it is in parallel or not. 2: predictions failed for Resample23: shrinkage=0.1, interaction.depth=2, n.minobsinnode=10, n.trees=150 Error in lvl[x] : invalid subscript type 'list' 3: predictions failed for Resample25: shrinkage=0.1, interaction.depth=3, n.minobsinnode=10, n.trees=150 Error in lvl[x] : invalid subscript type 'list' In parallel (using doParallel package) with the default tuning grid gives this error: Whereas in parallel with a custom tuning grid gives this error: Any insight you might have as to where this error is coming from or whether there is a way to avoid it would be much appreciated! |
In case it helps this is what the confusionMatrix results look like when I run the model with the gbm package directly: As you can see the classes are unbalanced but up sampling does not seem to solve the problem |
Hi, I'm having the same issue. It seems to me that reducing the amount of parallel cores reduces the likelihood that it crashes with that error. So maybe a memory issue? |
The issue is related to the warning message about
This is usually the result of a model making predictions that are constant across all of the samples. This results in
This is a general error when all of the models fail. @see24 I don't think that they are related. It's not good form to tack on "similar issues"; if you are using the same example and get the same results then please comment. Otherwise open a different issue with a small reproducible example. @ruizcrp I don't know which issue you are referring to. Please submit a new issue with a small reproducible example. |
We are getting 'arguments imply differing numbers of rows' when we are trying to do a multiclass gbm model. There's not much any information on why the GBM model is failing at iteration 60. Can any of you guys help us to debug and fix?
Adding @kchaitanyabandi for visibility
Based on one of the tasks logs,
Iter TrainDeviance ValidDeviance StepSize Improve
1 1.0986 -nan 0.2000 0.3625
2 0.8479 -nan 0.2000 0.1920
3 0.7156 -nan 0.2000 0.1080
4 0.6399 -nan 0.2000 0.0768
5 0.5824 -nan 0.2000 0.0516
6 0.5444 -nan 0.2000 0.0396
7 0.5144 -nan 0.2000 0.0341
8 0.4894 -nan 0.2000 0.0293
9 0.4680 -nan 0.2000 0.0228
10 0.4495 -nan 0.2000 0.0156
20 0.3630 -nan 0.2000 0.0025
40 0.3120 -nan 0.2000 0.0015
60 -nan -nan 0.2000 -nan
80 -nan -nan 0.2000 -nan
100 -nan -nan 0.2000 -nan
120 -nan -nan 0.2000 -nan
140 -nan -nan 0.2000 -nan
160 -nan -nan 0.2000 -nan
180 -nan -nan 0.2000 -nan
200 -nan -nan 0.2000 -nan
It appears that the GBM model is failing and when the results are getting collated. We received this error.
<simpleError in data.frame(pred = x, obs = y, stringsAsFactors = FALSE): arguments imply differing number of rows: 0, 2197>
Note: We are trying to run this with doAzureParallel, but it can also be reproduced on doParallel.
Adding @kchaitanyabandi for more information
update.packages(oldPkgs="caret", ask=FALSE)
sessionInfo()
Minimal, reproducible example:
Minimal dataset:
The error can be reproduced by the iris data set. However, @kchaitanyabandi might be able to share with you a sample data set.
Minimal, runnable code:
Session Info:
Thanks!
Brian
The text was updated successfully, but these errors were encountered: