Replies: 1 comment 1 reply
-
|
Yes, there were some changes that were made. First make sure that in the training forest you have set the importance to permute, Second, I believe we have made a change to the sub ratio value which is the size of the data set that is subsampled. This was made in conjunction with using the subsampled variance estimator as the default estimator as opposed to the jack-knife estimator. You should check to see if |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all.
I am currently using randomForestSRC version 3.2.3. I am re-running a survival forest, extracting permutation VIMPS and using the subsample function to obtain confidence intervals. I am using this code to extract the CI:
set.seed(12345)
oo_1 <- subsample(RFmodel)
vimpCI <- extract.subsample(oo_1)$var.jk.sel.Z
I had previously run the same forest on the same data set (and same seed) and extracted the VIMP confidence intervals in the same way (using the same seed). However, I was using an older version of R and older version of the package (and, unfortunately do not know the version---my computer had to be re-imaged and all software reloaded). The confidence intervals I am getting now are much wider and the list of "significant" variables is much shorter than those on my previous runs with an earlier version of the package.
I have read through the release notes and do not see why this would be. The performance of the forest is almost identical and the VIMPs extracted using RFmodel$importance are identical.
Was there a change to the subsample function that might explain this difference? And, if so, what is that change? And is there a way to replicate what I extracted previously so I understand the difference?
Thank you!
Mindy
Beta Was this translation helpful? Give feedback.
All reactions