Replies: 2 comments 1 reply
-
|
The option You should instead use the canonical call: |
Beta Was this translation helpful? Give feedback.
1 reply
-
|
From your output, the training performance as measured by the g-mean is
The performance using the trained forest on the test data has g-mean performance
Looks pretty close to me |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Dear Professor Ishwaran,
I have been working with an unbalanced classification model and have encountered an unexpected problem. Specifically, with different splitting schemes, I consistently observe that the test performance is significantly better than the training performance on all metrics. When i run a ranfom forest in the test data, the performance is very similar to the model in the train data, but the "predict" function returnes much better test performance results. Could you please share your opinion on this result and my code below? that would be much appreciated.
rf <- imbalanced(gender ~ ., data = traindata, ntree = 3000)
test_rf <- predict(rf, newdata = test_data, outcome = "test")
Beta Was this translation helpful? Give feedback.
All reactions